Hostname: page-component-586b7cd67f-2plfb Total loading time: 0 Render date: 2024-11-24T08:28:27.089Z Has data issue: false hasContentIssue false

An inhomogeneous Dirichlet theorem via shrinking targets

Published online by Cambridge University Press:  25 June 2019

Dmitry Kleinbock
Affiliation:
Brandeis University, Waltham, MA 02454-9110, USA email [email protected]
Nick Wadleigh
Affiliation:
Brandeis University, Waltham, MA 02454-9110, USA email [email protected]
Rights & Permissions [Opens in a new window]

Abstract

We give an integrability criterion on a real-valued non-increasing function $\unicode[STIX]{x1D713}$ guaranteeing that for almost all (or almost no) pairs $(A,\mathbf{b})$, where $A$ is a real $m\times n$ matrix and $\mathbf{b}\in \mathbb{R}^{m}$, the system

$$\begin{eqnarray}\Vert A\mathbf{q}+\mathbf{b}-\mathbf{p}\Vert ^{m}<\unicode[STIX]{x1D713}(T),\quad \Vert \mathbf{q}\Vert ^{n}<T,\end{eqnarray}$$
is solvable in $\mathbf{p}\in \mathbb{Z}^{m}$, $\mathbf{q}\in \mathbb{Z}^{n}$ for all sufficiently large $T$. The proof consists of a reduction to a shrinking target problem on the space of grids in $\mathbb{R}^{m+n}$. We also comment on the homogeneous counterpart to this problem, whose $m=n=1$ case was recently solved, but whose general case remains open.

Type
Research Article
Copyright
© The Authors 2019 

1 Introduction and motivation

1.1 Homogeneous Diophantine approximation

Fix positive integers $m,n$ . Let $M_{m,n}$ denote the space of real $m\times n$ matrices. The starting point for the present paper is the following theorem, proved by Dirichlet in 1842.

Theorem 1.1 (Dirichlet’s theorem).

For any $A\in M_{m,n}$ and $T>1$ , there exist $\mathbf{p}\in \mathbb{Z}^{m}$ , $\mathbf{q}\in \mathbb{Z}^{n}\smallsetminus \{0\}$ such that

(1.1) $$\begin{eqnarray}\Vert A\mathbf{q}-\mathbf{p}\Vert ^{m}\leqslant \frac{1}{T}\quad \text{and}\quad \Vert \mathbf{q}\Vert ^{n}<T.\end{eqnarray}$$

Here and hereafter $\Vert \cdot \Vert$ stands for the supremum norm on $\mathbb{R}^{k}$ , $k\in \mathbb{N}$ . Informally speaking, a matrix $A$ represents a vector-valued function $\mathbf{q}\mapsto A\mathbf{q}$ , and the above theorem asserts that one can choose a not-so-large non-zero integer vector $\mathbf{q}$ so that the output of that function is close to an integer vector. In the case $m=n=1$ the theorem just asserts that for any real number $\unicode[STIX]{x1D6FC}$ and $T>1$ , one of the first $T$ multiples of $\unicode[STIX]{x1D6FC}$ lies within $1/T$ of an integer. Theorem 1.1 is the archetypal uniform Diophantine approximation result, so called because it guarantees a non-trivial integer solution for all  $T$ . A weaker form of approximation (sometimes called asymptotic approximation; see, for example, [Reference WaldschmidtWal12, Reference Kim and LiaoKL18]) guarantees that such a system is solvable for an unbounded set of  $T$ . For instance, Theorem 1.1 implies that (1.1) is solvable for an unbounded set of  $T$ , a fortiori. The following corollary, which follows trivially from this weaker statement, is the archetypal asymptotic result.

Corollary 1.2. For any $A\in M_{m,n}$ there exist infinitely many $\mathbf{q}\in \mathbb{Z}^{n}$ such that

(1.2) $$\begin{eqnarray}\Vert A\mathbf{q}-\mathbf{p}\Vert ^{m}<\frac{1}{\Vert \mathbf{q}\Vert ^{n}}\quad \text{for some}~\mathbf{p}\in \mathbb{Z}^{m}.\end{eqnarray}$$

Together the aforementioned results initiate the metric theory of Diophantine approximation, a field concerned with understanding sets of $A\in M_{m,n}$ which admit improvements to Theorem 1.1 and Corollary 1.2. This paper has been motivated by an observation that the sensible ‘first questions’ about the asymptotic set-up were settled long ago, while the analogous questions about the uniform set-up remain open. Let us start by reviewing what is known in the asymptotic set-up.

For a function $\unicode[STIX]{x1D713}:\mathbb{R}_{+}\rightarrow \mathbb{R}_{+}$ , let us define $W_{m,n}(\unicode[STIX]{x1D713})$ , the set of $\unicode[STIX]{x1D713}$ -approximable matrices, to be the set of $A\in M_{m,n}$ for which there exist infinitely many $\mathbf{q}\in \mathbb{Z}^{n}$ such thatFootnote 1

(1.3) $$\begin{eqnarray}\Vert A\mathbf{q}-\mathbf{p}\Vert ^{m}\leqslant \unicode[STIX]{x1D713}(\Vert \mathbf{q}\Vert ^{n})\quad \text{for some}~\mathbf{p}\in \mathbb{Z}^{m}.\end{eqnarray}$$

Throughout the paper we use the notation $\unicode[STIX]{x1D713}_{a}(x):=x^{-a}$ . Thus Corollary 1.2 asserts that $W_{m,n}(\unicode[STIX]{x1D713}_{1})=M_{m,n}$ , and in the above definition we have simply replaced $\unicode[STIX]{x1D713}_{1}(\Vert \mathbf{q}\Vert ^{n})$ in (1.2) with $\unicode[STIX]{x1D713}(\Vert \mathbf{q}\Vert ^{n})$ . Precise conditions for the Lebesgue measure of $W_{m,n}(\unicode[STIX]{x1D713})$ to be zero or full are given in the following theorem.

Theorem 1.3 (Khintchine–Groshev theorem [Reference GroshevGro38]).

Given a non-increasingFootnote 2   $\unicode[STIX]{x1D713}$ , the set $W_{m,n}(\unicode[STIX]{x1D713})$ has zero (respectively, full) measure if and only if the series $\sum _{k}\unicode[STIX]{x1D713}(k)$ converges (respectively, diverges).

See [Reference SprindžukSpr79] or [Reference Beresnevich, Dickinson and VelaniBDV06] for details, and also [Reference Kleinbock and MargulisKM99] for an alternative proof using dynamics on the space of lattices.

Questions related to similarly improving Theorem 1.1 were first addressed in two seminal papers [Reference Davenport and SchmidtDS70, Reference Davenport and SchmidtDS69] by Davenport and Schmidt. However no zero–one law analogous to Theorem 1.3 has yet been proved in the set-up of uniform approximation for general $m,n\in \mathbb{N}$ . Let us introduce the following definition: for a non-increasing function $\unicode[STIX]{x1D713}:[T_{0},\infty )\rightarrow \mathbb{R}_{+}$ , where $T_{0}>1$ is fixed, say that $A\in M_{m,n}$ is $\unicode[STIX]{x1D713}$ -Dirichlet, or $A\in D_{m,n}(\unicode[STIX]{x1D713})$ , if the system

(1.4) $$\begin{eqnarray}\Vert A\mathbf{q}-\mathbf{p}\Vert ^{m}<\unicode[STIX]{x1D713}(T)\quad \text{and}\quad \Vert \mathbf{q}\Vert ^{n}<T\end{eqnarray}$$

has a non-trivial integer solution for all large enough  $T$ . In other words, we have replaced $\unicode[STIX]{x1D713}_{1}(T)$ in (1.1) with $\unicode[STIX]{x1D713}(T)$ , demanded the existence of non-trivial integer solutions for all $T$ except those belonging to a bounded set, and sharpened one of the inequalities in (1.1). The latter change, in particular, implies the following observation: for non-increasing  $\unicode[STIX]{x1D713}$ , membership in $D_{m,n}(\unicode[STIX]{x1D713})$ depends only on the solvability of the system (1.4) at integer values of  $T$ . (To show this it suffices to replace $T$ with $\lceil T\rceil$ and use the monotonicity of  $\unicode[STIX]{x1D713}$ .)

It is not difficult to see that $D_{1,1}(\unicode[STIX]{x1D713}_{1})=\mathbb{R}$ , and that for general $m,n$ , almost every matrix is $\unicode[STIX]{x1D713}_{1}$ -Dirichlet. In contrast, it was proved in [Reference Davenport and SchmidtDS69] for $\min (m,n)=1$ , and in [Reference Kleinbock and WeissKW08] for the general case, that for any $c<1$ , the set $D_{m,n}(c\unicode[STIX]{x1D713}_{1})$ of $c\unicode[STIX]{x1D713}_{1}$ -Dirichlet matrices has Lebesgue measure zero. This naturally motivates the following question.

Question 1.4. What is a necessary and sufficient condition on a non-increasing function $\unicode[STIX]{x1D713}$ (presumably expressed in the form of convergence/divergence of a certain series) guaranteeing that the set $D_{m,n}(\unicode[STIX]{x1D713})$ has zero or full measure?

In [Reference Kleinbock and WadleighKW18] we give an answer to this question for $m=n=1$ , but in general Question 1.4 seems to be much harder than its counterpart for the sets $W_{m,n}(\unicode[STIX]{x1D713})$ , answered by Theorem 1.3. We comment later in the paper on the reason for this difficulty, but the main subject of this paper is different: we take up an analogous inhomogeneous approximation problem, describe the analogues of the statements and concepts discussed in this section, and then show how an inhomogeneous analogue of Question 1.4 admits a complete solution based on a correspondence between Diophantine approximation and dynamics on homogeneous spaces.

1.2 Inhomogeneous approximation: the main result

The theory of inhomogeneous Diophantine approximation starts when one replaces the values of a system of linear forms $A\mathbf{q}$ by those of a system of affine forms $\mathbf{q}\mapsto A\mathbf{q}+\mathbf{b}$ , where $A\in M_{m,n}$ and $\mathbf{b}\in \mathbb{R}^{m}$ . Consider a non-increasing function $\unicode[STIX]{x1D713}:[T_{0},\infty )\rightarrow \mathbb{R}_{+}$ and, following the definition of the set $D_{m,n}(\unicode[STIX]{x1D713})$ , let us say that a pair $(A,\mathbf{b})\in M_{m,n}\times \mathbb{R}^{m}$ is $\unicode[STIX]{x1D713}$ -Dirichlet if there exist $\mathbf{p}\in \mathbb{Z}^{m}$ , $\mathbf{q}\in \mathbb{Z}^{n}$ such that

(1.5) $$\begin{eqnarray}\Vert A\mathbf{q}+\mathbf{b}-\mathbf{p}\Vert ^{m}<\unicode[STIX]{x1D713}(T),\quad \Vert \mathbf{q}\Vert ^{n}<T,\end{eqnarray}$$

whenever $T$ is large enough. (Note that in this set-up there is no need to single out the case $\mathbf{q}=0$ .) Denote the set of $\unicode[STIX]{x1D713}$ -Dirichlet pairs by $\widehat{D}_{m,n}(\unicode[STIX]{x1D713})$ . Note that, as is the case with $D_{m,n}(\unicode[STIX]{x1D713})$ , membership in $\widehat{D}_{m,n}(\unicode[STIX]{x1D713})$ depends only on the solubility of these inequalities at integer values of  $T$ , provided $\unicode[STIX]{x1D713}$ is non-increasing. Hence without loss of generality one can assume $\unicode[STIX]{x1D713}$ to be continuous.

Let us start with the simplest case: $\unicode[STIX]{x1D713}\equiv c$ is a constant function, or $\unicode[STIX]{x1D713}=c\unicode[STIX]{x1D713}_{0}$ in our notation. It is a trivial consequence of Dirichlet’s theorem that whenever $c>0$ ,

$$\begin{eqnarray}\Vert A\mathbf{q}-\mathbf{p}\Vert ^{m}<c,\quad \Vert \mathbf{q}\Vert ^{n}<T,\end{eqnarray}$$

is solvable in $\mathbf{p}\in \mathbb{Z}^{m}$ , $\mathbf{q}\in \mathbb{Z}^{n}\smallsetminus \{0\}$ whenever $T>c^{-1}$ . By contrast, it is clear that one cannot always solve

$$\begin{eqnarray}\Vert A\mathbf{q}+\mathbf{b}-\mathbf{p}\Vert ^{m}<c,\quad \Vert \mathbf{q}\Vert ^{n}<T,\end{eqnarray}$$

for $c\leqslant 1/2^{m}$ ; for example, take $A$ to be an integer matrix and take $\mathbf{b}$ with coordinates in $\mathbb{Z}+\frac{1}{2}$ . However, it follows from Kronecker’s theorem [Reference CasselsCas57, § 3.5] that, for a given $A\in M_{m,n}$ , there exist $\mathbf{b}\in \mathbb{R}^{m}$ and $c>0$ such that $(A,\mathbf{b})\notin \widehat{D}_{m,n}(c\unicode[STIX]{x1D713}_{0})$ , which amounts to saying that $A\mathbb{Z}^{n}$ is not dense in $\mathbb{R}^{m}/\mathbb{Z}^{m}$ , only if $A^{t}(\mathbb{Z}^{m}\smallsetminus \{0\})$ contains an integer vector. The set of such $A$ has measure zero since it is the union over $\mathbf{q}\in \mathbb{Z}^{n}$ , $\mathbf{p}\in \mathbb{Z}^{m}\smallsetminus \{0\}$ of the sets $\{A:A^{t}\mathbf{p}=\mathbf{q}\}$ . Thus for every $c>0$ , $\widehat{D}_{m,n}(c\unicode[STIX]{x1D713}_{0})$ has full measure.

Once $\unicode[STIX]{x1D713}$ is allowed to decay to zero, the sets $\widehat{D}_{m,n}(\unicode[STIX]{x1D713})$ become smaller. In particular, using dynamics on the space of grids in $\mathbb{R}^{m+n}$ , one can easily prove (see Proposition 2.3 below) that $\widehat{D}_{m,n}(C\unicode[STIX]{x1D713}_{1})$ is null for any $C>0$ . Thus one can naturally ask the following inhomogeneous analogue of Question 1.4.

Question 1.5. What is a necessary and sufficient condition on a non-increasing function $\unicode[STIX]{x1D713}$ (presumably expressed in the form of convergence/divergence of a certain series) guaranteeing that the set $\widehat{D}_{m,n}(\unicode[STIX]{x1D713})$ has zero or full measure?

The remainder of this work will be given to a proof of the following answer.

Theorem 1.6. Given a non-increasing $\unicode[STIX]{x1D713}$ , the set $\widehat{D}_{m,n}(\unicode[STIX]{x1D713})$ has zero (respectively, full) measure if and only if the series

(1.6) $$\begin{eqnarray}\mathop{\sum }_{j}\frac{1}{\unicode[STIX]{x1D713}(j)j^{2}}\end{eqnarray}$$

diverges (respectively, converges).

Note that this immediately gives results such as:

  • $\widehat{D}_{m,n}(C\unicode[STIX]{x1D713}_{a})$ has zero (respectively, full) measure if $a\geqslant 1$ (respectively, $a<1$ );

  • for $\unicode[STIX]{x1D713}(T)=C(\log T)^{b}\unicode[STIX]{x1D713}_{1}(T)$ , $\widehat{D}_{m,n}(\unicode[STIX]{x1D713})$ has zero (respectively, full) measure if $b\leqslant 1$ (respectively, $b>1$ ).

Our argument is based on a correspondence between Diophantine approximation and homogeneous dynamics. In the next section we introduce the space of grids in $\mathbb{R}^{m+n}$ and reduce the aforementioned inhomogeneous approximation problem to a shrinking target phenomenon for a flow on that space. We present a warm-up problem, Proposition 2.3, that demonstrates the usefulness of the reduction to dynamics and introduces several key ideas to be used later. This is followed by the statement of the main dynamical result, Theorem 3.6, which we prove in the two subsequent sections. The last section contains some concluding remarks, in particular a discussion of Question 1.4 and other open questions.

2 Dynamics on the space of grids: a warm-up

Fix $k\in \mathbb{N}$ and let

$$\begin{eqnarray}G_{k}:=\operatorname{SL}_{k}(\mathbb{R})\quad \text{and}\quad \widehat{G}_{k}:=\operatorname{ASL}_{k}(\mathbb{R})=G_{k}\rtimes \mathbb{R}^{k};\end{eqnarray}$$

the latter is the group of volume-preserving affine transformations of $\mathbb{R}^{k}$ . Also put

$$\begin{eqnarray}\unicode[STIX]{x1D6E4}_{k}:=\operatorname{SL}_{k}(\mathbb{Z})\quad \text{and}\quad \widehat{\unicode[STIX]{x1D6E4}}_{k}:=\operatorname{ASL}_{k}(\mathbb{Z})=\unicode[STIX]{x1D6E4}_{k}\rtimes \mathbb{Z}^{k}.\end{eqnarray}$$

Elements of $\widehat{G}_{k}$ will be denoted by $\langle g,\mathbf{w}\rangle$ where $g\in G_{k}$ and $\mathbf{w}\in \mathbb{R}^{k}$ ; that is, $\langle g,\mathbf{w}\rangle$ is the affine transformation $\mathbf{x}\mapsto g\mathbf{x}+\mathbf{w}$ . Denote by $\widehat{X}_{k}$ the space of translates of unimodular lattices in  $\mathbb{R}^{k}$ ; elements of $\widehat{X}_{k}$ will be referred to as unimodular grids. Clearly $\widehat{X}_{k}$ is canonically identified with $\widehat{G}_{k}/\widehat{\unicode[STIX]{x1D6E4}}_{k}$ via

$$\begin{eqnarray}\langle g,\mathbf{w}\rangle \widehat{\unicode[STIX]{x1D6E4}}_{k}\in \widehat{G}_{k}/\widehat{\unicode[STIX]{x1D6E4}}_{k}\quad \longleftrightarrow \quad g\mathbb{Z}^{k}+\mathbf{w}\in \widehat{X}_{k}.\end{eqnarray}$$

Similarly, $X_{k}:=G_{k}/\unicode[STIX]{x1D6E4}_{k}$ is identified with the space of unimodular lattices in $\mathbb{R}^{k}$ (i.e. unimodular grids containing the zero vector). Note that $\widehat{\unicode[STIX]{x1D6E4}}_{k}$ (respectively, $\unicode[STIX]{x1D6E4}_{k}$ ) is a lattice in $\widehat{G}_{k}$ (respectively, $G_{k}$ ). We will denote by $\widehat{\unicode[STIX]{x1D707}}$ (respectively $\unicode[STIX]{x1D707}$ ) the normalized Haar measures on $\widehat{X}_{k}$ and $X_{k}$ , respectively.

Now fix $m,n\in \mathbb{N}$ with $m+n=k$ , and for $t\in \mathbb{R}$ let

(2.1) $$\begin{eqnarray}g_{t}:=\operatorname{diag}(e^{t/m},\ldots ,e^{t/m},e^{-t/n},\ldots ,e^{-t/n}),\end{eqnarray}$$

where there are $m$ copies of $e^{t/m}$ and $n$ copies of $e^{-t/n}$ . The so-called expanding horospherical subgroup of $\widehat{G}_{k}$ with respect to $\{g_{t}:t>0\}$ is given by

(2.2) $$\begin{eqnarray}H:=\{u_{A,\mathbf{b}}:A\in M_{m,n},\boldsymbol{ b}\in \mathbb{ R}^{m}\},\quad \text{where }u_{A,\mathbf{b}}:=\langle \biggl(\begin{array}{@{}cc@{}}I_{m} & A\\ 0 & I_{n}\end{array}\biggr),\biggl(\begin{array}{@{}c@{}}\mathbf{b}\\ 0\end{array}\biggr)\rangle .\end{eqnarray}$$

On the other hand,

(2.3) $$\begin{eqnarray}\tilde{H}:=\bigg\{\langle \biggl(\begin{array}{@{}cc@{}}P & 0\\ R & Q\end{array}\biggr),\biggl(\begin{array}{@{}c@{}}0\\ \boldsymbol{ d}\end{array}\biggr)\rangle \,\biggr|\,\begin{array}{@{}c@{}}P\in M_{m,m},\,Q\in M_{n,n},\,\det (P)\det (Q)=1\\ R\in M_{n,m},\mathbf{d}\in \mathbb{R}^{n}\end{array}\bigg\}\end{eqnarray}$$

is a subgroup of $\widehat{G}_{k}$ complementary to $H$ which is non-expanding with respect to conjugation by $g_{t}$ , $t\geqslant 0$ : it is easy to see that

(2.4) $$\begin{eqnarray}g_{t}\langle \biggl(\begin{array}{@{}cc@{}}P & 0\\ R & Q\end{array}\biggr),\biggl(\begin{array}{@{}c@{}}0\\ \boldsymbol{ d}\end{array}\biggr)\rangle g_{-t}=\langle \biggl(\begin{array}{@{}cc@{}}P & 0\\ e^{-((m+n)/mn)t}R & Q\end{array}\biggr),\biggl(\begin{array}{@{}c@{}}0\\ e^{-t/n}\mathbf{d}\end{array}\biggr)\rangle .\end{eqnarray}$$

Let us also denote

(2.5) $$\begin{eqnarray}\unicode[STIX]{x1D6EC}_{A,\mathbf{b}}:=u_{A,\mathbf{b}}\mathbb{Z}^{k}=\bigg\{\biggl(\begin{array}{@{}c@{}}A\mathbf{q}+\mathbf{b}-\mathbf{p}\\ \boldsymbol{ q}\end{array}\biggr):\mathbf{p}\in \mathbb{Z}^{m},\,\mathbf{q}\in \mathbb{Z}^{n}\bigg\}.\end{eqnarray}$$

The reduction of Diophantine properties of $(A,\mathbf{b})$ to the behavior of the $g_{t}$ -trajectory of $\unicode[STIX]{x1D6EC}_{A,\mathbf{b}}$ described below mimics the classical Dani correspondence for homogeneous Diophantine approximation [Reference DaniDan85, Reference Kleinbock and MargulisKM99] and dates back to [Reference KleinbockKle99] (see also more recent papers [Reference ShapiraSha11, Reference Einsiedler and TsengET11, Reference Gorodnik and VisheGV18]). The crucial role is played by a function $\unicode[STIX]{x1D6E5}:\widehat{X}_{k}\rightarrow [-\infty ,\infty )$ given by

(2.6) $$\begin{eqnarray}\unicode[STIX]{x1D6E5}(\unicode[STIX]{x1D6EC}):=\log \inf _{\mathbf{v}\in \unicode[STIX]{x1D6EC}}\Vert \mathbf{v}\Vert .\end{eqnarray}$$

Note that $\unicode[STIX]{x1D6E5}(\unicode[STIX]{x1D6EC})=-\infty$ if and only if $\unicode[STIX]{x1D6EC}\ni 0$ . Also it is easy to see that $\unicode[STIX]{x1D6E5}$ is uniformly continuous outside of the set where it takes small values.

Lemma 2.1. For any $z\in \mathbb{R}$ , $\unicode[STIX]{x1D6E5}$ is uniformly continuous on the set $\unicode[STIX]{x1D6E5}^{-1}([z,\infty ))$ . That is, for any $z\in \mathbb{R}$ and any $\unicode[STIX]{x1D700}>0$ there exists a neighborhood $U$ of the identity in $\widehat{G}_{k}$ such that whenever $\unicode[STIX]{x1D6E5}(\unicode[STIX]{x1D6EC})\geqslant z$ and $g\in U$ , one has $|\unicode[STIX]{x1D6E5}(\unicode[STIX]{x1D6EC})-\unicode[STIX]{x1D6E5}(g\unicode[STIX]{x1D6EC})|<\unicode[STIX]{x1D700}$ .

Proof. Let $c>1$ , $z\in \mathbb{R}$ . Choose $\unicode[STIX]{x1D6FF}>0$ so that

$$\begin{eqnarray}c^{-1}\Vert \mathbf{v}\Vert \leqslant \Vert \mathbf{v}+\mathbf{w}\Vert \leqslant c\Vert \mathbf{v}\Vert\end{eqnarray}$$

whenever $\Vert \mathbf{w}\Vert \leqslant \unicode[STIX]{x1D6FF}$ and $\log \Vert \mathbf{v}\Vert \geqslant z-\log c$ . Then if $\log \Vert \mathbf{v}\Vert \geqslant z$ , $\Vert \mathbf{w}\Vert <\unicode[STIX]{x1D6FF}$ and the operator norms of both $g$ and $g^{-1}$ are not greater than $c$ (the latter two conditions define an open neighborhood $U$ of the identity in $\widehat{G}$ such that $\langle g,\mathbf{w}\rangle \in U$ ), we have

$$\begin{eqnarray}\frac{\Vert \mathbf{v}\Vert }{c^{2}}\leqslant \frac{\Vert g\mathbf{v}\Vert }{c}\leqslant \Vert g\mathbf{v}+\mathbf{w}\Vert \leqslant c\cdot \Vert g\mathbf{v}\Vert \leqslant c^{2}\cdot \Vert \mathbf{v}\Vert .\end{eqnarray}$$

Thus if $\unicode[STIX]{x1D6E5}(\unicode[STIX]{x1D6EC})\geqslant z$ and $\langle g,\mathbf{w}\rangle \in U$ , we have

$$\begin{eqnarray}\unicode[STIX]{x1D6E5}(\unicode[STIX]{x1D6EC})-2\log c\leqslant \unicode[STIX]{x1D6E5}(g\unicode[STIX]{x1D6EC}+\mathbf{w})\leqslant \unicode[STIX]{x1D6E5}(\unicode[STIX]{x1D6EC})+2\log c.\end{eqnarray}$$

Since $c>1$ is arbitrary, $\unicode[STIX]{x1D6E5}$ is uniformly continuous on $\unicode[STIX]{x1D6E5}^{-1}([z,\infty ))$ .◻

Another important feature of $\unicode[STIX]{x1D6E5}$ is that it is unbounded from above; indeed, the grid

$$\begin{eqnarray}\operatorname{diag}(1,\ldots ,1,{\textstyle \frac{1}{4}}e^{-z},4e^{z})\mathbb{Z}^{k}+(0,\ldots ,0,2e^{z})\end{eqnarray}$$

is disjoint from the ball centered at 0 of radius $e^{z}$ . Consequently, sets $\unicode[STIX]{x1D6E5}^{-1}([z,\infty ))$ have non-empty interior for all $z\in \mathbb{R}$ .

Let us now describe a basic special case of the correspondence between inhomogeneous improvement of Dirichlet’s theorem and dynamics on $\widehat{X}_{k}$ . The next lemma is essentially an inhomogeneous analogue of [Reference Kleinbock and WeissKW08, Proposition 2.1].

Lemma 2.2. Let $C>0$ and put $z=(\log C)/(m+n)$ . Then $(A,\mathbf{b})\in \widehat{D}_{m,n}(C\unicode[STIX]{x1D713}_{1})$ if and only if $\unicode[STIX]{x1D6E5}(g_{t}\unicode[STIX]{x1D6EC}_{A,\mathbf{b}})<z$ for all large enough $t>0$ .

Proof. For $T>1$ , put $\unicode[STIX]{x1D713}(T)=C\unicode[STIX]{x1D713}_{1}(T)=C/T$ , and define

$$\begin{eqnarray}t:=\log T-\frac{n}{m+n}\log C\quad \Longleftrightarrow \quad T=C^{m/(m+n)}e^{t}.\end{eqnarray}$$

Then $\unicode[STIX]{x1D713}(T)=C^{n/(m+n)}e^{-t}$ , and the system (1.5) can be written as

$$\begin{eqnarray}\Vert A\mathbf{q}+\mathbf{b}-\mathbf{p}\Vert ^{m}<C^{n/(m+n)}e^{-t},\quad \Vert \mathbf{q}\Vert ^{n}<C^{m/(m+n)}e^{t},\end{eqnarray}$$

which is the same as

$$\begin{eqnarray}e^{t/m}\Vert A\mathbf{q}+\mathbf{b}-\mathbf{p}\Vert <C^{1/(m+n)},\quad e^{-t/n}\Vert \mathbf{q}\Vert <C^{1/(m+n)}.\end{eqnarray}$$

In view of (2.1), (2.5) and (2.6), the solvability of (1.5) in $(\mathbf{p},\mathbf{q})\in \mathbb{Z}^{m+n}$ is equivalent to

$$\begin{eqnarray}\unicode[STIX]{x1D6E5}(g_{t}\unicode[STIX]{x1D6EC}_{A,\mathbf{b}})<\frac{\log C}{m+n}=z,\end{eqnarray}$$

and the conclusion follows. ◻

We will use the above lemma and the ergodicity of the $g_{t}$ -action on $\widehat{X}_{k}$ to compute the Lebesgue measure of $\widehat{D}_{m,n}(C\unicode[STIX]{x1D713}_{1})$ . The proof contains a Fubini theorem argument (following [Reference Kleinbock and MargulisKM99, Theorem 8.7] and dating back to [Reference DaniDan85]) used to pass from an almost-everywhere statement for lattices to an almost-everywhere statement for pairs in $M_{m,n}\times \mathbb{R}^{m}$ . We will refer to this argument twice more in the sequel.

Proposition 2.3. For any $m,n\in \mathbb{N}$ and any $C>0$ , the set $\widehat{D}_{m,n}(C\unicode[STIX]{x1D713}_{1})$ has Lebesgue measure zero.

Proof. Suppose $U$ is a subset of $M_{m,n}\times \mathbb{R}^{m}$ ( $\cong H$ as in (2.2)) of positive Lebesgue measure such that $\unicode[STIX]{x1D6E5}(g_{t}\unicode[STIX]{x1D6EC}_{A,\mathbf{b}})<(\log C)/(m+n)$ for any $(A,\mathbf{b})\in U$ and all large enough  $t$ . Then there exists a neighborhood $V$ of identity in $\tilde{H}$ as in (2.3) such that for all $g\in V$ , $(A,\mathbf{b})\in U$ and all large enough  $t$ ,

(2.7) $$\begin{eqnarray}\unicode[STIX]{x1D6E5}(g_{t}g\unicode[STIX]{x1D6EC}_{A,\mathbf{b}})=\unicode[STIX]{x1D6E5}(g_{t}gg_{t}^{-1}g_{t}\unicode[STIX]{x1D6EC}_{A,\mathbf{b}})<\frac{\log C}{m+n}+1.\end{eqnarray}$$

Indeed, one can use Lemma 2.1 and (2.4) to choose $V$ such that if (2.7) does not hold for $g\in V$ , then $|\unicode[STIX]{x1D6E5}(g_{t}gg_{t}^{-1}g_{t}\unicode[STIX]{x1D6EC}_{A,\mathbf{b}})-\unicode[STIX]{x1D6E5}(g_{t}\unicode[STIX]{x1D6EC}_{A,\mathbf{b}})|<1$ . But since the product map $\tilde{H}\times H\rightarrow \widehat{G}_{k}$ is a local diffeomorphism, $V\times U$ is mapped onto a set of positive measure. It follows that $\unicode[STIX]{x1D6E5}(g_{t}\unicode[STIX]{x1D6EC})<(\log C)/(m+n)+1$ for all large enough $t$ and for a set of lattices $\unicode[STIX]{x1D6EC}$ of positive Haar measure in $\widehat{X}_{k}$ .

On the other hand, from Moore’s ergodicity theorem [Reference MooreMoo66] together with the ergodicity criterion of Brezin and Moore (see [Reference Brezin and MooreBM81, Theorem 6.1] or [Reference MargulisMar91, Theorem 6]) it follows that every unbounded subgroup of $G_{k}$ , in particular, $\{g_{t}:t\in \mathbb{R}\}$ as above, acts ergodically on  $\widehat{X}_{k}$ . Since for any $C>0$ the set $\unicode[STIX]{x1D6E5}^{-1}([(\log C)/(m+n)+1,\infty ))$ has a non-empty interior, it follows that $\unicode[STIX]{x1D707}$ -almost every $\unicode[STIX]{x1D6EC}\in \widehat{X}_{k}$ must visit any such set at unbounded times under the action of  $g_{t}$ , a contradiction.◻

3 A correspondence between Dirichlet improvability and dynamics

Lemma 2.2 relates the complement of $\widehat{D}_{m,n}(C\unicode[STIX]{x1D713}_{0})$ to the set of grids visiting certain ‘target’ subsets of $\widehat{X}_{k}$ at unbounded times under the diagonal flow  $g_{t}$ . This is the special case where the target does not change with the time parameter  $t$ . For general non-increasing  $\unicode[STIX]{x1D713}$ , we get a family of ‘shrinking targets’ $\unicode[STIX]{x1D6E5}^{-1}([z_{\unicode[STIX]{x1D713}}(t),\infty ))$ (which in fact are shrinking only in a weak sense; see Remark 3.3), where $z_{\unicode[STIX]{x1D713}}$ is gotten by the following change of variables, known as the Dani correspondence.

Lemma 3.1 (See [Reference Kleinbock and MargulisKM99, Lemma 8.3]).

Let positive integers $m,n$ and $T_{0}\in \mathbb{R}_{+}$ be given. Suppose $\unicode[STIX]{x1D713}:[T_{0},\infty )\rightarrow \mathbb{R}_{+}$ is a continuous, non-increasing function. Then there exists a unique continuous function

$$\begin{eqnarray}z=z_{\unicode[STIX]{x1D713}}:[t_{0},\infty )\rightarrow \mathbb{R},\end{eqnarray}$$

where $t_{0}:=(m/(m+n))\log T_{0}-(n/(m+n))\log \unicode[STIX]{x1D713}(T_{0})$ , such that:

  1. (i) the function $t\mapsto t+nz(t)$ is strictly increasing and unbounded;

  2. (ii) the function $t\mapsto t-mz(t)$ is non-decreasing;

  3. (iii) $\unicode[STIX]{x1D713}(e^{t+nz(t)})=e^{-t+mz(t)}$ for all $t\geqslant t_{0}$ .

Remark 3.2. The function $z$ of Lemma 3.1 differs from the function $r$ of [Reference Kleinbock and MargulisKM99, Lemma 8.3] by a minus sign. This reflects the difference between the asymptotic and uniform approximation problems.

Remark 3.3. For future reference, we point out that properties (1) and (2) of Lemma 3.1 imply that any $z=z_{\unicode[STIX]{x1D713}}$ does not oscillate too wildly. Namely,

$$\begin{eqnarray}z(s)-\frac{1}{m}\leqslant z(u)\leqslant z(s)+\frac{1}{n}\quad \text{whenever}~s\leqslant u\leqslant s+1.\end{eqnarray}$$

Now we can state a general version of the correspondence between the improvability of the inhomogeneous Dirichlet theorem and dynamics on  $\widehat{X}_{k}$ , generalizing the first paragraph of the proof of Theorem 2.3.

Lemma 3.4. Let $\unicode[STIX]{x1D713}:[T_{0},\infty )\rightarrow \mathbb{R}_{+}$ be a non-increasing continuous function, and let $z=z_{\unicode[STIX]{x1D713}}$ be the function associated to $\unicode[STIX]{x1D713}$ by Lemma 3.1. The pair $(A,\mathbf{b})$ is in $\widehat{D}_{m,n}(\unicode[STIX]{x1D713})$ if and only if $\unicode[STIX]{x1D6E5}(g_{t}\unicode[STIX]{x1D6EC}_{A,b})<z_{\unicode[STIX]{x1D713}}(t)$ for all sufficiently large  $t$ .

Proof. We argue as in the proof of Lemma 2.2. Since $t\mapsto t+nz(t)$ is increasing and unbounded, $(A,\mathbf{b})\in \widehat{D}_{m,n}(\unicode[STIX]{x1D713})$ if and only if for all large enough $t$ we have

$$\begin{eqnarray}\Vert A\mathbf{q}+\mathbf{b}-\mathbf{p}\Vert ^{m}<\unicode[STIX]{x1D713}(e^{t+nz(t)})=e^{-t+mz(t)},\quad \Vert \mathbf{q}\Vert ^{n}<e^{t+nz(t)},\end{eqnarray}$$

for some $\mathbf{q}\in \mathbb{Z}^{n}$ , $\mathbf{p}\in \mathbb{Z}^{m}$ . This is the same as the solvability of

$$\begin{eqnarray}e^{t/m}\Vert A\mathbf{q}+\mathbf{b}-\mathbf{p}\Vert <e^{z(t)},\quad e^{-t/n}\Vert \mathbf{q}\Vert <e^{z(t)},\end{eqnarray}$$

which is the same as $\unicode[STIX]{x1D6E5}(g_{t}\unicode[STIX]{x1D6EC}_{A,\mathbf{b}})<z_{\unicode[STIX]{x1D713}}(t)$ .◻

Thus a pair fails to be $\unicode[STIX]{x1D713}$ -Dirichlet if and only if the associated grid visits the ‘target’ $\unicode[STIX]{x1D6E5}^{-1}([z_{\unicode[STIX]{x1D713}}(t),\infty ))$ at unbounded times $t$ under the flow  $g_{t}$ . This is known as a ‘shrinking target phenomenon’. Our next goal is to recast condition (1.6) using the function  $z_{\unicode[STIX]{x1D713}}$ .

Lemma 3.5. Let $\unicode[STIX]{x1D713}:[T_{0},\infty )\rightarrow \mathbb{R}_{+},T_{0}\geqslant 0$ , be a non-increasing continuous function, and $z=z_{\unicode[STIX]{x1D713}}$ the function associated to $\unicode[STIX]{x1D713}$ by Lemma 3.1. Then we have

$$\begin{eqnarray}\mathop{\sum }_{j=\lceil T_{0}\rceil }^{\infty }\frac{1}{j^{2}\unicode[STIX]{x1D713}(j)}<\infty \quad \text{if and only if}\quad \mathop{\sum }_{t=\lceil t_{0}\rceil }^{\infty }e^{-(m+n)z(t)}<\infty .\end{eqnarray}$$

Proof. We follow the lines of the proof of [Reference Kleinbock and MargulisKM99, Lemma 8.3]. Using the monotonicity of $\unicode[STIX]{x1D713}$ and Remark 3.3, we may replace the sums with integrals

$$\begin{eqnarray}\int _{T_{0}}^{\infty }x^{-2}\unicode[STIX]{x1D713}(x)^{-1}\,dx\quad \text{and}\quad \int _{t_{0}}^{\infty }e^{-(m+n)z(t)}\,dt,\end{eqnarray}$$

respectively. Define

$$\begin{eqnarray}P:=-\text{log}\circ \unicode[STIX]{x1D713}\circ \exp :[T_{0},\infty )\rightarrow \mathbb{R}\quad \text{and}\quad \unicode[STIX]{x1D706}(t):=t+nz(t).\end{eqnarray}$$

Since $\unicode[STIX]{x1D713}(e^{\unicode[STIX]{x1D706}})=e^{-P(\unicode[STIX]{x1D706})}$ , we have

$$\begin{eqnarray}\int _{T_{0}}^{\infty }x^{-2}\unicode[STIX]{x1D713}(x)^{-1}\,dx=\int _{ logT_{0}}^{\infty }\unicode[STIX]{x1D713}(e^{\unicode[STIX]{x1D706}})^{-1}e^{-\unicode[STIX]{x1D706}}\,d\unicode[STIX]{x1D706}=\int _{ logT_{0}}^{\infty }e^{P(\unicode[STIX]{x1D706})-\unicode[STIX]{x1D706}}\,d\unicode[STIX]{x1D706}.\end{eqnarray}$$

Using $P(\unicode[STIX]{x1D706}(t))=t-mz(t)$ , we also have

$$\begin{eqnarray}\displaystyle \int _{t_{0}}^{\infty }e^{-(m+n)z(t)}\,dt & = & \displaystyle \int _{\log T_{0}}^{\infty }e^{-(m+n)z(m\unicode[STIX]{x1D706}/(m+n)+nP(\unicode[STIX]{x1D706})/(m+n))}\,d\biggl[\frac{m}{m+n}\unicode[STIX]{x1D706}+\frac{n}{m+n}P(\unicode[STIX]{x1D706})\biggr]\nonumber\\ \displaystyle & = & \displaystyle \frac{m}{m+n}\int _{\log T_{0}}^{\infty }e^{P(\unicode[STIX]{x1D706})-\unicode[STIX]{x1D706}}\,d\unicode[STIX]{x1D706}+\frac{n}{m+n}\int _{\log T_{0}}^{\infty }e^{-\unicode[STIX]{x1D706}}e^{P(\unicode[STIX]{x1D706})}\,dP(\unicode[STIX]{x1D706})\nonumber\\ \displaystyle & = & \displaystyle \frac{m}{m+n}\int _{\log T_{0}}^{\infty }e^{P(\unicode[STIX]{x1D706})-\unicode[STIX]{x1D706}}\,d\unicode[STIX]{x1D706}+\frac{n}{m+n}\int _{\log T_{0}}^{\infty }e^{P(\unicode[STIX]{x1D706})-\unicode[STIX]{x1D706}}\,d\unicode[STIX]{x1D706}\nonumber\\ \displaystyle & & \displaystyle +\,\frac{n}{m+n}\Bigl(\lim _{\unicode[STIX]{x1D706}\rightarrow \infty }e^{P(\unicode[STIX]{x1D706})-\unicode[STIX]{x1D706}}-1\Bigr),\nonumber\end{eqnarray}$$

where we integrated by parts in the last line. Since all these quantities (aside from the constant $-1$ ) are positive, the convergence of $\int _{t_{0}}^{\infty }e^{-(m+n)z(t)}\,dt$ implies the convergence of $\int _{\log T_{0}}^{\infty }e^{P(\unicode[STIX]{x1D706})-\unicode[STIX]{x1D706}}\,d\unicode[STIX]{x1D706}$ . Conversely, suppose $\int _{\log T_{0}}^{\infty }e^{P(\unicode[STIX]{x1D706})-\unicode[STIX]{x1D706}}\,d\unicode[STIX]{x1D706}$ converges, yet $\int _{t_{0}}^{\infty }e^{-(m+n)z(t)}\,dt$ diverges. Then since $u\mapsto \int _{t_{0}}^{u}e^{(m+n)z(t)}\,dt$ is increasing in  $u$ , and the first two terms of the sum above converge, we must have $e^{P(\unicode[STIX]{x1D706})-\unicode[STIX]{x1D706}}$ eventually increasing in $\unicode[STIX]{x1D706}$ (recall that $\unicode[STIX]{x1D706}$ is an increasing and unbounded function). But this contradicts the convergence of $\int _{\log (T_{0})}^{\infty }e^{P(\unicode[STIX]{x1D706})-\unicode[STIX]{x1D706}}\,d\unicode[STIX]{x1D706}$ .◻

Now we are ready to reduce Theorem 1.6 to the following statement concerning dynamics on  $\widehat{X}_{k}$ .

Theorem 3.6. Fix $k\in \mathbb{N}$ and let $\{g_{t}:t\in \mathbb{R}\}$ be a diagonalizable unbounded one-parameter subgroup of  $G_{k}$ . Also take an arbitrary sequence $\{z(t):t\in \mathbb{N}\}$ of real numbers. Then the set

(3.1) $$\begin{eqnarray}\{\unicode[STIX]{x1D6EC}\in \widehat{X}_{k}:\unicode[STIX]{x1D6E5}(g_{t}\unicode[STIX]{x1D6EC})\geqslant z(t)~\text{for infinitely many}~t\in \mathbb{N}\}\end{eqnarray}$$

is null (respectively, conull) if the sum

(3.2) $$\begin{eqnarray}\mathop{\sum }_{t=1}^{\infty }e^{-kz(t)}\end{eqnarray}$$

converges (respectively, diverges).

Proof of Theorem 1.6 assuming Theorem 3.6.

Suppose that the series (1.6) converges, and take $z(t)=z_{\unicode[STIX]{x1D713}}(t)$ , the function associated to $\unicode[STIX]{x1D713}$ by Lemma 3.1. In view of Lemma 3.5, the series (3.2) converges as well. In particular, it follows that $z(t)\geqslant 0$ for all large enough $t\in \mathbb{N}$ , and also that $\sum _{t=1}^{\infty }e^{-k(z(t)-C)}<\infty$ for any $C>0$ . Take $g_{t}$ as in (2.1); Theorem 3.6 then implies that

(3.3) $$\begin{eqnarray}\widehat{\unicode[STIX]{x1D707}}(\{\unicode[STIX]{x1D6EC}\in \widehat{X}_{k}:\unicode[STIX]{x1D6E5}(g_{t}\unicode[STIX]{x1D6EC})\geqslant z(t)-C~\text{for infinitely many}~t\in \mathbb{N}\})=0.\end{eqnarray}$$

Suppose that the Lebesgue measure of $\widehat{D}_{m,n}(\unicode[STIX]{x1D713})^{c}$ is positive. Lemma 3.4 asserts that there exists a set $U$ of positive measure consisting of pairs $(A,\mathbf{b})$ for which $\unicode[STIX]{x1D6E5}(g_{t}\unicode[STIX]{x1D6EC}_{A,\mathbf{b}})\geqslant z(t)$ for an unbounded set of $t\geqslant 0$ . Then, using $z(t)\geqslant 0$ and Lemma 2.1, we can replace $t$ with its integer part:

$$\begin{eqnarray}\unicode[STIX]{x1D6E5}(g_{\lfloor t\rfloor }\unicode[STIX]{x1D6EC}_{A,\mathbf{b}})=\unicode[STIX]{x1D6E5}(g_{(\lfloor t\rfloor -t)}g_{t}\unicode[STIX]{x1D6EC}_{A,\mathbf{b}})\geqslant \unicode[STIX]{x1D6E5}(g_{t}\unicode[STIX]{x1D6EC}_{A,\mathbf{b}})-c\geqslant z(t)-c\geqslant z(\lfloor t\rfloor )-c-1/m,\end{eqnarray}$$

where $c$ is a positive constant and the last inequality follows from Remark 3.3. Therefore we get $\unicode[STIX]{x1D6E5}(g_{t}\unicode[STIX]{x1D6EC}_{A,\mathbf{b}})\geqslant z(t)-c-1/m$ for an unbounded set of $t\in \mathbb{N}$ as long as $(A,\mathbf{b})\in U$ .

Now recall the groups $H$ and $\tilde{H}$ from Equations (2.2) and (2.3). As in the proof of Proposition 2.3, we may identify $U$ with a subset of $H$ and, using the uniform continuity of $\unicode[STIX]{x1D6E5}$ (Lemma 2.1), find a neighborhood of identity $V\subset \tilde{H}$ such that, for all $g\in V$ and $(A,\mathbf{b})\in U$ ,

$$\begin{eqnarray}\unicode[STIX]{x1D6E5}(g_{t}g\unicode[STIX]{x1D6EC}_{A,\mathbf{b}})=\unicode[STIX]{x1D6E5}(g_{t}gg_{t}^{-1}g_{t}\unicode[STIX]{x1D6EC}_{A,\mathbf{b}})\geqslant \unicode[STIX]{x1D6E5}(g_{t}\unicode[STIX]{x1D6EC}_{A,\mathbf{b}})-1\end{eqnarray}$$

for all $t\geqslant 0$ , hence $\unicode[STIX]{x1D6E5}(g_{t}g\unicode[STIX]{x1D6EC}_{A,\mathbf{b}})\geqslant z(t)-1-c-1/m$ for an unbounded set of $t\in \mathbb{N}$ . Since the product map $\tilde{H}\times H\rightarrow \widehat{G}_{k}$ is a local diffeomorphism, the image of $V\times U$ is a set of positive measure in  $G_{k}$ , contradicting (3.3).

The proof of the divergence case proceeds along the same lines. If (1.6) diverges, by Lemma 3.5 so does (3.2). Define $z^{\prime }(t):=\max (z(t),0)$ ; then we have $\sum _{t=1}^{\infty }e^{-k(z^{\prime }(t))}=\infty$ as well, therefore $\sum _{t=1}^{\infty }e^{-k(z^{\prime }(t)+C)}=\infty$ for any $C>0$ . In view of Theorem 3.6,

(3.4) $$\begin{eqnarray}\text{the set}~\{\unicode[STIX]{x1D6EC}\in \widehat{X}_{k}:\unicode[STIX]{x1D6E5}(g_{t}\unicode[STIX]{x1D6EC})\geqslant z^{\prime }(t)+C~\text{for infinitely many}~t\in \mathbb{N}\}\text{ has full measure.}\end{eqnarray}$$

Now assume that the set $\widehat{D}_{m,n}(\unicode[STIX]{x1D713})$ has positive measure. Then using Lemma 3.4, one can choose a set $U$ of positive measure consisting of pairs $(A,\mathbf{b})$ for which

$$\begin{eqnarray}\unicode[STIX]{x1D6E5}(g_{t}\unicode[STIX]{x1D6EC}_{A,\mathbf{b}})<z(t)\leqslant z^{\prime }(t)\end{eqnarray}$$

for all large enough $t$ . Then, as before, using Lemma 2.1 with $z=0$ and (2.4), one finds a neighborhood of identity $V\subset \tilde{H}$ such that for all $g\in V$ and $(A,\mathbf{b})\in U$ ,

$$\begin{eqnarray}\unicode[STIX]{x1D6E5}(g_{t}g\unicode[STIX]{x1D6EC}_{A,\mathbf{b}})=\unicode[STIX]{x1D6E5}(g_{t}gg_{t}^{-1}g_{t}\unicode[STIX]{x1D6EC}_{A,\mathbf{b}})<\max (\unicode[STIX]{x1D6E5}(g_{t}\unicode[STIX]{x1D6EC}_{A,\mathbf{b}}),0)+1\end{eqnarray}$$

for all $t\geqslant 0$ ; hence $\unicode[STIX]{x1D6E5}(g_{t}g\unicode[STIX]{x1D6EC}_{A,\mathbf{b}})<z^{\prime }(t)+1$ for all large enough  $t$ . Again using the local product structure of $\widehat{G}_{k}$ , one concludes that the image of $V\times U$ in $\widehat{X}_{k}$ is a set of positive measure, contradicting (3.4).◻

We are now left with the task of proving Theorem 3.6. The proof will have two ingredients. In the next section we will establish a dynamical Borel–Cantelli lemma (Theorem 4.4) showing that the limsup set (3.1) is null or conull according to the convergence or divergence of the series

(3.5) $$\begin{eqnarray}\mathop{\sum }_{t=1}^{\infty }\widehat{\unicode[STIX]{x1D707}}(\{\unicode[STIX]{x1D6EC}\in \widehat{X}_{k}:\unicode[STIX]{x1D6E5}(\unicode[STIX]{x1D6EC})\geqslant z(t)\}).\end{eqnarray}$$

The proof is based on the methods of [Reference Kleinbock and MargulisKM99, Reference Kleinbock and MargulisKM18]; namely, it uses the exponential mixing of the $g_{t}$ -action on $\widehat{X}_{k}$ , as well as the so-called DL property of  $\unicode[STIX]{x1D6E5}$ . The latter will be established in § 6. Moreover, there we will relate (3.2) and (3.5) by showing that the summands in (3.5) are equal to $e^{-kz(t)}$ up to a constant (Theorem 4.6).

4 A general dynamical Borel–Cantelli lemma and exponential mixing

In this section we let $G$ be a Lie group and $\unicode[STIX]{x1D6E4}$ a lattice in  $G$ . Denote by $X$ the homogeneous space $G/\unicode[STIX]{x1D6E4}$ and by $\unicode[STIX]{x1D707}$ the $G$ -invariant probability measure on  $X$ . In what follows, $\Vert \cdot \Vert _{p}$ will stand for the $L^{p}$ -norm. Fix a basis $\{Y_{1},\ldots ,Y_{n}\}$ for the Lie algebra $\mathfrak{g}$ of  $G$ , and, given a smooth function $h\in C^{\infty }(X)$ and $\ell \in \mathbb{Z}_{+}$ , define the ‘ $L^{2}$ , order  $\ell$ Sobolev norm $\Vert h\Vert _{2,\ell }$ of $h$ by

$$\begin{eqnarray}\Vert h\Vert _{2,\ell }\stackrel{\text{def}}{=}\mathop{\sum }_{|\unicode[STIX]{x1D6FC}|\leqslant \ell }\Vert D^{\unicode[STIX]{x1D6FC}}h\Vert _{2},\end{eqnarray}$$

where $\unicode[STIX]{x1D6FC}=(\unicode[STIX]{x1D6FC}_{1},\ldots ,\unicode[STIX]{x1D6FC}_{n})$ is a multi-index, $|\unicode[STIX]{x1D6FC}|=\sum _{i=1}^{n}\unicode[STIX]{x1D6FC}_{i}$ , and $D^{\unicode[STIX]{x1D6FC}}$ is a differential operator of order $|\unicode[STIX]{x1D6FC}|$ which is a monomial in $Y_{1},\ldots ,Y_{n}$ , namely $D^{\unicode[STIX]{x1D6FC}}=Y_{1}^{\unicode[STIX]{x1D6FC}_{1}}\cdots Y_{n}^{\unicode[STIX]{x1D6FC}_{n}}$ . This definition depends on the basis; however, a change of basis would only distort $\Vert h\Vert _{2,\ell }$ by a bounded factor. We also let

$$\begin{eqnarray}C_{2}^{\infty }(X)=\{h\in C^{\infty }(X):\Vert h\Vert _{2,\ell }<\infty ~\text{for any}~\ell \in \mathbb{Z}_{+}\}.\end{eqnarray}$$

Fix a right-invariant Riemannian metric on $G$ and the corresponding metric ‘dist’ on  $X$ . For $g\in G$ , let us denote by $\Vert g\Vert$ the distance between $g\in G$ and the identity element of  $G$ . Note that $\Vert g\Vert =\Vert g^{-1}\Vert$ due to the right-invariance of the metric.

Definition 4.1. Let $L$ be a subgroup of $G$ . Say that the $L$ -action on $X$ is exponentially mixing if there exist $\unicode[STIX]{x1D6FE},E>0$ and $\ell \in \mathbb{Z}_{+}$ such that for any $\unicode[STIX]{x1D711},\unicode[STIX]{x1D713}\in C_{2}^{\infty }(X)$ and for any $g\in L$ one has

(EM) $$\begin{eqnarray}\biggl|\langle g\unicode[STIX]{x1D711},\unicode[STIX]{x1D713}\rangle -\int _{X}\unicode[STIX]{x1D711}\,d\unicode[STIX]{x1D707}\int _{X}\unicode[STIX]{x1D713}\,d\unicode[STIX]{x1D707}\biggr|\leqslant Ee^{-\unicode[STIX]{x1D6FE}\Vert g\Vert }\Vert \unicode[STIX]{x1D711}\Vert _{2,\ell }\Vert \unicode[STIX]{x1D713}\Vert _{2,\ell }.\end{eqnarray}$$

Here $\langle \cdot ,\cdot \rangle$ stands for the inner product in $L^{2}(X,\unicode[STIX]{x1D707})$ .

We also need two more definitions from [Reference Kleinbock and MargulisKM99, Reference Kleinbock and MargulisKM18].

Definition 4.2. A sequence of elements $\{f_{t}:t\in \mathbb{N}\}$ of elements of $G$ is called exponentially divergent if

(4.1) $$\begin{eqnarray}\sup _{t\in \mathbb{N}}\mathop{\sum }_{s=1}^{\infty }e^{-\unicode[STIX]{x1D6FE}\Vert f_{s}f_{t}^{-1}\Vert }<\infty \quad \forall \,\unicode[STIX]{x1D6FE}>0.\end{eqnarray}$$

Now let $\unicode[STIX]{x1D6E5}$ be a real-valued function on $X$ , and for $z\in \mathbb{R}$ denote

$$\begin{eqnarray}\unicode[STIX]{x1D6F7}_{\unicode[STIX]{x1D6E5}}(z)\stackrel{\text{def}}{=}\unicode[STIX]{x1D707}(\unicode[STIX]{x1D6E5}^{-1}([z,\infty ))).\end{eqnarray}$$

Definition 4.3. Say that $\unicode[STIX]{x1D6E5}$ is DL (an abbreviation for ‘distance-like’) if there exists $z_{0}\in \mathbb{R}$ such that $\unicode[STIX]{x1D6F7}_{\unicode[STIX]{x1D6E5}}(z_{0})>0$ and

  1. (a) $\unicode[STIX]{x1D6E5}$ is uniformly continuous on $\unicode[STIX]{x1D6E5}^{-1}([z_{0},\infty ))$ ; that is, for all $\unicode[STIX]{x1D700}>0$ there exists a neighborhood $U$ of identity in $G$ such that for any $x\in X$ with $\unicode[STIX]{x1D6E5}(x)\geqslant z_{0}$ ,

    $$\begin{eqnarray}g\in U\quad \Longrightarrow \quad |\unicode[STIX]{x1D6E5}(x)-\unicode[STIX]{x1D6E5}(gx)|<\unicode[STIX]{x1D700};\end{eqnarray}$$
  2. (b) the function $\unicode[STIX]{x1D6F7}_{\unicode[STIX]{x1D6E5}}$ does not decrease very fast; more precisely,

    (4.2) $$\begin{eqnarray}\exists \,c,\unicode[STIX]{x1D6FF}>0~\text{such that}~\unicode[STIX]{x1D6F7}_{\unicode[STIX]{x1D6E5}}(z)\geqslant c\unicode[STIX]{x1D6F7}_{\unicode[STIX]{x1D6E5}}(z-\unicode[STIX]{x1D6FF})\quad \forall \,z\geqslant z_{0}.\end{eqnarray}$$

The next theorem is a direct consequence of [Reference Kleinbock and MargulisKM18, Theorem 1.3].

Theorem 4.4. Suppose that the action of a subgroup $L\subset G$ on $X$ is exponentially mixing. Let $\{f_{t}:t\in \mathbb{N}\}$ be a sequence of elements of $L$ satisfying (4.1), and let $\unicode[STIX]{x1D6E5}$ be a DL function on  $X$ . Also let $\{z(t):t\in \mathbb{N}\}$ be a sequence of real numbers. Then the set

(4.3) $$\begin{eqnarray}\{\unicode[STIX]{x1D6EC}\in X:\unicode[STIX]{x1D6E5}(g_{t}\unicode[STIX]{x1D6EC})\geqslant z(t)~\text{for infinitely many}~t\in \mathbb{N}\}\end{eqnarray}$$

is null (respectively, conull) if the sum

(4.4) $$\begin{eqnarray}\mathop{\sum }_{t=1}^{\infty }\unicode[STIX]{x1D6F7}_{\unicode[STIX]{x1D6E5}}(z(t))\end{eqnarray}$$

converges (respectively, diverges).

Proof. The convergence case is immediate from the classical Borel–Cantelli lemma. The divergence case is established in [Reference Kleinbock and MargulisKM99, Reference Kleinbock and MargulisKM18] for $L=G$ , but the argument applies verbatim if $G$ is replaced by a subgroup.◻

From now on we will take $k\geqslant 2$ and consider the case $G=\widehat{G}_{k}$ , $\unicode[STIX]{x1D6E4}=\widehat{\unicode[STIX]{x1D6E4}}_{k}$ , $X=\widehat{X}_{k}$ , and $L=G_{k}$ , with notation as in the previous section. Then we have the following theorem.

Theorem 4.5. The $G_{k}$ -action on $\widehat{X}_{k}=\widehat{G}_{k}/\widehat{\unicode[STIX]{x1D6E4}}_{k}$ is exponentially mixing.

Proof. According to [Reference Kleinbock and MargulisKM99, Theorem 3.4], exponential mixing holds whenever the regular representation of $G_{k}$ on the space $L_{0}^{2}(\widehat{X}_{k})$ (functions in $L^{2}(\widehat{X}_{k})$ with integral zero) is isolated in the Fell topology from the trivial representation. This is immediate if $k>2$ since in this case $G_{k}$ has Property (T).

If $k=2$ , let us write $L_{0}^{2}(\widehat{X}_{2})$ as a direct sum of two spaces: functions invariant under the action of $\mathbb{R}^{2}$ by translations, and its orthogonal complement. The first representation is isomorphic to the regular representation of $\operatorname{SL}_{2}(\mathbb{R})$ on $L_{0}^{2}(\operatorname{SL}_{2}(\mathbb{R})/\operatorname{SL}_{2}(\mathbb{Z}))$ , which is isolated from the trivial representation by [Reference Kleinbock and MargulisKM99, Theorem 1.12]. As for the second component, one can use [Reference Howe and TanHT92, Theorem V.3.3.1] (see also [Reference Ghosh, Gorodnik and NevoGGN18, Theorem 4.3]) which asserts that for any unitary representation $(\unicode[STIX]{x1D70C},V)$ of $\operatorname{ASL}_{2}(\mathbb{R})$ with no non-zero vectors fixed by $\mathbb{R}^{2}$ , the restriction of $\unicode[STIX]{x1D70C}$ to $\operatorname{SL}_{2}(\mathbb{R})$ is tempered, that is, there exists a dense set of vectors in $V$ whose matrix coefficients are in $L^{2+\unicode[STIX]{x1D700}}$ for any $\unicode[STIX]{x1D700}>0$ . Exponential mixing thus follows from [Reference Katok and SpatzierKS94, Theorem 3.1], which establishes exponential decay of matrix coefficients of strongly $L^{p}$ irreducible unitary representations of connected semisimple centerfree Lie groups. See also the preprint [Reference EdwardsEdw13] for more precise estimates.◻

Now let $\unicode[STIX]{x1D6E5}$ be the function on $\widehat{X}_{k}$ defined by (2.6). In the next section we will establish the following two-sided estimate for the measure of super-level sets of  $\unicode[STIX]{x1D6E5}$ .

Theorem 4.6. For any $k\geqslant 2$ there exist $c,C>0$ such that

(4.5) $$\begin{eqnarray}ce^{-kz}\leqslant \unicode[STIX]{x1D6F7}_{\unicode[STIX]{x1D6E5}}(z)\leqslant Ce^{-kz}\quad \text{for all}~z\geqslant 0.\end{eqnarray}$$

This is all one needs to settle Theorem 3.6.

Proof of Theorem 3.6 modulo Theorem 4.6.

Let $\{g_{t}:t\in \mathbb{R}\}$ be a diagonalizable unbounded one-parameter subgroup of $G_{k}$ . By Theorem 4.5, the action of $G_{k}$ on $\widehat{X}_{k}$ is exponentially mixing. Observe also that one has $\Vert g_{t}\Vert \geqslant \unicode[STIX]{x1D6FC}t$ for some $\unicode[STIX]{x1D6FC}>0$ , which immediately implies (4.1). It is easy to see that (4.5) implies (4.2) with $z_{0}=0$ , and part (a) of Definition 4.3 is given by Lemma 2.1. The conditions of Theorem 4.4 are therefore met, and Theorem 3.6 follows. ◻

5 $\unicode[STIX]{x1D6E5}$ is distance-like: a warm-up

For the rest of the paper we keep the notation

$$\begin{eqnarray}G=G_{k},\quad \widehat{G}=\widehat{G}_{k}=G_{k}\rtimes \mathbb{R}^{k},\quad X=X_{k}=G_{k}/\unicode[STIX]{x1D6E4}_{k},\quad \widehat{X}=\widehat{X}_{k}=\widehat{G}_{k}/\widehat{\unicode[STIX]{x1D6E4}}_{k},\end{eqnarray}$$

and let $\unicode[STIX]{x1D707}$ (respectively, $\widehat{\unicode[STIX]{x1D707}}$ ) be the Haar probability measure on $X$ (respectively, $\widehat{X}$ ). We denote by $\unicode[STIX]{x1D707}_{G}$ and $\unicode[STIX]{x1D707}_{\widehat{G}}$ the left-invariant Haar measures on $G$ and $\widehat{G}$ , respectively, which are locally pushed forward to $\unicode[STIX]{x1D707}$ and  $\widehat{\unicode[STIX]{x1D707}}$ .

Recall that

$$\begin{eqnarray}\unicode[STIX]{x1D6F7}_{\unicode[STIX]{x1D6E5}}(z)=\widehat{\unicode[STIX]{x1D707}}(\{\unicode[STIX]{x1D6EC}\in \widehat{X}:\unicode[STIX]{x1D6E5}(\unicode[STIX]{x1D6EC})\geqslant z\})=\widehat{\unicode[STIX]{x1D707}}(\{\unicode[STIX]{x1D6EC}\in X:\unicode[STIX]{x1D6EC}\cap B(0,e^{z})=\varnothing \}),\end{eqnarray}$$

where for $\mathbf{v}\in \mathbb{R}^{k}$ and $r\geqslant 0$ we let $B(\mathbf{v},r)$ be the open ball in $\mathbb{R}^{k}$ centered at $\mathbf{v}$ of radius $r$ with respect to the supremum norm. It will be convenient to write

$$\begin{eqnarray}S_{r}:=\unicode[STIX]{x1D6E5}^{-1}([\log r,\infty ))=\{\unicode[STIX]{x1D6EC}\in \widehat{X}:B(0,r)\cap \unicode[STIX]{x1D6EC}=\varnothing \}.\end{eqnarray}$$

Our goal is thus to prove that

(5.1) $$\begin{eqnarray}cr^{-k}\leqslant \widehat{\unicode[STIX]{x1D707}}(S_{r})\leqslant Cr^{-k}\quad \text{for all}~r\geqslant 1,\end{eqnarray}$$

where $c,C$ are constants dependent only on $k$ .

First let us discuss the upper bound. It is in fact a special case of a recent result due to Athreya, namely a random Minkowski-type theorem for the space of grids [Reference AthreyaAth15, Theorem 1].

Proposition 5.1 (Athreya).

For a measurable $E\subset \mathbb{R}^{k}$ ,

$$\begin{eqnarray}\widehat{\unicode[STIX]{x1D707}}(\{\unicode[STIX]{x1D6EC}\in \widehat{X}:\unicode[STIX]{x1D6EC}\cap E=\varnothing \})\leqslant \frac{1}{1+\unicode[STIX]{x1D706}(E)}.\end{eqnarray}$$

Here and hereafter $\unicode[STIX]{x1D706}$ stands for Lebesgue measure on  $\mathbb{R}^{k}$ . Taking $E=B(0,r)$ shows that $\widehat{\unicode[STIX]{x1D707}}(S_{r})<2^{-k}r^{-k}$ . Thus it only remains to establish a lower bound in (5.1).

There exists an obvious projection, $\unicode[STIX]{x1D70B}:\widehat{X}\rightarrow X$ , making $\widehat{X}$ into a $\mathbb{T}^{k}$ -bundle over $X$ ( $\unicode[STIX]{x1D70B}$  simply translates one of the vectors in a grid to the origin). It is easy to see that $\unicode[STIX]{x1D707}_{\widehat{G}}$ is the product of $\unicode[STIX]{x1D707}_{G}$ and $\unicode[STIX]{x1D706}$ . Therefore one has the following Fubini formula:

(5.2) $$\begin{eqnarray}\widehat{\unicode[STIX]{x1D707}}(S_{r})=\int _{X}Q(\unicode[STIX]{x1D6EC},r)\,d\unicode[STIX]{x1D707}(\unicode[STIX]{x1D6EC}),\quad \text{where}~Q(\unicode[STIX]{x1D6EC},r):=\unicode[STIX]{x1D706}(S_{r}\cap \unicode[STIX]{x1D70B}^{-1}(\unicode[STIX]{x1D6EC})).\end{eqnarray}$$

Here, for $\unicode[STIX]{x1D6EC}\in X$ , $\unicode[STIX]{x1D70B}^{-1}(\unicode[STIX]{x1D6EC})$ is identified with $\mathbb{R}^{k}/\unicode[STIX]{x1D6EC}$ via

(5.3) $$\begin{eqnarray}[\mathbf{v}]\in \mathbb{R}^{k}/\unicode[STIX]{x1D6EC}\longleftrightarrow \unicode[STIX]{x1D6EC}-\mathbf{v},\end{eqnarray}$$

and, in the hope that it will not cause any confusion, we will let $\unicode[STIX]{x1D706}$ stand for the normalized Haar measure on $\mathbb{R}^{k}/\unicode[STIX]{x1D6EC}$ for any $\unicode[STIX]{x1D6EC}\in X$ . Writing $\unicode[STIX]{x1D70C}_{\unicode[STIX]{x1D6EC}}$ for the projection $\mathbb{R}^{k}\rightarrow \mathbb{R}^{k}/\unicode[STIX]{x1D6EC}$ , we have

(5.4) $$\begin{eqnarray}\displaystyle S(r)\cap \unicode[STIX]{x1D70B}^{-1}(\unicode[STIX]{x1D6EC}) & = & \displaystyle \{[\mathbf{v}]\in \mathbb{R}^{k}/\unicode[STIX]{x1D6EC}:B(0,r)\cap (\unicode[STIX]{x1D6EC}-\mathbf{v})=\varnothing \}\nonumber\\ \displaystyle & = & \displaystyle \{[\mathbf{v}]\in \mathbb{R}^{k}/\unicode[STIX]{x1D6EC}:B(\mathbf{v},r)\cap \unicode[STIX]{x1D6EC}=\varnothing \}=\unicode[STIX]{x1D70C}_{\unicode[STIX]{x1D6EC}}\bigg(\mathbb{R}^{k}\smallsetminus \mathop{\bigcup }_{\mathbf{v}\in \unicode[STIX]{x1D6EC}}B(\mathbf{v},r)\bigg),\end{eqnarray}$$

so that $Q(\unicode[STIX]{x1D6EC},r)$ is the area of a region in a fundamental domain (parallelepiped) in $\mathbb{R}^{k}$ for $\unicode[STIX]{x1D6EC}$ consisting of points which are farther than $r$ from its vertices, that is, from all points of  $\unicode[STIX]{x1D6EC}$ .

Recall that $\operatorname{SL}_{2}(\mathbb{R})$ double-covers the unit tangent bundle of the hyperbolic upper-half plane, $\mathbb{H}^{2}$ . Since the action of $\operatorname{SL}_{2}(\mathbb{Z})$ on $\mathbb{H}^{2}$ has a convenient fundamental domain, there are convenient coordinates for a set of full measure in $\operatorname{SL}_{2}(\mathbb{R})/\operatorname{SL}_{2}(\mathbb{Z})$ . This enables us to give a rather tidy proof for the two-dimensional case of (5.1), handling both bounds simultaneously without using Proposition 5.1. This proof also illustrates the main idea necessary to proving the lower bound in the general case. We therefore start with a separate, redundant proof of the two-dimensional case.

Proof of (5.1) for $k=2$ .

For fixed $r$ , consider the map $(\unicode[STIX]{x1D705},n,a)\mapsto Q(\unicode[STIX]{x1D705}na\mathbb{Z}^{2},r)$ , whose domain is $K\times N\times A$ , the Iwasawa decomposition for $G=\operatorname{SL}_{2}(\mathbb{R})$ . (Here $K$ , $N$ , $A$ are the groups of orthogonal, upper-triangular unipotent, and diagonal matrices respectively.) We first show that a change of $\unicode[STIX]{x1D705}$ does not significantly change the value of  $Q$ . Indeed, since rotation perturbs the sup norm by no more than a factor of  $\sqrt{2}$ , for any $\unicode[STIX]{x1D705}\in K$ and $\mathbf{x},\mathbf{y}\in \mathbb{R}^{2}$ we have

$$\begin{eqnarray}\Vert \unicode[STIX]{x1D705}\mathbf{x}-\unicode[STIX]{x1D705}\mathbf{y}\Vert \geqslant \sqrt{2}r\;\Longrightarrow \;\Vert \mathbf{x}-\mathbf{y}\Vert \geqslant r\;\Longrightarrow \;\Vert \unicode[STIX]{x1D705}\mathbf{x}-\unicode[STIX]{x1D705}\mathbf{y}\Vert \geqslant r/\sqrt{2},\end{eqnarray}$$

hence

$$\begin{eqnarray}\unicode[STIX]{x1D70C}_{\unicode[STIX]{x1D6EC}}\biggl(\mathbb{R}^{2}\smallsetminus \mathop{\bigcup }_{\mathbf{v}\in \unicode[STIX]{x1D6EC}}B(\unicode[STIX]{x1D705}\mathbf{v},\sqrt{2}r)\biggr)\subset \unicode[STIX]{x1D70C}_{\unicode[STIX]{x1D6EC}}\biggl(\mathbb{R}^{2}\smallsetminus \mathop{\bigcup }_{\mathbf{v}\in \unicode[STIX]{x1D6EC}}B(\mathbf{v},r)\biggr)\subset \unicode[STIX]{x1D70C}_{\unicode[STIX]{x1D6EC}}\biggl(\mathbb{R}^{2}\smallsetminus \mathop{\bigcup }_{\mathbf{v}\in \unicode[STIX]{x1D6EC}}B(\unicode[STIX]{x1D705}\mathbf{v},r/\sqrt{2})\biggr).\end{eqnarray}$$

By (5.4), this implies

(5.5) $$\begin{eqnarray}Q(\unicode[STIX]{x1D705}\unicode[STIX]{x1D6EC},\sqrt{2}r)\leqslant Q(\unicode[STIX]{x1D6EC},r)\leqslant Q(\unicode[STIX]{x1D705}\unicode[STIX]{x1D6EC},r/\sqrt{2}).\end{eqnarray}$$

Let $a=\operatorname{diag}(\unicode[STIX]{x1D6FC},\unicode[STIX]{x1D6FC}^{-1})$ . If

(5.6) $$\begin{eqnarray}2r>\unicode[STIX]{x1D6FC},\end{eqnarray}$$

then the lattice $na\mathbb{Z}^{2}$ consists of horizontal rows of vectors, each closer than $2r$ to its horizontal neighbors. Thus the boxes making up the union $\bigcup _{\mathbf{v}\in na\mathbb{Z}^{2}}B(\mathbf{v},r)$ overlap in the horizontal direction, creating horizontal strips. Thus, by (5.4), $Q(na\mathbb{Z}^{2},r)$ is just the area of the fundamental parallelogram $na(I\times I)$ minus the strips on top and bottom, as in the following figure.

This smaller parallelogram has area $\unicode[STIX]{x1D6FC}(\unicode[STIX]{x1D6FC}^{-1}-2r)$ , provided it is non-empty, that is, provided $2r\leqslant \unicode[STIX]{x1D6FC}^{-1}$ . Thus from (5.5), if $\unicode[STIX]{x1D6EC}=\unicode[STIX]{x1D705}na\mathbb{Z}^{2}$ , where $a=\operatorname{diag}(\unicode[STIX]{x1D6FC},\unicode[STIX]{x1D6FC}^{-1})$ , we have

$$\begin{eqnarray}Q(\unicode[STIX]{x1D6EC},r)\leqslant Q(\unicode[STIX]{x1D705}^{-1}\unicode[STIX]{x1D6EC},r/\sqrt{2})=Q(na\mathbb{Z}^{2},r/\sqrt{2}).\end{eqnarray}$$

Then if $\sqrt{2}r>\unicode[STIX]{x1D6FC}$ (so that (5.6) holds for $na\mathbb{Z}^{2}$ , after adjusting $r$ as above), we have

(5.7) $$\begin{eqnarray}\begin{array}{@{}rl@{}}\unicode[STIX]{x1D6FC}(\unicode[STIX]{x1D6FC}^{-1}-2\sqrt{2}r)\leqslant Q(\unicode[STIX]{x1D6EC},r)\leqslant \unicode[STIX]{x1D6FC}(\unicode[STIX]{x1D6FC}^{-1}-\sqrt{2}r) & \text{if}~\sqrt{2}r\leqslant \unicode[STIX]{x1D6FC}^{-1},\\ Q(\unicode[STIX]{x1D6EC},r)=0 & \text{if}~\sqrt{2}r\geqslant \unicode[STIX]{x1D6FC}^{-1}.\end{array}\end{eqnarray}$$

We now identify $X_{2}$ with $\operatorname{SL}_{2}(\mathbb{Z})\backslash T^{1}(\mathbb{H}^{2})$ via $g\mathbb{Z}^{2}\mapsto g^{-1}(i,i)$ (here, the matrix $g^{-1}$ acts on $(i,i)$ as a fractional-linear transformation). Recall that

$$\begin{eqnarray}F:=T^{1}\{|z|\geqslant 1,|\text{Re}(z)|\leqslant 1/2\}\end{eqnarray}$$

is a fundamental domain for the action of $\operatorname{SL}_{2}(\mathbb{Z})$ on $T^{1}(\mathbb{H}^{2})$ . Under the correspondence $g^{-1}(i,i)\mapsto g\mathbb{Z}^{2}$ , a point $(z,\unicode[STIX]{x1D703})\in F$ maps to a lattice $\unicode[STIX]{x1D6EC}=\unicode[STIX]{x1D705}na\mathbb{Z}^{2}$ with $\sqrt{3}/2\leqslant \operatorname{Im}(z)=\unicode[STIX]{x1D6FC}^{-2}$ . Thus if

(5.8) $$\begin{eqnarray}\sqrt{3}r^{2}>1\end{eqnarray}$$

(which ensures that the condition $\sqrt{2}r>\unicode[STIX]{x1D6FC}$ for (5.7) is met) then the estimates (5.7) hold with $\operatorname{Im}(z)=\unicode[STIX]{x1D6FC}^{-2}$ for any lattice $\unicode[STIX]{x1D6EC}_{(z,\unicode[STIX]{x1D703})}$ , $(z,\unicode[STIX]{x1D703})\in F$ . Writing $y=\operatorname{Im}(z)$ , the estimates become

(5.9) $$\begin{eqnarray}\begin{array}{@{}rl@{}}\displaystyle 1-\frac{2\sqrt{2}r}{\sqrt{y}}\leqslant Q(\unicode[STIX]{x1D6EC}_{(z,\unicode[STIX]{x1D703})},r)\leqslant 1-\frac{\sqrt{2}r}{\sqrt{y}} & \text{if}~\sqrt{2}r\leqslant \sqrt{y},\\ Q(\unicode[STIX]{x1D6EC}_{(z,\unicode[STIX]{x1D703})},r)=0 & \text{if}~\sqrt{2}r\geqslant \sqrt{y}.\end{array}\end{eqnarray}$$

Since the Haar measure on $X_{2}$ corresponds to the hyperbolic measure $(1/y^{2})\,dx\,dy\,d\unicode[STIX]{x1D703}$ on $F$ , we have

$$\begin{eqnarray}\int _{X_{2}^{\prime }}Q(\unicode[STIX]{x1D6EC},r)\,d\unicode[STIX]{x1D707}^{\prime }(\unicode[STIX]{x1D6EC})=\int _{F}Q(\unicode[STIX]{x1D6EC}_{(x+iy,\unicode[STIX]{x1D703})},r)\cdot \frac{1}{y^{2}}\,dx\,dy\,d\unicode[STIX]{x1D703}.\end{eqnarray}$$

Finally, since $r$ is large,Footnote 3   $Q(\unicode[STIX]{x1D6EC}_{(z,\unicode[STIX]{x1D703})},r)$ vanishes in the region of $F$ between the line $y=2r^{2}$ and the arc of the unit circle, permitting us to integrate over an unbounded rectangular region. The estimates (5.9) give

$$\begin{eqnarray}2\unicode[STIX]{x1D70B}\int _{8r^{2}}^{\infty }\biggl(1-\frac{2\sqrt{2}r}{\sqrt{y}}\biggr)\frac{dy}{y^{2}}\leqslant \int _{X_{2}}Q(\unicode[STIX]{x1D6EC},r)\,d\unicode[STIX]{x1D707}(\unicode[STIX]{x1D6EC})\leqslant 2\unicode[STIX]{x1D70B}\int _{2r^{2}}^{\infty }\biggl(1-\frac{\sqrt{2}r}{\sqrt{y}}\biggr)\frac{dy}{y^{2}},\end{eqnarray}$$

where the $2\unicode[STIX]{x1D70B}$ comes from integrating a constant function over the $\unicode[STIX]{x1D703}$ factor. Computing these integrals gives

$$\begin{eqnarray}\frac{\unicode[STIX]{x1D70B}}{12r^{2}}\leqslant \widehat{\unicode[STIX]{x1D707}}(S_{r})\leqslant \frac{\unicode[STIX]{x1D70B}}{3r^{2}}\quad \text{whenever}~r\geqslant 3^{-1/4}.\end{eqnarray}$$

This proves (5.1). ◻

6 Completion of the proof of Theorem 4.6

We now set up the proof of the general case ( $k\geqslant 2$ ) with some notation and remarks on Siegel sets. Then the proof will be given following two lemmas generalizing some statements from the proof of the two-dimensional case.

As before, we wish to write $Q(\unicode[STIX]{x1D6EC},r)$ introduced in (5.2) in terms of the coordinates of the Iwasawa decomposition of a representative $g\in G$ for $\unicode[STIX]{x1D6EC}=g\mathbb{Z}^{k}$ . We will assume $g$ lies in a subset of a particular Siegel set. Specifically, for elements of $G$ of the form

(6.1) $$\begin{eqnarray}n=\left[\begin{array}{@{}ccccc@{}}1 & \unicode[STIX]{x1D708}_{1,1} & \unicode[STIX]{x1D708}_{1,2} & \cdots \, & \unicode[STIX]{x1D708}_{1,k-1}\\ 0 & 1 & \unicode[STIX]{x1D708}_{2,1} & \cdots \, & \unicode[STIX]{x1D708}_{2,k-2}\\ 0 & 0 & 1 & \ddots & \vdots \\ \vdots & \vdots & \ddots & \ddots & \unicode[STIX]{x1D708}_{k-1,1}\\ 0 & 0 & \cdots \, & 0 & 1\end{array}\right],\quad a=\left[\begin{array}{@{}ccccc@{}}a_{1} & 0 & 0 & \cdots \, & 0\\ 0 & a_{2} & 0 & \cdots \, & 0\\ 0 & 0 & a_{3} & \ddots & \vdots \\ \vdots & \vdots & \ddots & \ddots & 0\\ 0 & 0 & \cdots \, & 0 & a_{k}\end{array}\right],\end{eqnarray}$$

and for $d,e\in \mathbb{R}$ , $c\in \mathbb{R}_{+}$ , define

$$\begin{eqnarray}\displaystyle A_{c} & := & \displaystyle \{a\in A:a_{j+1}\geqslant ca_{j}>0\hspace{5.69054pt}(j=1,\ldots ,k-1)\},\nonumber\\ \displaystyle N_{e,d} & := & \displaystyle \{n\in N:e\leqslant \unicode[STIX]{x1D708}_{i,j}\leqslant d~(1\leqslant i,j\leqslant k-1)\}.\nonumber\end{eqnarray}$$

Also write $K$ for $\operatorname{SO}(k)$ . It is known that $KA_{1/2}N_{-1,0}$ is a ‘coarse fundamental domain’ for $\unicode[STIX]{x1D6E4}_{k}$ in $G_{k}$ (see [Reference MorrisMor15, § 19.4(ii), following Remark 7.3.4]Footnote 4 ). That is, $KA_{1/2}N_{-1,0}$ contains a fundamental domain for the right-action of $\unicode[STIX]{x1D6E4}_{k}$ on $G_{k}$ , and it is covered by finitely many $\unicode[STIX]{x1D6E4}_{k}$ -translates of that domain. Therefore $KA_{1}N_{-1,0}$ is contained in a coarse fundamental domain, and since we are interested in a lower bound for $\int _{X}Q(\unicode[STIX]{x1D6EC},r)\,d\unicode[STIX]{x1D707}(\unicode[STIX]{x1D6EC})$ , it will suffice to bound the integral

(6.2) $$\begin{eqnarray}\int _{KA_{1}N_{-1,0}}Q(g\mathbb{Z}^{k},r)\,d\unicode[STIX]{x1D707}_{G}(g)\end{eqnarray}$$

from below.

For the purpose of the lower bound it will suffice to restrict ourselves to the subset of $KA_{1}N_{-1,0}$ with $a$ satisfying

(6.3) $$\begin{eqnarray}0<a_{1}\leqslant a_{2}\leqslant \cdots \leqslant a_{k-1}<2r\leqslant a_{k};\end{eqnarray}$$

as we will show, the integral over this set contains the highest-order term of (6.2) as a function of  $r$ .

Lemma 6.1. Suppose $a$ and $n$ are as in (6.1), and assume that $a$ satisfies (6.3). Then

(6.4) $$\begin{eqnarray}Q(na\mathbb{Z}^{k},r)=1-2ra_{1}\ldots a_{k-1}.\end{eqnarray}$$

Proof. The proof follows that of the two-dimensional case. Write $\unicode[STIX]{x1D6EC}=na\mathbb{Z}^{k}$ and let $\unicode[STIX]{x1D70C}_{\unicode[STIX]{x1D6EC}}:\mathbb{R}^{k}\rightarrow \mathbb{R}^{k}/\unicode[STIX]{x1D6EC}$ be the projection. Using (5.4), one can write

(6.5) $$\begin{eqnarray}Q(\unicode[STIX]{x1D6EC},r)=\unicode[STIX]{x1D706}\biggl(\unicode[STIX]{x1D70C}_{\unicode[STIX]{x1D6EC}}\biggl(\mathbb{R}^{k}\smallsetminus \mathop{\bigcup }_{\mathbf{v}\in \unicode[STIX]{x1D6EC}}B(\mathbf{v},r)\biggr)\biggr)=\unicode[STIX]{x1D706}\biggl(naI^{k}\smallsetminus \mathop{\bigcup }_{\mathbf{v}\in \unicode[STIX]{x1D6EC}}B(\mathbf{v},r)\biggr),\end{eqnarray}$$

where $I^{k}=[0,1]\times \cdots \times [0,1]$ , and  $\unicode[STIX]{x1D706}$ , as before, stands for both the normalized volume on $\unicode[STIX]{x1D70B}^{-1}(\unicode[STIX]{x1D6EC})$ and Lebesgue measure on $\mathbb{R}^{k}$ . Equation (6.3) implies

$$\begin{eqnarray}\mathop{\bigcup }_{\mathbf{v}\in \unicode[STIX]{x1D6EC}}B(\mathbf{v},r)=\mathbb{R}^{k-1}\times \mathop{\bigcup }_{\ell \in \mathbb{Z}}(\ell a_{k}-r,\ell a_{k}+r),\end{eqnarray}$$

so that the measure of $naI^{k}\smallsetminus \bigcup _{\mathbf{v}\in \unicode[STIX]{x1D6EC}}B(\mathbf{v},r)$ is the measure of a parallelepiped of dimensions $a_{1},a_{2},\ldots ,a_{k-1}$ and $a_{k}-2r$ , precisely as in the two-dimensional case. In fact the figure used in the proof of the two-dimensional case is still illustrative: just replace the squares with hypercubes, let the $y$ -axis stand for the $a_{k}$ -axis, and let the $x$ -axis stand for the hyperplane $a_{k}=0$ . This yields (6.4).◻

The next lemma will allow us to disregard the factor $K$ when estimating the integral (6.2).

Lemma 6.2. For $\unicode[STIX]{x1D705}\in K$ and $\unicode[STIX]{x1D6EC}\in X$ ,

$$\begin{eqnarray}Q(\unicode[STIX]{x1D6EC},k^{1/2}r)\leqslant Q(\unicode[STIX]{x1D705}\unicode[STIX]{x1D6EC},r)\leqslant Q(\unicode[STIX]{x1D6EC},k^{-1/2}r).\end{eqnarray}$$

Proof. If $P\subset \mathbb{R}^{k}$ is a fundamental parallelipiped for the action of $\unicode[STIX]{x1D6EC}$ on  $\mathbb{R}^{k}$ , (6.5) gives

$$\begin{eqnarray}Q(\unicode[STIX]{x1D705}\unicode[STIX]{x1D6EC},r)=\unicode[STIX]{x1D706}\biggl(\unicode[STIX]{x1D705}P\smallsetminus \mathop{\bigcup }_{\mathbf{v}\in \unicode[STIX]{x1D6EC}}\{B(0,r)+\unicode[STIX]{x1D705}\mathbf{v}\}\biggr)=\unicode[STIX]{x1D706}\biggl(P\smallsetminus \mathop{\bigcup }_{\mathbf{v}\in \unicode[STIX]{x1D6EC}}\{\unicode[STIX]{x1D705}^{-1}B(0,r)+\mathbf{v}\}\biggr).\end{eqnarray}$$

But

$$\begin{eqnarray}B(0,rk^{-1/2})\subset \unicode[STIX]{x1D705}^{-1}B(0,r)\subset B(0,rk^{1/2}),\end{eqnarray}$$

so the result follows from another application of (6.5). ◻

Now we are ready to write down the proof of (5.1) for the ( $k>2$ )-dimensional case.

Proof of (5.1) for $k>2$ .

Let $da,dn,d\unicode[STIX]{x1D705}$ denote Haar measures on $A$ , $N$ , and $K$ . Define

$$\begin{eqnarray}\unicode[STIX]{x1D702}:A\rightarrow \mathbb{R},\quad a=\operatorname{diag}(a_{1},\ldots ,a_{k})\mapsto \mathop{\prod }_{i<j}\frac{a_{i}}{a_{j}}.\end{eqnarray}$$

Then the Iwasawa decomposition identifies $\unicode[STIX]{x1D707}_{G}$ with the product measure $\unicode[STIX]{x1D702}(a)\,d\unicode[STIX]{x1D705}\,da\,dn$ (cf. [Reference Bekka and MayerBM00, V.2.4]). Recall that we aim to bound the integral (6.2) from below. Let us write $n^{a}=ana^{-1}$ for $n\in N$ , $a\in A$ . By decomposing $\unicode[STIX]{x1D707}_{G}$ as above and restricting the domain of integration, we have

$$\begin{eqnarray}\displaystyle \int _{KA_{1}N_{-1,0}}Q(g\mathbb{Z}^{k},r)\,d\unicode[STIX]{x1D707}_{G}(g) & = & \displaystyle \int _{KA_{1}N_{-1,0}}Q(\unicode[STIX]{x1D705}an\mathbb{Z}^{k},r)\,d\unicode[STIX]{x1D705}\,da\,dn\nonumber\\ \displaystyle & = & \displaystyle \int _{KA_{1}N_{-1,0}}Q(\unicode[STIX]{x1D705}n^{a}a\mathbb{Z}^{k},r)\,d\unicode[STIX]{x1D705}\,da\,dn\nonumber\\ \displaystyle & {\geqslant} & \displaystyle \int _{K}\int _{N_{-1,0}}\int _{\{a\in A_{1}:a_{k-1}\leqslant 2r\sqrt{k}\leqslant a_{k}\}}Q(\unicode[STIX]{x1D705}n^{a}a\mathbb{Z}^{k},r)\unicode[STIX]{x1D702}(a)\,da\,dn\,d\unicode[STIX]{x1D705}.\nonumber\end{eqnarray}$$

By Lemma 6.2, the latter integral is not smaller than

$$\begin{eqnarray}\int _{K}\int _{N_{-1,0}}\int _{\{a\in A_{1}:a_{k-1}\leqslant 2r\sqrt{k}\leqslant a_{k}\}}Q(n^{a}a\mathbb{Z}^{k},k^{1/2}r)\unicode[STIX]{x1D702}(a)\,da\,dn\,d\unicode[STIX]{x1D705},\end{eqnarray}$$

and by Lemma 6.1 this is the same as

$$\begin{eqnarray}\int _{K}\int _{N_{-1,0}}\int _{\{a\in A_{1}:a_{k-1}\leqslant 2r\sqrt{k}\leqslant a_{k}\}}(1-2rk^{1/2}a_{1}\ldots a_{k-1})\unicode[STIX]{x1D702}(a)\,da\,dn\,d\unicode[STIX]{x1D705}.\end{eqnarray}$$

Since this integrand depends only on $a$ , and the other factors have finite measure, it suffices to consider

$$\begin{eqnarray}\int _{\{a\in A_{1}:a_{k-1}\leqslant 2r\sqrt{k}\leqslant a_{k}\}}(1-2rk^{1/2}a_{1}\ldots a_{k-1})\unicode[STIX]{x1D702}(a)\,da.\end{eqnarray}$$

Finally we identify $da$ with Lebesgue measure (up to a constant) on $\mathbb{R}^{k-1}$ via

$$\begin{eqnarray}\operatorname{diag}(a_{1},\ldots ,a_{k})\mapsto (\log (a_{1}),\log (a_{2}),\ldots ,\log (a_{k-1})),\end{eqnarray}$$

see [Reference Bekka and MayerBM00, V.2.3].Footnote 5 We are therefore left with the integral

$$\begin{eqnarray}\int _{b_{1}\leqslant b_{2}\leqslant \cdots \leqslant b_{k-1}\leqslant \log (2r\sqrt{k})\leqslant -\mathop{\sum }_{i=1}^{k-1}b_{i}}\biggl(1-2rk^{1/2}\exp \biggl[\mathop{\sum }_{i=1}^{k-1}b_{i}\biggr]\biggr)\exp \biggl[\mathop{\sum }_{i<j}(b_{i}-b_{j})\biggr]\,d\unicode[STIX]{x1D706},\end{eqnarray}$$

where the $b_{k}$ occurring in the exponent of the second factor of the integrand must be understood to stand for $-\sum _{i=1}^{k-1}b_{i}$ .

Now the challenge is not the integrand (which consists of nice exponential functions) but the domain of integration. Thankfully we only have to integrate over a piece of it, since we are interested in a lower bound. The piece we will consider is the following set:

(6.6) $$\begin{eqnarray}\bigg\{(b_{1},\ldots ,b_{k-1}):b_{i}\leqslant b_{i+1}\leqslant \frac{-\text{log}(2r\sqrt{k})}{k-1}~(1\leqslant i\leqslant k-2)\bigg\}.\end{eqnarray}$$

This set is clearly contained in the domain of integration above. Reordering the variables $x_{i}:=b_{k-i}$ , and using the identity $\sum _{i<j}x_{i}-x_{j}=\sum _{i=1}^{k-1}2ix_{i}$ , we can compute the integral of $Q(a\mathbb{Z}^{k},\sqrt{k}r)$ over (6.6) as an iterated integral:

(6.7) $$\begin{eqnarray}\displaystyle & & \displaystyle \int _{-\infty }^{-\text{log}(2r\sqrt{k})/(k-1)}\int _{x_{k-1}}^{-\text{log}(2r\sqrt{k})/(k-1)}\cdots \int _{x_{2}}^{-\text{log}(2r\sqrt{k})/(k-1)}\nonumber\\ \displaystyle & & \displaystyle \quad \times \,(e^{\mathop{\sum }_{i=1}^{k-1}2ix_{i}}-2re^{\mathop{\sum }_{i=1}^{k-1}(2i+1)x_{i}})\,dx_{1}\,dx_{2}\cdots dx_{k-1}.\end{eqnarray}$$

It is easily seen by induction that for $2\leqslant \ell \leqslant k-1$ ,

$$\begin{eqnarray}\displaystyle & & \displaystyle \int _{x_{\ell }}^{-\text{log}(2r\sqrt{k})/(k-1)}\int _{x_{\ell -1}}^{-\text{log}(2r\sqrt{k})/(k-1)}\cdots \int _{x_{2}}^{-\text{log}(2r\sqrt{k})/(k-1)}\nonumber\\ \displaystyle & & \displaystyle \quad \times \,(e^{\mathop{\sum }_{i=1}^{k-1}2ix_{i}}-2re^{\mathop{\sum }_{i=1}^{k-1}(2i+1)x_{i}})\,dx_{1}\,dx_{2}\cdots dx_{\ell -1}\nonumber\end{eqnarray}$$

is a sum of terms of the form

$$\begin{eqnarray}c(2r\sqrt{k})^{-m/(k-1)}e^{\mathop{\sum }_{i=\ell }^{k-1}p_{i}x_{i}}\end{eqnarray}$$

where $c>0$ , $p_{i}$ are positive integers, and $m+\sum _{i=\ell }^{k-1}p_{i}=k(k-1)$ . Indeed,

$$\begin{eqnarray}\displaystyle & & \displaystyle \int _{x_{\ell +1}}^{-\text{log}(2r\sqrt{k})/(k-1)}c(2r\sqrt{k})^{-m/(k-1)}e^{\mathop{\sum }_{i=\ell }^{k-1}p_{i}x_{i}}\,dx_{\ell }\nonumber\\ \displaystyle & & \displaystyle \quad =\frac{c}{p_{\ell }}(2r\sqrt{k})^{-(m+p_{\ell })/(k-1)}\exp \biggl[\mathop{\sum }_{i=\ell +1}^{k-1}p_{i}x_{i}\biggr]\nonumber\\ \displaystyle & & \displaystyle \qquad -\,\frac{c}{p_{\ell }}(2r\sqrt{k})^{-m/(k-1)}\exp \biggl[(p_{\ell }+p_{\ell +1})x_{\ell +1}+\mathop{\sum }_{i=\ell +2}^{k-1}p_{i}x_{i}\biggr],\nonumber\end{eqnarray}$$

so that we have only to notice that

$$\begin{eqnarray}(m+p_{\ell })+\mathop{\sum }_{i=\ell +1}^{k-1}p_{i}=m+\biggl[(p_{\ell }+p_{\ell +1})+\mathop{\sum }_{i=\ell +2}^{k-1}p_{i}\biggr]=m+\mathop{\sum }_{i=\ell }^{k-1}p_{i}=k(k-1)\end{eqnarray}$$

from the induction hypothesis. Thus (6.7) is a sum of terms of the form

$$\begin{eqnarray}\displaystyle \int _{-\infty }^{-\text{log}(2r\sqrt{k})/(k-1)}c(2r\sqrt{k})^{-m/(k-1)}e^{p_{k-1}x_{k-1}}\,dx_{k-1} & = & \displaystyle \frac{c}{p_{k-1}}(2r\sqrt{k})^{-(m+p_{k-1})/(k-1)}\nonumber\\ \displaystyle & = & \displaystyle \frac{c}{p_{k-1}}(2r\sqrt{k})^{-k(k-1)/(k-1)}\nonumber\\ \displaystyle & = & \displaystyle \frac{c}{p_{k-1}}(2r\sqrt{k})^{-k},\nonumber\end{eqnarray}$$

where we have used $m+p_{k-1}=k(k-1)$ . Since the integral is positive, the sum of the coefficients must be positive, and the integral grows no more slowly than some multiple of $r^{-k}$ . ◻

7 Concluding remarks and open questions

7.1 The homogeneous problem

Here we return to the homogeneous case and discuss the approach to Question 1.4 suggested by the foregoing argument. Recall that $X=X_{k}=\operatorname{SL}_{k}(\mathbb{R})/\operatorname{SL}_{k}(\mathbb{Z})$ is the space of unimodular lattices in  $\mathbb{R}^{k}$ . Define

$$\begin{eqnarray}\unicode[STIX]{x1D6E5}_{0}:X_{k}\rightarrow \mathbb{R},\quad \unicode[STIX]{x1D6EC}\mapsto \log \inf _{\mathbf{v}\in \unicode[STIX]{x1D6EC}\smallsetminus 0}\Vert \mathbf{v}\Vert ,\end{eqnarray}$$

and for $A\in M_{m,n}$ , define

$$\begin{eqnarray}\unicode[STIX]{x1D6EC}_{A}:=\biggl(\begin{array}{@{}cc@{}}I_{m} & A\\ 0 & I_{n}\end{array}\biggr)\mathbb{Z}^{m+n}\in X_{k},\end{eqnarray}$$

where $k=m+n$ . If we restrict the flow $g_{t}$ to $X_{k}$ , it is not difficult to showFootnote 6 the following homogeneous version of Lemma 3.4.

Proposition 7.1. Fix positive integers $m,n$ , and let $\unicode[STIX]{x1D713}:[t_{0},\infty )\rightarrow (0,1)$ be continuous and non-increasing. Let $z=z_{\unicode[STIX]{x1D713}}$ be as in Lemma 3.1. Then $A\in D_{m,n}(\unicode[STIX]{x1D713})$ if and only if

$$\begin{eqnarray}\unicode[STIX]{x1D6E5}_{0}(g_{s}\unicode[STIX]{x1D6EC}_{A})<z_{\unicode[STIX]{x1D713}}(s)\end{eqnarray}$$

for all sufficiently large $s$ .

This way Question 1.4 reduces to a shrinking target problem for the flow $(X,g_{t})$ , where the targets are super-level sets $\unicode[STIX]{x1D6E5}_{0}^{-1}([z,\infty ))$ . But the family of super-level sets of $\unicode[STIX]{x1D6E5}_{0}$ differs in important ways from the family of super-level sets of  $\unicode[STIX]{x1D6E5}$ . In particular, by Minkowski’s theorem, $\unicode[STIX]{x1D6E5}_{0}^{-1}[z,\infty )$ is empty for $z>0$ . Hence the problem reduces to the case where the values $z_{\unicode[STIX]{x1D713}}(t)$ accumulate at 0, so that the targets shrink to the set $\unicode[STIX]{x1D6E5}_{0}^{-1}(0)$ . The latter set is a union of finitely many compact submanifolds of $X$ whose structure is explicitly described by the Hajós–Minkowski theorem (see [Reference CasselsCas71, § XI.1.3] or [Reference ShahSha10, Theorem 2.3]). In particular, the function $\unicode[STIX]{x1D6E5}_{0}$ is not DL, and Theorem 4.4 is not applicable. Other approaches to shrinking target problems on homogeneous spaces [Reference KelmerKel17, Reference Kelmer and YuKY19, Reference Kleinbock and ZhaoKZ18, Reference MaucourantMau06] also do not seem to be directly applicable.

On the other hand, the one-dimensional case ( $m=n=1$ ) has been completely settled in [Reference Kleinbock and WadleighKW18]. In particular, the following zero–one law has been established.

Theorem 7.2 [Reference Kleinbock and WadleighKW18, Theorem 1.8].

Let $\unicode[STIX]{x1D713}:[t_{0},\infty )\rightarrow \mathbb{R}_{+}$ be non-increasing, and suppose the function $t\mapsto t\unicode[STIX]{x1D713}(t)$ is non-decreasing and

(7.1) $$\begin{eqnarray}t\unicode[STIX]{x1D713}(t)<1\quad \text{for all}~t\geqslant t_{0}.\end{eqnarray}$$

Then if

(7.2) $$\begin{eqnarray}\mathop{\sum }_{i}\frac{-\text{log}(1-i\unicode[STIX]{x1D713}(i))(1-i\unicode[STIX]{x1D713}(i))}{i}=\infty \quad (\text{respectively}<\infty ),\end{eqnarray}$$

then the Lebesgue measure of $D_{1,1}(\unicode[STIX]{x1D713})$ (respectively, of $D_{1,1}(\unicode[STIX]{x1D713})^{c}$ ) is zero.

The proof is based on the observation that the condition $\unicode[STIX]{x1D6FC}\in D_{1,1}(\unicode[STIX]{x1D713})$ can be explicitly described in terms of the continued fraction expansion of  $\unicode[STIX]{x1D6FC}$ . However, this phenomenon is inherently one-dimensional, and new ideas are needed to settle the general case.

7.2 Hausdorff dimension

A sequel [Reference Hussain, Kleinbock, Wadleigh and WangHKWW18] to the paper [Reference Kleinbock and WadleighKW18] computes the Hausdorff dimension of limsup sets $D_{1,1}(\unicode[STIX]{x1D713})^{c}$ , and, more generally, establishes zero–infinity laws for the Hausdorff measure of those sets. For example, it is proved there that

$$\begin{eqnarray}\dim (D(\unicode[STIX]{x1D713})^{c})=\frac{2}{2+\unicode[STIX]{x1D70F}}\quad \text{when}~\unicode[STIX]{x1D713}(t)=\frac{1-at^{-\unicode[STIX]{x1D70F}}}{t}~(a>0,\unicode[STIX]{x1D70F}>0).\end{eqnarray}$$

One can ask similar questions for higher-dimensional versions, both in homogeneous and inhomogeneous settings. Even the $m=n=1$ case of the inhomogeneous problem is open.

7.3 Singly versus doubly metric problems

The main result of the present paper computes Lebesgue measure of the set $\widehat{D}_{m,n}(\unicode[STIX]{x1D713})\subset M_{m,n}\times \mathbb{R}^{m}$ . As often happens in inhomogeneous Diophantine problems, one can fix either $A$ or $\mathbf{b}$ and ask for the Lebesgue (or Hausdorff) measure of the corresponding slices of $\widehat{D}_{m,n}(\unicode[STIX]{x1D713})$ . It seems plausible that the convergence/divergence of the same series (1.6) is responsible for a full/zero measure dichotomy for slices

$$\begin{eqnarray}\{A\in M_{m,n}:(A,\mathbf{b})\in \widehat{D}_{m,n}(\unicode[STIX]{x1D713})\}\end{eqnarray}$$

for any fixed $\mathbf{b}\notin \mathbb{Z}^{m}$ . On the other hand, the Lebesgue measure of the set

$$\begin{eqnarray}\{\mathbf{b}\in \mathbb{R}^{m}:(A,\mathbf{b})\in \widehat{D}_{m,n}(\unicode[STIX]{x1D713})\}\end{eqnarray}$$

for a fixed $A\in M_{m,n}$ seems to depend heavily on Diophantine properties of  $A$ . For example, if $A$ has rational entries, then $(A,\mathbf{b})$ is not in $\widehat{D}_{m,n}(\unicode[STIX]{x1D713})$ whenever $\mathbf{b}\notin \mathbb{Q}^{m}$ and $\unicode[STIX]{x1D713}(T)\rightarrow 0$ as $T\rightarrow \infty$ . And on the other end of the approximation spectrum, if $A$ is badly approximable it is easy to see that there exists $C>0$ such that for all $\mathbf{b}\in \mathbb{R}^{m}$ , $(A,\mathbf{b})$ belongs to the (null) set $\widehat{D}_{m,n}(C\unicode[STIX]{x1D713}_{1})$ . Indeed, by the classical Dani correspondence, $A$ is badly approximable if and only if the trajectory $\{g_{t}\unicode[STIX]{x1D6EC}_{A}:t>0\}$ is bounded in $X_{k}$ , which is the case if and only if $\{g_{t}\unicode[STIX]{x1D6EC}_{A,\mathbf{b}}:t>0\}$ is bounded in $\widehat{X}_{k}$ for any $\mathbf{b}\in \mathbb{R}^{m}$ . Thus the claim follows in view of Lemma 2.2. It would be interesting to describe, for a given arbitrary non-increasing function  $\unicode[STIX]{x1D713}$ , explicit Diophantine conditions on $A\in M_{m,n}$ guaranteeing that $(A,\mathbf{b})\in \widehat{D}_{m,n}(\unicode[STIX]{x1D713})$ for all (or almost all) $\mathbf{b}\in \mathbb{R}^{m}$ .

7.4 Eventually always hitting

Finally, let us connect our results on improving the inhomogeneous Dirichlet theorem with a shrinking target property introduced recently by Kelmer [Reference KelmerKel17]. We start by setting some notation. Let $\unicode[STIX]{x1D6FC}$ be a measure-preserving $\mathbb{Z}^{n}$ -action on a probability space $(Y,\unicode[STIX]{x1D708})$ . For any $N\in \mathbb{N}$ denote

$$\begin{eqnarray}D_{N}:=\{\mathbf{q}\in \mathbb{Z}^{n}:\Vert \mathbf{q}\Vert \leqslant N\}\end{eqnarray}$$

(here, as before, $\Vert \cdot \Vert$ stands for the supremum norm). Then, given a nested family ${\mathcal{B}}=\{B_{N}:N\in \mathbb{N}\}$ of subsets of $Y$ , let us say that the $\unicode[STIX]{x1D6FC}$ -orbit of a point $x\in Y$ eventually always hits ${\mathcal{B}}$ if $\unicode[STIX]{x1D6FC}(D_{N})x\cap B_{N}\neq \varnothing$ for all sufficiently large $N\in \mathbb{N}$ . Following [Reference KelmerKel17], denote by ${\mathcal{A}}_{\mathbf{a}\mathbf{h}}^{\unicode[STIX]{x1D6FC}}({\mathcal{B}})$ the set of points of $Y$ with $\unicode[STIX]{x1D6FC}$ -orbits eventually always hitting  ${\mathcal{B}}$ . This is a liminf set with a rather complicated structure. In [Reference KelmerKel17] sufficient conditions for sets ${\mathcal{A}}_{\mathbf{a}\mathbf{h}}^{\unicode[STIX]{x1D6FC}}({\mathcal{B}})$ to be of full measure were found for unipotent and diagonalizable actions $\unicode[STIX]{x1D6FC}$ on hyperbolic manifolds. Namely, it was shownFootnote 7 (see [Reference KelmerKel17, Theorem 22 and Proposition 24]) that for rotation-invariant monotonically shrinking families  ${\mathcal{B}}$ , $\unicode[STIX]{x1D708}({\mathcal{A}}_{\mathbf{a}\mathbf{h}}^{\unicode[STIX]{x1D6FC}}({\mathcal{B}}))=1$ if the series

(7.3) $$\begin{eqnarray}\mathop{\sum }_{j}\frac{1}{2^{nj}\unicode[STIX]{x1D708}(B_{2^{j}})}\end{eqnarray}$$

converges. See also [Reference Kelmer and YuKY19] for some extensions to actions on homogeneous spaces of semisimple Lie groups. However, to the best of the authors’ knowledge, there are no non-trivial examples of measure-preserving systems for which necessary and sufficient conditions for sets ${\mathcal{A}}_{\mathbf{a}\mathbf{h}}^{\unicode[STIX]{x1D6FC}}({\mathcal{B}})$ to be of full measure exist in the literature.

Now, given $A\in M_{m,n}$ , take $Y=\mathbb{T}^{m}$ with normalized Lebesgue measure $\unicode[STIX]{x1D708}$ and consider the $\mathbb{Z}^{n}$ -action

(7.4) $$\begin{eqnarray}\mathbf{x}\mapsto \unicode[STIX]{x1D6FC}(\mathbf{q})\mathbf{x}:=\mathbf{x}+A\mathbf{q}\quad \text{mod}~\mathbb{Z}^{m}\end{eqnarray}$$

on $Y$ (generated by $n$ independent rotations of $\mathbb{T}^{m}$ by the column vectors of  $A$ ). Also fix $\mathbf{y}\in Y$ and a non-increasing sequence $\{r(N):N\in \mathbb{N}\}$ of positive numbers, and consider the family ${\mathcal{B}}$ of open balls

(7.5) $$\begin{eqnarray}B_{N}:=\{\mathbf{x}\in \mathbb{T}^{m}:\Vert \mathbf{x}-\mathbf{y}\Vert <r(N)\}.\end{eqnarray}$$

Then it is easy to see that $\mathbf{x}\in {\mathcal{A}}_{\mathbf{a}\mathbf{h}}^{\unicode[STIX]{x1D6FC}}({\mathcal{B}})$ if and only if for all sufficiently large $N\in \mathbb{N}$ there exist $\mathbf{q}\in \mathbb{Z}^{n}$ and $\mathbf{p}\in \mathbb{Z}^{m}$ such that

(7.6) $$\begin{eqnarray}\Vert \mathbf{q}\Vert <N+1\quad \text{and}\quad \Vert \mathbf{x}+A\mathbf{q}-\mathbf{p}-\mathbf{y}\Vert <r(N).\end{eqnarray}$$

Here and hereafter $\unicode[STIX]{x1D6FC}$ and $A$ are related via (7.4). A connection to the improvement of the inhomogeneous Dirichlet theorem is now straightforward. Indeed, from Theorem 1.6 one can derive the following corollary.

Corollary 7.3. Fix $\mathbf{y}\in \mathbb{T}^{m}$ and let ${\mathcal{B}}=\{B_{N}:N\in \mathbb{N}\}$ be as in (7.5), where $\{r(N):N\in \mathbb{N}\}$ is a non-increasing sequence of positive numbers. Then for Lebesgue-almost every $A\in M_{m,n}$ the set ${\mathcal{A}}_{\mathbf{a}\mathbf{h}}^{\unicode[STIX]{x1D6FC}}({\mathcal{B}})$ has zero (respectively, full) measure provided the sum (7.3) diverges (respectively, converges).

Proof. Extend $r(\cdot )$ to a non-increasing continuous function on $\mathbb{R}_{+}$ in an arbitrary way (for example, piecewise linearly). Then, similarly to the observation made after (1.4), one can notice that $\mathbf{x}\in {\mathcal{A}}_{\mathbf{a}\mathbf{h}}^{\unicode[STIX]{x1D6FC}}({\mathcal{B}})$ if and only if the system (7.6) is solvable in integers $\mathbf{p},\mathbf{q}$ for all sufficiently large $N\in \mathbb{R}_{+}$ . The latter happens if and only if the pair $(A,\mathbf{x}-\mathbf{y})$ belongs to $\widehat{D}_{m,n}(\unicode[STIX]{x1D713})$ , where

$$\begin{eqnarray}\unicode[STIX]{x1D713}(T):=r(T^{1/n}-1)^{m}.\end{eqnarray}$$

In view of Theorem 1.6, the divergence of the sum

$$\begin{eqnarray}\displaystyle \mathop{\sum }_{j}\frac{1}{\unicode[STIX]{x1D713}(j)j^{2}} & = & \displaystyle \mathop{\sum }_{j}\frac{1}{r(j^{1/n}-1)^{m}j^{2}}\asymp \int \frac{dx}{r(x^{1/n}-1)^{m}x^{2}}\asymp \int \frac{(y+1)^{n-1}\,dy}{r(y)^{m}(y+1)^{2n}}\nonumber\\ \displaystyle & \asymp & \displaystyle \int \frac{dy}{r(y)^{m}y^{n+1}}\asymp \int \frac{2^{z}\,dz}{r(2^{z})^{m}2^{z(n+1)}}\asymp \mathop{\sum }_{j}\frac{1}{r(2^{j})^{m}2^{nj}}\asymp \mathop{\sum }_{j}\frac{1}{2^{nj}\unicode[STIX]{x1D708}(B_{2^{j}})}\nonumber\end{eqnarray}$$

implies that $\widehat{D}_{m,n}(\unicode[STIX]{x1D713})$ has measure zero. Hence for almost every $A$ the set ${\mathcal{A}}_{\mathbf{a}\mathbf{h}}^{\unicode[STIX]{x1D6FC}}({\mathcal{B}})$ is null. Similarly, the convergence of (7.3) implies that $\widehat{D}_{m,n}(\unicode[STIX]{x1D713})$ is conull. Thus for Lebesgue-generic $A$ the set ${\mathcal{A}}_{\mathbf{a}\mathbf{h}}^{\unicode[STIX]{x1D6FC}}({\mathcal{B}})$ has full measure.◻

Acknowledgements

The authors would like to thank Alexander Gorodnik, Dubi Kelmer and Shucheng Yu for helpful discussions, and the anonymous referee for useful comments.

Footnotes

The first-named author was supported by NSF grants DMS-1101320 and DMS-1600814.

1 This definition essentially coincides with the one given in [Reference Kleinbock and MargulisKM99] but differs slightly from other sources, such as [Reference Beresnevich, Dickinson and VelaniBDV06, § 13], where the inequality $\Vert A\mathbf{q}-\mathbf{p}\Vert <\Vert \mathbf{q}\Vert \unicode[STIX]{x1D713}(\Vert \mathbf{q}\Vert )$ is used instead of (1.3).

2 The monotonicity condition can be removed unless $m=n=1$ .

3 Specifically we need $\sqrt{2}r\geqslant 1$ , which is already covered by (5.8).

4 Our definition is that of [Reference MooreMoo66] post-composed with $g\mapsto g^{-1}$ , since our action is on the right.

5 Our identification is theirs composed with a linear isomorphism of $\mathbb{R}^{k-1}$ .

6 See [Reference Kleinbock and WadleighKW18, Proposition 4.5], though notice that the function used there differs from $\unicode[STIX]{x1D6E5}_{0}$ by a minus sign.

7 Note that Kelmer considered the eventually always hitting property for forward orbits, that is, with sets $D_{N}^{+}:=\{\mathbf{q}\in \mathbb{Z}^{n}:q_{i}\geqslant 0,\Vert \mathbf{q}\Vert \leqslant N\}$ in place of  $D_{N}$ .

References

Athreya, J., Random affine lattices , Contemp. Math. 639 (2015), 169174.10.1090/conm/639/12793Google Scholar
Bekka, M. B. and Mayer, M., Ergodic theory and topological dynamics of group actions on homogeneous spaces, London Mathematical Society Lecture Note Series, vol. 269 (Cambridge University Press, Cambridge, 2000).10.1017/CBO9780511758898Google Scholar
Beresnevich, V., Dickinson, D. and Velani, S., Measure theoretic laws for lim sup sets , Mem. Amer. Math. Soc. 179 (2006).Google Scholar
Brezin, J. and Moore, C. C., Flows on homogeneous spaces: a new look , Amer. J. Math. 103 (1981), 571613.10.2307/2374105Google Scholar
Cassels, J. W. S., An introduction to Diophantine approximation, Cambridge Tracts, vol. 45 (Cambridge University Press, Cambridge, 1957).Google Scholar
Cassels, J. W. S., An introduction to the geometry of numbers, Grundlehren der mathematischen Wissenschaften, Band 99 (Springer, Berlin, 1971).Google Scholar
Dani, S. G., Divergent trajectories of flows on homogeneous spaces and Diophantine approximation , J. Reine Angew. Math. 359 (1985), 5589.Google Scholar
Davenport, H. and Schmidt, W. M., Dirichlet’s theorem on diophantine approximation. II , Acta Arith. 16 (1969/1970), 413424.10.4064/aa-16-4-413-424Google Scholar
Davenport, H. and Schmidt, W. M., Dirichlet’s theorem on diophantine approximation, Symposia Mathematica, vol. IV (Academic, 1970).Google Scholar
Edwards, S., The rate of mixing for diagonal flows on spaces of affine lattices, Preprint (2013), http://uu.diva-portal.org/smash/get/diva2:618047/FULLTEXT01.pdf.Google Scholar
Einsiedler, M. and Tseng, J., Badly approximable systems of affine forms, fractals, and Schmidt games , J. Reine Angew. Math. 660 (2011), 8397.Google Scholar
Ghosh, A., Gorodnik, A. and Nevo, A., Best possible rates of distribution of dense lattice orbits in homogeneous spaces , J. Reine Angew. Math. 745 (2018), 155188.10.1515/crelle-2016-0001Google Scholar
Gorodnik, A. and Vishe, P., Simultaneous Diophantine approximation – logarithmic improvements , Trans. Amer. Math. Soc. 370 (2018), 487507.10.1090/tran/6953Google Scholar
Groshev, A. V., Une théorème sur les systèmes des formes linéaires , Dokl. Akad. Nauk SSSR 9 (1938), 151152.Google Scholar
Howe, R. and Tan, E.-C., Nonabelian harmonic analysis. Applications of [[()[]mml:mo lspace="1em" rspace="0em"[]()]]SL[[()[]/mml:mo[]()]](2, ℝ), Universitext (Springer, New York, 1992).Google Scholar
Hussain, M., Kleinbock, D., Wadleigh, N. and Wang, B.-W., Hausdorff measure of sets of Dirichlet non-improvable numbers , Mathematika 64 (2018), 502518.10.1112/S0025579318000074Google Scholar
Katok, A. and Spatzier, R., First cohomology of Anosov actions of higher rank Abelian groups and applications to rigidity , Publ. Math. Inst. Hautes Études Sci. 79 (1994), 131156.10.1007/BF02698888Google Scholar
Kelmer, D., Shrinking targets for discrete time flows on hyperbolic manifolds , Geom. Funct. Anal. 27 (2017), 12571287.10.1007/s00039-017-0421-zGoogle Scholar
Kim, D. H. and Liao, L., Dirichlet uniformly well-approximated numbers , Int. Math. Res. Not. IMRN, rny015, doi:10.1093/imrn/rny015.Google Scholar
Kleinbock, D., Badly approximable systems of affine forms , J. Number Theory 79 (1999), 83102.10.1006/jnth.1999.2419Google Scholar
Kleinbock, D. and Margulis, G. A., Logarithm laws for flows on homogeneous spaces , Invent. Math. 138 (1999), 451494.10.1007/s002220050350Google Scholar
Kleinbock, D. and Margulis, G. A., Erratum to: Logarithm laws for flows on homogeneous spaces , Invent. Math. 211 (2018), 855862.10.1007/s00222-017-0751-3Google Scholar
Kleinbock, D. and Wadleigh, N., A zero-one law for improvements to Dirichlet’s theorem , Proc. Amer. Math. Soc. 146 (2018), 18331844.10.1090/proc/13685Google Scholar
Kleinbock, D. and Weiss, B., Dirichlet’s theorem on diophantine approximation and homogeneous flows , J. Mod. Dyn. 4 (2008), 4362.Google Scholar
Kelmer, D. and Yu, S., Shrinking target problems for flows on homogeneous spaces, Trans. Amer. Math. Soc., doi:10.1090/tran/7783.Google Scholar
Kleinbock, D. and Zhao, X., An application of lattice points counting to shrinking target problems , Discrete Contin. Dyn. Syst. 38 (2018), 155168.10.3934/dcds.2018007Google Scholar
Margulis, G. A., Dynamical and ergodic properties of subgroup actions on homogeneous spaces with applications to number theory , in Proceedings of the International Congress of Mathematicians, Vol. I, II (Kyoto, 1990) (Mathematical Society of Japan, Tokyo, 1991), 193215.Google Scholar
Maucourant, F., Dynamical Borel–Cantelli lemma for hyperbolic spaces , Israel J. Math. 152 (2006), 143155.10.1007/BF02771980Google Scholar
Moore, C. C., Ergodicity of flows on homogeneous spaces , Amer. J. Math. 88 (1966), 154178.10.2307/2373052Google Scholar
Morris, D. W., Introduction to arithmetic groups (Deductive Press, 2015).Google Scholar
Shah, N., Expanding translates of curves and Dirichlet–Minkowski theorem on linear forms , J. Amer. Math. Soc. 23 (2010), 563589.10.1090/S0894-0347-09-00657-2Google Scholar
Shapira, U., A solution to a problem of Cassels and Diophantine properties of cubic numbers , Ann. of Math. (2) 173 (2011), 543557.10.4007/annals.2011.173.1.11Google Scholar
Sprindžuk, V., Metric theory of Diophantine approximations (V. H. Winston and Sons, Washington DC, 1979), 4548.Google Scholar
Waldschmidt, M., Recent advances in Diophantine approximation , in Number theory, analysis and geometry (Springer, New York, 2012), 659704.Google Scholar