1. Introduction
1.1. Statement of results
In this paper, let $\rho :\mathbb {Z}^r\to \operatorname {GL}_d(\mathbb {Z})=\operatorname {Aut}(\mathbb {T}^d)$ be a group morphism and denote indifferently by $\rho $ the group action it induces on $\mathbb {T}^d$ . Our main result is the following theorem.
Theorem 1.1. If an action $\rho :\mathbb {Z}^r\curvearrowright \mathbb {T}^d$ by toral automorphisms contains no hyperbolic automorphisms, then for any $\tau>0$ , there exists an action $\alpha :\mathbb {Z}^r\curvearrowright \mathbb {T}^d$ by $C^1$ -diffeomorphisms such that:
-
(1) $d_{C^1}(\alpha ,\rho )<\tau $ ;
-
(2) $\alpha ^{\mathbf {n}}=\widetilde {H}\circ \rho \circ \widetilde {H}^{-1}$ for a homeomorphism $\widetilde {H}:\mathbb {T}^d\to \mathbb {T}^d$ that is homotopic to $\mathrm {id}$ ;
-
(3) neither $\widetilde {H}$ nor $\widetilde {H}^{-1}$ is differentiable.
Here the $C^1$ -distance $d_{C^1}$ between two actions is defined as $d_{C^1}(\alpha ,\rho )=\max _{\mathbf {n}\in \Xi }\|\alpha ^{\mathbf {n}}-\rho ^{\mathbf {n}}\|_{C^1}$ , where $\Xi \in \mathbb {Z}^r$ is the generating set
with $\mathbf {e}_i$ being the ith coordinate vector.
Definition 1.2. [Reference Damjanović and KatokDK10, Section 1.3.2]
An action $\rho :\mathbb {Z}^r\curvearrowright \mathbb {T}^d$ by toral automorphisms is genuinely partially hyperbolic if $\rho $ is ergodic with respect to the Haar measure on $\mathbb {T}^d$ , but $\rho ^{\mathbf {n}}$ is not hyperbolic for any $\mathbf {n}$ .
As remarked in [Reference Damjanović and KatokDK10], a genuinely partially hyperbolic action contains an element which has no root of unity among its eigenvalues, or equivalently an ergodic toral automorphim.
Corollary 1.3. Suppose $\rho :\mathbb {Z}^r\curvearrowright \mathbb {T}^d$ is a genuinely partially hyperbolic action by toral automorphisms. Then for any $\tau>0$ , there exists an action $\alpha :\mathbb {Z}^r\curvearrowright \mathbb {T}^d$ by $C^1$ -diffeomorphisms such that:
-
(1) $d_{C^1}(\alpha ,\rho )<\tau $ ;
-
(2) $\alpha $ and $\rho $ are not $C^1$ -conjugate.
Corollary 1.3 is deduced from Theorem 1.1 through a standard argument.
Proof. Let $\alpha $ be given by Theorem 1.1 and assume $\widetilde {G}:\mathbb {T}^d\to \mathbb {T}^d$ is a $C^1$ diffeomorphism such that $\alpha ^{\mathbf {n}}\circ \widetilde {G}=\widetilde {G}\circ \rho ^{\mathbf {n}}$ for all $\mathbf {n}\in \mathbb {Z}^d$ . Then $G:=\widetilde {H}^{-1}\circ \widetilde {G}$ is a homeomorphism of $\mathbb {T}^d$ such that
Since at least one of the $\rho ^{\mathbf {n}}$ is an ergodic toral automorphism, G is affine by [Reference WaltersW70, Corollary 2]. So $\widetilde {G}=\widetilde {H}\circ G$ cannot be $C^1$ because $\widetilde {H}$ is not, which contradicts our assumption.
1.2. Background
Faithful linear actions by higher rank abelian groups on tori and nilmanifolds, that is, $\mathbb {Z}^r$ -actions generated by automorphisms where $r\geq 2$ , have since been long expected to be rigid, in the following sense: under some additional assumptions, a smooth action $\alpha $ in the same homotopy class should be smoothly conjugated to the linear action itself, which we denote by $\rho $ . The issue we address in this paper is whether the conjugacy, denoted by h, should have the same smoothness as $\alpha $ .
One important rigidity phenomenon is the local rigidity of the actions $\rho $ described above, which stands for rigidity under perturbative assumptions. An action $\rho $ is said to be $C^{l,m,n}$ -locally rigid if all $C^l$ -actions that are sufficiently close to $\rho $ in $C^m$ topology are $C^n$ -conjugate to $\rho $ . For Cartan actions (that is, faithful linear actions by $\mathbb {Z}^r$ with the largest possible r, modulo restriction to a finite index subgroup) on tori, $C^{\infty ,1,\infty }$ local rigidity was proved by Katok and Lewis [Reference Katok and LewisKL91]. For some more general classes of hyperbolic actions, $C^{\infty ,1,\infty }$ local rigidity was proved by Katok and Spatzier [Reference Katok and SpatzierKS94, Reference Katok and SpatzierKS97] and Einsiedler and Fisher [Reference Einsiedler and FisherEF07]. For global rigidity see [Reference FranksF69], [Reference Fisher, Kalinin and SpatzierFKS11], [Reference Fisher, Kalinin and SpatzierFKS13] and [Reference Rodriguez HertzRH07]. Damjanović and Katok [Reference Damjanović and KatokDK10] proved $C^{\infty ,r,\infty }$ local rigidity for genuinely partially hyperbolic $\mathbb {Z}^r$ -actions by toral automorphisms by the Kolmogorov–Arnold–Moser (KAM) method. For finitely differentiable actions, $C^{l,1,l}$ is not expected to follow from KAM methods because of the loss of regularity when solving a cocycle equation of the form (2.1) below. When $r=1$ , that is, for the dynamics of a single toral automorphism A of $\mathbb {T}^d$ that is partially hyperbolic, such loss of regularity in the cocycle equation was discussed by Veech in [Reference VeechV86], where it was shown that, although the cocycle equation $g\circ A-A\circ g=f$ can be solved in $C^n$ if $f\in C^l$ and $n<l-d$ , there exists a $C^1$ -function f for which the equation has no $C^1$ -solutions.
Section 3 of this paper will describe similar loss of regularity when solving the cocycle equation for general genuinely partially hyperbolic $\mathbb {Z}^r$ -actions by toral automorphisms. In §2, we propose a reversed KAM scheme that allows an accumulation of such losses at certain sequences of periodic points, which leads to the failure of $C^{1,1,1}$ -rigidity in Theorem 1.1.
1.3. Notation
In the rest of this paper:
-
• $\rho $ will be fixed;
-
• all implicit constants in expression of the forms $X\ll Y$ and $X=O(Y)$ will be assumed to be dependent on r, d, $\rho $ , and $\Xi $ , but independent of other variables;
-
• $e(t)$ will denote the function $e^{2\pi i t}$ .
2. The inductive scheme
2.1. Sequence of conjugacies
We employ a reversed KAM scheme to construct a counterexample. A sequence of conjugacies $H_m$ will be constructed in later sections, where $H_m=\mathrm {id}+h_m$ for a sequence of $C^\infty $ smooth functions $h_m:\mathbb {T}^d\to \mathbb {R}^d$ that are small in $C^1$ norm. Inductively define
and
For $m=0$ , set $\widetilde {H}_0=\mathrm {id}$ and $\alpha _0=\rho $ .
Notice that as $H_m$ is homotopic to $\mathrm {id}$ , all the $\alpha _m$ terms are homotopic to $\rho $ .
Define a twisted coboundary $g_m: \mathbb {Z}^r\times \mathbb {T}^d\to \mathbb {R}^d$ over $\rho $ by
We pose a list of technical conditions on $h_m$ and $g_m$ as follows.
Condition 2.1. The sequence $\{h_m\}_{m=1}^\infty $ will be chosen, together with:
-
• a positive number $\tau \in (0,1)$ ;
-
• a sequence of positive numbers $\{\theta _m\}_{m=1}^\infty $ ;
-
• unit vectors $v,w\in \mathbb {R}^d$ , as well as two sequences of non-zero vectors $\{v_m\}_{m=1}^\infty , \{v_m^*\}_{m=1}^\infty $ from $\mathbb {R}^d$ ,
so that, for all $m\in \mathbb {N}$ :
-
(i) $\sum _{m=1}^\infty \theta _m<\tau $ ;
-
(ii) $\|h_m\|_{C^1}\ll \tau $ and
$$ \begin{align*}\Big(\max_{m'=1}^{m-1}\|\widetilde{H}_{m'}^{-1}\|_{C^1}\Big)\Big(\max_{m'=1}^{m-1}\|\widetilde{H}_{m'}\|_{C^1}\Big)\|h_m\|_{C^0}<\theta_m;\end{align*} $$ -
(iii) $\|\widetilde {H}_{m-1}\|_{C^2}\|\widetilde {H}_{m-1}^{-1}\|_{C^1}\|g_m^{\mathbf {n}}\|_{C^1}<\theta _m$ ;
-
(iv) $h_m(0)=0$ and either $(D_0\widetilde {H}_m)v=v+\tau w$ if m is odd or $(D_0\widetilde {H}_m)v=v$ if m is even;
-
(v) either $w=v$ or $(D_0\widetilde {H}_m)w=w$ ;
-
(vi) $h_m(v_{m'})=h_m(v_{m'}^*)=0$ for all $1\leq m'\leq m-1$ , where $v_{m'}$ is identified with its projection in $\mathbb {T}^d$ ;
-
(vii) $\|\widetilde {H}_m\|_{C^2}|v_m|<\theta _m$ , $\|\widetilde {H}_m\|_{C^1}|{v_m}/{|v_m|}-v|<\theta _m$ , $\|\widetilde {H}_m\|_{C^2}|v_m^*|<\theta _m$ , and $\|\widetilde {H}_m\|_{C^1} |{v_m^*}/{|v_m^*|}-({v+\tau w})/{|v+\tau w|}|<\theta _m$ .
Along our proof, it will turn out that v and w may or may not be the same.
2.2. Sufficient inductive conditions
We now show the following proposition.
Proposition 2.2. Given the action $\rho $ , if Condition 2.1 is satisfied and the constant $\tau>0$ therein is sufficiently small, then:
-
(1) $\{\widetilde {H}_m\}_{m=1}^\infty $ converges in $C^0$ to a homeomorphism $\widetilde {H}$ that is homotopic to $\mathrm {id}$ ;
-
(2) for all $\mathbf {n}\in \Xi $ , $\widetilde {H}\circ \rho ^{\mathbf {n}}\circ \widetilde {H}^{-1}$ is $C^1$ differentiable and
$$ \begin{align*}\|\widetilde{H}\circ\rho^{\mathbf{n}}\circ\widetilde{H}^{-1}-\rho^{\mathbf{n}}\|_{C^1}\ll\tau;\end{align*} $$ -
(3) neither $\widetilde {H}$ nor $\widetilde {H}^{-1}$ is differentiable.
We first recall a few technical facts regarding $C^k$ norms.
Lemma 2.3. For smooth maps $\phi ,\psi :\mathbb {T}^d\to \mathbb {T}^d$ and $\Delta :\mathbb {T}^d\to \mathbb {R}^d$ :
-
(1) $\|\phi \circ \psi \|_{C^2}\ll \|\phi \|_{C^2}(1+\|\psi \|_{C^0})^2(1+\|\psi \|_{C^2})$ . If $\psi $ is not homotopically trivial, then $\|\phi \circ \psi \|_{C^1}\leq \|\phi \|_{C^1}\|\psi \|_{C^1}$ ;
-
(2) $\|\phi \circ (\psi +\Delta )-\phi \circ \psi \|_{C^1}\ll \|\phi \|_{C^2}(1+\|\psi \|_{C^1})\|\Delta \|_{C^1}$ ;
-
(3) there is $\epsilon =\epsilon (d)$ such that if $\|\phi -\mathrm {id}\|_{C^1}\leq \epsilon $ , then $\phi $ is invertible, and $\|\phi ^{-1}\|_{C_1}\ll 1+\|\phi \|_{C_1}$ and $\|\phi ^{-1}\|_{C_2}\ll 1+\|\phi \|_{C_2}$ .
Proof of Lemma 2.3
(1) The $C^2$ bound is in [Reference KrikorianK99, Proposition A.2.3]. For the $C^1$ bound, note $\|\phi \circ \psi \|_{C^0}=\|\phi \|_{C^0}\leq \|\phi \|_{C^1}\|\psi \|_{C^1}$ , where we used $\|\psi \|_{C^1}\leq 1$ because $\psi $ is not homotopically trivial. In addition, $\|D(\phi \circ \psi )\|_{C^0}=\|(D\phi \circ \psi ) D\psi \|_{C^0}\leq \|\phi \|_{C^1}\|\psi \|_{C^1}$ .
(2) We have
Moreover,
(3) Is proven in [Reference HamiltonH82, Lemma 2.3.6].
Proof of Proposition 2.2
In the proof below, we will repeatedly use the fact that, because $\widetilde {H}_{m-1}$ is homotopic to $\mathrm {id}$ ,
(1) By Lemma 2.3, when $\tau $ is sufficiently small depending on the dimension d, $H_m=\mathrm {id}+h_m$ is invertible, and $H_m^{-1}$ is $C^1$ differentiable and homotopic to $\mathrm {id}$ . So every $\widetilde {H}_m$ is invertible in $C^1$ .
By Condition 2.1(ii) and (2.4), for all $x\in \mathbb {T}^d$ ,
It follows that $\{\widetilde {H}_m\}$ is a Cauchy, and hence convergent, sequence in $C^0$ . Its limit, which we denote by $\widetilde {H}$ , is a continuous map that is homotopic to $\mathrm {id}$ . Note
However, it is easy to see that $H_m^{-1}=\mathrm {id}+h_m^*$ , where $h_m^*=-h_m\circ H_m^{-1}$ . In particular, $\|h_m^*\|_{C^0}=\|h_m\|_{C^0}$ and
As $\widetilde {H}_m^{-1}=\widetilde {H}_{m-1}^{-1}+h_m^*\circ \widetilde {H}_{m-1}^{-1}$ , it follows that $\{\widetilde {H}_m^{-1}\}$ is a Cauchy sequence in $C^0$ topology, and thus converges to a continuous map $\widetilde {H}^*$ . Additionally, $\widetilde {H}^*$ is homotopic to $\mathrm {id}$ . We also have
Thus, for all m,
where we used equation (2.7) and the parts (i), (ii) of Condition 2.1. As $\sum _{m=1}^\infty \theta _m<\tau $ , it follows that $\|\widetilde {H}\circ \widetilde {H}^*-\mathrm {id}\|_{C^0}=0$ . Therefore, $\widetilde {H}\circ \widetilde {H}^*=\mathrm {id}$ .
Similarly, for all m,
As above, we know $\widetilde {H}^*\circ \widetilde {H}=\mathrm {id}$ .
We can now conclude that $\widetilde {H}^*=\widetilde {H}^{-1}$ and $\widetilde {H}$ is a homeomorphism of $\mathbb {T}^d$ .
(2) By Lemma 2.3, for $\mathbf {n}\in \Xi $ ,
Because $\sum _{m=1}^\infty \theta _m<\tau $ , the sequence $\{\alpha _m^{\mathbf {n}}\}$ is Cauchy in $C^1$ topology. Denote the limit by $\alpha ^{\mathbf {n}}$ . Since $\rho ^{\mathbf {n}}=\alpha _0^{\mathbf {n}}$ ,
Finally, we want to show that $\alpha ^{\mathbf {n}}=\widetilde {H}\circ \rho ^{\mathbf {n}}\circ \widetilde {H}^{-1}$ . For all $m\in \mathbb {N}$ and $\mathbf {n}\in \Xi $ ,
which decays to $0$ as $m\to \infty $ . Thus, $\widetilde {H}\circ \rho ^{\mathbf {n}}\circ \widetilde {H}^{-1}$ is the $C^0$ limit of $\alpha _m^{\mathbf {n}}$ , which coincides with $\alpha ^{\mathbf {n}}$ .
The extension of the definition $\alpha ^{\mathbf {n}}=\widetilde {H}\circ \rho ^{\mathbf {n}}\circ \widetilde {H}^{-1}$ to general $\mathbf {n}\in \mathbb {Z}^r$ forms a $C^1$ action generated by $\{\alpha ^{\mathbf {n}}:\ \mathbf {n}\in \Xi \}$ .
(3) Since $H_m(0)=0+h_m(0)=0$ ,
In addition, for all positive integers $m'>m\geq 1$ , $h_{m'}(v_m)=0$ and thus $H_{m'}(v_m)=v_m+h_{m'}(v_m)=v_m$ . Therefore, for all $k>m\geq 1$ ,
and
Set $y_m=v_m+\sum _{m'=1}^m h_{m'}\circ H_{m'+1}\circ \cdots \circ H_m(v_m)$ . Then $\widetilde {H}(v_m)=\widetilde {H}_m(v_m)$ is the projection of $y_m$ to $\mathbb {T}^d$ , which we indifferently denote by $y_m$ .
We first claim that $\widetilde {H}$ is not differentiable at $0$ . To show this, it is helpful to study the asymptotic behavior of the sequence of vectors ${y_m}/{|v_m|}$ .
Remark that since $\sum _{m=1}^\infty \theta _m<\tau $ , $\theta _m\to 0$ . Moreover, as $\widetilde {H}_m$ is homotopic to $\mathrm {id}$ , $\|\widetilde {H}_m\|_{C^2}{\kern-1pt}\geq{\kern-1pt} \|\widetilde {H}_m\|_{C^1}{\kern-1pt}\geq{\kern-1pt} 1$ . Thus, Condition 2.1(vii) shows $|v_m|{\kern-1pt}\leq{\kern-1pt} \theta _m$ and ${|{v_m}/{|v_m|}{\kern-1pt}-{\kern-1pt}v|{\kern-1pt}\leq{\kern-1pt} \theta _m}$ . Thus, $v_m\to ~0$ and ${v_m}/{|v_m|}\to v$ as $m\to \infty $ .
As $\widetilde {H}_m(v_m)=y_m$ , by Condition 2.1(vii),
This shows, using Condition 2.1(iv),
and similarly,
Non-differentiability of ${\widetilde {H}}$ : Assume for the sake of contradiction that $\widetilde {H}$ is differentiable at $0$ . Then, as $\widetilde {H}(v_m)=y_m$ as well,
as $m\to \infty $ . This contradicts equations (2.14) and (2.15) where different subsequences of ${y_m}/{|v_m|}$ have different limits. Therefore, $\widetilde {H}$ cannot be differentiable at $0$ .
Non-differentiability of ${\widetilde {H}^{-1}}$ : By equation (2.14), $\lim _{l\to \infty }({|y_{2l+1}|}/{|v_{2l+1}|})=|v+\tau w|$ . Thus,
and
However, using $v_m^*$ instead, we can define $y_m^*=\widetilde {H}(v_m^*)=\widetilde {H}_{m}(y_m^*)$ as in equation (2.12). Then $|v_m^*|\to 0$ and $|y_m^*|\to 0$ as $m\to \infty $ . The same computations in equations (2.13), (2.14), and (2.15) give rise to, in lieu of equation (2.16),
If $w=v$ , then
Therefore, $\lim _{l\to \infty }({y_{2l}^*}/{|v_{2l}^*|})=1$ and
If $w\neq v$ , then by equation (2.19) and properties (iv), (v) of Condition 2.1,
and therefore, $\lim _{l\to \infty }({y_{2l}^*}/{|v_{2l}^*|})=1$ and
As $v_{2l}^*=\widetilde {H}^{-1}(y_{2l}^*)$ and $v_{2l+1}=\widetilde {H}^{-1}(y_{2l+1})$ , in both the cases of equations (2.20) and (2.21), the same argument as in equation (2.16) shows $\widetilde {H}^{-1}$ is not differentiable at $0$ either.
2.3. Fulfillment of the inductive conditions
We will construct the sequence $\{h_m\}_{m=1}^\infty $ based on the following proposition.
Proposition 2.4. If the linear action $\rho :\mathbb {Z}^r\curvearrowright \mathbb {T}^d$ contains no hyperbolic automorphism, then there exist unit vectors $v,w\in \mathbb {R}^d$ , such that for all $\delta>0$ and $Q\in \mathbb {N}$ , there exists a $C^\infty $ function $h:\mathbb {T}^d\to \mathbb {R}^d$ , such that:
-
(1) $h(x)=0$ for all $x\in ((1/Q)\mathbb {Z}^d)/\mathbb {Z}^d\subseteq \mathbb {T}^d$ ;
-
(2) $(D_0h)v=w$ ; in addition, either $v=w$ or $(D_0h)w=0$ ;
-
(3) $\|h\|_{C^0}<\delta $ and $\|h\|_{C^1}\ll 1$ ;
-
(4) for all $\mathbf {n}\in \Xi $ , $g^{\mathbf {n}}:=\rho ^{\mathbf {n}} h-h\circ \rho ^{\mathbf {n}}$ satisfies $\|g^{\mathbf {n}}\|_{C^1}<\delta $ .
The proof of the proposition will be deferred to §3.
Proposition 2.5. Suppose the linear action $\rho :\mathbb {Z}^r\curvearrowright \mathbb {T}^d$ contains no hyperbolic automorphism and v, w are as in Proposition 2.4. Then for all sufficiently small $\tau>0$ and positive numbers $\{\theta _m\}_{m=1}^\infty $ that satisfy $\sum _{m=1}^\infty \theta _m<\tau $ , there exist sequences $\{h_m\}_{m=1}^\infty $ , $\{v_m\}_{m=1}^\infty $ and $\{v_m^*\}_{m=1}^\infty $ that satisfy Condition 2.1.
Proof. Part (i) is already assumed. So we only need to fulfill the remaining assumptions from Condition 2.1.
To inductively construct $h_m$ , assume for all $1\leq m'\leq m-1$ , there exist a $C^\infty $ function $h_{m'}$ , and non-zero vectors $v_{m'}, v_{m'}^*\in \mathbb {Q}^d$ that satisfy, together with v, w, the remaining properties from Condition 2.1. Then the diffeomorphism $\widetilde {H}_{m'}$ is also determined for all ${1\leq m'\leq m-1}$ by equation (2.1). Remark that with the convention $\widetilde {H}_0=\mathrm {id}$ , the requirements of $(D_0\widetilde {H}_m)v=v$ and $(D_0\widetilde {H}_m)w=w$ from parts (iv) and (v) of the condition are satisfied at the initial step $m=0$ .
Let
and $Q_m$ be the least common multiple of the denominators of $v_1$ , $\ldots $ , $v_{m-1}$ , $v_1^*$ , $\ldots $ , $v_{m-1}^*\in \mathbb {Q}^d$ . We obtain a $C^\infty $ function ${\mathring {h}}_m$ by applying Proposition 2.4 with parameters $\delta _m$ and $Q_m$ , and define
It in turn determines $H_m\hspace{-1pt}=\hspace{-1pt}\mathrm {id}\hspace{-1pt}+\hspace{-1pt}h_m$ and $\widetilde {H}_m\hspace{-1pt}=\hspace{-1pt}\widetilde {H}_{m-1}\hspace{-1pt}\circ\hspace{-1pt} H_m$ . Remark that $|{-\tau }/({1\hspace{-1pt}+\hspace{-1pt}\tau })|\hspace{-1pt}<\hspace{-1pt}\tau $ .
We claim $h_m$ , $H_m$ , and $\widetilde {H}_m$ satisfy the clauses (ii)–(vii) in Condition 2.1:
(ii) $\|h_m\|_{C^1}\leq \tau \|{\mathring {h}}_m\|_{C^1}\ll \tau $ and
(iii) For all $\mathbf {n}\in \Xi $ , with ${\mathring {g}}_m^{\mathbf {n}}={\mathring {h}}_m\circ \rho ^{\mathbf {n}}-\rho ^{\mathbf {n}}{\mathring {h}}_m$ ,
(iv) Since $0\in ((1/Q)\mathbb {Z}^d)/\mathbb {Z}^d$ , ${\mathring {h}}_m(0)=0$ and thus $h_m(0)=0$ . As it was assumed that $h_1(0)=\cdots =h_{m-1}(0)=0$ , we know $H_1(0)=\cdots =H_m(0)=0$ and $\widetilde {H}_m(0)=\widetilde {H}_{m-1} (0)=0$ . So
If m is odd and $v=w$ , then $v+(D_0h_m)v=v+\tau (D_0{\mathring {h}}_m)v=(1+\tau )v$ , and by inductive assumption, $(D_0\widetilde {H}_{m-1})v=v$ . So $(D_0\widetilde {H}_m)v=(D_0\widetilde {H}_{m-1})((1+\tau )v)=v+\tau v=v+\tau w$ .
If m is even and $v=w$ , then $v+(D_0h_m)v=v-\tau /({1+\tau })(D_0{\mathring {h}}_m)v=v-(\tau /({1+\tau })) v=v/({1+\tau })$ , and by inductive assumption, $(D_0\widetilde {H}_{m-1})v=v+\tau w=(1+\tau )v$ . So $(D_0\widetilde {H}_m)v=(D_0\widetilde {H}_{m-1})(v/({1+\tau }))=v$ .
If m is odd and $v\neq w$ , then $v+(D_0h_m)v=v+\tau (D_0{\mathring {h}}_m)v=v+\tau w$ , and by inductive assumption, $(D_0\widetilde {H}_{m-1})v=v$ , $(D_0\widetilde {H}_{m-1})w=w$ . So $(D_0\widetilde {H}_m)v= (D_0\widetilde {H}_{m-1})(v+\tau w)= v+\tau w$ .
If m is even and $v\neq w$ , then $v+(D_0h_m)v=v-\tau (D_0{\mathring {h}}_m)v=v-\tau w$ , and by inductive assumption, $(D_0\widetilde {H}_{m-1})v=v+\tau w$ , $(D_0\widetilde {H}_{m-1})w=w$ . So $(D_0\widetilde {H}_m)v=(D_0\widetilde {H}_{m-1}) (v-\tau w)=(v+\tau w)-\tau \cdot w=v$ .
Therefore, we have proved that property (iv) continues to hold at the mth step in all cases.
(v) Suppose $v\neq w$ . Then $(D_0{\mathring {h}}_m)w=0$ and, thus, $(D_0 h_m)w=0$ too. So $(D_0 H_m) w=(\mathrm {id}+(D_0 h_m))w=w$ . Since by inductive assumption $(D_0\widetilde {H}_{m-1})w=w$ , we still have $(D_0\widetilde {H}_m)w=(D_0\widetilde {H}_{m-1})(D_0 H_m)w=w$ .
(vi) By the choice of $Q_m$ , we know $v_{m'}$ , $v_{m'}^*$ are in $((1/{Q_m})\mathbb {Z})^d$ for all $1\leq m'\leq m-1$ . By Proposition 2.4, ${\mathring {h}}_m(v_{m'})={\mathring {h}}_m(v_{m'}^*)=0$ . So $h_m(v_{m'})=h_m(v_{m'}^*)=0$ as $h_m$ is proportional to ${\mathring {h}}_m$ .
(vii) Now that $h_m$ and $\widetilde {H}_m$ have been constructed, to finish the inductive step, it remains to choose rational vectors $v_m$ , $v_m^*$ that meet the requirement of property (vii), which can obviously be achieved. In fact, it suffices to take any rational vector $u\in \mathbb {Q}^d$ such that $|u-v|<{\theta _m}/{2\|\widetilde {H}_m\|_{C^1}}$ , and set $v_m=u/L$ for any sufficiently large integer $L>{2\|\widetilde {H}_m\|_{C^1}}/{\theta _m}$ . Additionally, $v_m^*$ can be similarly chosen near the direction of $v+\tau w$ .
3. Cocycles with small coboundaries
In this section, we complete the only still missing component of the argument, namely the proof of Proposition 2.4.
3.1. The linear algebra of commuting integer matrices
The linear algebra of the action $\rho $ is characterized by the following basic fact.
Lemma 3.1. Suppose $\rho :\mathbb {Z}^r\to \operatorname {GL}_d(\mathbb {Z})$ is a representation of $\mathbb {Z}^r$ in the group of toral automorphism of $\mathbb {T}^d$ . Then for some $J_1,J_2\geq 0$ and every $1\leq j\leq J_1+2J_2$ , there exist:
-
• a number field $\mathbb {F}_j$ embedded in $\mathbb {L}_j$ , where $\mathbb {L}_1=\cdots =\mathbb {L}_{J_1}=\mathbb {R}$ and $\mathbb {L}_{J_1+1}=\cdots =\mathbb {L}_{J_1+2J_2}=\mathbb {C}$ ;
-
• a positive dimension $d_j\geq 1$ ;
-
• a group morphism $\zeta _j:\mathbf {n}\to \zeta _j^{\mathbf {n}}$ from $\mathbb {Z}^r$ to the multiplicative group $\mathbb {F}_j^\times $ of $\mathbb {F}_j$ ;
-
• a group morphism $A_j:\mathbf {n}\to A_j^{\mathbf {n}}$ from $\mathbb {Z}^r$ to the group $\operatorname {N}_{d_j}(\mathbb {F}_j)$ of upper triangular nilpotent matrices in $\operatorname {SL}_{d_j}(\mathbb {F}_j)$ ;
-
• a linear transform $\mu _j\in \operatorname {Mat}_{d_j\times d}(\mathbb {F}_j)$ ;
such that:
-
(1) $\{\zeta _j^{\mathbf {n}}:\mathbf {n}\in \mathbb {Z}^r\}\not \subseteq \mathbb {R}$ generates $\mathbb {F}_j$ as a number field, and spans $\mathbb {L}_j$ over $\mathbb {R}$ ;
-
(2) $\zeta _1,\ldots ,\zeta _{J_1+2J_2}$ are distinct and this list is invariant under the action by the Galois group $\operatorname {Gal}(\overline {\mathbb {Q}}/\mathbb {Q})$ . Actually, for all $1\leq j\leq J_1+2J_2$ and $\sigma \in \operatorname {Gal}(\mathbb {F}_j/\mathbb {Q})$ , there exists a unique $1\leq j'\leq J_1+2J_2$ such that $\sigma (\mathbb {F}_j)=\mathbb {F}_{j'}$ , $d_j=d_{j'}$ , $\sigma (\zeta _j^{\mathbf {n}})=\zeta _{j'}^{\mathbf {n}}$ , $\sigma (A_j^{\mathbf {n}})=A_{j'}^{\mathbf {n}}$ and $\sigma (\mu _j)=\sigma (\mu _{j'})$ ;
-
(3) $\overline {\zeta _j^{\mathbf {n}}}=\zeta _{J_2+j}^{\mathbf {n}}$ for all $J_1\leq j\leq J_1+J_2$ , $\mathbf {n}\in \mathbb {Z}^r$ ;
-
(4) with $\iota _j=\mu _j$ for $1\leq j\leq J_1$ and $\iota _j=2\operatorname {Re} \mu _j$ for $J_1+1\leq j\leq J_1+J_2$ , the linear transform $\iota =\bigoplus _{j=1}^{J_1+J_2}\iota _j$ from $\bigoplus _{j=1}^{J_1+J_2}\mathbb {L}_j^{d_j}$ to $\mathbb {R}^d$ is an $\mathbb {R}$ -linear isomorphism and satisfies
$$ \begin{align*}\iota\circ \bigoplus_{j=1}^{J_1+J_2}\zeta_j^{\mathbf{n}} A_j^{\mathbf{n}}=\rho^{\mathbf{n}}\circ\iota.\end{align*} $$
The lemma should be a standard fact for experts. However, we still include the proof for completeness.
Proof. Thanks to the commutativity of $\mathbb {Z}^r$ , it is easy to show (see e.g. the proof of [Reference Rodriguez Hertz and WangRHW14, Lemma 2.2]) that $\mathbb {C}^d=(\mathbb {R}^d)\otimes _{\mathbb {R}} \mathbb {C}$ splits as a direct sum $\bigoplus _{j=1}^{\widetilde {J}} E_j^{\mathbb {C}}$ , where each $E_j^{\mathbb {C}}$ is a maximal common generalized eigenspace of all the $\rho ^{\mathbf {n}}$ terms. More precisely, for every j, there exists a group morphism from $\mathbb {Z}^r$ : $\zeta _j$ to $\mathbb {C}^\times $ such that
(1) Because $\rho ^{\mathbf {n}}\in \operatorname {GL}_d(\mathbb {Z})$ , every eigenvalue $\zeta _j^{\mathbf {n}}$ is an algebraic integer. Denote by $\mathbb {F}_j$ the field generated by $\{\zeta _j^{\mathbf {n}}:\mathbf {n}\in \mathbb {Z}^r\}$ , which is a number field as $\mathbb {Z}^r$ is finitely generated. Let $\mathbb {L}_j\in \{\mathbb {R},\mathbb {C}\}$ be the $\mathbb {R}$ -span of $\mathbb {F}_j$ .
(2) As the $\rho ^{\mathbf {n}}|_{E_j^{\mathbb {C}}}$ terms commute, they can be triangularized simultaneously over $\mathbb {C}$ . Actually, equation (3.1) asserts that $E_j^{\mathbb {C}}$ is a linear subspace defined over $\mathbb {F}_j$ . Together with the fact that the $\rho ^{\mathbf {n}}\in \operatorname {GL}_d(\mathbb {Z})$ , this shows that the simultaneous triangularization can be carried out over $\mathbb {F}_j$ . In other words, one can find a basis $y_{j1},\ldots ,y_{jd_j}\in E_j^{\mathbb {C}}\cap \mathbb {F}_j^d$ of $E_j^{\mathbb {C}}$ , such that the linear isomorphism $\mu _j:\mathbb {C}^{d_j}\to E_j^{\mathbb {C}}$ sending the kth coordinate vector to $y_{jk}$ satisfies
Note that $\mu _j$ is actually a matrix with coefficients in $\mathbb {F}_j$ .
Moreover, we can make the choices above equivariant under Galois conjugacies. Indeed, for every $\sigma \in \operatorname {Gal}(\overline {\mathbb {Q}}/\mathbb {Q})$ , the correspondence $\mathbf {n}\to \sigma (\zeta _j^{\mathbf {n}})$ is a group morphism from $\mathbb {Z}^r$ to $\sigma (\mathbb {F}_j)^\times $ . By equation (3.1), $\sigma (E_j^{\mathbb {C}}\cap \overline {\mathbb {Q}}^d)=\bigcap _{\mathbf {n}\in \mathbb {Z}^r}\ker _{\overline {\mathbb {Q}}^d}(\rho ^{\mathbf {n}}-\sigma (\zeta _j^{\mathbf {n}})\mathrm {id})^d$ is a non-empty $\overline {\mathbb {Q}}$ subspace of dimension $\dim _{\mathbb {C}} E_j^{\mathbb {C}}$ and its $\mathbb {C}$ -span is $\bigcap _{\mathbf {n}\in \mathbb {Z}^r}\ker _{\mathbb {C}^d}(\rho ^{\mathbf {n}}-\sigma (\zeta _j^{\mathbf {n}})\mathrm {id})^d$ , which is $E_{j'}^{\mathbb {C}}$ for some $1\leq j'\leq {\widetilde {J}}$ . (Note $j=j'$ if and only if $\sigma $ fixes every $\zeta _j^{\mathbf {n}}$ , or equivalently $\sigma $ acts trivially on $\mathbb {F}_j$ .) In this case, $d_{j'}=d_j$ and $\zeta _{j'}^{\mathbf {n}}=\sigma (\zeta _j^{\mathbf {n}})$ . Furthermore, one may choose the basis $y_{j1},\ldots ,y_{jd_j}$ for all the indices j in such a way that, in the situation above, $y_{j'k}=\sigma (y_{jk})$ for $1\leq k\leq d_j$ , or equivalently $\mu _{j'}=\sigma (\mu _j)$ . Then applying $\sigma $ to equation (3.2) yields
Since $\mu _{j'}$ is a linear embedding, this forces $A_{j'}^{\mathbf {n}}=\sigma (A_j^{\mathbf {n}})$ .
(3) By choice, $\zeta _1,\ldots ,\zeta _{\widetilde {J}}$ are distinct. Additionally, the previous paragraph shows that, by letting $\sigma \in \operatorname {Gal}(\overline {\mathbb {Q}}/\mathbb {Q})$ be the complex conjugation, each $\overline {\zeta _j}$ is also in the list. Remark that $\zeta _j=\overline {\zeta _j}$ if and only if $\{\zeta _j^{\mathbf {n}}:\mathbf {n}\in \mathbb {Z}^r\}\subseteq \mathbb {R}$ , or equivalently $\mathbb {F}_j=\mathbb {R}$ . After rearranging the list, we may assume that there are $J_1$ , $J_2$ such that $J_1+2J_2={\widetilde {J}}$ , $\mathbb {F}_j=\mathbb {R}$ assume real values for $j=1,\ldots J_1$ ; and that $\mathbb {F}_{J_2+j}=\mathbb {F}_j=\mathbb {C}$ and $\zeta _{J_2+j}=\overline {\zeta _j}$ for $j=J_1+1,\ldots ,J_1+J_2$ .
(4) As in the statement, set $\iota _j=\mu _j$ for $1\leq j\leq J_1$ and $\iota _j=2\operatorname {Re} \mu _j$ for $J_1+1\leq j\leq J_1+J_2$ . To show $\iota \circ \bigoplus _{j=1}^{J_1+J_2}\zeta _j^{\mathbf {n}} A_j^{\mathbf {n}}=\rho ^{\mathbf {n}}\circ \iota $ , we need for each $1\leq j\leq J_2$ that
For $1\leq j\leq J_1$ , this is just equation (3.2). For $J_1+1\leq j\leq J_1+J_2$ , let $u\in \mathbb {C}^{d_j}$ , because $\rho ^{\mathbf {n}}$ is a real matrix, for all $\mathbf {n}\in \mathbb {Z}^r$ and $z\in \mathbb {C}^{d_j}$ ,
So equation (3.3) holds for all $1\leq j\leq J_1+J_2$ .
It remains to show that $\iota $ is an isomorphism. Recall that $\mathbb {C}^d=\bigoplus _{j=1}^{J_1+2J_2}E_j^{\mathbb {C}}$ is a direct sum. However, the image of $\iota _j=\mu _j$ is contained in $E_j^{\mathbb {C}}$ for $1\leq j\leq J_1$ ; and the image of $\iota _j=2\operatorname {Re}\mu _j=\mu _j+\overline {\mu _j}=\mu _j+\mu _{J_2+j}$ is contained in $E_j^{\mathbb {C}}\oplus E_{J_2+j}^{\mathbb {C}}$ for $J_1+1\leq j\leq J_1+J_2$ . Hence, the images of $\iota $ is the direct sum $\bigoplus _{j=1}^{J_1+J_2} \iota _j(\mathbb {L}_j^{d_j})$ .
In addition, we claim each $\iota _j$ is injective. This is obvious in the case $1\leq j\leq J_1$ , where $\iota _j=\mu _j$ . For $J_1+1\leq j\leq J_1+J_2$ , if $\iota _j=2\operatorname {Re}\mu _j$ is not injective, then $\mu _j(z)=-\overline {\mu _j(z)}$ for some non-zero $z\in \mathbb {C}^{d_j}$ . However, $\mu _j(z)\neq 0$ , as $\mu _j$ is an embedding. This shows $E_j^{\mathbb {C}}\cap E_{J_1+j}^{\mathbb {C}}\neq \{0\}$ as $\mu _j(z)\in E_j^{\mathbb {C}}$ and $\overline {\mu _j(z)}\in E_{J_2+j}^{\mathbb {C}}$ , which contradicts the fact that $\bigoplus _{j=1}^{J_1+2J_2}E_j^{\mathbb {C}}$ is a direct sum. Hence, $\iota _j$ is injective for all $1\leq j\leq J_1+J_2$ .
So we may conclude that $\iota =\bigoplus _{j=1}^{J_1+J_2}\iota _j$ is injective from $\bigoplus _{j=1}^{J_1+ J_2}\mathbb {L}_j^{d_j}$ to $\mathbb {R}^d$ . As
$\iota $ must be a linear isomorphism. The proof is completed.
Corollary 3.2. Suppose $1\leq k\leq J_1+J_2$ and P is a $\mathbb {L}_k$ -vector subspace defined over $\mathbb {Q}$ of the kth component $\mathbb {L}_k^{d_k}$ in $\bigoplus _{j=1}^{J_1+ J_2}\mathbb {L}_j^{d_j}$ , then there exists a subspace $P'\subset \mathbb {R}^d$ defined over $\mathbb {Q}$ such that $P=\iota _k^{-1}(P')$ .
Proof. Choose a linear basis $\{p_1,\ldots ,p_N\}$ of P from $\mathbb {Q}^{d_k}\subset \mathbb {L}_k^{d_k}$ .
There are $j_1,\ldots ,j_{M_1}{\kern-1pt}\in{\kern-1pt} \{1,\ldots ,J_1\}$ and ${j_{M_1+1},\ldots , j_{M_1+M_2}{\kern-1pt}\in{\kern-1pt} \{J_1{\kern-1pt}+{\kern-1pt}1,\ldots , J_1{\kern-1pt}+{\kern-1pt}J_2\}}$ such that, after defining $j_{M_2+m}=J_2+j_m$ for every $M_1+1\leq m\leq M_1+M_2$ , $\{\zeta _{j_1},\ldots ,\zeta _{j_{M_1+2M_2}}\!\}$ form the orbit of $\zeta _k$ under the action by the Galois group $\operatorname {Gal}(\mathbb {F}_k/\mathbb {Q})$ . For each m, let $\sigma _m\in \operatorname {Gal}(\mathbb {F}_k/\mathbb {Q})$ be the element such that $\sigma _m(\zeta _k)=\zeta _{j_m}$ .
Define $(P')^{\mathbb {C}}\subseteq \mathbb {C}^d$ as the $\mathbb {C}$ -linear span of
Because $\mu _{j_m}=\sigma _m(\mu _k)$ and has image in $E_{j_m}^{\mathbb {C}}$ , these vectors have algebraic entries and are linearly independent, and this set is invariant by Galois conjugacies from $\operatorname {Gal}(\overline {\mathbb {Q}}/\mathbb {Q})$ . Hence, $(P')^{\mathbb {C}}$ is defined over $\mathbb {Q}$ of dimension $(M_1+2M_2)N$ . The intersection $P':=(P')^{\mathbb {C}}\cap \mathbb {R}^d$ is a real vector space defined over $\mathbb {Q}$ over the same dimension.
For each $p_n$ , $\iota _k(p_n)$ is either $\mu _k(p_n)$ if $1\leq k\leq J_1$ or $2\operatorname {Re}\mu _k(p_n)= \mu _k(p_n)+\mu _{J_2+k}(p_n)$ if $J_1+1\leq k\leq J_1+J_2$ . In these cases, either k or both k and $J_2+k$ are among the list $\{j_1,\ldots ,j_{M_1+2M_2}\}$ . It follows that $\iota _k(p_n)\in (P')^{\mathbb {C}}$ and hence $\iota _k(p_n)\in P'$ . We obtain that $P\subseteq \iota _k^{-1}(P')$ .
It remains to show that the equality holds. If $1\leq k\leq J_1$ , then $\mathbb {L}_k=\mathbb {R}$ and $\iota _k(\mathbb {L}_k^{d_j})=\mu _k(\mathbb {L}_k^{d_j})\subseteq E_k^{\mathbb {C}}$ . So $\iota _k(\iota _k^{-1}(P'))\subseteq P'\cap E_k^{\mathbb {C}}$ . As $(P')^{\mathbb {C}}\cap E_k^{\mathbb {C}}$ is the $\mathbb {C}$ -span of $\mu _k(p_1),\ldots ,\mu _k(p_N)$ , all of which are real vectors, $P'\cap E_k^{\mathbb {C}}$ is contained in the $\mathbb {R}$ -span of them. Because $\iota _k$ is an embedding, $\dim _{\mathbb {R}}\iota _k^{-1}(P')\leq N=\dim _{\mathbb {R}} P$ . Assume instead $J_1+1\leq k\leq J_1+J_2$ . Then $\mathbb {L}_k=\mathbb {C}$ and $\iota _k(\mathbb {L}_k^{d_j})=(\mu _k+\mu _{J_2+k})(\mathbb {L}_k^{d_j})\subseteq E_k^{\mathbb {C}}\oplus E_{J_2+k}^{\mathbb {C}}$ . So $\iota _k(\iota _k^{-1}(P'))$ is contained in $P' \cap (E_k^{\mathbb {C}}\oplus E_{J_2+k}^{\mathbb {C}})$ . As $(P')^{\mathbb {C}} \cap (E_k^{\mathbb {C}}\oplus E_{J_2+k}^{\mathbb {C}})$ is the $\mathbb {C}$ -span of $\mu _k(p_1)$ , $\ldots $ , $\mu _k(p_N)$ , $\mu _{J_2+k}(p_1)$ , $\ldots $ , $\mu _{J_2+k}(p_N)$ and has complex dimension $2N$ . Here, $P' \cap (E_k^{\mathbb {C}}\oplus E_{J_2+k}^{\mathbb {C}})=\mathbb {R}^d\cap (P')^{\mathbb {C}} \cap (E_k^{\mathbb {C}}\oplus E_{J_2+k}^{\mathbb {C}})$ has real dimension $2N$ . Again, since $\iota _k$ is injective, $\dim _{\mathbb {R}}(\iota _k^{-1}(P'))\leq 2N=2\dim _{\mathbb {C}} P=\dim _{\mathbb {R}} P$ . We conclude that in both cases, $P=\iota _k^{-1}(P')$ .
For $1\leq j\leq J$ , $1\leq k\leq d_j$ , write $u_{jk}$ for the kth coordinate vector in $\mathbb {L}_j^{d_j}$ , so that all vectors $s\in \bigoplus _{j=1}^J\mathbb {L}_j^{d_j}$ have the form
where $\pi _{jk}$ is the projection to the $u_{jk}$ coordinate.
Since none of the $\rho ^{\mathbf {n}}$ terms is hyperbolic, there must be at least one $j_0$ such that $|\zeta _{j_0}^{\mathbf {n}}|=1$ for all $\mathbf {n}\in \mathbb {Z}^r$ . This is because otherwise, the linear functionals $\mathbf {n}\to \log |\zeta _{j_0}^{\mathbf {n}}|$ on $\mathbb {Z}^r$ are all non-zero and one can find one $\mathbf {n}_*$ that is not in the kernel of any of such functionals. Then $|\zeta _j^{\mathbf {n}_*}|\neq 1$ for all j. In other words, $\rho ^{\mathbf {n}_*}$ has no eigenvalues in the unit circle, so $\rho ^{\mathbf {n}_*}$ is a hyperbolic matrix, which contradicts our assumption.
After renormalizing $\iota $ if necessary, we may assume
We define vectors ${\mathring {v}},{\mathring {w}}\in \mathbb {L}_{j_0}^{d_{j_0}}$ and $v,w\in \mathbb {R}^d$ by
as well as projections $\pi _{\mathring {v}}:\bigoplus _{j=1}^J\mathbb {L}_j^{d_j}\to \mathbb {L}_{j_0}$ and $\psi _v\in (\mathbb {R}^d)^*$ by
Note that
In the case where $d_{j_0}=1$ , we have $w=v$ and $\psi _v(w)=\psi _v(v)=1$ . However, when $d_{j_0}>1$ , ${\mathring {v}}\neq {\mathring {w}}$ and thus $\pi _{\mathring {v}}({\mathring {w}})=0$ , so $\psi _v(w)=0$ . In summary,
Let $W=\iota _{j_0}(\mathbb {L}_{j_0}{\mathring {w}})$ , which is isomorphic to $\mathbb {L}_{j_0}$ as a real vector space. For all $\mathbf {n}\in \mathbb {Z}^r$ and $w'\in W$ , since $w'=\iota (z{\mathring {w}})$ for some $z\in \mathbb {L}_{j_0}$ , and $A_{j_0}^{\mathbf {n}}$ is an upper triangular nilpotent matrix, $A_{j_0}^{\mathbf {n}}{\mathring {w}}={\mathring {w}}$ and thus
So W is $\rho $ -invariant and
Furthermore, for $u\in \mathbb {L}_{d_{j_0}}^{j_0}$ , $\pi _{\mathring {v}}(\zeta _{j_0}^{\mathbf {n}} A_{j_0}^{\mathbf {n}} u)=\zeta _{j_0}^{\mathbf {n}}\pi _{\mathring {v}}(u)$ and thus
for all $u\in \bigoplus _{j=1}^J\mathbb {L}_j^{d_j}$ . So
In particular, as $|\zeta _{j_0}^{\mathbf {n}}|=1$ , the size of $(\rho ^{\mathbf {n}})^{\mathrm {T}}\psi _v\in (\mathbb {R}^d)^*$ is uniformly bounded by
If $d_{j_0}>1$ , by applying Corollary 3.2 to the $\mathbb {L}_{j_0}$ -subspace $\bigoplus _{k=2}^{d_{j_0}}\mathbb {L}_{j_0}u_{j_0k}$ of $\mathbb {L}_{j_0}^{d_{j_0}}$ , there is a subspace $W'\subseteq \mathbb {R}^d$ defined over $\mathbb {Q}$ such that $\iota _{j_0}^{-1}(W')=\bigoplus _{k=2}^{d_{j_0}}\mathbb {L}_{d_{j_0}}u_{j_0k}$ . In particular, $W'$ contains $W=\iota _{j_0}(\mathbb {L}_{j_0}u_{j_0d_{j_0}})$ . Set $\Psi =\{\psi \in (\mathbb {R}^d)^*:\psi |_{W'}=0\}$ . Then $\Psi $ is a subspace defined over $\mathbb {Q}$ , and
Moreover,
It follows that $\psi _v=\operatorname {Re} \pi _{\mathring {v}}\circ \iota ^{-1}$ annihilates $W'$ , or equivalently, $\psi _v\in \Psi $ . Furthermore, for all $\mathbf {n}\in \mathbb {Z}^r$ , we have
Because $A_{j_0}^{\mathbf {n}}$ is an upper triangular nilpotent matrix, $\zeta _{j_0}^{\mathbf {n}}(A_{j_0}^{\mathbf {n}}-\mathrm {id}){\mathring {v}}\in \bigoplus _{k=2}^{d_{j_0}}\mathbb {L}_{d_{j_0}}u_{j_0k}$ and $\iota (\zeta _{j_0}^{\mathbf {n}}(A_{j_0}^{\mathbf {n}}-\mathrm {id}){\mathring {v}})\in W'$ . Thus,
If $d_{j_0}=1$ , take $\Psi =(\mathbb {R}^d)^*$ instead, which is also a rational subspace that contains $\psi _v$ . Additionally, equation (3.14) remains true in this case, because $A_{j_0}^{\mathbf {n}}=\mathrm {id}$ . To summarize, we have in any case the following corollary.
Corollary 3.3. There exists a subspace $\Psi \subset (\mathbb {R}^d)^*$ defined over $\mathbb {Q}$ which contains $\psi _v$ and satisfies equation (3.14). In addition, if $d_{j_0}>1$ , then equation (3.13) holds as well.
It should be remarked that all the constructions above are determined by the actions $\rho $ .
3.2. The construction of the cocycle
The construction is inspired by the construction of Veech in [Reference VeechV86, Proposition 1.5].
Let $\epsilon>0$ be a small parameter to be specified later.
We identify $(\mathbb {R}^d)^*$ with $\mathbb {R}^d$ in the standard way so that $(\mathbb {T}^d)^*\subset (\mathbb {R}^d)^*$ is realized as $\mathbb {Z}^d$ . Let $\Psi $ be as in Corollary 3.3. Then $\Psi _{\mathbb {Z}}:=\Psi \cap \mathbb {Z}^d$ is a lattice in $\Psi $ . There is a constant $R>0$ such that for every $\psi \in \Psi $ , there exists $\eta \in \Psi _{\mathbb {Z}}$ with $|\psi -\eta |<R$ . The choice of R depends only on $\rho $ .
Let $\eta _v$ be the nearest vector to $ (Q/\epsilon ) \psi _v$ in the lattice $Q\Psi _{\mathbb {Z}}$ . Then
Recall $W=\iota _{j_0}(\mathbb {L}_{j_0}{\mathring {w}})$ , which is isomorphic to $\mathbb {L}_{j_0}$ as an $\mathbb {R}$ -vector space and contains w. The function $h:\mathbb {T}^d\to \mathbb {R}^d$ will take value in $W\subseteq \mathbb {R}^d$ and have the form
for some $c>0$ , $N\in \mathbb {N}$ , and $ w_\Delta \in W$ , all of which will be defined later. Remark that h is $C^\infty $ as it is a Fourier series supported on finitely many frequencies.
Proof. Since $\eta _v\in Q\Psi _{\mathbb {Z}}\subseteq Q\mathbb {Z}^d$ and $\rho ^{\mathbf {n}}\in \operatorname {GL}(d,\mathbb {Z})$ , $(\rho ^{\mathbf {n}})^{\mathrm {T}}\eta _v\in (Q\mathbb {Z})^d$ for all $\mathbf {n}$ . Moreover, if $x\in ((1/Q)\mathbb {Z}^d)/\mathbb {Z}^d$ , then $e(\eta _v\cdot x)=1$ and $e((\rho ^{\mathbf {n}})^{\mathrm {T}}\eta \cdot x)=1$ for all $\mathbf {n}\in \mathbb {Z}^r$ . Therefore, $h(x)=0$ . This proves part (1).
The derivative of equation (3.16) at $x=0$ is the matrix
We first study the values of the first two terms in equation (3.17) with v or w as linear input. By definition of v and w,
If $\mathbb {L}_{j_0}=\mathbb {R}$ , then ${\mathring {w}}\in \mathbb {R}^{d_{j_0}}$ , $\zeta _{j_0}^{-\mathbf {n}}\overline {\zeta _{j_0}^{\mathbf {n}}}=1$ , and thus
If $\mathbb {L}_{j_0}=\mathbb {C}$ , then by Lemma 3.1(1), there is at least one $i\in \{1,\ldots ,r\}$ , say $i=1$ without loss of generality, such that $\zeta _{j_0}^{\mathbf {e}_i}\notin \mathbb {R}$ . Then ${\overline {\zeta _{j_0}^{\mathbf {e}_1}}}/{\zeta _{j_0}^{\mathbf {e}_1}}$ is in the unit circle but not equal to $1$ . In this case, $\sum _{n=-N}^N({\overline {\zeta _{j_0}^{\mathbf {e}_1}}}/{\zeta _{j_0}^{\mathbf {e}_1}})^n$ is uniformly bounded when N varies. Therefore,
So
Both equations (3.19) and (3.21) can be expressed as
We now attend to the second term in equation (3.17).
Since $\eta _v- (Q/\epsilon ) \psi _v\in \Psi $ , by equations (3.14), (3.15), and the fact that $|\zeta _{j_0}^{\mathbf {n}}|=1$ ,
Moreover, $|\rho ^{-\mathbf {n}}w|\ll 1$ by equation (3.10). So
Choose
Then by equations (3.22) and (3.23),
To make $(D_0h)v=w$ , one needs to find the solution $ w_\Delta \in W$ to
which by equation (3.25) is
Since $\psi _v(v)=1$ , by equation (3.15), $\eta _v(v)= Q/\epsilon +O(Q)= (Q/\epsilon )(1+O(\epsilon ))$ and thus, we have
as long as $\epsilon \ll 1$ . Note that $ w_\Delta $ is automatically in W because equation (3.25) and $w\in W$ .
Moreover, if $w\neq v$ , or in other words $d_{j_0}=1$ , then by Corollary 3.3 and the fact that $\eta _v\in \Psi $ , $\eta _v|_W=0$ . As $\rho ^{\mathbf {n}} w\in W$ for all $\mathbf {n}$ , in this case,
Lemma 3.5. Given c and h respectively from equations (3.16) and (3.24), for $N,Q\in \mathbb {N}$ and sufficiently small $\epsilon \ll 1$ , there exist $ w_\Delta \in W$ of size $O((\epsilon / Q)(1/N+\epsilon ))$ such that $(D_0h)v=w$ . In addition, $(D_0h)w=0$ if $w\neq v$ .
The first part of part (3) in Property 2.4 is given by the following lemma.
Lemma 3.6. Suppose c, $ w_\Delta $ , and h are chosen as above. Then $\|h\|_{C^0}\ll \epsilon / Q$ .
Proof. By equations (3.16), (3.24), and Lemma 3.5,
To bound the $C^1$ norms of h and $g^{\mathbf {n}}$ , write
for the matrix norm of the linear action $\rho $ , so that
For $\mathbf {n}\in \mathbb {Z}^r$ , we deduce from equations (3.12) and (3.15) that
By the construction in equation (3.16) of h, Lemma 3.6, as well as the bounds in equations (3.10), (3.12), (3.27), and (3.30),
For every $\mathbf {n}\in \Xi $ , $g^{\mathbf {n}}=\rho ^{\mathbf {n}} h-h\circ \rho ^{\mathbf {n}}$ is linearly controlled by h in $C^0$ norm:
In addition, $g^{\mathbf {n}}$ has the form
Because $\mathbf {n}\in \Xi $ , the summations $\sum _{\substack {\mathbf {a}\in \mathbb {Z}^r\\|\mathbf {a}|>N,|\mathbf {a}+\mathbf {n}|\leq N}}$ and $\sum _{\substack {\mathbf {a}\in \mathbb {Z}^r\\|\mathbf {a}|\leq N,|\mathbf {a}+\mathbf {n}|>N}}$ each has $O(N^{r-1})$ terms. Since $|\mathbf {n}|=1$ for all $\mathbf {n}\in \Xi $ , in all the terms in both summations, $|\mathbf {a}|\leq N+1$ and $|\mathbf {a}+\mathbf {n}|\leq N+1$ . For each of these terms, the derivative is bounded by
thanks to equations (3.10), (3.12), and (3.30). As $ w_\Delta \in W$ , $|\rho ^{\mathbf {n}} w_\Delta |\ll | w_\Delta |$ by equation (3.10), and the derivative of $((e(\eta _v\cdot x)-1)\rho ^{\mathbf {n}} w_\Delta -(e((\rho ^{\mathbf {n}})^T\eta _v\cdot x)-1) w_\Delta )$ is bounded by
thanks to equations (3.12) and (3.10).
Combining the above inequalities yields:
To summarize equations (3.31) and (3.36), we have the following lemma.
Lemma 3.7. Suppose c, $ w_\Delta $ , and h are chosen as above. Then $\|h\|_{C^1}\ll 1+\|\rho \|^N\epsilon $ and $\|g^{\mathbf {n}}\|_{C^1}\ll (1+\|\rho \|^N\epsilon )(1/N+\epsilon )$ for all $\mathbf {n}\in \Xi $ .
Proof of Proposition 2.4
The proposition follows directly from Lemmas 3.4, 3.5, 3.6, and 3.7 after choosing N and $\epsilon $ appropriately. Indeed, with $C>1$ denoting the largest among the implicit constants from Lemmas 3.6 and 3.7, choose $\epsilon $ sufficiently small such that $N:=\lfloor \log _{\|\rho \|}(1/\epsilon )\rfloor>{4C }/\delta $ and $ C \cdot (\epsilon / Q)<\delta $ . Then $1+\|\rho \|^N\epsilon < 2$ and $1/N+\epsilon \leq 2/N\leq \delta /{2C }$ . So $\|h\|_{C^0}\leq C \cdot (\epsilon / Q)<\delta ;$ $\|h\|_{C^1}\leq C (1+\|\rho \|^N\epsilon )<2C ;$ and $\|g\|_{C^1}<C (1+\|\rho \|^N\epsilon )(1/N+\epsilon )< C \cdot 2\cdot (\delta /{2C })=\delta $ .
Acknowledgements
We would like to thank the referee for their helpful comments. F.R.H. was supported by NSF grant DMS-1900778. Z.W. was supported by NSF grant DMS-1753042.