1 Introduction
In the seminal work [Reference DworkDw], Dwork laid the foundation of the theory of p-adic hypergeometric differential equations. His principal working example was the differential equation
whose analytic solution at the origin
encodes periods of the Legendre family $y^2=t(t-1)(t-x)$ . Dwork used the approximations
which are nothing but truncations of the infinite sum in Equation (1-2) and clearly converge to it in the disk $D_{0,1}=\{x\mid |x|_p<1\}$ , to show that the uniform limit $F_{s+1}(x)/F_s(x^p)$ as $s\to \infty $ exists in a larger domain $\mathfrak D^{\text {Dw}}$ and this limit, named the ‘unit root’, corresponds to a root of the local zeta function of the x-fiber in the family. Dwork’s work boosted the whole body of research in the area; we limit ourselves to mentioning some recent contributions on the theme [Reference Adolphson and SperberAS, Reference Beukers and VlasenkoBV, Reference Long, Tu, Yui and ZudilinLTYZ].
Dwork indicates in [Reference DworkDw] that in the p-adic case, he adopts a point of view similar to that of Igusa in [Reference IgusaIg] on the modulo p solutions of Equation (1-1). Namely, the cycles of the elliptic curve $y^2=t(t-1)(t-x)$ for a given x can be thought of as the local at x analytic solutions of the differential equation in Equation (1-1). At the same time, Igusa’s modulo p solution
of Equation (1-1), though indeed coinciding with Dwork’s $F_1(x)$ modulo p, hints at a somewhat different way for approximating the function in Equation (1-2) p-adically through different truncations of the infinite sum in Equation (1-2). Notice the difference of the limits of summations in Equations (1-3) and (1-4).
This recipe seemed to escape its own development until recently, when Schechtman and Varchenko constructed in [Reference Schechtman and VarchenkoSV2] polynomial solutions modulo p to general Knizhnik–Zamolodchikov (KZ) equations, recovering Igusa’s polynomial as a particular case.
The principal goal of this paper is to show that the p-approximation approach in [Reference Schechtman and VarchenkoSV2] goes in parallel with that of Dwork in [Reference DworkDw] and leads to Dwork-type congruences for solutions of the KZ equations, at least for the cases in which the technicality of proofs does not overshadow the beauty of outcomes.
In this paper, we study certain $p^s$ -approximation polynomials of hypergeometric periods. We consider an integral of hypergeometric type like in Equation (1-1) without specifying the cycle of integration. For any positive integer s, we replace the integrand by a polynomial $\Phi _{s}(t,x)$ with integer coefficients called the master polynomial and define the $p^s$ -approximation polynomial as the coefficient of $t^{p^s-1}$ in the master polynomial. This is our $p^s$ -analog of the initial integral. In the example of Equation (1-1), the master polynomial is $\Phi _{s}(t,x) = t^{(p^s-1)/2}(t-1)^{(p^s-1)/2}(t-x)^{(p^s-1)/2}$ and the $p^s$ -approximation polynomial is
We prove the Dwork-type congruence,
in Theorem 4.2.
In Section 6, we consider the simplest example of the KZ connection. In our example, the KZ connection is identified with the Gauss–Manin connection of the family of elliptic curves $y^2=(t-z_1)(t-z_2)(t-z_3)$ . We study the $p^s$ -approximation polynomials to the elliptic period $\int \!dt/y$ and show that the p-adic KZ connection of our example has an invariant line subbundle. This is a new p-adic feature since the corresponding complex KZ connection has no proper invariant subbundle due to the irreducibility of its monodromy representation.
Notice that usually, the invariant subbundles of the KZ connection over ${\mathbb C}$ are related to some additional conformal block constructions; for example see [Reference Feigin, Schechtman and VarchenkoFSV, Reference Schechtman and VarchenkoSV2, Reference VarchenkoV3]. Apparently, our subbundle is of a different p-adic nature.
The results above require proving p-adic convergence, which in turn rests upon establishing certain special congruences. Considering a general hypergeometric series $F(x)=\sum _{n=0}^\infty A(n)x^n$ and its $p^s$ -truncations $F_s(x)=\sum _{n=0}^{p^s-1}A(n)x^n$ , Dwork showed that
in [Reference DworkDw, Theorem 2]. This allowed him to conclude the existence of the p-adic limit $F_{s+1}(x)/F_s(x^p)$ as $s\to \infty $ in [Reference DworkDw, Theorem 3]. As an auxiliary component of Dwork’s argument, another set of congruences, à la Lucas, was established for the sequence of coefficients $A(n)$ :
(see [Reference DworkDw, Corollary 1(ii), page 36]). These two different-looking families of congruences in Equations (1-7) and (1-8) are both known as Dwork congruences, and to distinguish between the two, we name them type I and type II, respectively.
Our Igusa-inspired $p^s$ -approximations of solutions of a hypergeometric equation, like the $p^s$ -approximation in Equation (1-5), are of dual nature. Although congruences in Equation (1-6) look like congruences in Equation (1-7) of type I, we may view the sequence $(P_s(x))_{s\geqslant 1}$ as a subsequence of a suitable polynomial sequence $A(n;x)$ depending on the extra parameter x and satisfying the congruences
Then the restriction of these type-II Dwork congruences to the subsequence $(P_s(x))_{s\geqslant 1}$ reads as type-I congruences for the $p^s$ -approximation polynomials in parameter x. (We make this explicit in the remark after Theorem 4.4.)
In summary, our principal tools for establishing the existence of p-adic convergence are the Dwork-type congruences in Equation (1-9), of which the required type-I congruences are particular instances.
Initially, general theorems toward Dwork’s congruences were given by Mellit [Reference MellitMe] and independently by Samol and van Straten [Reference Samol and van StratenSvS]. They were generalized in [Reference Mellit and VlasenkoMV, Reference VlasenkoVl]. As these results are insufficiently general for us, we extend Dwork-type congruences further using the elegant method from Mellit’s unpublished preprint [Reference MellitMe]. Our Dwork-type congruences are displayed in Section 2, and their power is illustrated by the congruences in Equation (1-6) from Theorem 4.2 and by several other quite different applications in later sections. In particular, see Section 6 for applications to the KZ equations.
In Section 7, we conjecture some stronger congruences for the polynomials $P_s(x)$ .
We finish the introduction with a remark on the differential KZ equations. The differential KZ equations were discovered by physicists Knizhnik and Zamolodchikov [Reference Knizhnik and ZamolodchikovKZ] as differential equations satisfied by conformal blocks in conformal field theory. It was realized later that versions of KZ equations appear in different situations, for example, as quantum differential equations for Nakajima quiver varieties [Reference Maulik and OkounkovMO]. The KZ equations are closely related to quantum integrable systems and the Bethe ansatz method.
The KZ equations were identified with suitable Gauss–Manin connections in [Reference Schechtman and VarchenkoSV1], and integral representations for solutions of the KZ equations were constructed in the form of multidimensional hypergeometric integrals depending on parameters. Integral representations have connected the KZ equations with the theory of special functions and the hypergeometric functions.
In [Reference Schechtman and VarchenkoSV2], a new side of the KZ equations was observed, namely, polynomial solutions modulo p of the KZ equations were constructed. Those are certain vectors of polynomials with integer coefficients which solve the KZ differential equations modulo p. Those polynomial solutions are, in some sense, p-approximations of the multidimensional hypergeometric solutions of the KZ equations. This brings us a general problem of studying arithmetic properties of solutions of the KZ equations with possible applications to enumerative geometry of Nakajima varieties. The present paper is a step in this direction.
2 On ghosts
In this paper, p is an odd prime.
2.1 Mellit’s theorem
Let $\Lambda (t)$ be a Laurent polynomial in variables $t=(t_1,\ldots ,t_r)$ with coefficients in ${\mathbb Z}_p$ and constant term $\operatorname {CT}_t(\Lambda )$ . Assume that the Newton polytope of $\Lambda (t)$ contains only one interior point $\{0\}$ .
For a tuple $a=(a_0,a_1,\ldots ,a_{l-1})$ , denote by $l(a)=l$ its length. For two tuples a and b, the concatenation product $a*b$ is the tuple of length $l(a)+l(b)$ obtained by gluing a and b together. For $a=(a_0$ , …, $a_{l-1})$ of length l, denote by $a'=(a_1,\ldots ,a_{l-1})$ the ‘derivative’ tuple of length $l-1$ . If a is a tuple of numbers, denote $|a|=\sum _{i=0}^{l-1}a_i$ .
For a tuple $m=(m_0,\ldots ,m_{l-1})$ of integers from $\{1,\ldots ,p-1\}$ , denote by $\operatorname {CT}_t(\Lambda ^m)$ the constant term of the Laurent polynomial $\Lambda (t)^{m_0+m_1p+m_2p^2+\cdots +m_{l-1}p^{l-1}}$ .
Theorem 2.1 [Reference MellitMe].
Let $a,b,c$ be tuples of integers from $\{1,\ldots ,p-1\}$ , where $b,c,a'$ can be empty, that is, of length 0. Then,
We modify the statement and three-page Mellit’s proof of Theorem 2.1 and prove Theorem 2.9.
2.2 Convex polytopes
Given a positive integer r, we consider convex polytopes, which are convex hulls of finite subsets of ${\mathbb Z}^r\subset {\mathbb R}^r$ .
Definition 2.2. A tuple $(N_0, N_1,\ldots ,N_{l-1})$ of convex polytopes is called $\operatorname {admissible}$ if for any $0\leqslant i\leqslant j \leqslant l-1$ ,
2.3 Definition of ghosts
Let $\Lambda (t,z)$ be a Laurent polynomial in some variables $t=(t_1,\ldots ,t_r)$ , $z=(z_1,\ldots ,z_{r'})$ with coefficients in ${\mathbb Z}_p$ . We define the ghost terms $R_m(\Lambda )$ , $m\geqslant 0,$ as the unique sequence of Laurent polynomials in $t,z$ satisfying the following two properties.
-
(i) For each m,
$$ \begin{align*} \Lambda(t,z)^{p^m} = R_{0}(\Lambda)(t^{p^{m}}, z^{p^{m}}) +R_{1}(\Lambda)(t^{p^{m-1}}, z^{p^{m-1}})+\cdots + R_{m}(\Lambda)(t, z). \end{align*} $$ -
(ii) For each m, the coefficients of $R_m(\Lambda )(t,z)$ are divisible by $p^m$ in ${\mathbb Z}_p$ .
Properties (i) and (ii) recursively determine the polynomials $R_{m}(\Lambda )(t, z)$ . Namely,
Let $F(t,z)$ be a Laurent polynomial in $t,z$ . Let $N(F)$ be the Newton polytope of $F(t,z)$ with respect to the t variables only. Clearly,
2.4 Composed ghosts
Let $\lambda = (\Lambda _0(t,z), \ldots , \Lambda _{l-1}(t,z))$ be a tuple of Laurent polynomials with coefficients in ${\mathbb Z}_p$ . We decompose the product
into the sum of ghost terms of $\Lambda _0, \ldots , \Lambda _{l-1}$ . As the result, we obtain that $\tilde \lambda $ is the sum of the products
where $m=(m_0,\ldots ,m_{l-1})$ runs over the set of all l-tuples of integers satisfying ${0\leqslant m_i \leqslant i}$ . Clearly,
and
2.5 Indecomposable tuples
Denote by $S_k$ the set of all k-tuples $m=(m_0,\ldots ,m_{k-1})$ of integers such that $0\leqslant m_i\leqslant i$ . Put $S=\bigcup _{k=1}^\infty S_k$ . A tuple $m\in S$ is called indecomposable if it cannot be presented as $m'*m"$ for $m',m"\in S$ . Denote by $S_k^{\operatorname {ind}}$ the set of all indecomposable k-tuples and put $S^{\operatorname {ind}} = \bigcup _{k=1}^\infty S_k^{\operatorname {ind}}$ .
Lemma 2.3. If $m\in S_k^{\operatorname {ind}}$ , then $|m|\geqslant k-1$ .
Proof. If m is indecomposable, then for each $i\in \{1,\ldots ,k-1\}$ , there exists $j\geqslant i$ such that $m_j>j-i$ , that is, $j\geqslant i>j-m_j$ . The number of such i for a given j is $m_j$ . The total number of such i is $k-1$ ; therefore, the sum of $m_j$ is at least $k-1$ .
Lemma 2.4. For each $m\in S$ , there exist unique indecomposable $m^1,\ldots , m^r$ such that $m=m^1*\cdots *m^r$ .
Proof. The proof is by induction on $l(m)$ . If $l(m)=1$ , then $m=(m_0)=(0)$ and m is indecomposable. Let us prove the induction step. Let
be two decompositions into indecomposable factors. We may assume that $l(n^s)\geqslant l(m^r)$ . If $l(n^s) = l(m^r)$ , then $n^s = m^r$ . In this case, we can conclude that $m^1*\cdots *m^{r-1} = n^1*\cdots *n^{s-1}$ , and the statement follows from the induction hypothesis. If $l(n^s)> l(m^r)$ , then the sequence $n^s$ contains the sequence $m^r$ as its last $l(m^r)$ -part. This contradicts to the indecomposability of $n^s$ . The lemma is proved.
2.6 Polynomials $I_\lambda $
For an l-tuple $\lambda =(\Lambda _0(t,z),\Lambda _1(t,z),\ldots ,\Lambda _{l-1}(t,z))$ of Laurent polynomials with coefficients in ${\mathbb Z}_p$ , define
We have
by Lemma 2.3 and
Lemma 2.5. We have
where the sum is over the set of all possible decompositions of the tuple $\lambda $ into a product of tuples.
Proof. We have
For any $m\in S_{l}$ , let $m=m^1*\cdots *m^s$ be its unique indecomposition into indecomposable factors. Let $\lambda =\lambda ^1*\cdots *\lambda ^s$ be the corresponding factorization of the sequence $\lambda $ . Then,
This product contributes to the expansion of the product
into the sum, and conversely each summand in the expansion of Equation (2-2) comes from Equation (2-1) for a unique indecomposable factorization $m=m^1*\cdots *m^s$ .
2.7 Admissible tuples of Laurent polynomials
Definition 2.6. A tuple $\lambda =(\Lambda _0(t,z),\Lambda _1(t,z),\ldots ,\Lambda _{l-1}(t,z))$ of Laurent polynomials is called $\operatorname {admissible}$ if the tuple $(N(\Lambda _0(t,z)), N(\Lambda _1(t,z)),\ldots ,N(\Lambda _{l-1}(t,z)))$ of its Newton polytopes with respect to variables t is admissible.
Denote by $\operatorname {CT}_t(\Lambda )(z)$ the constant term of the Laurent polynomial $\Lambda (t,z)$ with respect to the variables t. The constant term $\operatorname {CT}_t(\Lambda )(z)$ is a Laurent polynomial in z.
Lemma 2.7. Let $\lambda =(\Lambda _0(t,z),\Lambda _1(t,z),\ldots ,\Lambda _{l-1}(t,z))$ be an admissible tuple of Laurent polynomials with coefficients in ${\mathbb Z}_p$ and $\lambda =\lambda ^1*\cdots *\lambda ^s$ . Then,
Proof. We have
Hence,
and
Thus, by induction on s, we prove the statement.
Corollary 2.8. We have
where the sum is over the set of all decompositions of $\lambda $ into a product of tuples.
2.8 Dwork congruence for tuples of Laurent polynomials
Theorem 2.9. Let $a,b,c$ be tuples of Laurent polynomials in $t,z$ with coefficients in ${\mathbb Z}_p$ , where $b,c,a'$ can be empty. Assume that the tuples $a*b, a*c, a'*b, a'*c$ of Laurent polynomials are admissible. Then,
Proof. The left-hand side and right-hand side of Equation (2-3) are
and
respectively. Since we work modulo $p^{l(a)}$ , all the terms with
may be dropped off from consideration. That inequality can be reformulated as $l(a)+l(b)+l(a)+l(c)-1 - q-s \geqslant l(a)$ , equivalently, as
Let us prove that the remaining terms in both expressions are in a bijective correspondence such that the corresponding terms are equal.
Namely, take one of the remaining summands on the left-hand side:
the summand corresponding to the presentation $a\kern1.2pt{*}\kern1.2ptb\kern1.2pt{=}\kern1.2pt x^1\kern1.2pt{*}\kern1.2pt\cdots *x^q$ , $a'\kern1.2pt{*}\kern1.2ptc\kern1.2pt{=}\kern1.2pt y^1 \kern1.2pt{*}\cdots \kern1.2pt{*}\kern1.2pt y^s$ .
Lemma 2.10. There exist indices $i\geqslant 1$ and $j\geqslant 0$ such that
Proof. If $l(x^1)=1$ , then $i=1$ and $j=0$ are the required indices.
Assume that $l(x^1)>1$ and the required i, j do not exist. Then each number in $\{2, \ldots ,l(a)\}$ cannot be represented simultaneously as $l(x^1)+\cdots +l(x^i)$ and $l(y^1)+\cdots +l(y^j) + 1$ . Therefore, the sum of the total number of $i\geqslant 1$ , such that $l(x^1)+\cdots +l(x^i)\leqslant l(a)$ , and the total number of $j\geqslant 1$ , such that $l(y^1)+\cdots +l(y^j) + 1\leqslant l(a)$ is at most $l(a)-1$ . The number of remaining i is at most $l(b)$ and the number of remaining j is at most $l(c)$ . Therefore, $q+s \leqslant l(a) - 1 + l(b) + l(c)$ , which is the same as Equation (2-5). Hence, the corresponding summand must have been dropped off. This establishes the existence of indices i and j required.
Now we return to the remaining summand in Equation (2-6). Choose the minimal indices $i\geqslant 1$ and $j\geqslant 0$ such that Equation (2-7) holds. Then it is easy to see that
and the summand in Equation (2-4) corresponding to the presentations in Equation (2-8) equals the product in Equation (2-6). This clearly gives the desired bijection.
3 $p^s$ -Approximation of a hypergeometric integral
Let $\alpha ,\beta , \gamma $ be rational numbers with $|\alpha |_p=|\beta |_p=|\gamma |_p=1$ . Consider a hypergeometric integral
where $C\subset {\mathbb C}-\{0,1,x\}$ is a contour on which the integrand takes its initial value when t encircles C. The function $I^{(C)}(x)$ satisfies the hypergeometric differential equation
This follows from Stokes’ theorem and the following identity of differential forms. Denote $\Phi (t,x) = t^\alpha (t-1)^\beta (t-x)^\gamma $ ,
Then,
The differential equation in Equation (3-2) turns into the standard hypergeometric differential equation
if $\alpha =a-c$ , $\beta =c-b-1$ , $\gamma =-a$ . If $c\notin {\mathbb Z}_{\leqslant 0}$ , then for a suitable choice of C and multiplication of the integral by a constant, the integral in Equation (3-1) can be expanded as a power series
Here, $(a)_n=\Gamma (a+n)/\Gamma (a)=\prod _{k=0}^{n-1}(a+k)$ stands for Pochhammer’s symbol.
We consider the following $p^s$ -approximation of the integral in Equation (3-1). Given a positive integer s, let $1\leqslant \alpha _s, \beta _s, \gamma _s \leqslant p^s$ be the unique positive integers such that
Define the master polynomial
and the $p^s$ -approximation polynomial $I_s(x)$ as the coefficient of $t^{p^s-1}$ in the master polynomial $\Phi _s(t,x)$ . Then,
The polynomial $I_s(x)$ has integer coefficients.
Theorem 3.1. The polynomial $I_s(x)$ is a solution of the hypergeometric differential equation in Equation (3-2) modulo $p^s$ ,
Proof. The theorem follows from Equation (3-3).
In this paper, we prove Dwork-type congruences for the $p^s$ -approximation polynomials $I_s(x)$ in several basic examples and leave general considerations for another occasion.
For more general versions of the $p^s$ -approximation construction, see [Reference Schechtman and VarchenkoSV2].
4 Function $_2F_1(\tfrac 12, \tfrac 12; 1,x)$
4.1 Polynomials $P_s(x)$
The function
satisfies the hypergeometric differential equation
Define the master polynomial
The number $M=({p^s-1})/2=({p-1})/2+({p-1})/2 p+\cdots +({p-1})/2 p^{s-1}$ is the unique positive integer such that $1\leqslant M\leqslant p^s$ and $M\equiv -1/2 \ \,(\mathrm {mod}\,{p^s})$ . Define the $p^s$ -approximation polynomial $P_s(x)$ as the coefficient of $t^{p^s-1}$ in the master polynomial $\Phi _{p^s}(t,x)$ . Then,
Define $P_0(x)=1$ .
Recall the hypergeometric function ${}_2F_1(a,b;c;x)$ . Then,
The polynomial $P_s(x)$ is a solution of the hypergeometric equation in Equation (4-2) modulo $p^s$ . This follows from Theorem 3.1 or from Equation (4-4).
4.2 Baby congruences
Let $\varphi _s(x)=(x+1)^{(p^s-1)/2}$ . Then,
This follows from $(x+1)^{p^s}\equiv (x^p+1)^{p^{s-1}}\,(\mathrm {mod}\,{p^s})$ .
Lemma 4.1. The master polynomials $\Phi _{s}(t,x)$ satisfy the baby congruence
Proof. The lemma follows from Equation (4-5).
4.3 Congruences for $P_s(x)$
Theorem 4.2. The approximation polynomials $P_s(x)$ satisfy the congruence
This theorem follows from a more general Theorem 4.4.
Using Equation (4-4), we may rewrite Equation (4-7) as the congruence
Let $\alpha $ be a rational number which is a p-adic unit, $\alpha = \alpha _0 + \alpha _1 p+ \alpha _2p^2+\cdots $ . Denote by $[\alpha ]_s$ the sum of the first s summands. Then the congruence in Equation (4-8) takes the form:
4.4 Coefficients of master polynomials
Consider
where
In particular, $C_{s,0}(x) = P_s(x)$ . Every coefficient $C_{s,j}(x)$ is a hypergeometric function:
while a hypergeometric expression in the case $j<0$ comes out from the following simple fact.
Lemma 4.3. We have $\hat \Phi _s(x/t,x) = \hat \Phi _s(t,x)$ and hence
We expand the congruence in Equation (4-6) into a congruence of polynomials $C_{s,j}(x)$ . The constant term in t gives us
The following Theorem 4.4 establishes the congruences of individual pairs of terms in Equation (4-10).
Theorem 4.4. For any k appearing in Equation (4-10),
In particular, for $k=0$ , we have the congruence in Equation (4-7).
Proof. Every index k appearing in Equation (4-10) can be written uniquely as
Using Equation (4-9), we reformulate Equation (4-11) as
and prove it. We have
It is easy to see that the $(s+1)$ -tuple of Laurent polynomials
is admissible in the sense of Definition 2.6. Now the application of Theorem 2.9 gives the congruence in Equation (4-12) and hence the congruence in Equation (4-11).
Remark 4.5. Denote $A(n,x) :={}_2F_1(-n,-n;1;x) = \sum _k \binom {n}{k}^2 x^k$ . Let
where $0\leqslant n_i<p$ . Then for any $m\in {\mathbb Z}_{\geqslant 0}$ ,
The proof follows from Theorem 2.9 and the identity
4.5 Limits of $P_s(x)$
For $\alpha \in {\mathbb Z}_p$ , there exists a unique solution $\omega (\alpha )\in {\mathbb Z}_p$ of the equation $\omega (\alpha )^p=\omega (\alpha )$ that is congruent to x modulo p. The element $\omega (\alpha )$ is called the Teichmüller representative of $\alpha $ . For $\alpha \in {\mathbb F}_p$ , $r>0$ , define the disc
Denote
see Equation (4-4). Denote
Theorem 4.6. For $s\geqslant 1$ , the rational function ${\bar P_{s+1}(x)}/{\bar P_s(x^p)}$ is regular on $\mathfrak D$ . The sequence $({\bar P_{s+1}(x)}/{\bar P_s(x^p)})_{s\geqslant 1}$ uniformly converges on $\mathfrak D$ . The limiting analytic function $f(x)$ equals the ratio ${F(x)}/{F(x^p)}$ on the disc $D_{0,1}$ , where $F(x):={}_2F_1(1/2,1/2;1;x)$ is defined by the convergent power series in Equation (4-1). We also have $|f(x)|_p=1$ for $x\in \mathfrak D$ .
Proof. We have ${\mathbb Z}_p = \bigcup _{\alpha \in {\mathbb F}_p}D_{\alpha ,1}$ and also $\mathfrak D= \bigcup _{\alpha \in {\mathbb F}_p,\, |\bar P_1(\omega (\alpha ))|_p =1} D_{\alpha ,1}$ since $\bar P_1(x)$ has coefficients in ${\mathbb Z}_p$ . In particular, $D_{0,1}\subset \mathfrak D$ . We also have
for the same reason.
We have $\bar P_s(x) \equiv \bar P_1(x) \bar P_1(x^p)\cdots \bar P_1(x^{p^{s-1}}) \ \,(\mathrm {mod}\,{p})$ . Indeed, the polynomial $ P_s(x)$ is the coefficient of $x^{p^s-1}$ in the master polynomials $\Phi _{p^s}(t,x)$ and
Hence, $|\bar P_s(x)|_p = |\bar P_s(x^p)|_p=1$ for $s\geqslant 1$ , $x\in \mathfrak D$ . Hence, the rational functions ${\bar P_{s+1}(x)}/{\bar P_s(x^p)}$ are regular on $\mathfrak D$ .
The congruence in Equation (4-7) implies that
This shows the uniform convergence of our sequence of rational functions on the domain $\mathfrak D$ . For the limiting function $f(x)$ , we have $|f(x)|_p=1$ for $x\in \mathfrak D$ .
Clearly, for any fixed index k, the coefficient $\binom {(p^s-1)/2}{k}^2$ of $x^k$ in $\bar P_s(x)$ converges p-adically to the coefficient $\binom {-1/2}{k}^2$ of $x^k$ in $F(x)$ . Hence, the sequence $(\bar P_s(x))_{s\geqslant 1}$ converges to $F(x)$ on $D_{0,1}$ , so that $f(x) ={F(x)}/{F(x^p)}$ on $D_{0,1}$ . The theorem is proved.
Dwork gives in [Reference DworkDw] a different construction of analytic continuation of the ratio ${F(x)}/{F(x^p)}$ from $D_{0,1}$ to a larger domain. He considers the sequence of polynomials
which are truncations of the hypergeometric series $F(x)$ , and shows that the sequence of rational functions $({F_{s+1}(x)}/{F_s(x^p)})_{s\geqslant 1}$ uniformly converges on the domain $\mathfrak D^{\operatorname {Dw}} =\{x\in {\mathbb Z}_p\ |\ |g(x)|_p=1\}$ , where the polynomial
is attributed by Dwork to Igusa [Reference IgusaIg]. Clearly, his limiting function $f^{\operatorname {Dw}}(x)$ equals the ratio ${F(x)}/{F(x^p)}$ on $D_{0,1}$ .
It is easy to see that the two sequences of rational functions $({\bar P_{s+1}(x)}/{\bar P_s(x^p)})_{s\geqslant 1}$ and $({F_{s+1}(x)}/{F_s(x^p)})_{s\geqslant 1}$ have the same limiting functions on the same domain. Indeed, $\bar P_1(x)\equiv g(x) \ \,(\mathrm {mod}\,{p})$ and hence $\mathfrak D=\mathfrak D^{\operatorname {Dw}}$ . Also, $f(x) = f^{\operatorname {Dw}}(x)$ on $D_{0,1}$ and hence on $\mathfrak D$ .
Dwork shows in [Reference DworkDw] interesting properties of the function $f(x)$ . For example, let $\alpha \in {\mathbb F}^\times _p-\{1\}$ be such that $\omega (\alpha ) \in {\mathfrak D}$ . Dwork shows that the zeta function of the elliptic curve defined over ${\mathbb F}_p$ by the equation $y^2=x(x-1)(x-\alpha )$ has two zeros, which are $1/((-1)^{(p-1)/2} f(\omega (\alpha )))$ and $(-1)^{(p-1)/2} f(\omega (\alpha ))/p$ . Clearly, $|f(\omega (\alpha ))|_p=1$ . The number $(-1)^{(p-1)/2} f(\omega (\alpha ))$ is called the unit root of that elliptic curve.
According to our discussion, this unit root can be calculated as the value at $x=\omega (\alpha )$ of the limit as $s\to \infty $ of the ratio ${\bar P_{s+1}(x)}/{\bar P_s(x^p)}$ of approximation polynomials multiplied by $(-1)^{(p-1)/2}$ .
5 Function $_2F_1(2/3, \tfrac 13; 1;x)$
5.1 Two hypergeometric integrals
The integral
where $C\subset {\mathbb C}-\{0,1,x\}$ is a contour on which the integrand takes its initial value when t encircles C, satisfies the hypergeometric differential equation
For a suitable choice of C, the integral $I^{(C)}(x)$ presents the hypergeometric function
The integral
where $D\subset {\mathbb C}-\{0,1,x\}$ is a contour on which the integrand takes its initial value when t encircles D, satisfies the same hypergeometric differential equation. For a suitable choice of D, the integral $J^{(D)}(x)$ presents the same hypergeometric function $_2F_1(2/3,\tfrac 13; 1,x)$ .
The differential form $t^{-1/3}(t-1)^{-1/3}(t-z)^{-2/3}\,dt$ is transformed to the differential form $-t^{-2/3}(t-1)^{-2/3}(t-z)^{-1/3}\,dt$ by the change of variable $t\mapsto (t-z)/(t-1)$ .
In this section, we discuss the $p^s$ -approximations of the integrals $I^{(C)}(x)$ and $J^{(D)}(x)$ .
5.2 The case $p=3\ell +1$
The master polynomial for $I^{(C)}(x)$ is given by the formula
The $p^s$ -approximation polynomial $Q_s(x)$ is defined as the coefficient of $t^{p^s-1}$ in $\Phi _s(t,x)$ ,
Define $Q_0(x)=1$ . We have
since $(-1)^{(p^s-1)/3}=1$ .
The polynomial $Q_s(x)$ is a solution of the hypergeometric equation in Equation (5-1) modulo $p^s$ . This follows from Theorem 3.1 or from Equation (5-2).
The master polynomial for $J^{(D)}(x)$ is given by the formula
The $p^s$ -approximation polynomial $R_s(x)$ is defined as the coefficient of $t^{p^s-1}$ in $\Psi _s(t,x)$ ,
Define $R_0(x)=1$ . We have
The master polynomials satisfy the baby congruences,
by Equation (4-5).
Theorem 5.1. For $p=3\ell +1$ , the approximation polynomials $R_s(x)$ and $Q_s(x)$ satisfy the congruences
Proof. Denote $\hat \Phi _1(t,x) = (t-1)^{\ell }(1-x/t)^{2\ell }$ . Then, $Q_{s+1}(x) = \operatorname {CT}_t[\hat \Phi _1(t,x)^{1+p+\cdots +p^s}]$ . It is easy to see that the $(s+1)$ -tuple $(\hat \Phi _1(t,x),\hat \Phi _1(t,x), \,\ldots , \hat \Phi _1(t,x))$ of Laurent polynomials is admissible in the sense of Definition 2.6. Now the application of Theorem 2.9 gives the congruence in Equation (5-5). The congruence in Equation (5-6) is proved in the same way applied to the formula $R_{s+1}(x) = \operatorname {CT}_t[\hat \Psi _1(t,x)^{1+p+\cdots +p^s}]$ , where $\hat \Psi _1(t,x) = (t-1)^{2\ell }(1-x/t)^{\ell }$ . The congruence in Equation (5-6) also follows from Equation (5-5) since $R_s(x)=Q_s(x)$ .
Equations (5-2) and (5-3) imply that for $p=3\ell +1$ , $s\geqslant 1$ ,
Using the expansions
we conclude that for $p=3\ell +1$ and $s\geqslant 1$ ,
5.3 The case $p=3\ell +2>2$
The master polynomial for $I^{(C)}(x)$ is given by the formulas
The $p^s$ -approximation polynomial $Q_s(x)$ is defined as the coefficient of $t^{p^s-1}$ in $\Phi _s(t,x)$ ,
Define $Q_0(x)=1$ . We have
Here we use the fact that for $p=3\ell +2>2$ , we have $(-1)^{(2p^s-1)/3}=-1 $ for odd s and $(-1)^{(p^s-1)/3}=1$ for even s.
The polynomial $Q_s(x)$ is a solution of the hypergeometric equation in Equation (5-1) modulo $p^s$ . This follows from Theorem 3.1 or from Equation (5-8).
The master polynomial for $J^{(D)}(x)$ is given by the formulas
The $p^s$ -approximation polynomial $R_s(x)$ is defined as the coefficient of $t^{p^s-1}$ in $\Phi _s(t,x)$ ,
Define $R_0(x)=1$ . We have
Here we use the fact that for $p=3\ell +2>2$ , we have $(-1)^{(p^s-2)/3}=-1$ for odd s and $(-1)^{(2p^s-1)/3}=1 $ for even s.
The polynomial $R_s(x)$ is a solution of the hypergeometric equation in Equation (5-1) modulo $p^s$ . This follows from Theorem 3.1 or from Equation (5-9).
Lemma 5.2. The master polynomials satisfy the baby congruences,
Proof. We prove Equation (5-10) for an odd s. The case of an even s and the congruence in Equation (5-11) are proved similarly. The left-hand side of Equation (5-10) for an odd $s=2k+1$ equals
while the right-hand side equals
Now the congruence in Equation (5-10) for an odd s follows from Equation (4-5).
Theorem 5.3. For $p=3\ell +2>2$ , the approximation polynomials $R_s(x)$ and $Q_s(x)$ satisfy the congruences
Proof. We prove Equation (5-12) for an odd s. The case of an even s and the congruence in Equation (5-13) are proved similarly. Denote
It is easy to see that for an odd s,
Observe that the $(s+1)$ -tuple of Laurent polynomials $(f(t,x),g(t,x),\ldots , f(t,x), g(t,x))$ is admissible in the sense of Definition 2.6. Now the application of Theorem 2.9 gives the congruence in Equation (5-12) for an odd s.
Remark 5.4. In general, we may take any admissible tuple of Laurent polynomials and obtain the corresponding Dwork congruences. For example, the tuples $(f,g,f,f,f,g,g,f,\ldots )$ and $(f,f,\ldots )$ are admissible.
Using Equations (5-8) and (5-9), we may reformulate the congruences in Equations (5-12) and (5-13) as
Recall that in these congruences, we have $p=3\ell +2$ .
Consider the p-adic presentations
Recall that $[-1/3]_s$ (respectively, $[-2/3]_s$ ) is the sum of the first s summands in Equation (5-16) (respectively, (5-17)). Then the congruences in Equations (5-14) and (5-15) imply that for $p=3\ell +2$ , $s\geqslant 1$ ,
5.4 Limits of $\overline {Q}_s(x)$
Define
Then for any prime $p>3$ ,
by Equations (5-7) and (5-18).
Theorem 5.5. For any prime $p>3$ and integer $s\geqslant 1$ , the rational function ${\bar Q_{s+1}(x)}/{\bar Q_s(x^p)}$ is regular on the domain
The sequence $({\bar Q_{s+1}(x)}/{\bar Q_s(x^p)})_{s\geqslant 1}$ uniformly converges on $\mathfrak D$ . The limiting analytic function $f(x)$ equals the ratio ${F(x)}/{F(x^p)}$ on the disc $D_{0,1}$ , where $F(x):={}_2F_1(2/3,1/3;1;x)$ is defined by the corresponding convergent power series.
Proof. The proof is the same as the proof of Theorem 4.6.
5.5 Remark
Although the congruences in Equations (5-7) and (5-18) look the same for $p=3\ell +1$ and $p=3\ell +2$ , the proofs of them are different as already presented. The proof of Equation (5-7) for $p=3\ell +1$ uses just any one of the two master polynomials: $\Phi _s(t,x)$ or $\Psi _s(t,x)$ , while the proof of Equation (5-18) for $p=3\ell +2$ uses the interaction of the two master polynomials $\Phi _s(t,x)$ and $\Psi _s(t,x)$ . See the baby congruences in Equations (5-4), (5-10), (5-11).
5.6 Congruences related to $-\tfrac 15$ , $-\tfrac 25$ , $-\tfrac 35$ , $-\tfrac 45$
In this section, we formulate the congruences related to the above rational numbers. The proof of these congruences is similar to the corresponding proofs in Sections 4 and 5.
For $p=5\ell \pm 2$ and any $s\geqslant 1$ ,
For $p=5\ell \pm 1$ and any $s\geqslant 1$ ,
Similar congruences hold for rational numbers of the form $a/b$ , where b is a prime and $1-b\leqslant a \leqslant -1$ . These congruences are described somewhere else.
6 KZ equations
6.1 KZ equations
Let ${\mathfrak g}$ be a simple Lie algebra with an invariant scalar product. The Casimir element is
where $(h_i)\subset {\mathfrak g}$ is an orthonormal basis. Let $V=\otimes _{i=1}^n V_i$ be a tensor product of ${\mathfrak g}$ -modules, $\kappa \in {\mathbb C}^\times $ a nonzero number. The KZ equations are the system of differential equations on a V-valued function $I(z_1,\ldots ,z_n)$ ,
where $\Omega _{i,j}:V\to V$ is the Casimir operator acting in the i th and j th tensor factors; see [Reference Knizhnik and ZamolodchikovKZ, Reference Etingof, Frenkel and KirillovEFK].
This system is a system of Fuchsian first-order linear differential equations. The equations are defined on the complement in ${\mathbb C}^n$ to the union of all diagonal hyperplanes.
The object of our discussion is the following particular case. We consider the following system of differential and algebraic equations for a column $3$ -vector $I=(I_1,I_2,I_3)$ depending on variables $z=(z_1,z_2,z_3)$ :
where $\Omega _{ij}=\Omega _{ji}$ and
Denote
Then the KZ equations can be written as the system of equations,
System in Equation (6-1) is the system of KZ equations with parameter $\kappa =2$ associated with the Lie algebra ${\mathfrak {sl}}_2$ and the subspace of singular vectors of weight $1$ of the tensor power $({\mathbb C}^2)^{\otimes 3}$ of two-dimensional irreducible ${\mathfrak {sl}}_2$ -modules, up to a gauge transformation; see this example in [Reference VarchenkoV2, Section 1.1].
6.2 Solutions over ${\mathbb C}$
Define the master function
and the column 3-vector
where $C\subset {\mathbb C}-\{z_1,z_2,z_3\}$ is a contour on which the integrand takes its initial value when t encircles C.
Theorem 6.1 (See [Reference VarchenkoV4]).
The function $I^{(C)}(z)$ is a solution of the system in Equation (6-1).
This theorem is a very particular case of the results in [Reference Schechtman and VarchenkoSV1].
Proof. The theorem follows from Stokes’ theorem and the two identities:
where $\Psi ^i(t,z)$ is the column $3$ -vector $(0,\ldots ,0,-{\Phi (t,z)}/{t-z_i},0,\ldots ,0)$ with the nonzero element at the i th place.
Theorem 6.2 (See [Reference VarchenkoV1, Equation (1.3)]).
All solutions of the system in Equation (6-1) have this form. Namely, the complex vector space of solutions of the form in Equation (6-2) is $2$ -dimensional.
6.3 Solutions as vectors of first derivatives
Consider the elliptic integral
Then,
Denote $\nabla T = ( {\partial T}/{\partial z_1}, {\partial T}/{\partial z_2}, {\partial T}/{\partial z_3})$ . Then the column gradient vector of the function $T(z)$ satisfies the following system of (KZ) equations:
This is a system of second-order linear differential equations on the function $T(z)$ .
6.4 Solutions modulo $p^s$
For an integer $s\geqslant 1$ , define the master polynomial
Define the column 3-vector
as the coefficient of $t^{p^s-1}$ in the polynomial
Theorem 6.3 [Reference VarchenkoV4].
The polynomial $I_s(z)$ is a solution of the system in Equation (6-1) modulo $p^s$ .
Proof. We have the following modifications of the identities in Equations (6-3), (6-4):
where $\Psi _s^i(t,z)$ is the column $3$ -vector $(0,\ldots ,0,-{\Phi _s(t,z)}/({t-z_i}),0,\ldots ,0)$ with the nonzero element at the i th place. Theorem 6.3 follows from these identities.
6.5 $p^s$ -Approximation polynomials of $T(z)$
Define the $p^s$ -approximation polynomial $T_s(z)$ of the elliptic integral $T(z)$ as the coefficient of $t^{p^s-1}$ in the master polynomial $\Phi _s(t,z)$ ,
We put $T_0(x)=1$ .
The polynomial $T_s(z_1,z_2,z_3)$ is symmetric with respect to permutations of $z_1,z_2,z_3$ and
where $P_s(x)$ is defined in Equation (4-3). The gradient vector
of the $p^s$ -approximation polynomial $T_s(z)$ is a solution modulo $p^s$ of the system in Equation (6-1) since
Lemma 6.4. For $s\geqslant 1$ , the master polynomials satisfy the baby congruences,
Theorem 6.5. For $s\geqslant 1$ ,
Proof. Let $h(t,z) = t^{1-p} ((t-z_1)(t-z_2)(t-z_3))^{(p-1)/2}$ . Then,
The tuple of Laurent polynomials $(h(t,z),h(t,z), \ldots )$ is admissible in the sense of Definition 2.6. Now the application of Theorem 2.9 gives the congruence in Equation (6-7).
6.6 Limits of $T_s(z)$
Denote $\bar T_s(z) := (-1)^{(p^s-1)/2} T_s(z)$ ; see Equation (6-6),
Notice that ${\mathbb Z}_p^3 = \bigcup _{\alpha ,\beta ,\gamma \in {\mathbb F}_p} D_{\alpha ,1}\times D_{\beta ,1}\times D_{\gamma ,1}$ . Since $\bar T_1(z)$ has coefficients in ${\mathbb Z}_p$ ,
where the summation ${\bigcup }^o$ is over all $\alpha ,\beta ,\gamma \in {\mathbb F}_p$ such that $|T(\omega (\alpha ),\omega (\beta ),\omega (\gamma ))|_p=1$ . For the same reason,
Denote
Lemma 6.6. We have $ \mathfrak E\subset D_{1,1}\times D_{0,1}\times D_{0,1}\subset \mathfrak D$ .
Proof. The first inclusion is clear. The second inclusion follows from the equality $\bar T_s(1,0,0)=1$ .
Theorem 6.7. For $s\geqslant 1$ , the rational function ${\bar T_{s+1}(z)}/{\bar T_s(z^p)}$ is regular on $\mathfrak D$ . The sequence $({\bar T_{s+1}(z)}/{\bar T_s(z^p)})_{s\geqslant 1}$ uniformly converges on $\mathfrak D$ . The limiting analytic function $f(z)$ , restricted to $\mathfrak E$ , equals the ratio ${F(z_2)}/{F(z_2^p)}$ , where $F(z_2):={}_2F_1(1/2,1/2;1;z_2)$ is defined by the convergent power series in Equation (4-1). We also have $|f(z)|_p=1$ for every $z\in \mathfrak D$ .
Proof. Similarly to the proof of Theorem 4.6, we have $\bar T_s(z) \equiv \bar T_1(z) \bar T_1(z^p)\cdots \bar T_1(x^{p^{s-1}}) (\mathrm { mod}\,{p})$ . Hence, $|\bar T_s(z)|_p = |\bar T_s(z^p)|_p=1$ for $s\geqslant 1$ , $z\in \mathfrak D$ . Hence, the rational functions ${\bar T_{s+1}(z)}/{\bar T_s(z^p)}$ are regular on $\mathfrak D$ .
The congruence in Equation (6-7) implies that
This shows the uniform convergence of $({\bar T_{s+1}(z)}/{\bar T_s(z^p)})_{s\geqslant 1}$ on $\mathfrak D$ . For the limiting function $f(z)$ , we have $|f(z)|_p=1$ for $z\in \mathfrak D$ .
We have $\bar T_s(1,z_2,0) = \bar P_s(z_2) = \sum _{k} \binom {(p^s-1)/2}{k}^2z_2^k$ . Clearly, for any fixed index k, the coefficient $\binom {(p^s-1)/2}{k}^2$ of $z_2^k$ in $\bar T_s(1,z_2,0)$ converges p-adically to the coefficient $\binom {-1/2}{k}^2$ of $z_2^k$ in $F(z_2)$ . Hence, the sequence $(\bar T_s(1,z_2,0))_{s\geqslant 1}$ converges to $F(z_2)$ on $\mathfrak E$ , so that $f(1,z_2,0) ={F(z_2)}/{F(z^p)}$ on $\mathfrak E$ . The theorem is proved.
Remark 6.8. The analytic function $f(z)$ of Theorem 6.10 exhibits behavior very different from the behavior of the corresponding ratio $T^{(C)}(z)/T^{(C)}(z^p)$ of complex elliptic integrals.
By Theorem 6.10, the function $f(z)$ restricted to the one-dimensional discs $\{ (z_1,0,1)\in {\mathbb Z}_p^3\ |\ |z_1|_p<1 \}$ , $\{ (1,z_2,0)\in {\mathbb Z}_p^3\ |\ |z_2|_p<1 \}$ , $\{ (0,1,z_3)\in {\mathbb Z}_p^3\ |\ |z_3|_p<1 \}$ equals ${F(z_1)}/{F(z_1^p)}$ , ${F(z_2)}/{F(z_2^p)}$ , ${F(z_3)}/{F(z_3^p)}$ , respectively.
In the complex case, for the ratio $T^{(C_1)}(z)/T^{(C_1)}(z^p)$ to be equal to ${F(z_1)}/{F(z_1^p)}$ on $\{ (z_1,0,1)\in {\mathbb C}^3\ |\ |z_1|<1 \}$ , the contour $C_1$ must be the cycle on the elliptic curve $y^2=(t-z_1)t(t-1)$ vanishing at $z_1=0$ . Similarly, for $T^{(C_2)}(z)/T^{(C_2)}(z^p)$ to be equal to ${F(z_2)}/{F(z_2^p)}$ on $\{ (1,z_2,0)\in {\mathbb C}^3\ |\ |z_2|<1 \}$ , the contour $C_2$ must be the cycle on the elliptic curve $y^2=(t-1)(t-z_2)t$ vanishing at $z_2=0$ , and for $T^{(C_3)}(z)/T^{(C_3)}(z^p)$ to be equal to ${F(z_3)}/{F(z_3^p)}$ on $\{ (0,1,z_3)\in {\mathbb C}^3\ |\ |z_3|<1 \}$ , the contour $C_3$ must be the cycle on the elliptic curve $y^2=t(t-1)(t-z_3)$ vanishing at $z_3=0$ . However, these three local complex analytic functions are not restrictions of a single univalued complex analytic function due to the irreducibility of the monodromy representation of the Gauss–Manin connection associated with the family of elliptic curves $y^2=(t-z_1)(t-z_2)(t-z_3)$ .
For $i,j\in \{1,2,3\}$ and $s\geqslant 1$ , denote
Theorem 6.9. For $s\geqslant 1$ , the rational functions $\eta ^{(i)}_s(z)$ and $\eta ^{(ij)}_s(z)$ are regular on $\mathfrak D$ . The sequences of rational functions $(\eta ^{(i)}_s(z))_{s\geqslant 1}$ and $(\eta ^{(ij)}_s(z))_{\geqslant 1}$ converge uniformly on $\mathfrak D$ to analytic functions. If $\eta ^{(j)}$ and $\eta ^{(ij)}$ denote the corresponding limits, then
Proof. Denote $\delta _i=z_i({\partial }/{\partial z_i})$ . By Theorem 6.7, the sequence $(f_s)$ uniformly converges to the analytic function f on $\mathfrak D$ . Therefore, the sequence of the derivatives $(({\partial }/{\partial z_i})f_s)$ uniformly converges on $\mathfrak D$ to $({\partial }/{\partial z_i})f$ . Hence, the sequence $((\delta _if_s)/f_s)$ uniformly converges on $\mathfrak D$ to the function $(\delta _if)/f$ . At the same time,
and, more generally,
Summing the relations up with suitable weights to get telescoping, we obtain, for any $r\leqslant s$ ,
Choosing $r=[s/2]$ and taking the limit as $s\to \infty $ on both sides,
The series on the left uniformly converges on $\mathfrak D$ . Hence, there exists the limit on the right-hand side. This means that
One can further differentiate the resulting equality with respect to any of the variables $z_1,z_2,z_3$ to get, by induction, formulas for $\eta ^{(ij)}$ and more generally for $\eta ^{(ijk\ldots )}$ . Note that Equation (6-11) comes out from differentiating logarithmic derivatives.
Equations (6-9) and (6-10) follow from Equation (6-5). The theorem is proved.
Theorem 6.10. We have the following system of equations on $\mathfrak D$ :
Theorem 6.11. The column vector
is nonzero at every point $z\in \mathfrak D$ .
Proof. On the one hand, if $\vec \eta (a) =0$ for some $a\in \mathfrak D$ , then all derivatives of $\vec \eta (z)$ at a are equal to zero. This follows from the first three equations in Equation (6-12) written as
Hence, $\vec \eta (z)$ equals zero identically on $\mathfrak D$ . On the other hand, $\eta ^{(2)}(1,0,0) = F'(0)/F(0) = 1/4$ by Theorem 6.7. This contradiction implies the theorem.
6.7 Subbundle $\mathcal L \,\to \, \mathfrak D$
Denote $W=\{(I_1,I_2,I_3)\in {\mathbb Q}_p^3\ |\ I_1+I_2+I_3=0\}$ . The differential operators $\nabla ^{\operatorname {KZ}}_i$ , $i=1,2,3$ , define a connection on the trivial bundle $W\times \mathfrak D \to \mathfrak D$ , called the KZ connection. The KZ connection is flat,
The flat sections of the KZ connection are solutions of the system in Equation (6-1) of KZ equations.
For any $a\in \mathfrak D$ , let $\mathcal L_a \subset W$ be the one-dimensional vector subspace generated by $\vec \eta (a)$ . Then,
is an analytic line subbundle of the trivial bundle $W\times \mathfrak D \to \mathfrak D$ .
Theorem 6.12. The subbundle $\mathcal L \,\to \, \mathfrak D$ is invariant with respect to the KZ connection. In other words, if $s(z)$ is any section of $\mathcal L \,\to \, \mathfrak D$ , then the sections $\nabla _i s(z)$ , $i=1,2,3$ , also are sections of $\mathcal L \,\to \, \mathfrak D$ .
Proof. The theorem follows from Equation (6-13).
Remark 6.13. For any $a\in \mathfrak D$ , we may find locally a scalar analytic function $u(z)$ such that $u(z)\cdot \vec \eta (z)$ is a solution of the KZ equations in Equation (6-1). Such a function is a solution of the system of equations $ {\partial u}/{\partial z_i} = - \eta ^{(i)} u$ , $i=1,2,3$ . This system is compatible since $ {\partial \eta ^{(j)}}/{\partial z_i} = \eta ^{(ij)} - \eta ^{(i)}\eta ^{(j)} = {\partial \eta ^{(i)}}/{\partial z_j}$ .
Remark 6.14. The corresponding complex KZ connection does not have invariant line subbundles due to irreducibility of the monodromy of the KZ connection, which in our case is the Gauss–Manin connection of the family $y^2=(t-z_1)(t-z_2)(t-z_3)$ . Thus, the existence of the KZ invariant line subbundle $\mathcal L \to \mathfrak D$ is a pure p-adic feature.
Remark 6.15. The invariant subbundles of the KZ connection over ${\mathbb C}$ usually are related to some additional conformal block constructions; see [Reference Feigin, Schechtman and VarchenkoFSV, Reference Schechtman and VarchenkoSV2, Reference VarchenkoV3]. Apparently, the subbundle $\mathcal L \to \mathfrak D$ is of a different p-adic nature; see [Reference VarchenkoV4].
Remark 6.16. Following Dwork, we may expect that locally at any point $a\in \mathfrak D$ , the solutions of the KZ equations of the form $u(z)\cdot \vec \eta (z),$ where $u(z)$ is a scalar function, are given at a by power series in $z_i-a_i$ , $i=1,2,3$ , bounded in their polydiscs of convergence, while any other local solution at a is given by a power series unbounded in its polydisc of convergence; see [Reference DworkDw] and [Reference VarchenkoV4, Theorem A.4].
6.8 Other definitions of subbundle $\mathcal L \to \mathfrak D$
6.8.1 Line subbundle $\mathcal M \to \mathfrak D$
Define a polynomial $U_s(z)$ as the coefficient of $t^{p^s-1}$ in the master polynomial $\Phi _s(t+z_3,z) = ((t-(z_1-z_3))(t-(z_2-z_3))t)^{(p^s-1)/2}$ . It is easy to see that $U_1(z)=T_1(z)\ \,(\mathrm {mod}\,{p})$ . Similarly to Theorem 6.5, we conclude that
Hence, the sequence $({U_{s+1}(z)}/{U_s(z^p)})_{s\geqslant 1}$ uniformly converges to an analytic function on the domain $\mathfrak D$ defined in Equation (6-8). The vector-valued polynomial $\nabla U_s(z) =({\partial U_s}/{\partial z_1}, {\partial U_s}/{\partial z_2}, {\partial U_s}/{\partial z_3} )$ is a solution modulo $p^s$ of the KZ equations in Equation (6-1); see [Reference VarchenkoV4, Theorem 9.1], and see the proof of Theorem 6.3. Consider the function
defined on the same domain $\mathfrak D$ . Similarly to the proofs of Theorems 6.9–6.12, we conclude that the function $\vec {\mu }(z)$ is nonzero on $\mathfrak D$ and its values span an analytic line subbundle
of the trivial bundle $W\times \mathfrak D\to \mathfrak D$ ; here, $\mathcal M_a\subset W$ is the one-dimensional subspace generated by $\vec \mu (a)$ . The line subbundle $\mathcal M \,\to \, \mathfrak D$ is invariant with respect to the KZ connection.
Theorem 6.17. The line bundles $\mathcal M \,\to \, \mathfrak D$ and $\mathcal L \,\to \, \mathfrak D$ coincide.
Proof. The proof rests on the following two lemmas.
Lemma 6.18. The line bundles $\mathcal M \,\to \, \mathfrak D$ and $\mathcal L \,\to \, \mathfrak D$ coincide if there is $a\in \mathfrak D$ such that $\mathcal M_a=\mathcal L_a$ .
Proof. Let $\mathcal M_a=\mathcal L_a$ for some $a\in \mathfrak D$ . Then, $\mathcal M_z = \mathcal L_s$ in some neighborhood of a, since locally the subbundles are generated by the values of the solutions with the same initial condition at $z=a$ . Hence, $\mathcal M_z = \mathcal L_z$ on $\mathfrak D$ .
Lemma 6.19. For $i=1,2,3$ , the functions ${\partial T_s}/{\partial z_i}(z)/ T_s(z)$ and ${\partial U_s}/{\partial z_i}(z)/ U_s(z)$ are equal on the line $z_1=1$ , $z_3=0$ .
Hence, $\mathcal M=\mathcal L$ over the points of that line and, therefore, $\mathcal M_z = \mathcal L_z$ for $z\in \mathfrak D$ .
6.8.2 Line subbundle $\mathcal N\to \hat {\mathfrak D}$
Let $\omega (x) =F'(x)/F(x)$ , where $F(x)\ \mathrm{is\ defined\ as} {}_2F_1(1/2, 1/2; 1;x)$ . We have $\omega (x) = \lim _{s\to \infty }({P_s'(x)}/{P_s(x)})$ on $D_{0,1}$ . Introduce new variables
and a vector-valued function
Define
For any $\sigma =(i,j,k)\in S_3$ , define
where $g(\lambda )$ is the Igusa polynomial in Equation (4-13).
Using Dwork’s results in [Reference DworkDw], it is shown in [Reference VarchenkoV4, Appendix] that the values of the analytic continuation of the function $\vec \omega (u)$ generate a line bundle $\mathcal N\to \hat {\mathfrak D}$ invariant with respect to the KZ connection.
Theorem 6.20. The line bundles $\mathcal M\to \mathfrak D$ and $\mathcal N \to \hat {\mathfrak D}$ coincide on $\mathfrak D\cap \hat {\mathfrak D}$ .
Thus, we identify the line bundles $\mathcal L\to \mathfrak D$ , $\mathcal M\to \mathfrak D$ , and $\mathcal N \to \hat {\mathfrak D}$ over $\mathfrak D\cap \hat {\mathfrak D}$ .
Proof. We have
Hence,
Clearly, the limit of this vector equals $\vec \omega (u)$ as $s\to \infty $ . The theorem is proved.
7 Concluding remarks
7.1 Conjectural stronger congruences for $\overline {P}_s(x)$
By Theorem 4.2, we have for polynomials $\bar P_s(x) := (-1)^{(p^s-1)/2} P_s(x)$ ,
In particular, for the coefficient of $x^{N_0+N_1p+N_2p^2 +N_3p^3}$ in $\bar P_4(x) \bar P_2(x^p) -\bar P_3(x)\bar P_3(x^p)$ ,
Computer experiments show that this sum can be split into subsums with at most four terms so that each subsum is divisible by $p^3$ . More precisely, let $0\leqslant a, b, \,c,c',d,d'\leqslant p-1$ be integers. Define
and
We expect that the integer $B(a,b;c,c';d,d')$ is divisible by $p^3$ .
More generally, define
Set
where $\operatorname {Sym}$ denotes the symmetrization with respect to the index j in $k^{(i)}_j$ in each group $k^{(i)}=(k^{(i)}_1,k^{(i)}_2)$ . Thus, the symmetrization has $2^s$ summands; the case $s=2$ of this symmetrization is displayed in Equation (7-1).
Conjecture 7.1. The integer $B(a,b;k)$ is divisible by $p^{s+1}$ .
This conjecture is supported by computer experiments and is checked for $s=1$ using [Reference GranvilleGr].
7.2 Papers [Reference Beukers and VlasenkoBV, Reference VlasenkoVl]
After reading this paper, Masha Vlasenko was able to invent a new proof of our congruences in Theorems 4.2 and 5.1 (private communication). Her proof was based on the results of [Reference Beukers and VlasenkoBV, Reference VlasenkoVl].
Acknowledgements
The authors thank Frits Beukers, Andrew Granville, Anton Mellit, Richárd Rimányi, Steven Sperber, and Masha Vlasenko for useful discussions.
Support in part by NSF grant DMS-1954266 is acknowledged.