This is another sequel to a recent post in which I showed the Riemann zeta function can be locally approximated by a polynomial, in the sense that for randomly chosen one has an approximation

where grows slowly with , and is a polynomial of degree . It turns out that in the function field setting there is an exact version of this approximation which captures many of the known features of the Riemann zeta function, namely Dirichlet -functions for a random character of given modulus over a function field. This model was (essentially) studied in a fairly recent paper by Andrade, Miller, Pratt, and Trinh; I am not sure if there is any further literature on this model beyond this paper (though the number field analogue of low-lying zeroes of Dirichlet -functions is certainly well studied). In this model it is possible to set fixed and let go to infinity, thus providing a simple finite-dimensional model problem for problems involving the statistics of zeroes of the zeta function.

In this post I would like to record this analogue precisely. We will need a finite field of some order and a natural number , and set

We will primarily think of as being large and as being either fixed or growing very slowly with , though it is possible to also consider other asymptotic regimes (such as holding fixed and letting go to infinity). Let be the ring of polynomials of one variable with coefficients in , and let be the multiplicative semigroup of monic polynomials in ; one should view and as the function field analogue of the integers and natural numbers respectively. We use the valuation for polynomials (with ); this is the analogue of the usual absolute value on the integers. We select an irreducible polynomial of size (i.e., has degree ). The multiplicative group can be shown to be cyclic of order . A Dirichlet character of modulus is a completely multiplicative function of modulus , that is periodic of period and vanishes on those not coprime to . From Fourier analysis we see that there are exactly Dirichlet characters of modulus . A Dirichlet character is said to be *odd* if it is not identically one on the group of non-zero constants; there are only non-odd characters (including the principal character), so in the limit most Dirichlet characters are odd. We will work primarily with odd characters in order to be able to ignore the effect of the place at infinity.

Let be an odd Dirichlet character of modulus . The Dirichlet -function is then defined (for of sufficiently large real part, at least) as

Note that for , the set is invariant under shifts whenever ; since this covers a full set of residue classes of , and the odd character has mean zero on this set of residue classes, we conclude that the sum vanishes for . In particular, the -function is entire, and for any real number and complex number , we can write the -function as a polynomial

where and the coefficients are given by the formula

Note that can easily be normalised to zero by the relation

In particular, the dependence on is periodic with period (so by abuse of notation one could also take to be an element of ).

Fourier inversion yields a functional equation for the polynomial :

Proposition 1 (Functional equation)Let be an odd Dirichlet character of modulus , and . There exists a phase (depending on ) such thatfor all , or equivalently that

where .

*Proof:* We can normalise . Let be the finite field . We can write

where denotes the subgroup of consisting of (residue classes of) polynomials of degree less than . Let be a non-trivial character of whose kernel lies in the space (this is easily achieved by pulling back a non-trivial character from the quotient ). We can use the Fourier inversion formula to write

where

From change of variables we see that is a scalar multiple of ; from Plancherel we conclude that

for some phase . We conclude that

The inner sum equals if , and vanishes otherwise, thus

For in , and the contribution of the sum vanishes as is odd. Thus we may restrict to , so that

By the multiplicativity of , this factorises as

From the one-dimensional version of (3) (and the fact that is odd) we have

for some phase . The claim follows.

As one corollary of the functional equation, is a phase rotation of and thus is non-zero, so has degree exactly . The functional equation is then equivalent to the zeroes of being symmetric across the unit circle. In fact we have the stronger

Theorem 2 (Riemann hypothesis for Dirichlet -functions over function fields)Let be an odd Dirichlet character of modulus , and . Then all the zeroes of lie on the unit circle.

We derive this result from the Riemann hypothesis for curves over function fields below the fold.

In view of this theorem (and the fact that ), we may write

for some unitary matrix . It is possible to interpret as the action of the geometric Frobenius map on a certain cohomology group, but we will not do so here. The situation here is simpler than in the number field case because the factor arising from very small primes is now absent (in the function field setting there are no primes of size between and ).

We now let vary uniformly at random over all odd characters of modulus , and uniformly over , independently of ; we also make the distribution of the random variable conjugation invariant in . We use to denote the expectation with respect to this randomness. One can then ask what the limiting distribution of is in various regimes; we will focus in this post on the regime where is fixed and is being sent to infinity. In the spirit of the Sato-Tate conjecture, one should expect to converge in distribution to the circular unitary ensemble (CUE), that is to say Haar probability measure on . This may well be provable from Deligne’s “Weil II” machinery (in the spirit of this monograph of Katz and Sarnak), though I do not know how feasible this is or whether it has already been done in the literature; here we shall avoid using this machinery and study what partial results towards this CUE hypothesis one can make without it.

If one lets be the eigenvalues of (ordered arbitrarily), then we now have

and hence the are essentially elementary symmetric polynomials of the eigenvalues:

One can take log derivatives to conclude

On the other hand, as in the number field case one has the Dirichlet series expansion

where has sufficiently large real part, , and the von Mangoldt function is defined as when is the power of an irreducible and otherwise. We conclude the “explicit formula”

Similarly on inverting we have

Since we also have

for sufficiently large real part, where the Möbius function is equal to when is the product of distinct irreducibles, and otherwise, we conclude that the Möbius coefficients

are just the complete homogeneous symmetric polynomials of the eigenvalues:

One can then derive various algebraic relationships between the coefficients from various identities involving symmetric polynomials, but we will not do so here.

What do we know about the distribution of ? By construction, it is conjugation-invariant; from (2) it is also invariant with respect to the rotations for any phase . We also have the function field analogue of the Rudnick-Sarnak asymptotics:

Proposition 3 (Rudnick-Sarnak asymptotics)Let be nonnegative integers. Ifis equal to in the limit (holding fixed) unless for all , in which case it is equal to

Comparing this with Proposition 1 from this previous post, we thus see that all the low moments of are consistent with the CUE hypothesis (and also with the ACUE hypothesis, again by the previous post). The case of this proposition was essentially established by Andrade, Miller, Pratt, and Trinh.

*Proof:* We may assume the homogeneity relationship

since otherwise the claim follows from the invariance under phase rotation . By (6), the expression (9) is equal to

where

and consists of copies of for each , and similarly consists of copies of for each .

The polynomials and are monic of degree , which by hypothesis is less than the degree of , and thus they can only be scalar multiples of each other in if they are identical (in ). As such, we see that the average

vanishes unless , in which case this average is equal to . Thus the expression (9) simplifies to

There are at most choices for the product , and each one contributes to the above sum. All but of these choices are square-free, so by accepting an error of , we may restrict attention to square-free . This forces to all be irreducible (as opposed to powers of irreducibles); as is a unique factorisation domain, this forces and to be a permutation of . By the size restrictions, this then forces for all (if the above expression is to be anything other than ), and each is associated to possible choices of . Writing and then reinstating the non-squarefree possibilities for , we can thus write the above expression as

Using the prime number theorem , we obtain the claim.

Comparing this with Proposition 1 from this previous post, we thus see that all the low moments of are consistent with the CUE and ACUE hypotheses:

Corollary 4 (CUE statistics at low frequencies)Let be the eigenvalues of , permuted uniformly at random. Let be a linear combination of monomials where are integers with either or . Then

The analogue of the GUE hypothesis in this setting would be the CUE hypothesis, which asserts that the threshold here can be replaced by an arbitrarily large quantity. As far as I know this is not known even for (though, as mentioned previously, in principle one may be able to resolve such cases using Deligne’s proof of the Riemann hypothesis for function fields). Among other things, this would allow one to distinguish CUE from ACUE, since as discussed in the previous post, these two distributions agree when tested against monomials up to threshold , though not to .

*Proof:* By permutation symmetry we can take to be symmetric, and by linearity we may then take to be the symmetrisation of a single monomial . If then both expectations vanish due to the phase rotation symmetry, so we may assume that and . We can write this symmetric polynomial as a constant multiple of plus other monomials with a smaller value of . Since , the claim now follows by induction from Proposition 3 and Proposition 1 from the previous post.

Thus, for instance, for , the moment

is equal to

because all the monomials in are of the required form when . The latter expectation can be computed exactly (for any natural number ) using a formula

of Baker-Forrester and Keating-Snaith, thus for instance

and more generally

when , where are the integers

and more generally

(OEIS A039622). Thus we have

for if and is sufficiently slowly growing depending on . The CUE hypothesis would imply that that this formula also holds for higher . (The situation here is cleaner than in the number field case, in which the GUE hypothesis only suggests the correct lower bound for the moments rather than an asymptotic, due to the absence of the wildly fluctuating additional factor that is present in the Riemann zeta function model.)

Now we can recover the analogue of Montgomery’s work on the pair correlation conjecture. Consider the statistic

where

is some finite linear combination of monomials independent of . We can expand the above sum as

Assuming the CUE hypothesis, then by Example 3 of the previous post, we would conclude that

This is the analogue of Montgomery’s pair correlation conjecture. Proposition 3 implies that this claim is true whenever is supported on . If instead we assume the ACUE hypothesis (or the weaker Alternative Hypothesis that the phase gaps are non-zero multiples of ), one should instead have

for arbitrary ; this is the function field analogue of a recent result of Baluyot. In any event, since is non-negative, we unconditionally have the lower bound

By applying (12) for various choices of test functions we can obtain various bounds on the behaviour of eigenvalues. For instance suppose we take the Fejér kernel

Then (12) applies unconditionally and we conclude that

The right-hand side evaluates to . On the other hand, is non-negative, and equal to when . Thus

The sum is at least , and is at least if is not a simple eigenvalue. Thus

and thus the expected number of simple eigenvalues is at least ; in particular, at least two thirds of the eigenvalues are simple asymptotically on average. If we had (12) without any restriction on the support of , the same arguments allow one to show that the expected proportion of simple eigenvalues is .

Suppose that the phase gaps in are all greater than almost surely. Let is non-negative and non-positive for outside of the arc . Then from (13) one has

so by taking contrapositives one can force the existence of a gap less than asymptotically if one can find with non-negative, non-positive for outside of the arc , and for which one has the inequality

By a suitable choice of (based on a minorant of Selberg) one can ensure this for for large; see Section 5 of these notes of Goldston. This is not the smallest value of currently obtainable in the literature for the number field case (which is currently , due to Goldston and Turnage-Butterbaugh, by a somewhat different method), but is still significantly less than the trivial value of . On the other hand, due to the compatibility of the ACUE distribution with Proposition 3, it is not possible to lower below purely through the use of Proposition 3.

In some cases it is possible to go beyond Proposition 3. Consider the mollified moment

where

for some coefficients . We can compute this moment in the CUE case:

Proposition 5We have

*Proof:* From (5) one has

hence

where we suppress the dependence on the eigenvalues . Now observe the Pieri formula

where are the hook Schur polynomials

and we adopt the convention that vanishes for , or when and . Then also vanishes for . We conclude that

As the Schur polynomials are orthonormal on the unitary group, the claim follows.

The CUE hypothesis would then imply the corresponding mollified moment conjecture

(See this paper of Conrey, and this paper of Radziwill, for some discussion of the analogous conjecture for the zeta function, which is essentially due to Farmer.)

From Proposition 3 one sees that this conjecture holds in the range . It is likely that the function field analogue of the calculations of Conrey (based ultimately on deep exponential sum estimates of Deshouillers and Iwaniec) can extend this range to for any , if is sufficiently large depending on ; these bounds thus go beyond what is available from Proposition 3. On the other hand, as discussed in Remark 7 of the previous post, ACUE would also predict (14) for as large as , so the available mollified moment estimates are not strong enough to rule out ACUE. It would be interesting to see if there is some other estimate in the function field setting that can be used to exclude the ACUE hypothesis (possibly one that exploits the fact that GRH is available in the function field case?).

** — 1. Deriving RH for Dirichlet -functions from RH for curves — **

In this section we show how every Dirichlet -function over a function field with squarefree modulus is a factor of the zeta function of some curve over a function field up to a finite number of local factors, thus giving RH for the former as a consequence of RH for the latter (which can in turn be established by elementary methods such as Stepanov’s method, as discussed in this previous post). The non-squarefree case is more complicated (and can be established using the machinery of Carlitz modules), but we will not need to develop that case here. Thanks to Felipe Voloch and Will Sawin for explaining some of the arguments in this section (from this MathOverflow post).

Let be the order of the Dirichlet character in question. We first deal with the simplest case, in which the modulus is irreducible, and divides ; furthermore we assume that in , that is to say at least one of and is even.

In this case, we will show that

up to a finite number of local factors, where ranges over all Dirichlet characters of modulus and order at most , and is the curve over . Taking logarithmic derivatives, this amounts to requiring the identity

The term is only non-zero when is of the form for some and some irreducible of degree , in which case . Each such has distinct roots in , which by the Frobenius action can be given as . Each such root can be a choice for on the right-hand side of (15), and gives choices for if is an power in , and otherwise. Thus we reduce to showing that for all but finitely many , we have the power reciprocity law

We can exclude the case , so are now coprime. Let denote the degree of . As is assumed irreducible, the multiplicative group is cyclic of order . From this it is easy to see that is equal to when , and zero otherwise. Thus we need to show that

Let denote the roots of (in some extension of ). The condition can be rewritten as

Factoring , this becomes

Using resultants, this is just

In a similar vein, as the multiplicative group of is cyclic of order , one has for some if and only if

which on factoring out (and noting that ) becomes

or using resultants

As , we obtain the claim in this case.

Next, we continue to assume that and , but now allow to be the product of distinct irreducibles . The multiplicative group now splits by the Chinese remainder theorem as the direct product of the cyclic groups , . It is then not difficult to repeat the above arguments, replacing by the curve

we leave the details to the reader.

Finally, we now remove the hypotheses that and . As is square-free, the Euler totient function is the product of quantities of the form and is thus coprime to ; in particular, as must divide this Euler totient function, is also coprime to . There must then exist some power of such that ; if is odd, one can also ensure that , thus in either case we have . Thus to reduce to the previous case, we somehow need to change to (note that will be squarefree in any field extension, since finite fields are perfect).

Let be a degree field extension of , then is a degree extension of . Let be the norm map. Let be the composition of the original Dirichlet character with the norm map; this can then be checked to be a Dirichlet character on with modulus , of order dividing . We claim that

where ranges over the complex roots of unity, which allows us to establish RH for from that of . Taking logarithms, we see that it suffices to show that

or equivalently

for all . But each that gives a non-zero contribution on the right-hand side is the power of some irreducible in , which then splits into (say) distinct irreducibles in , with degree that of , and all of norm . This gives contributions to the right-hand side, each of which is times that of the left-hand side; conversely, every term in the right-hand side arises precisely once in this fashion. The claim follows.

## 12 comments

Comments feed for this article

17 May, 2019 at 2:09 pm

BRYou mention the large q, fixed N limit and there is indeed a result in this case due to Katz (the main theorem in http://web.math.princeton.edu/~nmk/dirichlet32.pdf. For a re-statement in language closer to analytic number theory, see Theorem 5.2 in this paper of Keating and Rudnick: https://arxiv.org/pdf/1204.0708.pdf). It would be interesting if there was an elementary way to see this equidistribution result if only for characters modulo some ‘fixed’ small squarefree polynomial e.g. X^3+1. I would guess the consensus is that it requires some machinery even in this very simple setting.

In your previous post you mentioned that one can sometimes go a little further using estimates for Kloosterman sums. Are function field analogues of these already worked out some place?

18 May, 2019 at 8:11 am

Will SawinEven for a cubic polynomial the problem is going to be quite tricky. The problem you typically face is given a sum over characters, because there is no way to attack it directly, the only route seems to be to use orthogonality of characters to return it to a sum over polynomials. But a very strong estimate for the resulting sums follows from the Reimann hypothesis from the resulting L-functions. This estimate is hard to prove other than by returning to the world of L-functions and using either non-elementary means or Bombieri-Stepanov, and to prove equidistribution you need to get an even stronger estimate, without resorting to L-functions!

The super strong form of the Riemann hypothesis due to Deligne (combined with categorical arguments) provides a solution, as in the result of Katz you mention, but this only gets an additional factor of sqrt(q) beyong the Riemann savings, where really (except in the case of cubic poynomials, I guess) much more should be true.

With regards to Kloosterman sums, I believe the mentioned estimates over the integers involve passing through automorphic problems using tools like the Kusnetsov formula. These tools are not fully developed over function fields, so the best current results on the fourth moment (see e.g. https://arxiv.org/abs/1901.06295) give only a little cancellation in the error term.

18 May, 2019 at 10:58 am

Will Sawin1. I suspect elementary methods to rule out the alternative hypothesis and other similar scenarios are not available. One reason this may be tricky is that, if we change the rules slightly (allow to be reducible and work with even characters) then a version of the alternate hypothesis can actually happen:

Suppose that is a power of and that . In this case a result of Katz shows the relevant monodromy group is finite, which has some surprising consequences (all spacings are integer multiples of some fixed spacing, but unlike in the alternative hypothesis, a spacing of zero happens with positive probability). So any method to rule out this kind of behavior over function fields must somehow fail badly in this case.

2. As BR mentioned, the full equidistribution is known, by work of Katz, in the limit, via a geometric method, as one of several recent papers on -function equidistribution over function fields. Let me mention an interesting fact about them:

These geometric results always require calculation of a monodromy group. There are two main strategies for the calculation of this group. One of these (not the one Katz used for this particular result, though it can probably be proved by this method) relies on as the key statement exactly the story of low-frequency equidistribution you prove in Proposition 3 and Corollary 4. Groups have a rigidity property that allows one to deduce high-frequency equidistribution from low-frequency equidistribution.

3. For the large and fixed case of these problems (or even for growing faster than a constant multiple of ) we do not know much more here than over the integers (and on some things, know less).

In some of my own research I have guessed that the best approach to the problem is to study the average of the trace of one irreducible representation of at a time. For this reason it might be interesting to look at the average of traces of irreducible representations over the distribution of the alternative hypothesis, and see how they grow or shrink with N and with the highest weight of the irreducible representation. Do you know how these averages behave?

18 May, 2019 at 8:28 pm

Terence Tao1. is quite interesting! It means that in addition to the ACUE distribution, there is another “fake CUE” distribution that is consistent with all the low-frequency CUE statistics but now has a positive proportion of repeated zeroes. So there is a limit to what one can do to bound the proportion of repeated zeroes from this sort of information. I assume the explicit form of distribution can be extracted from Katz’s work?

[EDIT: I wonder if there is any relation between Katz’s examples and the examples of Conrad-Conrad-Gross regarding the failure of the polynomial Chowla conjecture in function fields for certain specific polynomials in low characteristic?]

As for your last question, one can compute for any two Schur polynomials (where by abuse of notation we use to denote applied to the eigenvalues of ) without too much difficulty. For CUE this is just by character orthogonality. Returning to ACUE, this amounts (up to some normalisation factors) to sampling on the roots of unity, which by Poisson summation amounts to summing the Fourier integrals for . Using the determinantal formula for the Schur polynomials, which cancels off the Vandermonde determinant factors, one ends up with the combinatorial problem of counting solutions to the equation for permutations (weighted by ), where is the long word (and I guess dividing by to normalise). For comparison with the CUE case, one is counting solutions to (weighted by ), which after dividing by is just .

19 May, 2019 at 8:38 am

Will SawinUnfortunately 1 may not be as interesting as you think. The relevant distribution is valid only for . I convinced myself that the monodromy group is the central extension of , so one just takes the distribution of the characteristic polynomials of random elements of this central extension inside (and then rotates them around the unit circle, if desired. Such a measure satisfies your Corollary 4 except with replaced by the stronger bound of .

But this is an exceptional phenomenon in low dimension and it is not possible for a measure arising from a group to satisfy Corollary 4 but not full equidistribution if is large ( should be enough). Since large is what is relevant for the actual Riemann zeta function, groups will not tell us anything new about zeta. Unless there is some trick where the case can be used to say something about larger ?

With regards to Conrad-Conrad-Gross, I suspect not, as Conrad-Congrad-Gross is a large-degree phenomenon, while this is a phenomenon mainly about low-degree polynomials. The Schur polynomials associated to the complete symmetric functions are related to Mobius sums, but I do not think they are the same sort of Mobius sums.

Right, so with the expectation of Schur functions over the ACUE, if you take the case natural to me where corresponds to a one-dimensional character, you can see that the normalized expectation is always or , because fixing there can be at most one choice for , and if there is one choice for a given , there is one for every . One strange consequence of this nice formula is that there could maybe be a perfectly reasonable geometric explanation for the distribution of the function field -function zeroes being for some constant , uniform in and , which would be quite strange behavior.

19 May, 2019 at 1:25 pm

shabiatiyaHi Will, in your cv, you mentioned a paper to appear with Xiochun Li. Are you sure this name is correct?

20 May, 2019 at 8:32 am

Terence TaoI think the expectation of Schur functions (divided by the normalising power of the determinant) over ACUE may also be equal to when the permutations have opposing sign.

I’ll try to work out exactly what distribution your example of the binary icosahedral group gives. When N=2 and one does the arbitrary phase rotation, the only independent random variable is , or equivalently the even moments . For CUE this variable has the quarter-circle distribution and the moments are the Catalan numbers for (i.e. equal to ). For ACUE the variable is uniformly distributed on and the moments are (i.e. equal to for ). For Dirichlet characters one is trying to count solutions to the system for monic linear polynomials and various scalars (with given a weight of and all other a weight of ); this is trivial for when has degree 3 but I don’t know how easy it is to use the Weil II machinery to count this for higher .

UPDATE: Okay, for the binary icosahedral group (viewed as a finite subgroup of the unit quaternion group) the variable takes values in with probabilities , so the moments are , given by for , where is the golden ratio and are the Lucas numbers. (This moment sequence does not appear to be in the OEIS, though for some reason https://oeis.org/A202061 is a near miss; I’ve just submitted it to the OEIS. UPDATE: Now at https://oeis.org/A308329 ) It is remarkable how closely the moments of this discrete distribution match the Catalan number moments of the semicircular distribution – as you said, one has to go to degree 12 moments before one can distinguish this distribution from CUE. (There is also presumably some representation theoretic reason as to why these moments are all integers, although it eludes me at present. EDIT: Duh. It’s the dimension of the invariant component of the tensor product of the k^th tensor power of the standard representation of the binary icosahedral group with its conjugate.)

20 May, 2019 at 9:44 am

AnonymousIt would be interesting to find the generating function of these moments (which may help explaining why they are all integers.)

21 May, 2019 at 3:30 am

VincentSorry, minor notational issue: when you say that the multiplicative group is cyclic of order $Q – 1$ do you mean order $|Q| -1 = T -1$?

[Corrected, thanks – T.]21 May, 2019 at 3:40 am

VincentSorry to keep going on about notation, but I don’t understand the second sum in the definition of the $L$-function. To me it looks like the $(n)$ should be below the bar and then above the bar there should be some extra brackets or bars indicating that you are counting the number of elements of a set. But then the expression as written comes back a number of times in the text so maybe it is written as intended? But then I there is probably some notational convention at work that I don’t know. Could you clarify? Thanks in advance!

[Oops, this was a TeX error propagated by cut and paste, now fixed – T.]7 June, 2019 at 9:57 am

Quinonsthis is why terence tao doesnt come to my sleepovers anymore: https://twitter.com/benorlin/status/1136687249653981184?s=19

2 August, 2019 at 3:00 am

Michele NardelliDear Prof. Tao,

with regard the Riemann zeta-function, in particular the value of zeta(2) = 1.6449, can be obtained interesting mathematical connections with some developements of the Ramanujan’s Mock Theta Functions, in turn connected with some sectors of Particle Physics, Cosmology and some expressions concerning the Black Hole entropies and the hypothetical mass of Dark Matter particles. The link of paper which concerns these mathematical results, also including zeta(2), is:

Thank You and best regards