The alternative hypothesis for unitary matrices

8 May, 2019 in expository, math.GR, math.NT, math.PR | Tags: Alternative hypothesis, random matrices, Riemann zeta function, Siegel zero | by Terence Tao

In a recent post I discussed how the Riemann zeta function ${\zeta}$ can be locally approximated by a polynomial, in the sense that for randomly chosen ${t \in [T,2T]}$ one has an approximation

$\displaystyle \zeta(\frac{1}{2} + it - \frac{2\pi i z}{\log T}) \approx P_t( e^{2\pi i z/N} ) \ \ \ \ \ (1)$

where ${N}$ grows slowly with ${T}$ , and ${P_t}$ is a polynomial of degree ${N}$ . Assuming the Riemann hypothesis (as we will throughout this post), the zeroes of ${P_t}$ should all lie on the unit circle, and one should then be able to write ${P_t}$ as a scalar multiple of the characteristic polynomial of (the inverse of) a unitary matrix ${U = U_t \in U(N)}$ , which we normalise as

$\displaystyle P_t(Z) = \exp(A_t) \mathrm{det}(1 - ZU). \ \ \ \ \ (2)$

Here ${A_t}$ is some quantity depending on ${t}$ . We view ${U}$ as a random element of ${U(N)}$ ; in the limit ${T \rightarrow \infty}$ , the GUE hypothesis is equivalent to ${U}$ becoming equidistributed with respect to Haar measure on ${U(N)}$ (also known as the Circular Unitary Ensemble, CUE; it is to the unit circle what the Gaussian Unitary Ensemble (GUE) is on the real line). One can also view ${U}$ as analogous to the “geometric Frobenius” operator in the function field setting, though unfortunately it is difficult at present to make this analogy any more precise (due, among other things, to the lack of a sufficiently satisfactory theory of the “field of one element“).

Taking logarithmic derivatives of (2), we have

$\displaystyle -\frac{P'_t(Z)}{P_t(Z)} = \mathrm{tr}( U (1-ZU)^{-1} ) = \sum_{j=1}^\infty Z^{j-1} \mathrm{tr} U^j \ \ \ \ \ (3)$

and hence on taking logarithmic derivatives of (1) in the ${z}$ variable we (heuristically) have

$\displaystyle -\frac{2\pi i}{\log T} \frac{\zeta'}{\zeta}( \frac{1}{2} + it - \frac{2\pi i z}{\log T}) \approx \frac{2\pi i}{N} \sum_{j=1}^\infty e^{2\pi i jz/N} \mathrm{tr} U^j.$

Morally speaking, we have

$\displaystyle - \frac{\zeta'}{\zeta}( \frac{1}{2} + it - \frac{2\pi i z}{\log T}) = \sum_{n=1}^\infty \frac{\Lambda(n)}{n^{1/2+it}} e^{2\pi i z (\log n/\log T)}$

so on comparing coefficients we expect to interpret the moments ${\mathrm{tr} U^j}$ of ${U}$ as a finite Dirichlet series:

$\displaystyle \mathrm{tr} U^j \approx \frac{N}{\log T} \sum_{T^{(j-1)/N} < n \leq T^{j/N}} \frac{\Lambda(n)}{n^{1/2+it}}. \ \ \ \ \ (4)$

To understand the distribution of ${U}$ in the unitary group ${U(N)}$ , it suffices to understand the distribution of the moments

$\displaystyle {\bf E}_t \prod_{j=1}^k (\mathrm{tr} U^j)^{a_j} (\overline{\mathrm{tr} U^j})^{b_j} \ \ \ \ \ (5)$

where ${{\bf E}_t}$ denotes averaging over ${t \in [T,2T]}$ , and ${k, a_1,\dots,a_k, b_1,\dots,b_k \geq 0}$ . The GUE hypothesis asserts that in the limit ${T \rightarrow \infty}$ , these moments converge to their CUE counterparts

$\displaystyle {\bf E}_{\mathrm{CUE}} \prod_{j=1}^k (\mathrm{tr} U^j)^{a_j} (\overline{\mathrm{tr} U^j})^{b_j} \ \ \ \ \ (6)$

where ${U}$ is now drawn uniformly in ${U(n)}$ with respect to the CUE ensemble, and ${{\bf E}_{\mathrm{CUE}}}$ denotes expectation with respect to that measure.

The moment (6) vanishes unless one has the homogeneity condition

$\displaystyle \sum_{j=1}^k j a_j = \sum_{j=1}^k j b_j. \ \ \ \ \ (7)$

This follows from the fact that for any phase ${\theta \in {\bf R}}$ , ${e(\theta) U}$ has the same distribution as ${U}$ , where we use the number theory notation ${e(\theta) := e^{2\pi i\theta}}$ .

In the case when the degree ${\sum_{j=1}^k j a_j}$ is low, we can use representation theory to establish the following simple formula for the moment (6), as evaluated by Diaconis and Shahshahani:

Proposition 1 (Low moments in CUE model) If

$\displaystyle \sum_{j=1}^k j a_j \leq N, \ \ \ \ \ (8)$

then the moment (6) vanishes unless ${a_j=b_j}$ for all ${j}$ , in which case it is equal to

$\displaystyle \prod_{j=1}^k j^{a_j} a_j!. \ \ \ \ \ (9)$

Another way of viewing this proposition is that for ${U}$ distributed according to CUE, the random variables ${\mathrm{tr} U^j}$ are distributed like independent complex random variables of mean zero and variance ${j}$ , as long as one only considers moments obeying (8). This identity definitely breaks down for larger values of ${a_j}$ , so one only obtains central limit theorems in certain limiting regimes, notably when one only considers a fixed number of ${j}$ ‘s and lets ${N}$ go to infinity. (The paper of Diaconis and Shahshahani writes ${\sum_{j=1}^k a_j + b_j}$ in place of ${\sum_{j=1}^k j a_j}$ , but I believe this to be a typo.)

Proof: Let ${D}$ be the left-hand side of (8). We may assume that (7) holds since we are done otherwise, hence

$\displaystyle D = \sum_{j=1}^k j a_j = \sum_{j=1}^k j b_j.$

Our starting point is Schur-Weyl duality. Namely, we consider the ${n^D}$ -dimensional complex vector space

$\displaystyle ({\bf C}^n)^{\otimes D} = {\bf C}^n \otimes \dots \otimes {\bf C}^n.$

This space has an action of the product group ${S_D \times GL_n({\bf C})}$ : the symmetric group ${S_D}$ acts by permutation on the ${D}$ tensor factors, while the general linear group ${GL_n({\bf C})}$ acts diagonally on the ${{\bf C}^n}$ factors, and the two actions commute with each other. Schur-Weyl duality gives a decomposition

$\displaystyle ({\bf C}^n)^{\otimes D} \equiv \bigoplus_\lambda V^\lambda_{S_D} \otimes V^\lambda_{GL_n({\bf C})} \ \ \ \ \ (10)$

where ${\lambda}$ ranges over Young tableaux of size ${D}$ with at most ${n}$ rows, ${V^\lambda_{S_D}}$ is the ${S_D}$ -irreducible unitary representation corresponding to ${\lambda}$ (which can be constructed for instance using Specht modules), and ${V^\lambda_{GL_n({\bf C})}}$ is the ${GL_n({\bf C})}$ -irreducible polynomial representation corresponding with highest weight ${\lambda}$ .

Let ${\pi \in S_D}$ be a permutation consisting of ${a_j}$ cycles of length ${j}$ (this is uniquely determined up to conjugation), and let ${g \in GL_n({\bf C})}$ . The pair ${(\pi,g)}$ then acts on ${({\bf C}^n)^{\otimes D}}$ , with the action on basis elements ${e_{i_1} \otimes \dots \otimes e_{i_D}}$ given by

$\displaystyle g e_{\pi(i_1)} \otimes \dots \otimes g_{\pi(i_D)}.$

The trace of this action can then be computed as

$\displaystyle \sum_{i_1,\dots,i_D \in \{1,\dots,n\}} g_{\pi(i_1),i_1} \dots g_{\pi(i_D),i_D}$

where ${g_{i,j}}$ is the ${ij}$ matrix coefficient of ${g}$ . Breaking up into cycles and summing, this is just

$\displaystyle \prod_{j=1}^k \mathrm{tr}(g^j)^{a_j}.$

But we can also compute this trace using the Schur-Weyl decomposition (10), yielding the identity

$\displaystyle \prod_{j=1}^k \mathrm{tr}(g^j)^{a_j} = \sum_\lambda \chi_\lambda(\pi) s_\lambda(g) \ \ \ \ \ (11)$

where ${\chi_\lambda: S_D \rightarrow {\bf C}}$ is the character on ${S_D}$ associated to ${V^\lambda_{S_D}}$ , and ${s_\lambda: GL_n({\bf C}) \rightarrow {\bf C}}$ is the character on ${GL_n({\bf C})}$ associated to ${V^\lambda_{GL_n({\bf C})}}$ . As is well known, ${s_\lambda(g)}$ is just the Schur polynomial of weight ${\lambda}$ applied to the (algebraic, generalised) eigenvalues of ${g}$ . We can specialise to unitary matrices to conclude that

$\displaystyle \prod_{j=1}^k \mathrm{tr}(U^j)^{a_j} = \sum_\lambda \chi_\lambda(\pi) s_\lambda(U)$

and similarly

$\displaystyle \prod_{j=1}^k \mathrm{tr}(U^j)^{b_j} = \sum_\lambda \chi_\lambda(\pi') s_\lambda(U)$

where ${\pi' \in S_D}$ consists of ${b_j}$ cycles of length ${j}$ for each ${j=1,\dots,k}$ . On the other hand, the characters ${s_\lambda}$ are an orthonormal system on ${L^2(U(N))}$ with the CUE measure. Thus we can write the expectation (6) as

$\displaystyle \sum_\lambda \chi_\lambda(\pi) \overline{\chi_\lambda(\pi')}. \ \ \ \ \ (12)$

Now recall that ${\lambda}$ ranges over all the Young tableaux of size ${D}$ with at most ${N}$ rows. But by (8) we have ${D \leq N}$ , and so the condition of having ${N}$ rows is redundant. Hence ${\lambda}$ now ranges over all Young tableaux of size ${D}$ , which as is well known enumerates all the irreducible representations of ${S_D}$ . One can then use the standard orthogonality properties of characters to show that the sum (12) vanishes if ${\pi}$ , ${\pi'}$ are not conjugate, and is equal to ${D!}$ divided by the size of the conjugacy class of ${\pi}$ (or equivalently, by the size of the centraliser of ${\pi}$ ) otherwise. But the latter expression is easily computed to be ${\prod_{j=1}^k j^{a_j} a_j!}$ , giving the claim. $\Box$

Example 2 We illustrate the identity (11) when ${D=3}$ , ${n \geq 3}$ . The Schur polynomials are given as

$\displaystyle s_{3}(g) = \sum_i \lambda_i^3 + \sum_{i<j} \lambda_i^2 \lambda_j + \lambda_i \lambda_j^2 + \sum_{i<j<k} \lambda_i \lambda_j \lambda_k$

$\displaystyle s_{2,1}(g) = \sum_{i < j} \lambda_i^2 \lambda_j + \sum_{i < j,k} \lambda_i \lambda_j \lambda_k$

$\displaystyle s_{1,1,1}(g) = \sum_{i<j<k} \lambda_i \lambda_j \lambda_k$

where ${\lambda_1,\dots,\lambda_n}$ are the (generalised) eigenvalues of ${g}$ , and the formula (11) in this case becomes

$\displaystyle \mathrm{tr}(g^3) = s_{3}(g) - s_{2,1}(g) + s_{1,1,1}(g)$

$\displaystyle \mathrm{tr}(g^2) \mathrm{tr}(g) = s_{3}(g) - s_{1,1,1}(g)$

$\displaystyle \mathrm{tr}(g)^3 = s_{3}(g) + 2 s_{2,1}(g) + s_{1,1,1}(g).$

The functions ${s_{1,1,1}, s_{2,1}, s_3}$ are orthonormal on ${U(n)}$ , so the three functions ${\mathrm{tr}(g^3), \mathrm{tr}(g^2) \mathrm{tr}(g), \mathrm{tr}(g)^3}$ are also, and their ${L^2}$ norms are ${\sqrt{3}}$ , ${\sqrt{2}}$ , and ${\sqrt{6}}$ respectively, reflecting the size in ${S_3}$ of the centralisers of the permutations ${(123)}$ , ${(12)}$ , and ${\mathrm{id}}$ respectively. If ${n}$ is instead set to say ${2}$ , then the ${s_{1,1,1}}$ terms now disappear (the Young tableau here has too many rows), and the three quantities here now have some non-trivial covariance.

Example 3 Consider the moment ${{\bf E}_{\mathrm{CUE}} |\mathrm{tr} U^j|^2}$ . For ${j \leq N}$ , the above proposition shows us that this moment is equal to ${D}$ . What happens for ${j>N}$ ? The formula (12) computes this moment as

$\displaystyle \sum_\lambda |\chi_\lambda(\pi)|^2$

where ${\pi}$ is a cycle of length ${j}$ in ${S_j}$ , and ${\lambda}$ ranges over all Young tableaux with size ${j}$ and at most ${N}$ rows. The Murnaghan-Nakayama rule tells us that ${\chi_\lambda(\pi)}$ vanishes unless ${\lambda}$ is a hook (all but one of the non-zero rows consisting of just a single box; this also can be interpreted as an exterior power representation on the space ${{\bf C}^j_{\sum=0}}$ of vectors in ${{\bf C}^j}$ whose coordinates sum to zero), in which case it is equal to ${\pm 1}$ (depending on the parity of the number of non-zero rows). As such we see that this moment is equal to ${N}$ . Thus in general we have

$\displaystyle {\bf E}_{\mathrm{CUE}} |\mathrm{tr} U^j|^2 = \min(j,N). \ \ \ \ \ (13)$

Now we discuss what is known for the analogous moments (5). Here we shall be rather non-rigorous, in particular ignoring an annoying “Archimedean” issue that the product of the ranges ${T^{(j-1)/N} < n \leq T^{j/N}}$ and ${T^{(k-1)/N} < n \leq T^{k/N}}$ is not quite the range ${T^{(j+k-1)/N} < n \leq T^{j+k/N}}$ but instead leaks into the adjacent range ${T^{(j+k-2)/N} < n \leq T^{j+k-1/N}}$ . This issue can be addressed by working in a “weak" sense in which parameters such as ${j,k}$ are averaged over fairly long scales, or by passing to a function field analogue of these questions, but we shall simply ignore the issue completely and work at a heuristic level only. For similar reasons we will ignore some technical issues arising from the sharp cutoff of ${t}$ to the range ${[T,2T]}$ (it would be slightly better technically to use a smooth cutoff).

One can morally expand out (5) using (4) as

$\displaystyle (\frac{N}{\log T})^{J+K} \sum_{n_1,\dots,n_J,m_1,\dots,m_K} \frac{\Lambda(n_1) \dots \Lambda(n_J) \Lambda(m_1) \dots \Lambda(m_K)}{n_1^{1/2} \dots n_J^{1/2} m_1^{1/2} \dots m_K^{1/2}} \times \ \ \ \ \ (14)$

$\displaystyle \times {\bf E}_t (m_1 \dots m_K / n_1 \dots n_J)^{it}$

where ${J := \sum_{j=1}^k a_j}$ , ${K := \sum_{j=1}^k b_j}$ , and the integers ${n_i,m_i}$ are in the ranges

$\displaystyle T^{(j-1)/N} < n_{a_1 + \dots + a_{j-1} + i} \leq T^{j/N}$

for ${j=1,\dots,k}$ and ${1 \leq i \leq a_j}$ , and

$\displaystyle T^{(j-1)/N} < m_{b_1 + \dots + b_{j-1} + i} \leq T^{j/N}$

for ${j=1,\dots,k}$ and ${1 \leq i \leq b_j}$ . Morally, the expectation here is negligible unless

$\displaystyle m_1 \dots m_K = (1 + O(1/T)) n_1 \dots n_J \ \ \ \ \ (15)$

in which case the expecation is oscillates with magnitude one. In particular, if (7) fails (with some room to spare) then the moment (5) should be negligible, which is consistent with the analogous behaviour for the moments (6). Now suppose that (8) holds (with some room to spare). Then ${n_1 \dots n_J}$ is significantly less than ${T}$ , so the ${O(1/T)}$ multiplicative error in (15) becomes an additive error of ${o(1)}$ . On the other hand, because of the fundamental integrality gap – that the integers are always separated from each other by a distance of at least ${1}$ – this forces the integers ${m_1 \dots m_K}$ , ${n_1 \dots n_J}$ to in fact be equal:

$\displaystyle m_1 \dots m_K = n_1 \dots n_J. \ \ \ \ \ (16)$

The von Mangoldt factors ${\Lambda(n_1) \dots \Lambda(n_J) \Lambda(m_1) \dots \Lambda(m_K)}$ effectively restrict ${n_1,\dots,n_J,m_1,\dots,m_K}$ to be prime (the effect of prime powers is negligible). By the fundamental theorem of arithmetic, the constraint (16) then forces ${J=K}$ , and ${n_1,\dots,n_J}$ to be a permutation of ${m_1,\dots,m_K}$ , which then forces ${a_j = b_j}$ for all ${j=1,\dots,k}$ ._ For a given ${n_1,\dots,n_J}$ , the number of possible ${m_1 \dots m_K}$ is then ${\prod_{j=1}^k a_j!}$ , and the expectation in (14) is equal to ${1}$ . Thus this expectation is morally

$\displaystyle (\frac{N}{\log T})^{J+K} \sum_{n_1,\dots,n_J} \frac{\Lambda^2(n_1) \dots \Lambda^2(n_J) }{n_1 \dots n_J} \prod_{j=1}^k a_j!$

and using Mertens’ theorem this soon simplifies asymptotically to the same quantity in Proposition 1. Thus we see that (morally at least) the moments (5) associated to the zeta function asymptotically match the moments (6) coming from the CUE model in the low degree case (8), thus lending support to the GUE hypothesis. (These observations are basically due to Rudnick and Sarnak, with the degree ${1}$ case of pair correlations due to Montgomery, and the degree ${2}$ case due to Hejhal.)

With some rare exceptions (such as those estimates coming from “Kloostermania”), the moment estimates of Rudnick and Sarnak basically represent the state of the art for what is known for the moments (5). For instance, Montgomery’s pair correlation conjecture, in our language, is basically the analogue of (13) for ${{\mathbf E}_t}$ , thus

$\displaystyle {\bf E}_{t} |\mathrm{tr} U^j|^2 \approx \min(j,N) \ \ \ \ \ (17)$

for all ${j \geq 0}$ . Montgomery showed this for (essentially) the range ${j \leq N}$ (as remarked above, this is a special case of the Rudnick-Sarnak result), but no further cases of this conjecture are known.

These estimates can be used to give some non-trivial information on the largest and smallest spacings between zeroes of the zeta function, which in our notation corresponds to spacing between eigenvalues of ${U}$ . One such method used today for this is due to Montgomery and Odlyzko and was greatly simplified by Conrey, Ghosh, and Gonek. The basic idea, translated to our random matrix notation, is as follows. Suppose ${Q_t(Z)}$ is some random polynomial depending on ${t}$ of degree at most ${N}$ . Let ${\lambda_1,\dots,\lambda_n}$ denote the eigenvalues of ${U}$ , and let ${c > 0}$ be a parameter. Observe from the pigeonhole principle that if the quantity

$\displaystyle \sum_{j=1}^n \int_0^{c/N} |Q_t( e(\theta) \lambda_j )|^2\ d\theta \ \ \ \ \ (18)$

exceeds the quantity

$\displaystyle \int_{0}^{2\pi} |Q_t(e(\theta))|^2\ d\theta, \ \ \ \ \ (19)$

then the arcs ${\{ e(\theta) \lambda_j: 0 \leq \theta \leq c \}}$ cannot all be disjoint, and hence there exists a pair of eigenvalues making an angle of less than ${c/N}$ ( ${c}$ times the mean angle separation). Similarly, if the quantity (18) falls below that of (19), then these arcs cannot cover the unit circle, and hence there exists a pair of eigenvalues making an angle of greater than ${c}$ times the mean angle separation. By judiciously choosing the coefficients of ${Q_t}$ as functions of the moments ${\mathrm{tr}(U^j)}$ , one can ensure that both quantities (18), (19) can be computed by the Rudnick-Sarnak estimates (or estimates of equivalent strength); indeed, from the residue theorem one can write (18) as

$\displaystyle \frac{1}{2\pi i} \int_0^{c/N} (\int_{|z| = 1+\varepsilon} - \int_{|z|=1-\varepsilon}) Q_t( e(\theta) z ) \overline{Q_t}( \frac{1}{e(\theta) z} ) \frac{P'_t(z)}{P_t(z)}\ dz$

for sufficiently small ${\varepsilon>0}$ , and this can be computed (in principle, at least) using (3) if the coefficients of ${Q_t}$ are in an appropriate form. Using this sort of technology (translated back to the Riemann zeta function setting), one can show that gaps between consecutive zeroes of zeta are less than ${\mu}$ times the mean spacing and greater than ${\lambda}$ times the mean spacing infinitely often for certain ${0 < \mu < 1 < \lambda}$ ; the current records are ${\mu = 0.50412}$ (due to Goldston and Turnage-Butterbaugh) and ${\lambda = 3.18}$ (due to Bui and Milinovich, who input some additional estimates beyond the Rudnick-Sarnak set, namely the twisted fourth moment estimates of Bettin, Bui, Li, and Radziwill, and using a technique based on Hall’s method rather than the Montgomery-Odlyzko method).

It would be of great interest if one could push the upper bound ${\mu}$ for the smallest gap below ${1/2}$ . The reason for this is that this would then exclude the Alternative Hypothesis that the spacing between zeroes are asymptotically always (or almost always) a non-zero half-integer multiple of the mean spacing, or in our language that the gaps between the phases ${\theta}$ of the eigenvalues ${e^{2\pi i\theta}}$ of ${U}$ are nasymptotically always non-zero integer multiples of ${1/2N}$ . The significance of this hypothesis is that it is implied by the existence of a Siegel zero (of conductor a small power of ${T}$ ); see this paper of Conrey and Iwaniec. (In our language, what is going on is that if there is a Siegel zero in which ${L(1,\chi)}$ is very close to zero, then ${1*\chi}$ behaves like the Kronecker delta, and hence (by the Riemann-Siegel formula) the combined ${L}$ -function ${\zeta(s) L(s,\chi)}$ will have a polynomial approximation which in our language looks like a scalar multiple of ${1 + e(\theta) Z^{2N+M}}$ , where ${q \approx T^{M/N}}$ and ${\theta}$ is a phase. The zeroes of this approximation lie on a coset of the ${(2N+M)^{th}}$ roots of unity; the polynomial ${P}$ is a factor of this approximation and hence will also lie in this coset, implying in particular that all eigenvalue spacings are multiples of ${1/(2N+M)}$ . Taking ${M = o(N)}$ then gives the claim.)

Unfortunately, the known methods do not seem to break this barrier without some significant new input; already the original paper of Montgomery and Odlyzko observed this limitation for their particular technique (and in fact fall very slightly short, as observed in unpublished work of Goldston and of Milinovich). In this post I would like to record another way to see this, by providing an “alternative” probability distribution to the CUE distribution (which one might dub the Alternative Circular Unitary Ensemble (ACUE) which is indistinguishable in low moments in the sense that the expectation ${{\bf E}_{ACUE}}$ for this model also obeys Proposition 1, but for which the phase spacings are always a multiple of ${1/2N}$ . This shows that if one is to rule out the Alternative Hypothesis (and thus in particular rule out Siegel zeroes), one needs to input some additional moment information beyond Proposition 1. It would be interesting to see if any of the other known moment estimates that go beyond this proposition are consistent with this alternative distribution. (UPDATE: it looks like they are, see Remark 7 below.)

To describe this alternative distribution, let us first recall the Weyl description of the CUE measure on the unitary group ${U(n)}$ in terms of the distribution of the phases ${\theta_1,\dots,\theta_N \in {\bf R}/{\bf Z}}$ of the eigenvalues, randomly permuted in any order. This distribution is given by the probability measure

$\displaystyle \frac{1}{N!} |V(\theta)|^2\ d\theta_1 \dots d\theta_N; \ \ \ \ \ (20)$

where

$\displaystyle V(\theta) := \prod_{1 \leq i<j \leq N} (e(\theta_i)-e(\theta_j))$

is the Vandermonde determinant; see for instance this previous blog post for the derivation of a very similar formula for the GUE distribution, which can be adapted to CUE without much difficulty. To see that this is a probability measure, first observe the Vandermonde determinant identity

$\displaystyle V(\theta) = \sum_{\pi \in S_N} \mathrm{sgn}(\pi) e(\theta \cdot \pi(\rho))$

where ${\theta := (\theta_1,\dots,\theta_N)}$ , ${\cdot}$ denotes the dot product, and ${\rho := (1,2,\dots,N)}$ is the “long word”, which implies that (20) is a trigonometric series with constant term ${1}$ ; it is also clearly non-negative, so it is a probability measure. One can thus generate a random CUE matrix by first drawing ${(\theta_1,\dots,\theta_n) \in ({\bf R}/{\bf Z})^N}$ using the probability measure (20), and then generating ${U}$ to be a random unitary matrix with eigenvalues ${e(\theta_1),\dots,e(\theta_N)}$ .

For the alternative distribution, we first draw ${(\theta_1,\dots,\theta_N)}$ on the discrete torus ${(\frac{1}{2N}{\bf Z}/{\bf Z})^N}$ (thus each ${\theta_j}$ is a ${2N^{th}}$ root of unity) with probability density function

$\displaystyle \frac{1}{(2N)^N} \frac{1}{N!} |V(\theta)|^2 \ \ \ \ \ (21)$

shift by a phase ${\alpha \in {\bf R}/{\bf Z}}$ drawn uniformly at random, and then select ${U}$ to be a random unitary matrix with eigenvalues ${e^{i(\theta_1+\alpha)}, \dots, e^{i(\theta_N+\alpha)}}$ . Let us first verify that (21) is a probability density function. Clearly it is non-negative. It is the linear combination of exponentials of the form ${e(\theta \cdot (\pi(\rho)-\pi'(\rho))}$ for ${\pi,\pi' \in S_N}$ . The diagonal contribution ${\pi=\pi'}$ gives the constant function ${\frac{1}{(2N)^N}}$ , which has total mass one. All of the other exponentials have a frequency ${\pi(\rho)-\pi'(\rho)}$ that is not a multiple of ${2N}$ , and hence will have mean zero on ${(\frac{1}{2N}{\bf Z}/{\bf Z})^N}$ . The claim follows.

From construction it is clear that the matrix ${U}$ drawn from this alternative distribution will have all eigenvalue phase spacings be a non-zero multiple of ${1/2N}$ . Now we verify that the alternative distribution also obeys Proposition 1. The alternative distribution remains invariant under rotation by phases, so the claim is again clear when (8) fails. Inspecting the proof of that proposition, we see that it suffices to show that the Schur polynomials ${s_\lambda}$ with ${\lambda}$ of size at most ${N}$ and of equal size remain orthonormal with respect to the alternative measure. That is to say,

$\displaystyle \int_{U(N)} s_\lambda(U) \overline{s_{\lambda'}(U)}\ d\mu_{\mathrm{CUE}}(U) = \int_{U(N)} s_\lambda(U) \overline{s_{\lambda'}(U)}\ d\mu_{\mathrm{ACUE}}(U)$

when ${\lambda,\lambda'}$ have size equal to each other and at most ${N}$ . In this case the phase ${\alpha}$ in the definition of ${U}$ is irrelevant. In terms of eigenvalue measures, we are then reduced to showing that

$\displaystyle \int_{({\bf R}/{\bf Z})^N} s_\lambda(\theta) \overline{s_{\lambda'}(\theta)} |V(\theta)|^2\ d\theta = \frac{1}{(2N)^N} \sum_{\theta \in (\frac{1}{2N}{\bf Z}/{\bf Z})^N} s_\lambda(\theta) \overline{s_{\lambda'}(\theta)} |V(\theta)|^2.$

By Fourier decomposition, it then suffices to show that the trigonometric polynomial ${s_\lambda(\theta) \overline{s_{\lambda'}(\theta)} |V(\theta)|^2}$ does not contain any components of the form ${e( \theta \cdot 2N k)}$ for some non-zero lattice vector ${k \in {\bf Z}^N}$ . But we have already observed that ${|V(\theta)|^2}$ is a linear combination of plane waves of the form ${e(\theta \cdot (\pi(\rho)-\pi'(\rho))}$ for ${\pi,\pi' \in S_N}$ . Also, as is well known, ${s_\lambda(\theta)}$ is a linear combination of plane waves ${e( \theta \cdot \kappa )}$ where ${\kappa}$ is majorised by ${\lambda}$ , and similarly ${s_{\lambda'}(\theta)}$ is a linear combination of plane waves ${e( \theta \cdot \kappa' )}$ where ${\kappa'}$ is majorised by ${\lambda'}$ . So the product ${s_\lambda(\theta) \overline{s_{\lambda'}(\theta)} |V(\theta)|^2}$ is a linear combination of plane waves of the form ${e(\theta \cdot (\kappa - \kappa' + \pi(\rho) - \pi'(\rho)))}$ . But every coefficient of the vector ${\kappa - \kappa' + \pi(\rho) - \pi'(\rho)}$ lies between ${1-2N}$ and ${2N-1}$ , and so cannot be of the form ${2Nk}$ for any non-zero lattice vector ${k}$ , giving the claim.

Example 4 If ${N=2}$ , then the distribution (21) assigns a probability of ${\frac{1}{4^2 2!} 2}$ to any pair ${(\theta_1,\theta_2) \in (\frac{1}{4} {\bf Z}/{\bf Z})^2}$ that is a permuted rotation of ${(0,\frac{1}{4})}$ , and a probability of ${\frac{1}{4^2 2!} 4}$ to any pair that is a permuted rotation of ${(0,\frac{1}{2})}$ . Thus, a matrix ${U}$ drawn from the alternative distribution will be conjugate to a phase rotation of ${\mathrm{diag}(1, i)}$ with probability ${1/2}$ , and to ${\mathrm{diag}(1,-1)}$ with probability ${1/2}$ .

A similar computation when ${N=3}$ gives ${U}$ conjugate to a phase rotation of ${\mathrm{diag}(1, e(1/6), e(1/3))}$ with probability ${1/12}$ , to a phase rotation of ${\mathrm{diag}( 1, e(1/6), -1)}$ or its adjoint with probability of ${1/3}$ each, and a phase rotation of ${\mathrm{diag}(1, e(1/3), e(2/3))}$ with probability ${1/4}$ .

Remark 5 For large ${N}$ it does not seem that this specific alternative distribution is the only distribution consistent with Proposition 1 and which has all phase spacings a non-zero multiple of ${1/2N}$ ; in particular, it may not be the only distribution consistent with a Siegel zero. Still, it is a very explicit distribution that might serve as a test case for the limitations of various arguments for controlling quantities such as the largest or smallest spacing between zeroes of zeta. The ACUE is in some sense the distribution that maximally resembles CUE (in the sense that it has the greatest number of Fourier coefficients agreeing) while still also being consistent with the Alternative Hypothesis, and so should be the most difficult enemy to eliminate if one wishes to disprove that hypothesis.

In some cases, even just a tiny improvement in known results would be able to exclude the alternative hypothesis. For instance, if the alternative hypothesis held, then ${|\mathrm{tr}(U^j)|}$ is periodic in ${j}$ with period ${2N}$ , so from Proposition 1 for the alternative distribution one has

$\displaystyle {\bf E}_{\mathrm{ACUE}} |\mathrm{tr} U^j|^2 = \min_{k \in {\bf Z}} |j-2Nk|$

which differs from (13) for any ${|j| > N}$ . (This fact was implicitly observed recently by Baluyot, in the original context of the zeta function.) Thus a verification of the pair correlation conjecture (17) for even a single ${j}$ with ${|j| > N}$ would rule out the alternative hypothesis. Unfortunately, such a verification appears to be on comparable difficulty with (an averaged version of) the Hardy-Littlewood conjecture, with power saving error term. (This is consistent with the fact that Siegel zeroes can cause distortions in the Hardy-Littlewood conjecture, as (implicitly) discussed in this previous blog post.)

Remark 6 One can view the CUE as normalised Lebesgue measure on ${U(N)}$ (viewed as a smooth submanifold of ${{\bf C}^{N^2}}$ ). One can similarly view ACUE as normalised Lebesgue measure on the (disconnected) smooth submanifold of ${U(N)}$ consisting of those unitary matrices whose phase spacings are non-zero integer multiples of ${1/2N}$ ; informally, ACUE is CUE restricted to this lower dimensional submanifold. As is well known, the phases of CUE eigenvalues form a determinantal point process with kernel ${K(\theta,\theta') = \frac{1}{N} \sum_{j=0}^{N-1} e(j(\theta - \theta'))}$ (or one can equivalently take ${K(\theta,\theta') = \frac{\sin(\pi N (\theta-\theta'))}{N\sin(\pi(\theta-\theta'))}}$ ; in a similar spirit, the phases of ACUE eigenvalues, once they are rotated to be ${2N^{th}}$ roots of unity, become a discrete determinantal point process on those roots of unity with exactly the same kernel (except for a normalising factor of ${\frac{1}{2}}$ ). In particular, the ${k}$ -point correlation functions of ACUE (after this rotation) are precisely the restriction of the ${k}$ -point correlation functions of CUE after normalisation, that is to say they are proportional to ${\mathrm{det}( K( \theta_i,\theta_j) )_{1 \leq i,j \leq k}}$ .

Remark 7 One family of estimates that go beyond the Rudnick-Sarnak family of estimates are twisted moment estimates for the zeta function, such as ones that give asymptotics for

$\displaystyle \int_T^{2T} |\zeta(\frac{1}{2}+it)|^{2k} |Q(\frac{1}{2}+it)|^2\ dt$

for some small even exponent ${2k}$ (almost always ${2}$ or ${4}$ ) and some short Dirichlet polynomial ${Q}$ ; see for instance this paper of Bettin, Bui, Li, and Radziwill for some examples of such estimates. The analogous unitary matrix average would be something like

$\displaystyle {\bf E}_t |P_t(1)|^{2k} |Q_t(1)|^2$

where ${Q_t}$ is now some random medium degree polynomial that depends on the unitary matrix ${U}$ associated to ${P_t}$ (and in applications will typically also contain some negative power of ${\exp(A_t)}$ to cancel the corresponding powers of ${\exp(A_t)}$ in ${|P_t(1)|^{2k}}$ ). Unfortunately such averages generally are unable to distinguish the CUE from the ACUE. For instance, if all the coefficients of ${Q}$ involve products of traces ${\mathrm{tr}(U^k)}$ of total order less than ${N-k}$ , then in terms of the eigenvalue phases ${\theta}$ , ${|Q(1)|^2}$ is a linear combination of plane waves ${e(\theta \cdot \xi)}$ where the frequencies ${\xi}$ have coefficients of magnitude less than ${N-k}$ . On the other hand, as each coefficient of ${P_t}$ is an elementary symmetric function of the eigenvalues, ${P_t(1)}$ is a linear combination of plane waves ${e(\theta \cdot \xi)}$ where the frequencies ${\xi}$ have coefficients of magnitude at most ${1}$ . Thus ${|P_t(1)|^{2k} |Q_t(1)|^2}$ is a linear combination of plane waves where the frequencies ${\xi}$ have coefficients of magnitude less than ${N}$ , and thus is orthogonal to the difference between the CUE and ACUE measures on the phase torus ${({\bf R}/{\bf Z})^n}$ by the previous arguments. In other words, ${|P_t(1)|^{2k} |Q_t(1)|^2}$ has the same expectation with respect to ACUE as it does with respect to CUE. Thus one can only start distinguishing CUE from ACUE if the mollifier ${Q_t}$ has degree close to or exceeding ${N}$ , which corresponds to Dirichlet polynomials ${Q}$ of length close to or exceeding ${T}$ , which is far beyond current technology for such moment estimates.

Remark 8 The GUE hypothesis for the zeta function asserts that the average

$\displaystyle \lim_{T \rightarrow \infty} \frac{1}{T} \int_T^{2T} \sum_{\gamma_1,\dots,\gamma_n \hbox{ distinct}} \eta( \frac{\log T}{2\pi}(\gamma_1-t),\dots, \frac{\log T}{2\pi}(\gamma_k-t))\ dt \ \ \ \ \ (22)$

is equal to

$\displaystyle \int_{{\bf R}^n} \eta(x) \det(K(x_i-x_j))_{1 \leq i,j \leq k}\ dx_1 \dots dx_k \ \ \ \ \ (23)$

for any ${k \geq 1}$ and any test function ${\eta: {\bf R}^k \rightarrow {\bf C}}$ , where ${K(x) := \frac{\sin \pi x}{\pi x}}$ is the Dyson sine kernel and ${\gamma_i}$ are the ordinates of zeroes of the zeta function. This corresponds to the CUE distribution for ${U}$ . The ACUE distribution then corresponds to an “alternative gaussian unitary ensemble (AGUE)” hypothesis, in which the average (22) is instead predicted to equal a Riemann sum version of the integral (23):

$\displaystyle \int_0^1 2^{-k} \sum_{x_1,\dots,x_k \in \frac{1}{2} {\bf Z} + \theta} \eta(x) \det(K(x_i-x_j))_{1 \leq i,j \leq k}\ d\theta.$

This is a stronger version of the alternative hypothesis that the spacing between adjacent zeroes is almost always approximately a half-integer multiple of the mean spacing. I do not know of any known moment estimates for Dirichlet series that is able to eliminate this AGUE hypothesis (even assuming GRH). (UPDATE: These facts have also been independently observed in forthcoming work of Lagarias and Rodgers.)

30 comments

Comments feed for this article

9 May, 2019 at 2:27 am

Raphael

I like that N grow slowly with N in the introduction ;-)

[Corrected, thanks – T.]

9 May, 2019 at 4:40 am

Anonymous

It should be added that the bounds $\lambda, \mu$ for consecutive nontrivial zeta zeros are only in the “infinitely often” sense.

[Corrected, thanks – T.]

9 May, 2019 at 5:33 am

Joseph

Is it expected that there is a similar limitation for larger gaps, i.e. one can construct a random matrix model such that the Schur functions of size at most $N$ remain orthonormal, and the maximal gap is bounded by $C/N$ for some constant $C$ ?

9 May, 2019 at 7:36 am

Terence Tao

I don’t know of such a limitation, and indeed there seems to be considerably more scope to improve the bounds on $\lambda$ than on $\mu$ . For instance, the current best bound 0.515396 for $\mu$ only improves very slightly on the previous record 0.515398 of Feng and Xu, and does not seem to use any moment information beyond the Rudnick-Sarnak level if I am not mistaken, whereas the current record 3.18 for $\lambda$ is considerable improvement over the previous record 2.76 of Bredberg (or of the bound 3.072 on GRH of Feng and Xu), and uses additional moment information. (I intend to work out what the analogues of such information is in the unitary matrix setting, perhaps in a followup blog post.)

A few years ago there was an attempt (at an AIM workshop) to use the sieve that James Maynard and I discovered for detecting short intervals with many primes, to see if they could somehow be adapted to improve the bound on $\lambda$ , but my understanding was that this attempt ran into some serious obstacles (I was not directly involved in it though).

9 May, 2019 at 12:10 pm

Caroline Turnage-Butterbaugh

Dan Goldston and I recently posted a preprint on arXiv (https://arxiv.org/abs/1904.06001) where we improve the bound on mu to 0.50412 using the Montgomery-Odlyzko / Conrey-Ghosh-Gonek method with weights that are supported on numbers with a small number of prime factors.

9 May, 2019 at 5:06 pm

Terence Tao

Congratulations on your recent result! I’ve updated the blog post accordingly.

11 May, 2019 at 10:39 am

Anonymous

How is it possible that unlike Goldston-Turnage-Butterbaugh weights, Conrey-Ghosh-Gonek weights involve $d_r(k)$ with the optimal r>1 so the latter weights, when optimized, are supported mainly on numbers with a large number of prime factors?

11 May, 2019 at 7:21 pm

Daniel Goldston

For small gaps if you replace $d_r(k)$ with $1$ you get the result $\mu < 0.5182$ in place of $\mu < 0.5171$ , so the effect of the divisor function is rather small. Where $d_r(k)$ leads to dramatic improvements is when you look for large or small gaps between $r$ zeros ( $r$ -gaps) when $r$ is large. See the recent paper of Conrey-Turnage-Butterbaugh https://arxiv.org/abs/1708.00030 .

15 May, 2019 at 10:04 am

If we take Feng-Wu weights with a few terms, with $r$ in $d_r(k)$ close to zero, and then optimize (a small number of) the terms related to prime factors, then we seem to get Goldston-Turnage-Butterbaugh bound. This is strange, since Feng-Wu optimal value is $r>1$ , not $r$ close to zero.

16 May, 2019 at 2:58 am

Dear Professor Goldston,
I was able to model your weights, that are supported on numbers with a small number of prime factors, by general Feng-Wu weights with $r$ in $d_r(k)$ close to zero.
For weights supported on 1 and primes my bound matches your bound $\mu \le 0{.}667702086\ldots$ But for weights supported on numbers with $\Omega(k) \le 4$ your bound $\mu \le 0{.}50412$ seems to be incorrect.

9 November, 2019 at 5:45 pm

Daniel Goldston

We found a mistake in our preprint which invalidates our calculations which gave 0.50412. While the Wu weights are correct, our generalization only holds with functions (or polynomials) which are symmetric in all their variables. We do not yet know whether this restriction allows this method to improve on the earlier results or not.

9 May, 2019 at 1:57 pm

arch1

spacing been zeroes -> spacing between zeroes?

[Corrected, thanks -T.]

9 May, 2019 at 7:59 pm

Anonymous

In example 2, what does $i < j,k$ mean?
(1) $i < j$ and $i < k$
(2) $i < j$

[The former – T.]

10 May, 2019 at 10:57 am

Anonymous

Getting \mu T / \log T up to T). This is unfortunately a more difficult problem where the quality of the results is worse.

[You may be experiencing the wordpress issue of < and > being interpreted as HTML delimiters. Try using < and > instead – T.]

10 May, 2019 at 2:47 pm

Anonymous

Do (non-symmetric) expressions such as $\sum_{i < j} x_ix_j^2$ ever pop up in combinatorial contexts?

11 May, 2019 at 9:26 am

Terence Tao

Sometimes it is convenient to express symmetric polynomials combinatorially in terms of non-symmetric objects. For instance the Schur polynomial $s_\lambda(x)$ is a symmetric polynomial in the variables $x_1,\dots,x_n$ , but it can be split into non-symmetric monomials as $s_\lambda(x) = \sum_T x^T$ , where $T$ ranges over all semi-standard tableaux of shape $\lambda$ and entries in $\{1,\dots,n\}$ , which is what I am implicitly using in this blog post to compute the Schur polynomials. (With this definition it is not immediately obvious that the Schur polynomials are symmetric; there are other equivalent definitions, such as the Jacobi-Trudi identities, which make this more obvious.)

11 May, 2019 at 12:01 am

Anonymous

“the the Vandermonde determinant”

[Corrected, thanks – T.]

13 May, 2019 at 6:12 am

Sorry all these seem like a problem that can be encoded in quantified real formula with small number of quantifiers. Why can’t we just encode that way and run cook book algorithms or obtain approximations (small quantifiers should not blow up run time too much)?

13 May, 2019 at 9:33 am

Terence Tao

For any fixed $N$ , the problems here are indeed finitary in nature, and could potentially be used to provide some numerical ways to explore the consequences of various moment estimates. For instance, one can pose the question for any fixed $N$ of what the most extreme values of $\lambda$ and $\mu$ for probability measures on $U(N)$ that obey the analogue of Proposition 1 (or equivalently, the $N$ -point correlation function agrees with that for CUE when tested against any plane wave $e^{i\theta \cdot \xi}$ with either $|\xi|_{\ell^1} \leq 2N$ or $\sum_j \xi_j \neq 0$ ). But one is eventually interested in taking the limit as $N \to \infty$ and here one would need some theoretical analysis and not simply numerical computation.

13 May, 2019 at 3:17 pm

So $\exists\lambda\mu\forall N\dots$ type is not something that yields amenable valid quantified formula (I understand need for precise closed form expression but here at end of day you are just looking for $0.5...$ or some new tighter bound)? We are looking for extremal value and is there no expression for $\dots$ that can do this and reduce to a computation (or is that the point of the whole research in this direction)?

14 May, 2019 at 9:00 am

Terence Tao

It is possible that there is some monotonicity in $N$ , for instance if one gets an upper bound for $\mu$ at a given value of $N$ , this may imply also the same upper bound for $\mu$ in the limit $N \to \infty$ . If so then one could imagine for instance that for each $\varepsilon > 0$ there would be a value of $N_\varepsilon$ for which one could use this monotonicity to prove a bound $\mu \leq 0.5 + \varepsilon$ , but that this value would go to infinity as $\varepsilon \to 0$ and so one could not establish $\mu \leq 0.5$ this way without an infinite amount of computation, unless one could somehow find a way to verify $\mu \leq 0.5 + \varepsilon$ that was uniform in $\varepsilon$ (as opposed to requiring an increasingly large amount of computation as $\varepsilon \to 0$ ).

14 May, 2019 at 9:00 pm

Perhaps then the question would be would it be easy to get necessary and sufficient conditions that would apply to ‘broadly’ speaking these monotone conditions in general situations so that answering that would establish a path for (if not this problem) perhaps problems easier than this?

13 May, 2019 at 6:59 am

Brad Rodgers

This is a nice post! One quick comment is about the discrete CUE in remark 6: there are a few other places it seems to have cropped up in the literature (though for very different reasons!). Harold Widom made use of it for analytic reasons in “Random Hermitian Matrices and (Nonrandom) Toeplitz Matrices”, and pretty curiously it also came up in work of Johansson on domino tilings (sec 2.5 of https://arxiv.org/pdf/math/0011250.pdf) and in work of the Russian school of asymptotic rep theorists (e.g. https://projecteuclid.org/euclid.dmj/1194547695).

In fact Jeff Lagarias and I have recently been thinking about some closely related topics to this post, though using slightly different methods. We had independently obtained something like the process you construct in Remark 8. I’ve just now put on my website one of the drafts we’ve been working on: https://mast.queensu.ca/~br66/bandlimited-mimickry.draft.pdf. Theorem 1.8 there was motivated with the idea of showing that even knowing (23) here for test functions $\eta$ with Fourier support in $[-1,1]^k$ for all $k$ is not enough to eliminate a distribution akin to AGUE. (This Fourier support entails knowing the Rudnick-Sarnak information.) Our methods are slightly different; instead of symmetric function theory we use an expansion of determinants and Poisson summation. We listed a few of the questions we were not able to resolve in section 5. I think the finitary perspective you give here (with N points rather than an infinite number) is probably the right one to use for thinking about things like this!

17 May, 2019 at 8:53 am

A function field analogue of Riemann zeta statistics | What's new

[…] this with Proposition 1 from this previous post, we thus see that all the low moments of are consistent with the CUE hypothesis (and also with the […]

20 May, 2019 at 10:26 am

Anonymous

Sorry if this is explained someplace that I missed it, but why not more generally sample matrices with eigenvalues of the form $\exp(2 \pi i k/M)$ in $U(N)$ (with the same Vandermonde weighting, and same translation by a random scalar)? Why take $M = 2N$ ?

20 May, 2019 at 12:54 pm

Terence Tao

$M=2N$ is the only choice which (a) is consistent with the “low frequency” moment estimates of Rudnick-Sarnak type, and (b) is consistent which the Alternative Hypothesis that the phase gaps are multiples of $1/2N$ (which is what happens in particular in the presence of a Siegel zero). Taking $M$ to be a smaller factor of $2N$ would retain (b) but not (a); taking a larger choice of $M$ would retain (a) but not (b).

22 May, 2019 at 6:06 pm

anonymous

My question is philosophically related, at least, to the theme of these several recent posts: what bearing, if any, does the Alternative Hypothesis have on the moments of the Riemann zeta function?

Keating-Snaith and others have formulated well-known conjectures about the shape of the $2k$ -th moment of the zeta function based on random matrix theory. For instance, model the zeta function by a characteristic polynomial and integrate with respect to the GUE measure and see what comes out. What happens if you do this same computation with an Alternative measure? In light of your discussion one might guess the predictions are the same for low moments, but that the predictions begin to diverge for larger moments.

22 May, 2019 at 9:50 pm

Terence Tao

Remarkably, CUE and ACUE are virtually indistinguishable from moments: Remark 7 in particular shows that the $2k^{th}$ moments agree for all $k < N$ . Given that the analogue of $N$ in the Riemann zeta case is something like $\log T$ , this means that one needs to go up to something like the $2 \log T^{th}$ moment of zeta before one should start seeing a distinction between the GUE hypothesis and the alternative GUE hypothesis!

Terry

23 May, 2019 at 12:16 am

Anonymous

Terry, this can’t be right because the log T moment of the zeta function is dominated by the single largest value of zeta which is conjectured to be exp(sqrt(log T)). But even if you don’t believe this conjecture and only assume RH then the largest value of zeta is at most exp(log T / loglog T) and it still dominates the game.

24 May, 2019 at 7:07 am

Terence Tao

Fair point; at such high moments the distribution of zeroes at the microscale becomes less relevant and the behaviour is dominated instead by the oscillation of small primes. (Though in such situations it would still be the case that it would not be possible to use these moments to distinguish GUE from AGUE.) One could work instead with mollified moments in which one first multiplies the zeta function by a suitable mollifier to damp out the effect of small primes and only retain the effect of the nearby zeroes, before raising to high powers. (Though, as mentioned in Remark 7, once one allows for mollifiers then it becomes easier to distinguish GUE/CUE from AGUE/ACUE using lower moments.)

	Anonymous on Work hard
	Anonymous on Infinite partial sumsets in th…
	Anonymous on Erratum for “An inverse…
	Anonymous on Erratum for “An inverse…
	Anonymous on Infinite partial sumsets in th…
	Anonymous on Infinite partial sumsets in th…
	Anonymous on Infinite partial sumsets in th…
	Anonymous on Infinite partial sumsets in th…
	Anonymous on Infinite partial sumsets in th…
	Anonymous on Infinite partial sumsets in th…
	Anonymous on Infinite partial sumsets in th…
	Anonymous on Infinite partial sumsets in th…
	Anonymous on Infinite partial sumsets in th…
	Anonymous on Infinite partial sumsets in th…
	Anonymous on Erratum for “An inverse…

The alternative hypothesis for unitary matrices

Recent Comments

Articles by others

Diversions

Mathematics

Selected articles

Software

The sciences

Top Posts

Archives

Categories

The Polymath Blog

30 comments

Leave a comment Cancel reply

For commenters

The alternative hypothesis for unitary matrices

Share this:

Recent Comments

Articles by others

Diversions

Mathematics

Selected articles

Software

The sciences

Top Posts

Archives

Categories

The Polymath Blog

30 comments

Leave a comment Cancel reply

For commenters