Heat flow and zeroes of polynomials II: zeroes on a circle

7 June, 2018 in expository, math.CA, math.GR, math.NT | Tags: heat flow, polynomials, symplectic group | by Terence Tao

This is a sequel to this previous blog post, in which we discussed the effect of the heat flow evolution

$\displaystyle \partial_t P(t,z) = \partial_{zz} P(t,z)$

on the zeroes of a time-dependent family of polynomials ${z \mapsto P(t,z)}$ , with a particular focus on the case when the polynomials ${z \mapsto P(t,z)}$ had real zeroes. Here (inspired by some discussions I had during a recent conference on the Riemann hypothesis in Bristol) we record the analogous theory in which the polynomials instead have zeroes on a circle ${\{ z: |z| = \sqrt{q} \}}$ , with the heat flow slightly adjusted to compensate for this. As we shall discuss shortly, a key example of this situation arises when ${P}$ is the numerator of the zeta function of a curve.

More precisely, let ${g}$ be a natural number. We will say that a polynomial

$\displaystyle P(z) = \sum_{j=0}^{2g} a_j z^j$

of degree ${2g}$ (so that ${a_{2g} \neq 0}$ ) obeys the functional equation if the ${a_j}$ are all real and

$\displaystyle a_j = q^{g-j} a_{2g-j}$

for all ${j=0,\dots,2g}$ , thus

$\displaystyle P(\overline{z}) = \overline{P(z)}$

and

$\displaystyle P(q/z) = q^g z^{-2g} P(z)$

for all non-zero ${z}$ . This means that the ${2g}$ zeroes ${\alpha_1,\dots,\alpha_{2g}}$ of ${P(z)}$ (counting multiplicity) lie in ${{\bf C} \backslash \{0\}}$ and are symmetric with respect to complex conjugation ${z \mapsto \overline{z}}$ and inversion ${z \mapsto q/z}$ across the circle ${\{ |z| = \sqrt{q}\}}$ . We say that this polynomial obeys the Riemann hypothesis if all of its zeroes actually lie on the circle ${\{ z = \sqrt{q}\}}$ . For instance, in the ${g=1}$ case, the polynomial ${z^2 - a_1 z + q}$ obeys the Riemann hypothesis if and only if ${|a_1| \leq 2\sqrt{q}}$ .

Such polynomials arise in number theory as follows: if ${C}$ is a projective curve of genus ${g}$ over a finite field ${\mathbf{F}_q}$ , then, as famously proven by Weil, the associated local zeta function ${\zeta_{C,q}(z)}$ (as defined for instance in this previous blog post) is known to take the form

$\displaystyle \zeta_{C,q}(z) = \frac{P(z)}{(1-z)(1-qz)}$

where ${P}$ is a degree ${2g}$ polynomial obeying both the functional equation and the Riemann hypothesis. In the case that ${C}$ is an elliptic curve, then ${g=1}$ and ${P}$ takes the form ${P(z) = z^2 - a_1 z + q}$ , where ${a_1}$ is the number of ${{\bf F}_q}$ -points of ${C}$ minus ${q+1}$ . The Riemann hypothesis in this case is a famous result of Hasse.

Another key example of such polynomials arise from rescaled characteristic polynomials

$\displaystyle P(z) := \det( 1 - \sqrt{q} F ) \ \ \ \ \ (1)$

of ${2g \times 2g}$ matrices ${F}$ in the compact symplectic group ${Sp(g)}$ . These polynomials obey both the functional equation and the Riemann hypothesis. The Sato-Tate conjecture (in higher genus) asserts, roughly speaking, that “typical” polyomials ${P}$ arising from the number theoretic situation above are distributed like the rescaled characteristic polynomials (1), where ${F}$ is drawn uniformly from ${Sp(g)}$ with Haar measure.

Given a polynomial ${z \mapsto P(0,z)}$ of degree ${2g}$ with coefficients

$\displaystyle P(0,z) = \sum_{j=0}^{2g} a_j(0) z^j,$

we can evolve it in time by the formula

$\displaystyle P(t,z) = \sum_{j=0}^{2g} \exp( t(j-g)^2 ) a_j(0) z^j,$

thus ${a_j(t) = \exp(t(j-g)) a_j(0)}$ for ${t \in {\bf R}}$ . Informally, as one increases ${t}$ , this evolution accentuates the effect of the extreme monomials, particularly, ${z^0}$ and ${z^{2g}}$ at the expense of the intermediate monomials such as ${z^g}$ , and conversely as one decreases ${t}$ . This family of polynomials obeys the heat-type equation

$\displaystyle \partial_t P(t,z) = (z \partial_z - g)^2 P(t,z). \ \ \ \ \ (2)$

In view of the results of Marcus, Spielman, and Srivastava, it is also very likely that one can interpret this flow in terms of expected characteristic polynomials involving conjugation over the compact symplectic group ${Sp(n)}$ , and should also be tied to some sort of “ ${\beta=\infty}$ ” version of Brownian motion on this group, but we have not attempted to work this connection out in detail.

It is clear that if ${z \mapsto P(0,z)}$ obeys the functional equation, then so does ${z \mapsto P(t,z)}$ for any other time ${t}$ . Now we investigate the evolution of the zeroes. Suppose at some time ${t_0}$ that the zeroes ${\alpha_1(t_0),\dots,\alpha_{2g}(t_0)}$ of ${z \mapsto P(t_0,z)}$ are distinct, then

$\displaystyle P(t_0,z) = a_{2g}(0) \exp( t_0g^2 ) \prod_{j=1}^{2g} (z - \alpha_j(t_0) ).$

From the inverse function theorem we see that for times ${t}$ sufficiently close to ${t_0}$ , the zeroes ${\alpha_1(t),\dots,\alpha_{2g}(t)}$ of ${z \mapsto P(t,z)}$ continue to be distinct (and vary smoothly in ${t}$ ), with

$\displaystyle P(t,z) = a_{2g}(0) \exp( t g^2 ) \prod_{j=1}^{2g} (z - \alpha_j(t) ).$

Differentiating this at any ${z}$ not equal to any of the ${\alpha_j(t)}$ , we obtain

$\displaystyle \partial_t P(t,z) = P(t,z) ( g^2 - \sum_{j=1}^{2g} \frac{\alpha'_j(t)}{z - \alpha_j(t)})$

and

$\displaystyle \partial_z P(t,z) = P(t,z) ( \sum_{j=1}^{2g} \frac{1}{z - \alpha_j(t)})$

and

$\displaystyle \partial_{zz} P(t,z) = P(t,z) ( \sum_{1 \leq j,k \leq 2g: j \neq k} \frac{1}{(z - \alpha_j(t))(z - \alpha_k(t))}).$

Inserting these formulae into (2) (expanding ${(z \partial_z - g)^2}$ as ${z^2 \partial_{zz} - (2g-1) z \partial_z + g^2}$ ) and canceling some terms, we conclude that

$\displaystyle - \sum_{j=1}^{2g} \frac{\alpha'_j(t)}{z - \alpha_j(t)} = z^2 \sum_{1 \leq j,k \leq 2g: j \neq k} \frac{1}{(z - \alpha_j(t))(z - \alpha_k(t))}$

$\displaystyle - (2g-1) z \sum_{j=1}^{2g} \frac{1}{z - \alpha_j(t)}$

for ${t}$ sufficiently close to ${t_0}$ , and ${z}$ not equal to ${\alpha_1(t),\dots,\alpha_{2g}(t)}$ . Extracting the residue at ${z = \alpha_j(t)}$ , we conclude that

$\displaystyle - \alpha'_j(t) = 2 \alpha_j(t)^2 \sum_{1 \leq k \leq 2g: k \neq j} \frac{1}{\alpha_j(t) - \alpha_k(t)} - (2g-1) \alpha_j(t)$

which we can rearrange as

$\displaystyle \frac{\alpha'_j(t)}{\alpha_j(t)} = - \sum_{1 \leq k \leq 2g: k \neq j} \frac{\alpha_j(t)+\alpha_k(t)}{\alpha_j(t)-\alpha_k(t)}.$

If we make the change of variables ${\alpha_j(t) = \sqrt{q} e^{i\theta_j(t)}}$ (noting that one can make ${\theta_j}$ depend smoothly on ${t}$ for ${t}$ sufficiently close to ${t_0}$ ), this becomes

$\displaystyle \partial_t \theta_j(t) = \sum_{1 \leq k \leq 2g: k \neq j} \cot \frac{\theta_j(t) - \theta_k(t)}{2}. \ \ \ \ \ (3)$

Intuitively, this equation asserts that the phases ${\theta_j}$ repel each other if they are real (and attract each other if their difference is imaginary). If ${z \mapsto P(t_0,z)}$ obeys the Riemann hypothesis, then the ${\theta_j}$ are all real at time ${t_0}$ , then the Picard uniqueness theorem (applied to ${\theta_j(t)}$ and its complex conjugate) then shows that the ${\theta_j}$ are also real for ${t}$ sufficiently close to ${t_0}$ . If we then define the entropy functional

$\displaystyle H(\theta_1,\dots,\theta_{2g}) := \sum_{1 \leq j < k \leq 2g} \log \frac{1}{|\sin \frac{\theta_j-\theta_k}{2}| }$

then the above equation becomes a gradient flow

$\displaystyle \partial_t \theta_j(t) = - 2 \frac{\partial H}{\partial \theta_j}( \theta_1(t),\dots,\theta_{2g}(t) )$

which implies in particular that ${H(\theta_1(t),\dots,\theta_{2g}(t))}$ is non-increasing in time. This shows that as one evolves time forward from ${t_0}$ , there is a uniform lower bound on the separation between the phases ${\theta_1(t),\dots,\theta_{2g}(t)}$ , and hence the equation can be solved indefinitely; in particular, ${z \mapsto P(t,z)}$ obeys the Riemann hypothesis for all ${t > t_0}$ if it does so at time ${t_0}$ . Our argument here assumed that the zeroes of ${z \mapsto P(t_0,z)}$ were simple, but this assumption can be removed by the usual limiting argument.

For any polynomial ${z \mapsto P(0,z)}$ obeying the functional equation, the rescaled polynomials ${z \mapsto e^{-g^2 t} P(t,z)}$ converge locally uniformly to ${a_{2g}(0) (z^{2g} + q^g)}$ as ${t \rightarrow +\infty}$ . By Rouche’s theorem, we conclude that the zeroes of ${z \mapsto P(t,z)}$ converge to the equally spaced points ${\{ e^{2\pi i(j+1/2)/2g}: j=1,\dots,2g\}}$ on the circle ${\{ |z| = \sqrt{q}\}}$ . Together with the symmetry properties of the zeroes, this implies in particular that ${z \mapsto P(t,z)}$ obeys the Riemann hypothesis for all sufficiently large positive ${t}$ . In the opposite direction, when ${t \rightarrow -\infty}$ , the polynomials ${z \mapsto P(t,z)}$ converge locally uniformly to ${a_g(0) z^g}$ , so if ${a_g(0) \neq 0}$ , ${g}$ of the zeroes converge to the origin and the other ${g}$ converge to infinity. In particular, ${z \mapsto P(t,z)}$ fails the Riemann hypothesis for sufficiently large negative ${t}$ . Thus (if ${a_g(0) \neq 0}$ ), there must exist a real number ${\Lambda}$ , which we call the de Bruijn-Newman constant of the original polynomial ${z \mapsto P(0,z)}$ , such that ${z \mapsto P(t,z)}$ obeys the Riemann hypothesis for ${t \geq \Lambda}$ and fails the Riemann hypothesis for ${t < \Lambda}$ . The situation is a bit more complicated if ${a_g(0)}$ vanishes; if ${k}$ is the first natural number such that ${a_{g+k}(0)}$ (or equivalently, ${a_{g-j}(0)}$ ) does not vanish, then by the above arguments one finds in the limit ${t \rightarrow -\infty}$ that ${g-k}$ of the zeroes go to the origin, ${g-k}$ go to infinity, and the remaining ${2k}$ zeroes converge to the equally spaced points ${\{ e^{2\pi i(j+1/2)/2k}: j=1,\dots,2k\}}$ . In this case the de Bruijn-Newman constant remains finite except in the degenerate case ${k=g}$ , in which case ${\Lambda = -\infty}$ .

For instance, consider the case when ${g=1}$ and ${P(0,z) = z^2 - a_1 z + q}$ for some real ${a_1}$ with ${|a_1| \leq 2\sqrt{q}}$ . Then the quadratic polynomial

$\displaystyle P(t,z) = e^t z^2 - a_1 z + e^t q$

has zeroes

$\displaystyle \frac{a_1 \pm \sqrt{a_1^2 - 4 e^{2t} q}}{2e^t}$

and one easily checks that these zeroes lie on the circle ${\{ |z|=\sqrt{q}\}}$ when ${t \geq \log \frac{|a_1|}{2\sqrt{q}}}$ , and are on the real axis otherwise. Thus in this case we have ${\Lambda = \log \frac{|a_1|}{2\sqrt{q}}}$ (with ${\Lambda=-\infty}$ if ${a_1=0}$ ). Note how as ${t}$ increases to ${+\infty}$ , the zeroes repel each other and eventually converge to ${\pm i \sqrt{q}}$ , while as ${t}$ decreases to ${-\infty}$ , the zeroes collide and then separate on the real axis, with one zero going to the origin and the other to infinity.

The arguments in my paper with Brad Rodgers (discussed in this previous post) indicate that for a “typical” polynomial ${P}$ of degree ${g}$ that obeys the Riemann hypothesis, the expected time to relaxation to equilibrium (in which the zeroes are equally spaced) should be comparable to ${1/g}$ , basically because the average spacing is ${1/g}$ and hence by (3) the typical velocity of the zeroes should be comparable to ${g}$ , and the diameter of the unit circle is comparable to ${1}$ , thus requiring time comparable to ${1/g}$ to reach equilibrium. Taking contrapositives, this suggests that the de Bruijn-Newman constant ${\Lambda}$ should typically take on values comparable to ${-1/g}$ (since typically one would not expect the initial configuration of zeroes to be close to evenly spaced). I have not attempted to formalise or prove this claim, but presumably one could do some numerics (perhaps using some of the examples of ${P}$ given previously) to explore this further.

18 comments

Comments feed for this article

7 June, 2018 at 6:10 am

Anonymous

It seems that Riemann zeta function (whose zeros are pseudo-randomly distributed) can’t be interpreted (in any sense) as some “limiting case” of the above local zeta functions (having uniformly distributed zeros.)

8 June, 2018 at 1:56 pm

Joseph

I would have the intuition that maybe the value of $\Lambda$ should be closer to zero than $-1/g$ because of the zeros which are close to each other: if the closest pair of zeros has distance $d$ and if we run the flow backwards, then they attract each other with speed about $1/d$ , and then would collide at time about $d^2$ if we neglect the other zeros (which are at much larger distance). For characteristic polynomial of $CUE$ where $d$ has order $g^{-4/3}$ , I would expect $\Lambda$ of about $-g^{-8/3}$ if my intuition is correct.

9 June, 2018 at 6:54 am

Terence Tao

Hmm, I think you’re right; the time to relaxation to equilibrium gives a lower bound on $\Lambda$ but it is far from optimal.

In the case of zeroes on the real line, there is a criterion of Csordas, Smith and Varga (see Theorem 1 of https://link.springer.com/article/10.1007/BF01205170 ) that says, roughly speaking, that if one has a pair of zeroes at separation $d$ , and the other zeroes are significantly further away from this pair than $d$ , then $\Lambda \geq -c/d^2$ . It may well be that an analogous result holds on the circle.

9 June, 2018 at 1:01 pm

Joseph

I think it is equivalent to have points on the circle or all the determinations of their arguments on the real line (which gives a $2 \pi$ -periodic set), because of the formula $(1/2) \cot (x/2) = \sum_{k \in \mathbb{Z}} 1/(x-k)$ .

9 June, 2018 at 8:35 am

Aula

The third display above (3) is too wide.

[Corrected, thanks – T.]

9 June, 2018 at 10:04 am

Will Sawin

It should possible to obtain the “time to relaxation” upper bound using only the middle coefficient. It has variance $\lfloor g/2\rfloor +1$ , so it is presumably typically of size $\approx \sqrt{g/2}$ , meaning at time $-2 \log 2 /g$ , it is of size $\approx 2^{ 2 g}$ compared to the first and last coefficients, compared to the bound ${2g \choose g} \leq 2^{2g}$ for polynomials with all roots on the unit circle.

18 June, 2018 at 9:22 am

Anonymous

Dear Terry,
I am not an expert mathematician.But I think the Riemann’s difficult level is 5 times more than Poincare.Because one only needs one way to go to the destination,Riemann must has 5 ways done:straight and then go back,attack the middle point,the first point and then the final point.And not enough, you must go with the first step on the old road many times.I know Terry very well.Someday not so long,the community of maths has amusing news from Terry.
Best wishes,
(Company of Pro.Tao -two decades)

2 July, 2018 at 3:16 pm

curious

Does the entropy here have anything do with information?

16 August, 2018 at 9:19 am

Tatenda Kubalalika

Dear Prof. Tao, according to equation $(15)$ of your paper with Prof. Rodgers, “the de Bruijn-newman constant is nonnegative”, the de Bruijn-Newman constant seems to be equal to $0$ .

Indeed, for $t<0$ , we have

$\displaystyle H_{t}(z)=\frac{1}{\sqrt{4\pi}}\int_{\mathbb{R}} e^{-r^{2}/4}H_{0}(z+r|t|^{1/2}) \mathrm{d}r. \quad (1)$

Notice that the right-hand side is invariant under the transformation $t\mapsto -t$ , thus we have

$\displaystyle H_{t}(z)=H_{-t}(z) \quad (2)$

for all $t\neq 0$ . Suppose that $H_{y}(z)=0$ for some real $y\neq 0$ hence $H_{-y}(z)=0$ . But we know by a result of Rodgers and Tao that $H_{Y}(z) \neq 0$ for any $Y<0$ and $z\in \mathbb{R}$ . Thus we arrive at a contradiction,, which entails that our supposition must be false, and the desired result follows. $\square$

16 August, 2018 at 2:30 pm

Terence Tao

Unfortunately, the identity (1) is only valid for $t \leq 0$ (as you point out), hence the identity (2) is only valid when $t, -t \leq 0$ , that is to say it is only established in the case $t=0$ (where it is trivial). Actually, one can check numerically that (2) is false in general.

16 August, 2018 at 11:59 pm

Anonymous

Since in (1) (for $t < 0$ ) $|t|$ can be replaced by $-t$ , is it possible that this modification of (1) (with the resulting branch point at $t=0$ due to $(-t)^{1/2}$ ) still holds for some analytic continuation (with respect to $t$ ) of the modified integral ?

17 August, 2018 at 6:23 pm

Terence Tao

For positive t, one has

$\displaystyle H_t(z) = \frac{1}{\sqrt{4\pi}} \int_{\bf R} e^{-r^2/4} H_0(z + ir|t|^{1/2})\ dr;$

(see the equation before (35) in https://github.com/km-git-acc/dbn_upper_bound/blob/master/Writeup/debruijn.pdf ).

16 August, 2018 at 2:50 pm

Tatenda Kubalalika

Thank you for your response. Do you have any specific $t$ (or t’s) in mind for which (2) is false ? Because it seems to me that $H_{t}(z)$ is the Fourier transform of $\phi(t)e^{zt^2}$, which is an even function. This seems to imply that $H_t$ should also be even. That is, $H_{t}(z)=H_{-t}(z)$ for all real $t$. Of course, my reasoning could be flawed.

16 August, 2018 at 4:28 pm

My mistake indeed in the last comment: i made a typo in the above Fourier transform, which invalidates the argument.

24 August, 2018 at 2:45 am

Tate. I. K

Indeed, the de Bruijn-Newman constant could be equal to zero.

Suppose there exists some pair $(T, z)$ of real numbers with $T\neq 0$ , such that
$\latex H_{T}(z)=0.$ We shall refer to this as equation $(1)$ . It is a classical fact that $H_0$ as many real zeros, and let $z+a$ be one such zero, where $a$ is some real number. That is, $H_{0}(z+a)=0.$ We shall refer to this as equation $(2)$ . Combining equations $(1)$ and $(2)$ yields
$H_{T}(z)=H_{0}(z+a).$ We shall refer to this as equation $(3)$ As noted in Rodgers and Tao’s paper (page 3), one can view $H_{T}(z)$ as the evolution of $H_{0}(z)$ under the backwards heat equation $\partial_{T}H_{T}(z)=-\partial_{zz}H_{T}(z)$ , where $T$ denotes the time. Hence from equation $(3)$ we deduce that,“one can view $H_{0}(z+a)$ as the EVOLUTION of $H_{0}(z)\cdots$ ” But this quoted statement is meaningless, since both $H_{0}(z)$ and $H_{0}(z+a)$ represent the same time $T=0$ .

We therefore conclude that our supposition must be false, and the desired result follows. $\square$ equal to zero.

24 August, 2018 at 2:20 pm

Terence Tao

(a) There certainly do exist pairs $(T,z)$ of real numbers with $T \neq 0$ and $H_T(z)=0$ . In fact, it is a result of Ki, Kim and Lee that for $T>0$ , there are infinitely many zeroes of $H_T$ , all but finitely many of which are real.

(b) You have only shown that the equation $H_T(z) = H_0(z+a)$ holds for a single value of $z$ , not for all $z$ . The evolution of the heat equation does not depend only on the pointwise value of the initial data at a single value of $z$ , but on the values at all other positions as well (the heat equation has infinite speed of propagation).

24 August, 2018 at 2:57 am

Anonymous commenter.

@Tate. I. K, it really seems that you have demonstrated that $\Lambda=0$ . However, your argument is suspiciously short. It will be truly strange if such a long-standing problem as the RH could have such a short solution. I suggest that you submit your work to a formal journal, and goodluck !

28 April, 2022 at 12:43 pm

Brian C. Hall

In Section 2.3 of a recent preprint of mine with Ching-Wei Ho, we studied (essentially) this evolution from a random matrix point of view. Suppose, for example, that you start with the characteristic polynomial of a Brownian motion in the unitary group and then evolve toward negative time. We conjecture that the roots will rapidly move off the unit circle and that they will eventually resemble the eigenvalues of a Brownian motion in the general linear group. See https://arxiv.org/abs/2202.09660

	Anonymous on Erratum for “An inverse…
	Anonymous on Infinite partial sumsets in th…
	Anonymous on Infinite partial sumsets in th…
	Anonymous on A Banach algebra proof of the…
	Anonymous on A Banach algebra proof of the…
	Aleksandar on 245C, Notes 4: Sobolev sp…
	Anonymous on Erratum for “An inverse…
	Anonymous on Erratum for “An inverse…
	Anonymous on Erratum for “An inverse…
	Anonymous on Erratum for “An inverse…
	Terence Tao on 245C, Notes 4: Sobolev sp…
	Terence Tao on 275A, Notes 3: The weak and st…
	Terence Tao on What is a gauge?
	Terence Tao on Erratum for “An inverse…
	Terence Tao on 275A, Notes 3: The weak and st…

Heat flow and zeroes of polynomials II: zeroes on a circle

Recent Comments

Articles by others

Diversions

Mathematics

Selected articles

Software

The sciences

Top Posts

Archives

Categories

The Polymath Blog

18 comments

Leave a comment Cancel reply

For commenters

Heat flow and zeroes of polynomials II: zeroes on a circle

Share this:

Recent Comments

Articles by others

Diversions

Mathematics

Selected articles

Software

The sciences

Top Posts

Archives

Categories

The Polymath Blog

18 comments

Leave a comment Cancel reply

For commenters