The Riemann zeta function , defined for by the formula
is the Riemann Xi function,
is the Gamma factor at infinity, and the Gamma function is defined for by
and extended meromorphically to other values of s by analytic continuation.
There are many proofs known of the functional equation (2). One of them (dating back to Riemann himself) relies on the Poisson summation formula
is the Fourier transform on , with being the standard character on . (The reason for this rather strange notation for the real line and its associated structures will be made clearer shortly.) Applying this formula to the (Archimedean) Gaussian function
which is its own (additive) Fourier transform, and then applying the multiplicative Fourier transform (i.e. the Mellin transform), one soon obtains (2). (Riemann also had another proof of the functional equation relying primarily on contour integration, which I will not discuss here.) One can “clean up” this proof a bit by replacing the Gaussian by a Dirac delta function, although one now has to work formally and “renormalise” by throwing away some infinite terms. (One can use the theory of distributions to make this latter approach rigorous, but I will not discuss this here.) Note how this proof combines the additive Fourier transform with the multiplicative Fourier transform. [Continuing with this theme, the Gamma function (5) is an inner product between an additive character and a multiplicative character , and the zeta function (1) can be viewed both additively, as a sum over n, or multiplicatively, as an Euler product.]
In the famous thesis of Tate, the above argument was reinterpreted using the language of the adele ring , with the Poisson summation formula (4) on replaced by the Poisson summation formula
on , where is the rationals, , and f is now a Schwartz-Bruhat function on . Applying this formula to the adelic (or global) Gaussian function , which is its own Fourier transform, and then using the adelic Mellin transform, one again obtains (2). Again, the proof can be cleaned up by replacing the Gaussian with a Dirac mass, at the cost of making the computations formal (or requiring the theory of distributions).
In this post I will write down both Riemann’s proof and Tate’s proof together (but omitting some technical details), to emphasise the fact that they are, in some sense, the same proof. However, Tate’s proof gives a high-level clarity to the situation (in particular, explaining more adequately why the Gamma factor at infinity (4) fits seamlessly with the Riemann zeta function (1) to form the Xi function (2)), and allows one to generalise the functional equation relatively painlessly to other zeta-functions and L-functions, such as Dedekind zeta functions and Hecke L-functions.
[Note: the material here is very standard in modern algebraic number theory; the post here is partially for my own benefit, as most treatments of this topic in the literature tend to operate in far higher levels of generality than I would prefer.]
– Riemann’s proof –
Applying the Poisson summation formula (2) for to the Schwartz function (8), we see that the theta function
obeys the functional equation
for . In particular, since is rapidly decreasing as , we see that is rapidly decreasing as .
Formally, we can take Mellin transforms of (11) and conclude that
for any s, where is the standard multiplicative Haar measure on . This does not make rigorous sense, because the integrands here diverge at 0 and at infinity (which is ultimately due to the poles of the Riemann Xi function at s=0 and s=1), but let us forge ahead regardless. By making the change of variables and using (3), (4), we see that
and so from (10) and (1) we formally have
If we casually discard the divergent integral and apply (12), we formally obtain the functional equation (2).
Of course, the above computations were totally formal in nature. Nevertheless it is possible to make the argument rigorous. For instance, when , we have a rigorous version of (14), namely
Observe that this expression extends meromorphically to all of s and can thus be taken as a definition of for all , and the functional equation (2) is then manifestly obvious.
Here is a slightly different way to view the above computations. Since the Gaussian (8) is its own Fourier transform, we see for every that the Fourier transform of is . Integrating this fact against on using (13), we obtain (formally) at least that the Fourier transform of is . (Note from scaling considerations it is formally clear that the Fourier transform of must be some sort of constant multiple of ; the Gamma factors can thus be viewed as the normalisation of these multiplicative characters that is compatible with the Fourier transform.) Formally applying the Poisson summation formula (4) to this, and casually discarding the singular terms at the origin, we obtain (2). One can make the above computations rigorous using the theory of distributions, and by using Gaussians to regularise the various integrals and summations appearing here, in which case the computations become essentially equivalent to the previous ones.
– p-adic analogues –
The above “Archimedean” Fourier analysis on has analogues in the p-adic completions of the rationals . (This is analogous to my discussion of dyadic models, although the p-adics are still characteristic 0 and are thus not as dyadic as their function field cousins , .)
Recall that the reals are the metric completion of the rationals with respect to the metric arising from the usual Archimedean absolute value . This absolute value obeys the following basic properties:
- Positivity: we have for all x, with equality if and only if x=0.
- Multiplicativity: we have for all x, y.
- Triangle inequality: We have for all x, y.
A function from k to with the above three properties is known as an absolute value (or valuation) on k. In addition to the Archimedean absolute value, each prime p defines a p-adic absolute value on k, defined by the formula , where n is the number of times p divides x. (This number could be negative if the denominator of the rational number x contains factors of p.) Equivalently, is the unique valuation such that and whenever n is an integer coprime to p.
One easily verifies that is an absolute value; in fact it not only obeys the triangle inequality, but also the ultra-triangle inequality , making the p-adic absolute value a non-Archimedean absolute value.
A classical theorem of Ostrowski asserts that the Archimedean absolute value and the p-adic absolute values are in fact the only absolute values on the rationals k, up to the renormalisation of replacing an absolute value |x| with a power . If we define a place to be an absolute value up to renormalisation, we thus see that the rationals k have one Archimedean (or infinite) place , together with one non-Archimedean (or finite) place p for every prime.
One could have set to some other value between 0 and 1 than 1/p (thus replacing with some power ) and still get an absolute value; but this normalisation is natural because it allows one to write the fundamental theorem of arithmetic in the appealing form
for all (17)
where ranges over all places, and is the multiplicative group of k.
If one takes the metric completion of the rationals using a p-adic absolute value rather than the Archimedean one, one obtains the p-adic field . One can view this field as a kind of inverted version of the real field , in which p has been inverted to be small rather than large. Some illustrations of this inversion:
- In , the sequence goes to infinity as and goes to zero as ; in , it is the other way around.
- Elements of can be expressed base p as strings of digits that need not terminate to the right of the decimal point, but must terminate to the left. In , it is the other way around. (The famous ambiguity in does not occur in the p-adic field , because the latter has the topology of a Cantor space rather than a continuum.)
- In , the integers is closed and forms a discrete cocompact additive subgroup. In , the integers are not closed, but their closure (the ring of p-adic integers) forms a compact codiscrete additive subgroup.
Despite this inversion, we can obtain analogues of most of the additive and multiplicative Fourier analytic computations of the previous section for the p-adics.
Let’s first begin with the additive Fourier structure. By the theory of Haar measures, there is a unique translation-invariant measure on which assigns a unit mass to the compact codiscrete subgroup . One can check that this measure interacts with dilations in the expected manner, thus
for all absolutely integrable f and all invertible .
Just as has a standard character , we can define a standard character as the unique character (i.e. continuous homomorphism from to ) such that for all integers n (in particular, is trivial on the integers, just as is). One easily verifies that this is indeed a character. From this and the additive Haar measure , we can now define the p-adic Fourier transform
for reasonable (e.g. absolutely integrable) f, and it is a routine matter to verify all the usual Fourier-analytic identities for this transform (or one can appeal to the general theory of Fourier analysis on locally compact abelian groups).
In , we have the Gaussian function (8), which is its own Fourier transform. In , the analogous Gaussian function is given by the formula
i.e. the p-adic Gaussian is just the indicator function of the p-adic integers. One easily verifies that this function is also its own Fourier transform.
Now we turn to the multiplicative Fourier theory for . The natural multiplicative Haar measure on is given by the formula ; the normalisation factor is natural in order for the group of units to have unit mass.
In , we see from (13) that the Gamma factor at infinity can be expressed (for ) as the Mellin transform of the Gaussian:
In analogy with this, we can define the Gamma factor at p by the formula
Due to the simple and explicit nature of all the expressions on the right-hand side, it is a straightforward matter to compute this factor explicitly; it becomes
for , at least; of course, one can then extend meromorphically in the obvious manner.
In , we showed (formally, at least) that and were Fourier transforms of each other. One can similarly show that and are Fourier transforms of each other in .
On the other hand, there is no obvious analogue of the Poisson summation formula manipulations for , because (unlike ), lacks a discrete cocompact subgroup.
– Tate’s proof –
We have just performed some “local” additive and multiplicative Fourier analysis at a single place. (This use of “local” may seem unrelated to the topological or analytical notion of “local”, as in “in the vicinity of a single point”, but it is actually much the same concept; compare for instance the formal power series in p for a p-adic number with the Taylor series expansion in of a function f(t) around a point . Indeed one can view local analysis at a place p as being the analysis of the integers or rationals when p is “close to zero”; one can make this precise using the language of schemes, but we will not do so here.)
In his famous thesis, Tate observed that all these local Fourier-analytic computations could be unified into a single global Fourier-analytic computation, using the languge of the adele ring . This ring is the set of all tuples , where ranges over places and , and furthermore all but finitely many of the lie in their associated ring of integers . (Equivalently, the adele ring is the tensor product of the rationals with the ring of integral adeles .) This restriction that the consists mostly of integers in is important for a large variety of analytic and algebraic reasons; for instance, it keeps the adele ring -compact.
Many of the structures and objects on the local fields can be multiplied together to form corresponding global structures on the adele ring. For instance:
- The commutative ring structures on the multiply together to give a commutative ring structure on .
- The locally compact Hausdorff structures on the multiply together to give a locally compact Hausdorff structure on .
- The local additive Haar measures on the multiply together to give a global additive Haar measure on .
- The local characters on the multiply together to give a global character (here it is essential that most components of an adele are integers, and so are trivial with respect to their local character).
- The local additive Fourier transforms on the then multiply to form a global additive Fourier transform on , defined as for reasonable f.
- The local absolute values on multiply to form a global absolute value on , though with the important caveat that |x| can vanish for non-zero x (indeed, a simple calculation using Euler’s observation shows that almost every x does this, with respect to additive Haar measure). The x for which |x| is non-zero are invertible and known as ideles, and form a multiplicative group ; the ideles have measure zero inside the adeles.
- The local gaussians multiply together to form a global gaussian , which is its own Fourier transform.
- The embeddings at each place multiply together to form a diagonal embedding . This embedding is both discrete (by the fundamental theorem of arithmetic) and cocompact (this is basically because the integers are cocompact in the adelic integers).
- The local multiplicative Haar measures multiply together to form a global multiplicative Haar measure , though one should caution that this measure is supported on the ideles rather than the adeles .
- The local Gamma factors for each place multiply together to form the Riemann Xi function (3) (for at least), thanks to the Euler product formula .
Recall that the local Gamma factors were the local Mellin transforms of the local Gaussians. Multiplying this together, we see that the Riemann Xi function is the global Mellin transform of the global Gaussian:
Our derivation of (24) used the Euler product formula. Another way to establish (24) using the original form (1) of the zeta function is to observe (thanks to the fundamental theorem of arithmetic) that the set is a fundamental domain for the action of on , thus
Partitioning (24) using (25) and then using (13) and (1) one can give an alternate derivation of (24). (The two derivations are ultimately the same, of course, since the Euler product formula is itself essentially a restatement of the fundamental theorem of arithmetic.)
In his thesis, Tate established the Poisson summation formula (9) for the adeles for all sufficiently nice f (e.g. any f in the Schwartz-Bruhat class would do). Applying this to the global gaussian g, we conclude that the global Theta function obeys the functional equation
for all ideles t. This formally implies that
which on applying (25) and (24) (and casually discarding the singular contributions of ) yields the functional equation (2). One can make this formal computation rigorous in exactly the same way that Riemann’s proof was made rigorous in previous sections.
Recall that Riemann’s proof could also be established by inspecting the Fourier transforms of . A similar approach can work here. If we (very formally!) apply the Poisson summation formula (9) to the measure , one obtains
Unpacking this summation using (25) (and (17)), and casually discarding the a=0 term, we formally conclude that
Rescaling this, we formally conclude that the Fourier transform of is . Inserting this into (24), the functional equation (2) formally follows from Parseval’s theorem; alternatively, one can derive it by multiplying together all the local facts that the Fourier transform of in is . These arguments can be made rigorous using the theory of distributions (and a lot of care), but we will not do so here.