The prime number theorem can be expressed as the assertion
as , where
is the von Mangoldt function. It is a basic result in analytic number theory, but requires a bit of effort to prove. One “elementary” proof of this theorem proceeds through the Selberg symmetry formula
where the second von Mangoldt function is defined by the formula
(We are avoiding the use of the symbol here to denote Dirichlet convolution, as we will need this symbol to denote ordinary convolution shortly.) For the convenience of the reader, we give a proof of the Selberg symmetry formula below the fold. Actually, for the purposes of proving the prime number theorem, the weaker estimate
suffices.
In this post I would like to record a somewhat “soft analysis” reformulation of the elementary proof of the prime number theorem in terms of Banach algebras, and specifically in Banach algebra structures on (completions of) the space of compactly supported continuous functions
equipped with the convolution operation
This soft argument does not easily give any quantitative decay rate in the prime number theorem, but by the same token it avoids many of the quantitative calculations in the traditional proofs of this theorem. Ultimately, the key “soft analysis” fact used is the spectral radius formula
for any element of a unital commutative Banach algebra
, where
is the space of characters (i.e., continuous unital algebra homomorphisms from
to
) of
. This formula is due to Gelfand and may be found in any text on Banach algebras; for sake of completeness we prove it below the fold.
The connection between prime numbers and Banach algebras is given by the following consequence of the Selberg symmetry formula.
Theorem 1 (Construction of a Banach algebra norm) For any
, let
denote the quantity
Then
is a seminorm on
with the bound
for all
. Furthermore, we have the Banach algebra bound
for all
.
We prove this theorem below the fold. The prime number theorem then follows from Theorem 1 and the following two assertions. The first is an application of the spectral radius formula (6) and some basic Fourier analysis (in particular, the observation that contains a plentiful supply of local units):
Theorem 2 (Non-trivial Banach algebras with many local units have non-trivial spectrum) Let
be a seminorm on
obeying (7), (8). Suppose that
is not identically zero. Then there exists
such that
for all
. In particular, by (7), one has
whenever
is a non-negative function.
The second is a consequence of the Selberg symmetry formula and the fact that is real (as well as Mertens’ theorem, in the
case), and is closely related to the non-vanishing of the Riemann zeta function
on the line
:
Theorem 3 (Breaking the parity barrier) Let
. Then there exists
such that
is non-negative, and
Assuming Theorems 1, 2, 3, we may now quickly establish the prime number theorem as follows. Theorem 2 and Theorem 3 imply that the seminorm constructed in Theorem 1 is trivial, and thus
as for any Schwartz function
(the decay rate in
may depend on
). Specialising to functions of the form
for some smooth compactly supported
on
, we conclude that
as ; by the smooth Urysohn lemma this implies that
as for any fixed
, and the prime number theorem then follows by a telescoping series argument.
The same argument also yields the prime number theorem in arithmetic progressions, or equivalently that
for any fixed Dirichlet character ; the one difference is that the use of Mertens’ theorem is replaced by the basic fact that the quantity
is non-vanishing.
— 1. Proof of Selberg symmetry formula —
We now prove (2). From (3) we have
From the integral test we have the estimates
for some absolute constants whose exact value is unimportant for us, and for any
. We conclude that
for some further absolute constants . Replacing
by
and inserting this into (9), one obtains
The error term can be computed to be . The main term simplifies by Möbius inversion to
, and the claim follows.
— 2. Constructing the Banach algebra —
We now prove Theorem 1. It is convenient to transform the situation from the classical context of arithmetic functions on (such as
or
) to the more Fourier-analytic context of Radon measures on the real line
. Define the discrete Radon measure
and for any , let
denote the left translate of the measure
by
, thus
for any continuous compactly supported . We note in passing that the prime number theorem (1) is equivalent to the assertion that the translates
converge in the vague topology to Lebesgue measure
as
.
where is the convolution of the Radon measures
, and
is the measure
multiplied by the identity function
. From (4) one has
We claim that the Selberg symmetry formula (5) implies (in fact, it is equivalent to) the assertion that the translates converge in the vague topology to
. Indeed, (5) implies for any fixed
that
or equivalently that
which we rewrite as
Since for
, we thus have
which implies that converges vaguely to
, and the claim follows.
Now we begin the proof of Theorem 1. Observe that the quantity can be rewritten as
and converges vaguely to
, we see that the measures
are precompact in the vague topology, thanks to the Helly selection principle or Prokhorov theorem. In particular, we have
for some limit point of the translates
in the vague topology. From (12) we have
Finally, we prove (8). By(11), it suffices to show that
for any , where the
decay errors are allowed to depend on
. Since
converges vaguely to
, we already have from (10) that
so it suffices to show that
Let be Lebesgue measure on the half-line
. Then
, so
converges vaguely to
. The measure
is equal to
times the function
, so by Mertens’ theorem this function also converges vaguely to
. We conclude that
converges vaguely to , and so it suffices to show that
We rewrite this as
On the support of , we have
, so it suffices to show that
(The error term in can be controlled by using (15) with
replaced by
, and modifying the preceding arguments to replace
by
.)
From Fubini’s theorem we have
The integrand vanishes unless
. By (11), we have
and
and the claim (15) follows.
— 3. Non-trivial algebras with many local units have non-trivial spectrum —
We now prove Theorem 2. Let be the Banach algebra completion of
under the seminorm
(thus
is the space of Cauchy sequences in
, quotiented out by the sequences that go to zero in the seminorm
). Since
is not identically zero,
is a non-trivial commutative Banach algebra (but it is not necessarily unital).
It is convenient to adjoin a unit to
to create a unital commutative Banach algebra
with the extended norm
for and
; one easily verifies that
is a unital commutative Banach algebra.
Suppose that all elements of have zero spectral radius (as defined in (6)). Let
be a Schwartz function with compactly supported Fourier transform. Then we can find another Schwartz function
with compactly supported Fourier transform such that
(by ensuring that
on the support of
; thus
is a “local unit” on the Fourier support of
). Thus
for all
. But
has spectral radius zero, thus
is zero in
. By density this implies that
is trivial, a contradiction.
Thus there is an element of with positive spectral radius. Then by (6), there is a character
that is does not vanish identically on
. Suppose that for each
there exists
in the kernel of
whose Fourier coefficient
is non-vanishing. Since the kernel of
is a space closed with respect to convolutions by
functions, some Fourier analysis and a smooth partition of unity then shows that the kernel of
contains any Schwartz function with compactly supported Fourier transform, and thus by density
is trivial, a contradiction. Thus there must exist
such that
contains all test functions with Fourier coefficient vanishing at
. From this we conclude that
on
is a constant multiple of the Fourier coefficient map
; being a non-trivial algebra homomorphism on
, we thus have
for all . Since characters have norm at most
(as can be seen for instance from (6)), we obtain the claim.
— 4. Breaking the parity barrier —
We now prove Theorem 3. We divide into two cases, depending on whether or
. If
, we let
be a continuous function that equals
on
and is supported on
for some large
. From Mertens’ theorem we have
for sufficiently large depending on
, and thus
The claim then follows by taking sufficiently large.
Now suppose . In the language of Section 2, we have
for some limit point of the
. We can write the right-hand side as
for some phase . From (14),
is a real measure between
and
, so by the triangle inequality we have
Now we set , where
is as before. Then
Since is periodic with period
and has mean value strictly less than
(in fact, it has mean
), we thus have
if is sufficiently large depending on
. The claim follows.
— 5. The prime number theorem in arithmetic progressions —
Let be a non-principal Dirichlet character of some period
. We allow all implied constants in the
notation to depend on
. In this section we sketch the changes to the above arguments needed to establish
which gives the prime number in arithmetic progressions by the usual Fourier expansion into Dirichlet characters.
We have the twisted versions
and
of (3), (4). Since has mean zero, a decomposition into intervals of length
reveals that
from which we obtain the twisted Selberg symmetry formula
If we define the twisted measures
and
then
and hence converges weakly to zero as
. Introducing the twisted norms
we may verify that obeys the conclusions of Theorem 1.
By repeating the previous arguments, it will suffice that the analogue of Theorem 3 for holds. When
, we can argue as in Section 4, where the role of Mertens’ theorem is replaced by Dirichlet’s theorem
which is ultimately a consequence of the non-vanishing of .
For , the argument in Section 4 works with minimal changes if
is real-valued. If
is complex valued, it still takes only a finite number of values
in the unit disk. Then the limit measures
appearing in Section 4 are equal to Lebesgue measure
times a density taking values in the convex hull
of this finite set of values, which is a polygon in the unit disk. One can then modify the arguments in Section 4 to bound
for some phase . If we set
as before, we again observe that the function
is periodic and has mean strictly less than one, and so we can again establish the required bound
if
is large enough.
— 6. Proof of Gelfand formula —
We now prove (6).
If is a character, then it has an operator norm:
But we may eliminate this norm by using the “tensor power trick”: replacing with
and then taking
roots we conclude that
and then on sending we have
Replacing by
again, taking
roots, and sending
we conclude that
(The limit exists because is submultiplicative.) This gives one direction of (6). To give the other direction, suppose for sake of contradiction that we could find an
such that
for some real number .
There are two cases, depending on whether we can find a complex number with
and
non-invertible. First suppose that such a
exists; then
generates an ideal of
, which by Zorn’s lemma is contained in a maximal ideal
, whose quotient is then a field. By Neumann series, any element of
sufficiently close to the identity is invertible and thus not in
; since
is a field, we conclude that the complement of
is open, and so
is closed. This makes
a Banach algebra as well as a field. If
is not a multiple of the identity, then
is invertible for every
and so (by Neumann series)
is an analytic function from
to
which goes to zero at infinity, contradicting Liouville’s theorem. Thus
is one-dimensional (this is the Banach-Mazur theorem) and thus isomorphic to
; this gives a continuous unital algebra homomorphism
with
in the kernel, thus
, contradicting the second inequality in (16).
Now suppose that is invertible for all
. Then, as in the preceding argument,
is an analytic function from
to
which decays to zero at infinity, so we have the Cauchy integral formula
for any natural number . From the triangle inequality we conclude in particular that
which contradicts the first inequality in (16).
11 comments
Comments feed for this article
25 October, 2014 at 1:37 pm
voloch
What’s the role of $\epsilon$ in theorem 3?
[Oops, that shouldn’t be there – deleted, thanks – T.]
25 October, 2014 at 2:03 pm
wubr2000
Reblogged this on Importantish.
26 October, 2014 at 3:08 am
Mats Granvik
26 October, 2014 at 9:39 am
Matt R.
Reblogged this on Math by Matt.
26 October, 2014 at 1:25 pm
Anonymous
Seeing double. There is a second copy at the end.
[Corrected, thanks – T.]
28 October, 2014 at 7:43 am
Yemon Choi
The title of Section 3 seems a little misleading, since it’s perfectly possible to have infinite-dimensional commutative Banach algebras in which every element has spectral radius zero, e.g. the quotient of the convolution algebra $L^1(R_+)$ by the ideal of functions supported on $[1,\infty)$. (This quotient algebra even has a bounded approximate identity; of course it doesn’t have local units on a dense subalgebra, otherwise one could run the argument given in your post.)
Of course this is merely a quibble over terminology and doesn’t affect the actual mathematics in the post!
[Terminology changed, thanks – T.]
5 November, 2014 at 8:34 am
Tony Huynh
The link to the Selberg Symmetry Formula is the same as the link to the von Mangoldt function, which I suspect is unintended.
[Corrected, thanks – T.]
5 November, 2014 at 7:18 pm
peter
i would love to understand this article but I am to dumb for that also my iq is only 2 digit but I want to have 3 digit iq. Can u help me ?
29 November, 2014 at 7:11 pm
zhenyuliao
Reblogged this on zhenyuliao and commented:
Fun proof.
16 January, 2015 at 12:40 pm
hiklicepleh
You can say something about the non-trivial estimates
?
22 September, 2016 at 8:45 am
254A, Notes 1: Elementary multiplicative number theory | What's new
[…] for various cutoff functions , such as smooth, compactly supported functions. See for instance this previous blog post for an instance of such an approach. Another related approach is the “pretentious” […]