Richard Oberlin, Andreas Seeger, Christoph Thiele, Jim Wright, and I have just uploaded to the arXiv our paper “A variation norm Carleson theorem“, submitted to J. Europ. Math. Soc..
The celebrated Carleson-Hunt theorem asserts that if is an
function for some
, then the partial Fourier series
of converge to
almost everywhere. (The claim fails for
, as shown by a famous counterexample of Kolmogorov.) The theorem follows easily from the inequality
(1)
where , and
depends only on
. Indeed, one first verifies Carleson’s theorem for a dense subclass of
(e.g. the space of test functions) and then uses a standard limiting argument involving (1) (this is an example of the trick “give yourself an epsilon of room“).
The Carleson-Hunt theorem shows that converges as
for almost every
, but does not say much more about the nature of that convergence. One way to measure the strength of the convergence is to introduce the variational norms
for various . For
this is the Carleson maximal function
; for
this is the total variation of the sequence
, which one can verify to be the
norm of
.
Our main result is to obtain the following variational strengthening of (1)
(2)
whenever and
; these conditions on
are optimal. (For those readers familiar with martingales, the relationship of (2) to (1) is analogous to the relationship between Lepingle’s inequality (a variant of the more well known Doob’s inequality) and the Hardy-Littlewood maximal inequality.)
Because a sequence with finite -variation for some finite
is necessarily convergent, this leads to a new proof of the Carleson-Hunt theorem without the need for a dense subclass. In particular, we obtain ergodic theory analogues of this result, in the case where no obvious dense subclass is available; more precisely, we obtain a new (and more “quantitative”) proof of a Wiener-Wintner-type theorem (first obtained by Lacey and Terwilleger), namely that given any measure-preserving group
on a measure space
, and a function
for some
, one has for almost every
that for every real number
, the integrals
converge as
for every
(not merely almost every
).
The estimate (2) also provides a new proof of a result of Christ and Kiselev on the almost everywhere boundedness of eigenfunctions of Schrodinger operators with potentials with
. Unfortunately, due to various endpoint issues, this barely fails to settle the endpoint case
, a conjecture known as the nonlinear Carleson conjecture (discussed in this previous post).
The approach here follows the Lacey-Thiele approach to Carleson’s theorem (which is in turn based on an earlier approach of Fefferman), based on linearising the Carleson maximal function by picking the integer which attains the supremum
, dividing phase space into “tiles”, and organising these tiles into “trees” and then into “forests” based on the distribution of the phase space “energy” of
, together with the “mass” distribution of the graph of the function
. One then needs to combine various “Bessel” type bounds on the energy, “Vitali-type” bounds on the mass, and “Calderon-Zygmund” type estimates on the trees together to obtain the result.
In our setting, the main new difficulty is that there are multiple integers associated to each point rather than one, which requires a more detailed analysis of the “multiplicity” of forests that was not present in earlier work. (Also, the Calderon-Zygmund estimates need to be replaced with Lepingle type estimates, though this is a relatively standard change, being first introduced in a paper of Bourgain.)

11 comments
Comments feed for this article
12 October, 2009 at 4:12 am
Andrew Bailey
Small typo in your definition of the variational norm: | -> (.
Nice paper. Could you shed a little more light on your comment about the analysis of the “multiplicity” of forests?
By the way, should you feel inclined to write anything more on this topic, I for one would be interested in reading it. :-)
12 October, 2009 at 8:21 am
Terence Tao
Thanks for the correction!
A forest is a collection of trees T which are basically disjoint in phase space, but possibly overlapping in physical space; in particular, the physical support
of the trees
in a forest may overlap, so the multiplicity function
may be larger than 1 on occasion. Previous work in this area measured the size of a forest by the
norm of its multiplicity function (i.e. the sum of the widths
of its constituent trees); for our work we also need to consider other
norms (actually we rely mostly on the
norm and the BMO norm, and then interpolate to get everything else.) The basic point is that if one needs to sample the Fourier series at k points to get the maximum variation, then one expects the most important forests to have multiplicity between 1 and k (higher multiplicity forests are also present, but have a diminished contribution to the final estimate compared to the lower-multiplicity forests).
13 October, 2009 at 4:46 am
Andrew Bailey
Interesting. I hadn’t thought of
in this way before. Thanks.
13 October, 2009 at 4:30 am
shannon7774
Is there a relationship between Carleson’s theorem and almost-sure convergence of an L^p martingale? You alluded to martingales, but I wasn’t sure if this is the connection point?
13 October, 2009 at 8:47 am
Terence Tao
Yes, the latter is roughly speaking the lacunary special case of the former. The lacunary partial Fourier series
, where
is a power of two, is roughly analogous to the projection of f to the
-algebra of functions on the unit circle which are constant on dyadic intervals of length
. (To make this analogy more accurate, one would have to smooth the Fourier series, leading to a Littlewood-Paley projection.) Pointwise convergence of the (smoothed out) lacunary partial Fourier series can be established by basically the same methods used to establish almost everywhere convergence for martingale projections. Whilst the entire Carleson maximal function is modeled by using all the tiles in phase space, smoothed out lacunary partial Fourier series are basically modeled by a single tree of tiles, with base frequency zero. (A tree with non-zero base frequency corresponds to a modulated lacunary partial Fourier series, such as
.)
13 October, 2009 at 9:48 am
Anonymous
Is there a reason you choose to work on the circle in this situation? Typically the Lacey-Thiele methods (including an earlier paper of yours) are carried out on the line.
13 October, 2009 at 10:46 am
Terence Tao
Well, there is a correspondence between the two: a suitably “scale-invariant” estimate on the circle can be rescaled to produce as a limiting case the corresponding estimate on the line, and in the converse direction there is a somewhat trickier transference argument (using things like Poisson summation) that recovers the circle case from the line case. Because the latter implication is a bit harder than the other, the circle result is slightly “deeper” and so we decided to make it our headline result, but the way we prove it is by first recalling the transference argument to reduce things to the line case, and then establish the estimate on the line by the usual time-frequency methods.
14 October, 2009 at 6:32 am
Andrew Bailey
Working directly on the circle in the Lacey/Thiele paper does throw up a few interesting features. One that particularly intrigues me is that if you re-cast certain quantities viewed in the paper on the “time” side as being on the frequency side, you get oscillatory integrals which look like reasonably natural candidates for the the kinds of estimates that were obtained on the time side. However, try to do the same thing on the circle and you get oscillatory sums and suddenly things are less clear.
1 November, 2009 at 9:53 am
Vorlesung Funktionalanalysis: Erste Etappe « UGroh's Weblog
[...] einer Menge vom Lebesgueschen Maß konvergiert. Mehr zum Theorem von Carleson findet sich auf dem Blog von T. Tao. Im Gegensatz hierzu gibt es Funktionen, deren Forurierreihe nicht punktweise konvergiert. Dieses [...]
16 November, 2009 at 4:09 pm
Mark Lewko
Can you say anything about the (double) endpoint case when r=p=2? I understand that the inequality as stated in (2) fails, but is it possible that the failure is, say, only logarithmic? That is, could there be an inequality of the form || ||s[f](n,x)||_{v^2}||_{2} << ln(N)^{b} ||f||_{2} for f(x) = \sum_{n=-N}^{N} \hat{f}(n) e(nx)?
16 November, 2009 at 4:23 pm
Terence Tao
Yes, this should be the case for some b (perhaps even b=1). Generally, the only sources of divergence in these sorts of estimates come from a pileup of scales; if one only considers frequency scales from 1 to N then there are only a logarithmic number of scales in play, and each scale should only contribute a bounded amount. (Note for instance that Carleson’s theorem with a logarithmic loss is quite easy to show, basically because the Dirichlet kernel has an L^1 norm which is logarithmic in nature (which is closely related to the fact that the L^1 norm at each dyadic scale is bounded).)