You are currently browsing the monthly archive for March 2020.
This set of notes focuses on the restriction problem in Fourier analysis. Introduced by Elias Stein in the 1970s, the restriction problem is a key model problem for understanding more general oscillatory integral operators, and which has turned out to be connected to many questions in geometric measure theory, harmonic analysis, combinatorics, number theory, and PDE. Only partial results on the problem are known, but these partial results have already proven to be very useful or influential in many applications.
We work in a Euclidean space . Recall that
is the space of
-power integrable functions
, quotiented out by almost everywhere equivalence, with the usual modifications when
. If
then the Fourier transform
will be defined in this course by the formula
From the dominated convergence theorem we see that is a continuous function; from the Riemann-Lebesgue lemma we see that it goes to zero at infinity. Thus
lies in the space
of continuous functions that go to zero at infinity, which is a subspace of
. Indeed, from the triangle inequality it is obvious that
If , then Plancherel’s theorem tells us that we have the identity
Because of this, there is a unique way to extend the Fourier transform from
to
, in such a way that it becomes a unitary map from
to itself. By abuse of notation we continue to denote this extension of the Fourier transform by
. Strictly speaking, this extension is no longer defined in a pointwise sense by the formula (1) (indeed, the integral on the RHS ceases to be absolutely integrable once
leaves
; we will return to the (surprisingly difficult) question of whether pointwise convergence continues to hold (at least in an almost everywhere sense) later in this course, when we discuss Carleson’s theorem. On the other hand, the formula (1) remains valid in the sense of distributions, and in practice most of the identities and inequalities one can show about the Fourier transform of “nice” functions (e.g., functions in
, or in the Schwartz class
, or test function class
) can be extended to functions in “rough” function spaces such as
by standard limiting arguments.
By (2), (3), and the Riesz-Thorin interpolation theorem, we also obtain the Hausdorff-Young inequality
for all and
, where
is the dual exponent to
, defined by the usual formula
. (One can improve this inequality by a constant factor, with the optimal constant worked out by Beckner, but the focus in these notes will not be on optimal constants.) As a consequence, the Fourier transform can also be uniquely extended as a continuous linear map from
. (The situation with
is much worse; see below the fold.)
The restriction problem asks, for a given exponent and a subset
of
, whether it is possible to meaningfully restrict the Fourier transform
of a function
to the set
. If the set
has positive Lebesgue measure, then the answer is yes, since
lies in
and therefore has a meaningful restriction to
even though functions in
are only defined up to sets of measure zero. But what if
has measure zero? If
, then
is continuous and therefore can be meaningfully restricted to any set
. At the other extreme, if
and
is an arbitrary function in
, then by Plancherel’s theorem,
is also an arbitrary function in
, and thus has no well-defined restriction to any set
of measure zero.
It was observed by Stein (as reported in the Ph.D. thesis of Charlie Fefferman) that for certain measure zero subsets of
, such as the sphere
, one can obtain meaningful restrictions of the Fourier transforms of functions
for certain
between
and
, thus demonstrating that the Fourier transform of such functions retains more structure than a typical element of
:
Theorem 1 (Preliminary
restriction theorem) If
and
, then one has the estimate
for all Schwartz functions
, where
denotes surface measure on the sphere
. In particular, the restriction
can be meaningfully defined by continuous linear extension to an element of
.
Proof: Fix . We expand out
From (1) and Fubini’s theorem, the right-hand side may be expanded as
where the inverse Fourier transform of the measure
is defined by the formula
In other words, we have the identity
using the Hermitian inner product . Since the sphere
have bounded measure, we have from the triangle inequality that
Also, from the method of stationary phase (as covered in the previous class 247A), or Bessel function asymptotics, we have the decay
for any (note that the bound already follows from (6) unless
). We remark that the exponent
here can be seen geometrically from the following considerations. For
, the phase
on the sphere is stationary at the two antipodal points
of the sphere, and constant on the tangent hyperplanes to the sphere at these points. The wavelength of this phase is proportional to
, so the phase would be approximately stationary on a cap formed by intersecting the sphere with a
neighbourhood of the tangent hyperplane to one of the stationary points. As the sphere is tangent to second order at these points, this cap will have diameter
in the directions of the
-dimensional tangent space, so the cap will have surface measure
, which leads to the prediction (7). We combine (6), (7) into the unified estimate
where the “Japanese bracket” is defined as
. Since
lies in
precisely when
, we conclude that
Applying Young’s convolution inequality, we conclude (after some arithmetic) that
whenever , and the claim now follows from (5) and Hölder’s inequality.
Remark 2 By using the Hardy-Littlewood-Sobolev inequality in place of Young’s convolution inequality, one can also establish this result for
.
Motivated by this result, given any Radon measure on
and any exponents
, we use
to denote the claim that the restriction estimate
for all Schwartz functions ; if
is a
-dimensional submanifold of
(possibly with boundary), we write
for
where
is the
-dimensional surface measure on
. Thus, for instance, we trivially always have
, while Theorem 1 asserts that
holds whenever
. We will not give a comprehensive survey of restriction theory in these notes, but instead focus on some model results that showcase some of the basic techniques in the field. (I have a more detailed survey on this topic from 2003, but it is somewhat out of date.)
Read the rest of this entry »
After some discussion with the applied math research groups here at UCLA (in particular the groups led by Andrea Bertozzi and Deanna Needell), one of the members of these groups, Chris Strohmeier, has produced a proposal for a Polymath project to crowdsource in a single repository (a) a collection of public data sets relating to the COVID-19 pandemic, (b) requests for such data sets, (c) requests for data cleaning of such sets, and (d) submissions of cleaned data sets. (The proposal can be viewed as a PDF, and is also available on Overleaf). As mentioned in the proposal, this database would be slightly different in focus than existing data sets such as the COVID-19 data sets hosted on Kaggle, with a focus on producing high quality cleaned data sets. (Another relevant data set that I am aware of is the SafeGraph aggregated foot traffic data, although this data set, while open, is not quite public as it requires a non-commercial agreement to execute. Feel free to mention further relevant data sets in the comments.)
This seems like a very interesting and timely proposal to me and I would like to open it up for discussion, for instance by proposing some seed requests for data and data cleaning and to discuss possible platforms that such a repository could be built on. In the spirit of “building the plane while flying it”, one could begin by creating a basic github repository as a prototype and use the comments in this blog post to handle requests, and then migrate to a more high quality platform once it becomes clear what direction this project might move in. (For instance one might eventually move beyond data cleaning to more sophisticated types of data analysis.)
UPDATE, Mar 25: a prototype page for such a clearinghouse is now up at this wiki page.
UPDATE, Mar 27: the data cleaning aspect of this project largely duplicates the existing efforts at the United against COVID-19 project, so we are redirecting requests of this type to that project (and specifically to their data discourse page). The polymath proposal will now refocus on crowdsourcing a list of public data sets relating to the COVID-19 pandemic.
At the most recent MSRI board of trustees meeting on Mar 7 (conducted online, naturally), Nicolas Jewell (a Professor of Biostatistics and Statistics at Berkeley, also affiliated with the Berkeley School of Public Health and the London School of Health and Tropical Disease), gave a presentation on the current coronavirus epidemic entitled “2019-2020 Novel Coronavirus outbreak: mathematics of epidemics, and what it can and cannot tell us”. The presentation (updated with Mar 18 data), hosted by David Eisenbud (the director of MSRI), together with a question and answer session, is now on Youtube:
(I am on this board, but could not make it to this particular meeting; I caught up on the presentation later, and thought it would of interest to several readers of this blog.) While there is some mathematics in the presentation, it is relatively non-technical.
Just a short post to note that this year’s Abel prize has been awarded jointly to Hillel Furstenberg and Grigory Margulis for “for pioneering the use of methods from probability and dynamics in group theory, number theory and combinatorics”. I was not involved in the decision making process of the Abel committee this year, but I certainly feel that the contributions of both mathematicians are worthy of the prize. Certainly both mathematicians have influenced my own work (for instance, Furstenberg’s proof of Szemeredi’s theorem ended up being a key influence in my result with Ben Green that the primes contain arbitrarily long arithmetic progressions); see for instance these blog posts mentioning Furstenberg, and these blog posts mentioning Margulis.
Next quarter, starting March 30, I will be teaching “Math 247B: Classical Fourier Analysis” here at UCLA. (The course should more accurately be named “Modern real-variable harmonic analysis”, but we have not gotten around to implementing such a name change.) This class (a continuation of Math 247A from previous quarter, taught by my colleague, Monica Visan) will cover the following topics:
- Restriction theory and Strichartz estimates
- Decoupling estimates and applications
- Paraproducts; time frequency analysis; Carleson’s theorem
As usual, lecture notes will be made available on this blog.
Unlike previous courses, this one will be given online as part of UCLA’s social distancing efforts. In particular, the course will be open to anyone with an internet connection (no UCLA affiliation is required), though non-UCLA participants will not have full access to all aspects of the course, and there is the possibility that some restrictions on participation may be imposed if there are significant disruptions to class activity. For more information, see the course description. UPDATE: due to time limitations, I will not be able to respond to personal email inquiries about this class from non-UCLA participants in the course. Please use the comment thread to this blog post for such inquiries. I will also update the course description throughout the course to reflect the latest information about the course, both for UCLA students enrolled in the course and for non-UCLA participants.
Just a short note that the memorial article “Analysis and applications: The mathematical work of Elias Stein” has just been published in the Bulletin of the American Mathematical Society. This article was a collective effort led by Charlie Fefferman, Alex Ionescu, Steve Wainger and myself to describe the various mathematical contributions of Elias Stein, who passed away in December 2018; it also features contributions from Loredana Lanzani, Akos Magyar, Mariusz Mirek, Alexander Nagel, Duong Phong, Lillian Pierce, Fulvio Ricci, Christopher Sogge, and Brian Street. (My contribution was mostly focused on Stein’s contribution to restriction theory.)
As part of social distancing efforts to slow down the spread of the novel coronavirus, several universities have now transitioned, or begun transitioning, to online teaching models. (My home university of UCLA has not yet done so, but is certainly considering the option. UPDATE: we are transitioning.) As a consequence, I thought it might be an appropriate time to start a discussion on the pros and cons of various technologies for giving talks and lectures online, particularly in the context of mathematical talks where there may be special considerations coming for instance for the need to do mathematical computations on a blackboard or equivalent. My own institution is for instance recommending the use of Zoom for lectures and Respondus for giving finals, and has a limited number of classrooms set up for high quality video and audio casting, as well as a platform for discussion forums and course materials for each class. For smaller meetings, such as one-on-one meetings with graduate students, one can of course improvise using off-the-shelf tools such as Skype. I would be interested in knowing what other options are available and what success lecturers have had with them.
The same goes for giving mathematical talks. I learned recently (from Jordan Ellenberg) that Rachel Preis has recently launched a “virtual math seminar on open conjectures in number theory in arithmetic geometry” (VaNTAGe) that is run using the BlueJeans platform. And for many years there has been a regular joint math seminar between UC Berkeley, U. Paris-Nord, U. Zurich, and U. Bonn (see e.g., this calendar), and nowadays many mathematical institutes stream their talks or at least videotape them to place them online later. Our own department does not have a dedicated lecture hall for videocasting, so I would be interested in knowing of any successful ways to improvise such casting with more portable technology. (Skype in principle could work here, but I have found this to be clunky even for smaller meetings involving just a handful of partcipants.)
EDIT: in addition to lectures and talks, it would also be topical to discuss online options for office hours, midterms, and final exams.
In the modern theory of additive combinatorics, a large role is played by the Gowers uniformity norms , where
,
is a finite abelian group, and
is a function (one can also consider these norms in finite approximate groups such as
instead of finite groups, but we will focus on the group case here for simplicity). These norms can be defined by the formula
where we use the averaging notation
for any non-empty finite set (with
denoting the cardinality of
), and
is the multiplicative discrete derivative operator
One reason why these norms play an important role is that they control various multilinear averages. We give two sample examples here:
We establish these claims a little later in this post.
In some more recent literature (e.g., this paper of Conlon, Fox, and Zhao), the role of Gowers norms have been replaced by (generalisations) of the cut norm, a concept originating from graph theory. In this blog post, it will be convenient to define these cut norms in the language of probability theory (using boldface to denote random variables).
Definition 2 (Cut norm) Let
be independent random variables with
; to avoid minor technicalities we assume that these random variables are discrete and take values in a finite set. Given a random variable
of these independent random variables, we define the cut norm
where the supremum ranges over all choices
of random variables
that are
-bounded (thus
surely), and such that
does not depend on
.
If
, we abbreviate
as
.
Strictly speaking, the cut norm is only a cut semi-norm when , but we will abuse notation by referring to it as a norm nevertheless.
Example 3 If
is a bipartite graph, and
,
are independent random variables chosen uniformly from
respectively, then
where the supremum ranges over all
-bounded functions
,
. The right hand side is essentially the cut norm of the graph
, as defined for instance by Frieze and Kannan.
The cut norm is basically an expectation when :
Example 4 If
, we see from definition that
If
, one easily checks that
where
is the conditional expectation of
to the
-algebra generated by all the variables other than
, i.e., the
-algebra generated by
. In particular, if
are independent random variables drawn uniformly from
respectively, then
Here are some basic properties of the cut norm:
Lemma 5 (Basic properties of cut norm) Let
be independent discrete random variables, and
a function of these variables.
- (i) (Permutation invariance) The cut norm
is invariant with respect to permutations of the
, or permutations of the
.
- (ii) (Conditioning) One has
where on the right-hand side we view, for each realisation
of
,
as a function
of the random variables
alone, thus the right-hand side may be expanded as
- (iii) (Monotonicity) If
, we have
- (iv) (Multiplicative invariances) If
is a
-bounded function that does not depend on one of the
, then
In particular, if we additionally assume
, then
- (v) (Cauchy-Schwarz) If
, one has
where
is a copy of
that is independent of
and
is the random variable
- (vi) (Averaging) If
and
, where
is another random variable independent of
, and
is a random variable depending on both
and
, then
Proof: The claims (i), (ii) are clear from expanding out all the definitions. The claim (iii) also easily follows from the definitions (the left-hand side involves a supremum over a more general class of multipliers , while the right-hand side omits the
multiplier), as does (iv) (the multiplier
can be absorbed into one of the multipliers in the definition of the cut norm). The claim (vi) follows by expanding out the definitions, and observing that all of the terms in the supremum appearing in the left-hand side also appear as terms in the supremum on the right-hand side. It remains to prove (v). By definition, the left-hand side is the supremum over all quantities of the form
where the are
-bounded functions of
that do not depend on
. We average out in the
direction (that is, we condition out the variables
), and pull out the factor
(which does not depend on
), to write this as
which by Cauchy-Schwarz is bounded by
which can be expanded using the copy as
Expanding
and noting that each is
-bounded and independent of
for
, we obtain the claim.
Now we can relate the cut norm to Gowers uniformity norms:
Lemma 6 Let
be a finite abelian group, let
be independent random variables uniformly drawn from
for some
, and let
. Then
If
is additionally assumed to be
-bounded, we have the converse inequalities
Proof: Applying Lemma 5(v) times, we can bound
where are independent copies of
that are also independent of
. The expression inside the norm can also be written as
so by Example 4 one can write (6) as
which after some change of variables simplifies to
which by Cauchy-Schwarz is bounded by
which one can rearrange as
giving (2). A similar argument bounds
by
which gives (3).
For (4), we can reverse the above steps and expand as
which we can write as
for some -bounded function
. This can in turn be expanded as
for some -bounded functions
that do not depend on
. By Example 4, this can be written as
which by several applications of Theorem 5(iii) and then Theorem 5(iv) can be bounded by
giving (4). A similar argument gives (5).
Now we can prove Proposition 1. We begin with part (i). By permutation we may assume , then by translation we may assume
. Replacing
by
and
by
, we can write the left-hand side of (1) as
where
is a -bounded function that does not depend on
. Taking
to be independent random variables drawn uniformly from
, the left-hand side of (1) can then be written as
which by Example 4 is bounded in magnitude by
After many applications of Lemma 5(iii), (iv), this is bounded by
By Lemma 5(ii) we may drop the variable, and then the claim follows from Lemma 6.
For part (ii), we replace by
and
by
to write the left-hand side as
the point here is that the first factor does not involve , the second factor does not involve
, and the third factor has no quadratic terms in
. Letting
be independent variables drawn uniformly from
, we can use Example 4 to bound this in magnitude by
which by Lemma 5(i),(iii),(iv) is bounded by
and then by Lemma 5(v) we may bound this by
which by Example 4 is
Now the expression inside the expectation is the product of four factors, each of which is or
applied to an affine form
where
depends on
and
is one of
,
,
,
. With probability
, the four different values of
are distinct, and then by part (i) we have
When they are not distinct, we can instead bound this quantity by . Taking expectations in
, we obtain the claim.
The analogue of the inverse theorem for cut norms is the following claim (which I learned from Ben Green):
Lemma 7 (
-type inverse theorem) Let
be independent random variables drawn from a finite abelian group
, and let
be
-bounded. Then we have
where
is the group of homomorphisms
is a homomorphism from
to
, and
.
Proof: Suppose first that for some
, then by definition
for some -bounded
. By Fourier expansion, the left-hand side is also
where . From Plancherel’s theorem we have
hence by Hölder’s inequality one has for some
, and hence
Conversely, suppose (7) holds. Then there is such that
which on substitution and Example 4 implies
The term splits into the product of a factor
not depending on
, and a factor
not depending on
. Applying Lemma 5(iii), (iv) we conclude that
The claim follows.
The higher order inverse theorems are much less trivial (and the optimal quantitative bounds are not currently known). However, there is a useful degree lowering argument, due to Peluse and Prendiville, that can allow one to lower the order of a uniformity norm in some cases. We give a simple version of this argument here:
Lemma 8 (Degree lowering argument, special case) Let
be a finite abelian group, let
be a non-empty finite set, and let
be a function of the form
for some
-bounded functions
indexed by
. Suppose that
for some
and
. Then one of the following claims hold (with implied constants allowed to depend on
):
- (i) (Degree lowering) one has
.
- (ii) (Non-zero frequency) There exist
and non-zero
such that
There are more sophisticated versions of this argument in which the frequency is “minor arc” rather than “zero frequency”, and then the Gowers norms are localised to suitable large arithmetic progressions; this is implicit in the above-mentioned paper of Peluse and Prendiville.
Proof: One can write
and hence we conclude that
for a set of tuples
of density
. Applying Lemma 6 and Lemma 7, we see that for each such tuple, there exists
such that
where is drawn uniformly from
.
Let us adopt the convention that vanishes for
not in
, then from Lemma 5(ii) we have
where are independent random variables drawn uniformly from
and also independent of
. By repeated application of Lemma 5(iii) we then have
Expanding out and using Lemma 5(iv) repeatedly we conclude that
From definition of we then have
By Lemma 5(vi), we see that the left-hand side is less than
where is drawn uniformly from
, independently of
. By repeated application of Lemma 5(i), (v) repeatedly, we conclude that
where are independent copies of
that are also independent of
,
. By Lemma 5(ii) and Example 4 we conclude that
with probability .
The left-hand side can be rewritten as
where is the additive version of
, thus
Translating , we can simplify this a little to
If the frequency is ever non-vanishing in the event (9) then conclusion (ii) applies. We conclude that
with probability . In particular, by the pigeonhole principle, there exist
such that
with probability . Expanding this out, we obtain a representation of the form
holding with probability , where the
are functions that do not depend on the
coordinate. From (8) we conclude that
for of the tuples
. Thus by Lemma 5(ii)
By repeated application of Lemma 5(iii) we then have
and then by repeated application of Lemma 5(iv)
and then the conclusion (i) follows from Lemma 6.
As an application of degree lowering, we give an inverse theorem for the average in Proposition 1(ii), first established by Bourgain-Chang and later reproved by Peluse (by different methods from those given here):
Proposition 9 Let
be a cyclic group of prime order. Suppose that one has
-bounded functions
such that
for some
. Then either
, or one has
We remark that a modification of the arguments below also give .
Proof: The left-hand side of (10) can be written as
where is the dual function
By Cauchy-Schwarz one thus has
and hence by Proposition 1, we either have (in which case we are done) or
Writing with
, we conclude that either
, or that
for some and non-zero
. The left-hand side can be rewritten as
where and
. We can rewrite this in turn as
which is bounded by
where are independent random variables drawn uniformly from
. Applying Lemma 5(v), we conclude that
However, a routine Gauss sum calculation reveals that the left-hand side is for some absolute constant
because
is non-zero, so that
. The only remaining case to consider is when
Repeating the above arguments we then conclude that
and then
The left-hand side can be computed to equal , and the claim follows.
This argument was given for the cyclic group setting, but the argument can also be applied to the integers (see Peluse-Prendiville) and can also be used to establish an analogue over the reals (that was first obtained by Bourgain).
The National Academies of Sciences, Engineering, and Medicine have initiated a project on “Illustrating the Impact of the Mathematical Sciences“, in which various media will be produced to showcase how mathematics impacts the modern world. (I am serving on the committee for creating this media, which has been an interesting experience; the first time for instance that I have had to seriously interact with graphic designers.) One of the first products is a “webinar” series on the ten topics our committee have chosen to focus on, that is currently running weekly on Tuesdays. Last week I moderated the first such webinar, titled “From Solving to Seeing”, in which Profs. Gunther Uhlmann and Anna Gilbert presented ways in which inverse problems, compressed sensing, and other modern mathematical techniques have been used to obtain images (such as MRI images) that would not otherwise be accessible. Next week I will moderate another webinar, titled “Abstract Geometry, Concrete Impact”, in which Profs. Katherine Stange and Jordan Ellenberg will discuss how modern abstract geometries are used in modern applications such as cryptography. The full list of webinars and the latest information on the speakers can be found at this website. (Past webinars can be viewed directly from the web site; live webinars require a (free) registration, and offer the ability to submit text questions to the speakers via the moderator.)
We are currently in the process of designing posters (and possibly even a more interactive online resource) for each of the ten topics listed in the webinars; hopefully these will be available in a few months.
Recent Comments