You are currently browsing the monthly archive for January 2020.
I just heard the news that Louis Nirenberg died a few days ago, aged 94. Nirenberg made a vast number of contributions to analysis and PDE (and his work has come up repeatedly on my own blog); I wrote about his beautiful moving planes argument with Gidas and Ni to establish symmetry of ground states in this post on the occasion of him receiving the Chern medal, and on how his extremely useful interpolation inequality with Gagliardo (generalising a previous inequality of Ladyzhenskaya) can be viewed as an amplification of the usual Sobolev inequality in this post. Another fundamentally useful inequality of Nirenberg is the John-Nirenberg inequality established with Fritz John: if a (locally integrable) function (which for simplicity of exposition we place in one dimension) obeys the bounded mean oscillation property
for all intervals , where
is the average value of
on
, then one has exponentially good large deviation estimates
for all and some absolute constant
. This can be compared with Markov’s inequality, which only gives the far weaker decay
The point is that (1) is assumed to hold not just for a given interval , but also all subintervals of
, and this is a much more powerful hypothesis, allowing one for instance to use the standard Calderon-Zygmund technique of stopping time arguments to “amplify” (3) to (2). Basically, for any given interval
, one can use (1) and repeated halving of the interval
until significant deviation from the mean is encountered to locate some disjoint exceptional subintervals
where
deviates from
by
, with the total measure of the
being a small fraction of that of
(thanks to a variant of (3)), and with
staying within
of
at almost every point of
outside of these exceptional intervals. One can then establish (2) by an induction on
. (There are other proofs of this inequality also, e.g., one can use Bellman functions, as discussed in this old set of notes of mine.) Informally, the John-Nirenberg inequality asserts that functions of bounded mean oscillation are “almost as good” as bounded functions, in that they almost always stay within a bounded distance from their mean, and in fact the space BMO of functions of bounded mean oscillation ends up being superior to the space
of bounded measurable functions for many harmonic analysis purposes (among other things, the space is more stable with respect to singular integral operators).
I met Louis a few times in my career; even in his later years when he was wheelchair-bound, he would often come to conferences and talks, and ask very insightful questions at the end of the lecture (even when it looked like he was asleep during much of the actual talk!). I have a vague memory of him asking me some questions in one of the early talks I gave as a postdoc; I unfortunately do not remember exactly what the topic was (some sort of PDE, I think), but I was struck by how kindly the questions were posed, and how patiently he would listen to my excited chattering about my own work.
Define the Collatz map on the natural numbers
by setting
to equal
when
is odd and
when
is even, and let
denote the forward Collatz orbit of
. The notorious Collatz conjecture asserts that
for all
. Equivalently, if we define the backwards Collatz orbit
to be all the natural numbers
that encounter
in their forward Collatz orbit, then the Collatz conjecture asserts that
. As a partial result towards this latter statement, Krasikov and Lagarias in 2003 established the bound
for all and
. (This improved upon previous values of
obtained by Applegate and Lagarias in 1995,
by Applegate and Lagarias in 1995 by a different method,
by Wirsching in 1993,
by Krasikov in 1989,
by Sander in 1990, and some
by Crandall in 1978.) This is still the largest value of
for which (1) has been established. Of course, the Collatz conjecture would imply that we can take
equal to
, which is the assertion that a positive density set of natural numbers obeys the Collatz conjecture. This is not yet established, although the results in my previous paper do at least imply that a positive density set of natural numbers iterates to an (explicitly computable) bounded set, so in principle the
case of (1) could now be verified by an (enormous) finite computation in which one verifies that every number in this explicit bounded set iterates to
. In this post I would like to record a possible alternate route to this problem that depends on the distribution of a certain family of random variables that appeared in my previous paper, that I called Syracuse random variables.
Definition 1 (Syracuse random variables) For any natural number
, a Syracuse random variable
on the cyclic group
is defined as a random variable of the form
where
are independent copies of a geometric random variable
on the natural numbers with mean
, thus
} for
. In (2) the arithmetic is performed in the ring
.
Thus for instance
and so forth. After reversing the labeling of the , one could also view
as the mod
reduction of a
-adic random variable
The probability density function of the Syracuse random variable can be explicitly computed by a recursive formula (see Lemma 1.12 of my previous paper). For instance, when
,
is equal to
for
respectively, while when
,
is equal to
when respectively.
The relationship of these random variables to the Collatz problem can be explained as follows. Let denote the odd natural numbers, and define the Syracuse map
by
where the –valuation
is the number of times
divides
. We can define the forward orbit
and backward orbit
of the Syracuse map as before. It is not difficult to then see that the Collatz conjecture is equivalent to the assertion
, and that the assertion (1) for a given
is equivalent to the assertion
for all , where
is now understood to range over odd natural numbers. A brief calculation then shows that for any odd natural number
and natural number
, one has
where the natural numbers are defined by the formula
so in particular
Heuristically, one expects the -valuation
of a typical odd number
to be approximately distributed according to the geometric distribution
, so one therefore expects the residue class
to be distributed approximately according to the random variable
.
The Syracuse random variables will always avoid multiples of three (this reflects the fact that
is never a multiple of three), but attains any non-multiple of three in
with positive probability. For any natural number
, set
Equivalently, is the greatest quantity for which we have the inequality
for all integers not divisible by three, where
is the set of all tuples
for which
Thus for instance ,
, and
. On the other hand, since all the probabilities
sum to
as
ranges over the non-multiples of
, we have the trivial upper bound
There is also an easy submultiplicativity result:
Lemma 2 For any natural numbers
, we have
Proof: Let be an integer not divisible by
, then by (4) we have
If we let denote the set of tuples
that can be formed from the tuples in
by deleting the final component
from each tuple, then we have
with an integer not divisible by three. By definition of
and a relabeling, we then have
for all . For such tuples we then have
so that . Since
for each , the claim follows.
From this lemma we see that for some absolute constant
. Heuristically, we expect the Syracuse random variables to be somewhat approximately equidistributed amongst the multiples of
(in Proposition 1.4 of my previous paper I prove a fine scale mixing result that supports this heuristic). As a consequence it is natural to conjecture that
. I cannot prove this, but I can show that this conjecture would imply that we can take the exponent
in (1), (3) arbitrarily close to one:
Proposition 3 Suppose that
(that is to say,
as
). Then
as
, or equivalently
I prove this proposition below the fold. A variant of the argument shows that for any value of , (1), (3) holds whenever
, where
is an explicitly computable function with
as
. In principle, one could then improve the Krasikov-Lagarias result
by getting a sufficiently good upper bound on
, which is in principle achievable numerically (note for instance that Lemma 2 implies the bound
for any
, since
for any
).
Just a brief post to record some notable papers in my fields of interest that appeared on the arXiv recently.
- “A sharp square function estimate for the cone in
“, by Larry Guth, Hong Wang, and Ruixiang Zhang. This paper establishes an optimal (up to epsilon losses) square function estimate for the three-dimensional light cone that was essentially conjectured by Mockenhaupt, Seeger, and Sogge, which has a number of other consequences including Sogge’s local smoothing conjecture for the wave equation in two spatial dimensions, which in turn implies the (already known) Bochner-Riesz, restriction, and Kakeya conjectures in two dimensions. Interestingly, modern techniques such as polynomial partitioning and decoupling estimates are not used in this argument; instead, the authors mostly rely on an induction on scales argument and Kakeya type estimates. Many previous authors (including myself) were able to get weaker estimates of this type by an induction on scales method, but there were always significant inefficiencies in doing so; in particular knowing the sharp square function estimate at smaller scales did not imply the sharp square function estimate at the given larger scale. The authors here get around this issue by finding an even stronger estimate that implies the square function estimate, but behaves significantly better with respect to induction on scales.
- “On the Chowla and twin primes conjectures over
“, by Will Sawin and Mark Shusterman. This paper resolves a number of well known open conjectures in analytic number theory, such as the Chowla conjecture and the twin prime conjecture (in the strong form conjectured by Hardy and Littlewood), in the case of function fields where the field is a prime power
which is fixed (in contrast to a number of existing results in the “large
” limit) but has a large exponent
. The techniques here are orthogonal to those used in recent progress towards the Chowla conjecture over the integers (e.g., in this previous paper of mine); the starting point is an algebraic observation that in certain function fields, the Mobius function behaves like a quadratic Dirichlet character along certain arithmetic progressions. In principle, this reduces problems such as Chowla’s conjecture to problems about estimating sums of Dirichlet characters, for which more is known; but the task is still far from trivial.
- “Bounds for sets with no polynomial progressions“, by Sarah Peluse. This paper can be viewed as part of a larger project to obtain quantitative density Ramsey theorems of Szemeredi type. For instance, Gowers famously established a relatively good quantitative bound for Szemeredi’s theorem that all dense subsets of integers contain arbitrarily long arithmetic progressions
. The corresponding question for polynomial progressions
is considered more difficult for a number of reasons. One of them is that dilation invariance is lost; a dilation of an arithmetic progression is again an arithmetic progression, but a dilation of a polynomial progression will in general not be a polynomial progression with the same polynomials
. Another issue is that the ranges of the two parameters
are now at different scales. Peluse gets around these difficulties in the case when all the polynomials
have distinct degrees, which is in some sense the opposite case to that considered by Gowers (in particular, she avoids the need to obtain quantitative inverse theorems for high order Gowers norms; which was recently obtained in this integer setting by Manners but with bounds that are probably not strong enough to for the bounds in Peluse’s results, due to a degree lowering argument that is available in this case). To resolve the first difficulty one has to make all the estimates rather uniform in the coefficients of the polynomials
, so that one can still run a density increment argument efficiently. To resolve the second difficulty one needs to find a quantitative concatenation theorem for Gowers uniformity norms. Many of these ideas were developed in previous papers of Peluse and Peluse-Prendiville in simpler settings.
- “On blow up for the energy super critical defocusing non linear Schrödinger equations“, by Frank Merle, Pierre Raphael, Igor Rodnianski, and Jeremie Szeftel. This paper (when combined with two companion papers) resolves a long-standing problem as to whether finite time blowup occurs for the defocusing supercritical nonlinear Schrödinger equation (at least in certain dimensions and nonlinearities). I had a previous paper establishing a result like this if one “cheated” by replacing the nonlinear Schrodinger equation by a system of such equations, but remarkably they are able to tackle the original equation itself without any such cheating. Given the very analogous situation with Navier-Stokes, where again one can create finite time blowup by “cheating” and modifying the equation, it does raise hope that finite time blowup for the incompressible Navier-Stokes and Euler equations can be established… In fact the connection may not just be at the level of analogy; a surprising key ingredient in the proofs here is the observation that a certain blowup ansatz for the nonlinear Schrodinger equation is governed by solutions to the (compressible) Euler equation, and finite time blowup examples for the latter can be used to construct finite time blowup examples for the former.
Recent Comments