This is a blog version of a talk I recently gave at the IPAM workshop on “The Kakeya Problem, Restriction Problem, and Sum-product Theory”.
Note: the discussion here will be highly non-rigorous in nature, being extremely loose in particular with asymptotic notation and with the notion of dimension. Caveat emptor.
One of the most infamous unsolved problems at the intersection of geometric measure theory, incidence combinatorics, and real-variable harmonic analysis is the Kakeya set conjecture. We will focus on the following three-dimensional case of the conjecture, stated informally as follows:
Conjecture 1 (Kakeya conjecture) Let
be a subset of
that contains a unit line segment in every direction. Then
.
This conjecture is not precisely formulated here, because we have not specified exactly what type of set is (e.g. measurable, Borel, compact, etc.) and what notion of dimension we are using. We will deliberately ignore these technical details in this post. It is slightly more convenient for us here to work with lines instead of unit line segments, so we work with the following slight variant of the conjecture (which is essentially equivalent):
Conjecture 2 (Kakeya conjecture, again) Let
be a family of lines in
that meet
and contain a line in each direction. Let
be the union of the restriction
to
of every line
in
. Then
.
As the space of all directions in is two-dimensional, we thus see that
is an (at least) two-dimensional subset of the four-dimensional space of lines in
(actually, it lies in a compact subset of this space, since we have constrained the lines to meet
). One could then ask if this is the only property of
that is needed to establish the Kakeya conjecture, that is to say if any subset of
which contains a two-dimensional family of lines (restricted to
, and meeting
) is necessarily three-dimensional. Here we have an easy counterexample, namely a plane in
(passing through the origin), which contains a two-dimensional collection of lines. However, we can exclude this case by adding an additional axiom, leading to what one might call a “strong” Kakeya conjecture:
Conjecture 3 (Strong Kakeya conjecture) Let
be a two-dimensional family of lines in
that meet
, and assume the Wolff axiom that no (affine) plane contains more than a one-dimensional family of lines in
. Let
be the union of the restriction
of every line
in
. Then
.
Actually, to make things work out we need a more quantitative version of the Wolff axiom in which we constrain the metric entropy (and not just dimension) of lines that lie close to a plane, rather than exactly on the plane. However, for the informal discussion here we will ignore these technical details. Families of lines that lie in different directions will obey the Wolff axiom, but the converse is not true in general.
In 1995, Wolff established the important lower bound (for various notions of dimension, e.g. Hausdorff dimension) for sets
in Conjecture 3 (and hence also for the other forms of the Kakeya problem). However, there is a key obstruction to going beyond the
barrier, coming from the possible existence of half-dimensional (approximate) subfields of the reals
. To explain this problem, it easiest to first discuss the complex version of the strong Kakeya conjecture, in which all relevant (real) dimensions are doubled:
Conjecture 4 (Strong Kakeya conjecture over
) Let
be a four (real) dimensional family of complex lines in
that meet the unit ball
in
, and assume the Wolff axiom that no four (real) dimensional (affine) subspace contains more than a two (real) dimensional family of complex lines in
. Let
be the union of the restriction
of every complex line
in
. Then
has real dimension
.
The argument of Wolff can be adapted to the complex case to show that all sets occuring in Conjecture 4 have real dimension at least
. Unfortunately, this is sharp, due to the following fundamental counterexample:
Proposition 5 (Heisenberg group counterexample) Let
be the Heisenberg group
and let
be the family of complex lines
with
and
. Then
is a five (real) dimensional subset of
that contains every line in the four (real) dimensional set
; however each four real dimensional (affine) subspace contains at most a two (real) dimensional set of lines in
. In particular, the strong Kakeya conjecture over the complex numbers is false.
This proposition is proven by a routine computation, which we omit here. The group structure on is given by the group law
giving the structure of a
-step simply-connected nilpotent Lie group, isomorphic to the usual Heisenberg group over
. Note that while the Heisenberg group is a counterexample to the complex strong Kakeya conjecture, it is not a counterexample to the complex form of the original Kakeya conjecture, because the complex lines
in the Heisenberg counterexample do not point in distinct directions, but instead only point in a three (real) dimensional subset of the four (real) dimensional space of available directions for complex lines. For instance, one has the one real-dimensional family of parallel lines
with ; multiplying this family of lines on the right by a group element in
gives other families of parallel lines, which in fact sweep out all of
.
The Heisenberg counterexample ultimately arises from the “half-dimensional” (and hence degree two) subfield of
, which induces an involution
which can then be used to define the Heisenberg group
through the formula
Analogous Heisenberg counterexamples can also be constructed if one works over finite fields that contain a “half-dimensional” subfield
; we leave the details to the interested reader. Morally speaking, if
in turn contained a subfield of dimension
(or even a subring or “approximate subring”), then one ought to be able to use this field to generate a counterexample to the strong Kakeya conjecture over the reals. Fortunately, such subfields do not exist; this was a conjecture of Erdos and Volkmann that was proven by Edgar and Miller, and more quantitatively by Bourgain (answering a question of Nets Katz and myself). However, this fact is not entirely trivial to prove, being a key example of the sum-product phenomenon.
We thus see that to go beyond the dimension bound of Wolff for the 3D Kakeya problem over the reals, one must do at least one of two things:
- (a) Exploit the distinct directions of the lines in
in a way that goes beyond the Wolff axiom; or
- (b) Exploit the fact that
does not contain half-dimensional subfields (or more generally, intermediate-dimensional approximate subrings).
(The situation is more complicated in higher dimensions, as there are more obstructions than the Heisenberg group; for instance, in four dimensions quadric surfaces are an important obstruction, as discussed in this paper of mine.)
Various partial or complete results on the Kakeya problem over various fields have been obtained through route (a) or route (b). For instance, in 2000, Nets Katz, Izabella Laba and myself used route (a) to improve Wolff’s lower bound of for Kakeya sets very slightly to
(for a weak notion of dimension, namely upper Minkowski dimension). In 2004, Bourgain, Katz, and myself established a sum-product estimate which (among other things) ruled out approximate intermediate-dimensional subrings of
, and then pursued route (b) to obtain a corresponding improvement
to the Kakeya conjecture over finite fields of prime order. The analogous (discretised) sum-product estimate over the reals was established by Bourgain in 2003, which in principle would allow one to extend the result of Katz, Laba and myself to the strong Kakeya setting, but this has not been carried out in the literature. Finally, in 2009, Dvir used route (a) and introduced the polynomial method (as discussed previously here) to completely settle the Kakeya conjecture in finite fields.
Below the fold, I present a heuristic argument of Nets Katz and myself, which in principle would use route (b) to establish the full (strong) Kakeya conjecture. In broad terms, the strategy is as follows:
- Assume that the (strong) Kakeya conjecture fails, so that there are sets
of the form in Conjecture 3 of dimension
for some
. Assume that
is “optimal”, in the sense that
is as large as possible.
- Use the optimality of
(and suitable non-isotropic rescalings) to establish strong forms of standard structural properties expected of such sets
, namely “stickiness”, “planiness”, “local graininess” and “global graininess” (we will roughly describe these properties below). Heuristically, these properties are constraining
to “behave like” a putative Heisenberg group counterexample.
- By playing all these structural properties off of each other, show that
can be parameterised locally by a one-dimensional set which generates a counterexample to Bourgain’s sum-product theorem. This contradiction establishes the Kakeya conjecture.
Nets and I have had an informal version of argument for many years, but were never able to make a satisfactory theorem (or even a partial Kakeya result) out of it, because we could not rigorously establish anywhere near enough of the necessary structural properties (stickiness, planiness, etc.) on the optimal set for a large number of reasons (one of which being that we did not have a good notion of dimension that did everything that we wished to demand of it). However, there is beginning to be movement in these directions (e.g. in this recent result of Guth using the polynomial method obtaining a weak version of local graininess on certain Kakeya sets). In view of this (and given that neither Nets or I have been actively working in this direction for some time now, due to many other projects), we’ve decided to distribute these ideas more widely than before, and in particular on this blog.
— 1. Stickiness —
Let be a set of the form in Conjecture 3 that has a “minimal dimension”
for some
, whatever that means. (If the infimal dimension cannot be attained exactly, it may be enough to work with a set
whose dimension is sufficiently close to the infimum.)
The first step – and, somewhat annoyingly, the one for which we have been least able to make rigorous – is to establish a property on the family of lines that is known as “stickiness”, which roughly speaking asserts that each line in
is close to as many other lines in
as is permitted by dimensionality considerations. Indeed, if one places a suitable metric on the space of lines in
(or more precisely, on the compact portion of that space consisting of lines that meet
), then stickiness asserts that
is “genuinely two-dimensional” in the sense that for any scale
,
should be covered by about
balls of radius
(where we ignore all logarithmic factors in the scale, and also pretend for simplicity that all sets under consideration are roughly “uniform” in the sense that the multiplicities and other combinatorial statistics of the set are essentially constant across the set; there are standard techniques such as dyadic pigeonholing that often allow one to reduce to this case, but we will ignore these technical issues here). In particular, if one has two scales
, then each of the
-balls covering
should essentially contain about
of the
-balls covering
.
In my previous paper with Nets and Izabella, we were able to establish such stickiness for sets of dimension close to the Wolff exponent
by using an x-ray estimate established in a separate paper of Wolff, but we do not know how to establish stickiness more generally, and I view this as the largest single obstacle to making this program to attack the Kakeya conjecture a success.
Translating the stickiness property back into the physical space , we find that the
-neighbourhood
of
should consist of the union of about
-tubes
(that is, solid cylinders of length
and radius
), and similarly the larger
-neighbourhood
should consist of the union of about
-tubes
. We’ll refer to the
-tubes
as “thin tubes” and
-tubes
as “fat” tubes. Each thin tube
should lie in essentially one parent fat tube
, and conversely each fat tube
should essentially contain about
thin tubes
.
This creates a somewhat self-similar structure to the set as follows. Let
denote the collection of fat tubes. Then for each fat tube
, we can form the subset
of
by taking the union of all the thin tubes inside of
, thus we have the decomposition
This is very much not a disjoint union; indeed, the strong overlap of the is going to be an important source for the other structural properties we will suppose
to have.
AS has dimension
, we expect
to be essentially the union of about
-cubes
, and
to be the union of about
-cubes
. Thus, each large cube
should contain about
small cubes
. This in turn implies that each fat tube
should contain about
small cubes, and so we expect the volume
of
to be
Now consider the set , which is a subset of
, so that
The set is the union of about
-tubes inside a
-tube. If we perform a non-isotropic scaling to rescale a
-tube to become a
-tube, then the
-tubes rescale to be
-tubes, and we obtain a rescaled set
with
This set is the union of about
-tubes obeying the Wolff axiom, and so by definition of
should have volume at least
:
The estimates (1)–(4) fit together in a perfect circle, forcing all of the inequalities to be approximate equalities; thus we heuristically have that to be an optimal Kakeya configuration (much like
or
), and to have
for every fat tube . Informally, we have an affine self-similar structure: the portion of
inside every
-tube is a rescaled version of
.
Example 1 Let us return to the complex setting, and take
to be the Heisenberg group
(with
), so that
The Heisenberg group contains the line
, so the set
would essentially be a
-tube
in
, and
Applying the rescaling
we then have
so in this case we have essentially perfect self-similarity:
— 2. Planiness and local graininess —
The next stage is fairly well understood, being mapped out in my previous paper with Nets and Izabella. We now take . From (5) we know that if a
-cube
lies in a
-tube
, then
On the other hand, with the choice , elementary geometry reveals that
is essentially the union of
-tubes oriented to be parallel to
. We conclude that
resembles the union of
-tubes oriented parallel to
. But if
is another
-tube passing through
, then
must also resemble
-tubes oriented parallel to
. If
are widely separated in angle (which we can morally ensure through a “bilinear reduction”, which we will not discuss here), this forces the following local graininess property: for every
-cube,
resembles the union of
parallel slabs (or “local grains”); since
contains about
-cubes, we see that the number of such slabs is about
.
Local graininess also forces coarse-scale planiness: the -tubes passing through
must all essentially lie within
of a common plane, parallel to the plane in which the local grains are oriented. The reason for this is that if the tubes were any less coplanar than this, then there would be local graininess in more than one direction, and this would start forcing
to contain cubes that are much larger than
in size, contradicting the
-dimensionality of
.
Rescaling to be
, we then conclude fine-scale planiness: the
-tubes passing through a
-cube in
all essentially lie within
of a common plane.
Example 2 We return to the complex Heisenberg group from the previous example. The intersection of
with the (complex)
-cube
takes the form
which is the union of about
![]()
(complex) slabs of the form
where
ranges over a
-separated subset of the real interval
. This illustrates the local graininess of the Heisenberg group in one cube
; local graininess in other cubes can then be established by applying the group law (or by direct computation).
Similarly, the lines in the Heisenberg group passing through the origin
take the form
with
, which all lie in the complex plane
. Thus, (many of) the
-tubes passing through the
-cube
take the form
for some real
, and these lie within
of the same complex plane, exhibiting planiness at scale
(and similarly at other scales, such as
); note also the relevant plane
is parallel to the slabs (8). Again, by the group law, a similar geometry holds at other points of
.
Remark 1 One should also be able to establish a weak form of planiness via the multilinear Kakeya estimate of Bennett, Carbery, and myself. Recently, Guth has used the polynomial method to also establish a weak form of local graininess. One may hope that a sufficient development of these new techniques may reduce the need to establish strong stickiness properties in this argument, which is currently the largest missing technical piece needed to make the argument rigorous.
— 3. Global graininess —
By applying a non-isotropic rescaling, we can add a further structural property to the stickiness, planiness, and graininess properties already obtained. Namely, take again and consider a single
-tube
in
, which we will normalise to be
Applying the rescaling , we obtain a rescaled set
, which as discussed before obeys similar properties to
, and in particular should continue to have the sticky, plany, and grainy properties that the original set
had. Now consider a cube
in
, thus
for some . By local graininess,
is the union of
slabs oriented in some plane parallel to
; thus we have some slope
with the property that
is the union of about
sets of the form
for various . Rescaling this, we conclude that the horizontal slice
of is the union of about
rectangles of the form
for various . We refer to this property as global graininess at scale
. Relabeling
as
again, we arrive at a set
that obeys the same stickiness, planiness, and local graininess property as
, but also obeys the global graininess property that for every
, the slice
is the union of about rectangles (or “global grains”) of the form
for various , and some function
. Some multiscale analysis lets one establish some additional “stickiness” properties on
, which roughly speaking means that
behaves like a mostly Lipschitz function, but we will not discuss this further here. We also note that the local and global grains have to be compatible, in the sense that the local grains are parallel to the global grains when they meet, for reasons similar to the compatibility of the planes and local grains mentioned previously. In particular, the planes must also be compatible with the global grains.
Note that we have broken the rotational symmetry of the Kakeya problem here by performing a rescaling that privileges the vertical axis over the horizontal
directions; one cannot expect to have global graininess in every orientation, only in an orientation that one designates to be the horizontal.
Example 3 We continue the running example of the Heisenberg group. Due to the perfect self-similarity (7), we essentially have
, so that
already has the desired global graininess property. And indeed, from (6) we see that for any
, the set
is of the form
and is thus the union of about
complex rectangles of the form
with
purely imaginary and
. In particular,
, the global grains are oriented along the
axis, which is compatible with the planes and local grains at
, which are oriented along the
plane. Similarly at other locations on the Heisenberg group, thanks to the group law.
— 4. Reduction to a 2D structure —
Now we put together all the various structures that we have. Consider a single global grain of , which we normalise to be at the slice
and to be the rectangle
so that . We will work in a
-neighbourhood of this grain, namely a “fat grain”
By global graininess, the slice of
takes the form
for some set in the real line which is the union of
-intervals. Meanwhile local graininess forces the
slice
of to be the union of parallel
-rectangles aligned with the
-axis, which we can renormalise (by a shear respecting the
slice) to be vertically oriented; to be compatible with (9), we see that this slice must be equal to
Now we consider a horizontal slice
of for some
. By global graininess (and stickiness of
), this slice is the union of rectangles of the form
for various , where
is a rescaled version of
(and in particular should have similar stickiness properties to
). To be compatible with (10), we conclude that this slice must in fact essentially take the form
Finally, we consider a vertical slice
of for some
. By local graininess, this slice is the union of rectangles of the form
for some slope function , which one can argue to also be sticky. In order to be compatible with (9), this slice must in fact essentially take the form
For this to be compatible with (11), we obtain the crucial “zero-curvature” property
Example 4 In the running example of the Heisenberg group,
is the “half-dimensional set”
and
are complex conjugation, so that
and
are bounded subsets of the imaginary axis.
We can write (12) as a two-dimensional assertion
where are the one-dimensional graphs in
given by
and
and
with denoting the dot product on
. Recall that
is basically the
-neighbourhood of a
-dimensional set.
— 5. Reduction to a 1D structure —
We have just eliminated one dimension from the three-dimensional Kakeya problem. Now we eliminate one more dimension, to reduce to a one-dimensional problem.
The sets are one-dimensional. Viewed in polar coordinates, we can (heuristically, at least) assume that for some
, there is a
-dimensional family of rays through the origin, such that
meets each such ray in a
-dimensional set. Standard projection theorems (or double counting) then tell us that for a typical ray in this
-dimensional family, the projection of the one-dimensional set
to such a ray should have dimension at least
. Because of this, we can conclude that
should contain a product set
, where
are bounded non-empty subsets of
of dimensions
and
respectively. If
is too close to zero or one, this already makes
of dimension larger than
, a contradiction, so we may assume that
is bounded away from zero and one. Dropping the discretisations and informally moving back to the continuous setting, we thus have the one-dimensional setup
with
and
— 6. Contradicting the sum-product theorem —
Finally, we use some additive combinatorics manipulations (inspired by some finite field arguments of Bourgain) to reduce to a setting ruled out by Bourgain’s sum-product theorem.
From iterating (13), we expect all iterated sumsets to have dimension bounded above by
, and thus bounded away from
. Stabilising this, we thus expect
to be contained in (a translate of) an approximate group
in the real line with
Ignoring the translate, we thus see that is contained in
for all
(removing those portions of
that are too close to zero, if needed).
Now let be a probability measure supported on
, then the Fourier transform
should be essentially one on the set
, where
is some sort of dual approximate group to
; thus
should be about
on
, which is a set of dimension at least
. On the other hand, Plancherel’s theorem tells us that the spectrum
has dimension at most
. Thus we have
for any . Iterating this, we conclude that iterated product sets
should be contained inside a dilate of
. On the other hand, we also expect
be approximately closed under addition, and so the iterated sum-product sets
should also be contained inside a dilate of
and thus have dimension bounded away from zero. In particular, these iterated sum-product sets must stabilise at some set
with
and
; but this more or less directly contradicts the sum-product theorem of Bourgain.
9 comments
Comments feed for this article
7 May, 2014 at 7:16 pm
Fan
“In view of this (and given that neither Nets or I have been actively working in this direction for some time now, due to many other projects), we’ve decided to distribute these ideas more widely than before, and in particular on this blog. ” Maybe on polymath?
8 May, 2014 at 5:59 pm
David Roberts
Fan,
I was thinking the same, but perhaps someone else would take it up, if Terry doesn’t.
11 May, 2014 at 1:33 pm
Anonymous
I’m definitely a novice here, but I have a thought for a project in this direction that might be amenable to the polymath approach. Namely, we may be able to see what happens if we go through the reasoning in this post, and take stock of what could be useful to improve the Katz-Laba-Tao result. For example:
What kind of improvements to stickiness estimates would be useful? How might Guth’s “local graininess” arguments help?
If it’s not clear how Guth’s results help, can we at least work with a plausible strengthening of them?
The idea being that instead of actually trying to “make progress”, we can use the arguments explained here to present a new set of directions for work on the Kakeya problem. Maybe this is a project best pursued by individual researchers, or best not pursued at all, but I thought I’d sketch an idea of what a polymath might look like.
9 May, 2014 at 9:18 am
Anonymous
I think a Polymath about this problem is a great idea.
10 May, 2014 at 6:47 am
Mark Lewko
What are the obstructions to carrying out an argument in this spirit to obtain something close to the full 3-d Kakeya conjecture in finite fields (as opposed to “just” the Bourgain-Katz-Tao $5/2+\delta$ estimate)? It seems that many of the multi-scale issues (such as those related to stickiness which you consider the most serious obstruction) should be absent in that setting.
10 May, 2014 at 9:00 am
Terence Tao
It’s precisely the absence of multi-scale analysis that makes this entire line of argument useless in finite fields; what we are doing here is basically to repeatedly rescale or “Taylor expand” the Kakeya set until one arrives at a quadratic structure, namely (12) or (13). As far as I can tell, no analogue of these rescalings are available in finite fields (except possibly in the case of finite fields with lots and lots of subfields, e.g.
, although I haven’t thought too carefully about this).
10 May, 2014 at 3:58 pm
murphmath
A p-adic variant might be a reasonable alternative. Thickened lines over Z_p correspond to “lines” over Z/p^k. In this setting the combinatorics are very nice. For instance, if you consider the quotient map from (Z/p^2)^3 to (Z/p)^3, lines over Z/p lift to “fat lines” over Z/p^2, with each (“thin”) line over Z/p^2 belonging to a unique “fat line”. Rescaling by 1/p turns the family of thin lines belonging to a single fat line into a family of lines over Z/p.
Since there are good incidence theorems available over Z/p, you might be able to get good stickiness results (e.g. I(P,L)<|P|^{1/3}|L| under "true 3D" conditions on L is in Guth's class notes, which is like a strong x-ray estimate; this was proven by Elekes, Kaplan, and Sharir over R, but they use the Szemeredi-Trotter theorem in part of the argument, while Guth just uses degree reduction).
16 January, 2018 at 7:35 pm
Anton Rasmussen
Just wanted to post that the link to the video was broken; but, I was able to hunt it down via WebArchive to the following link:https://ipam.wistia.com/medias/hhsl7oam61
8 February, 2020 at 10:43 pm
The largest clique in the Paley Graph: unexpected significant progress and surprising connections. | Combinatorics and more
[…] Of course, the sum-product phenomena is a huge area with amazing applications (See, here, here, here, and here; I recall that Jean Bourgain referred to it as a gold mine or something like that). Now, […]