You are currently browsing the category archive for the ‘math.CO’ category.
Ben Green and I have just uploaded to the arXiv our paper “New bounds for Szemeredi’s theorem, Ia: Progressions of length 4 in finite field geometries revisited“, submitted to Proc. Lond. Math. Soc.. This is both an erratum to, and a replacement for, our previous paper “New bounds for Szemeredi’s theorem. I. Progressions of length 4 in finite field geometries“. The main objective in both papers is to bound the quantity for a vector space
over a finite field
of characteristic greater than
, where
is defined as the cardinality of the largest subset of
that does not contain an arithmetic progression of length
. In our earlier paper, we gave two arguments that bounded
in the regime when the field
was fixed and
was large. The first “cheap” argument gave the bound
and the more complicated “expensive” argument gave the improvement
depending only on
.
Unfortunately, while the cheap argument is correct, we discovered a subtle but serious gap in our expensive argument in the original paper. Roughly speaking, the strategy in that argument is to employ the density increment method: one begins with a large subset of
that has no arithmetic progressions of length
, and seeks to locate a subspace on which
has a significantly increased density. Then, by using a “Koopman-von Neumann theorem”, ultimately based on an iteration of the inverse
theorem of Ben and myself (and also independently by Samorodnitsky), one approximates
by a “quadratically structured” function
, which is (locally) a combination of a bounded number of quadratic phase functions, which one can prepare to be in a certain “locally equidistributed” or “locally high rank” form. (It is this reduction to the high rank case that distinguishes the “expensive” argument from the “cheap” one.) Because
has no progressions of length
, the count of progressions of length
weighted by
will also be small; by combining this with the theory of equidistribution of quadratic phase functions, one can then conclude that there will be a subspace on which
has increased density.
The error in the paper was to conclude from this that the original function also had increased density on the same subspace; it turns out that the manner in which
approximates
is not strong enough to deduce this latter conclusion from the former. (One can strengthen the nature of approximation until one restores such a conclusion, but only at the price of deteriorating the quantitative bounds on
one gets at the end of the day to be worse than the cheap argument.)
After trying unsuccessfully to repair this error, we eventually found an alternate argument, based on earlier papers of ourselves and of Bergelson-Host-Kra, that avoided the density increment method entirely and ended up giving a simpler proof of a stronger result than (1), and also gives the explicit value of for the exponent
in (1). In fact, it gives the following stronger result:
Theorem 1 Let
be a subset of
of density at least
, and let
. Then there is a subspace
of
of codimension
such that the number of (possibly degenerate) progressions
in
is at least
.
The bound (1) is an easy consequence of this theorem after choosing and removing the degenerate progressions from the conclusion of the theorem.
The main new idea is to work with a local Koopman-von Neumann theorem rather than a global one, trading a relatively weak global approximation to with a significantly stronger local approximation to
on a subspace
. This is somewhat analogous to how sometimes in graph theory it is more efficient (from the point of view of quantative estimates) to work with a local version of the Szemerédi regularity lemma which gives just a single regular pair of cells, rather than attempting to regularise almost all of the cells. This local approach is well adapted to the inverse
theorem we use (which also has this local aspect), and also makes the reduction to the high rank case much cleaner. At the end of the day, one ends up with a fairly large subspace
on which
is quite dense (of density
) and which can be well approximated by a “pure quadratic” object, namely a function of a small number of quadratic phases obeying a high rank condition. One can then exploit a special positivity property of the count of length four progressions weighted by pure quadratic objects, essentially due to Bergelson-Host-Kra, which then gives the required lower bound.
A few days ago, Endre Szemerédi was awarded the 2012 Abel prize “for his fundamental contributions to discrete mathematics and theoretical computer science, and in recognition of the profound and lasting impact of these contributions on additive number theory and ergodic theory.” The full citation for the prize may be found here, and the written notes for a talk given by Tim Gowers on Endre’s work at the announcement may be found here (and video of the talk can be found here).
As I was on the Abel prize committee this year, I won’t comment further on the prize, but will instead focus on what is arguably Endre’s most well known result, namely Szemerédi’s theorem on arithmetic progressions:
Theorem 1 (Szemerédi’s theorem) Let
be a set of integers of positive upper density, thus
, where
. Then
contains an arithmetic progression of length
for any
.
Szemerédi’s original proof of this theorem is a remarkably intricate piece of combinatorial reasoning. Most proofs of theorems in mathematics – even long and difficult ones – generally come with a reasonably compact “high-level” overview, in which the proof is (conceptually, at least) broken down into simpler pieces. There may well be technical difficulties in formulating and then proving each of the component pieces, and then in fitting the pieces together, but usually the “big picture” is reasonably clear. To give just one example, the overall strategy of Perelman’s proof of the Poincaré conjecture can be briefly summarised as follows: to show that a simply connected three-dimensional manifold is homeomorphic to a sphere, place a Riemannian metric on it and perform Ricci flow, excising any singularities that arise by surgery, until the entire manifold becomes extinct. By reversing the flow and analysing the surgeries performed, obtain enough control on the topology of the original manifold to establish that it is a topological sphere.
In contrast, the pieces of Szemerédi’s proof are highly interlocking, particularly with regard to all the epsilon-type parameters involved; it takes quite a bit of notational setup and foundational lemmas before the key steps of the proof can even be stated, let alone proved. Szemerédi’s original paper contains a logical diagram of the proof (reproduced in Gowers’ recent talk) which already gives a fair indication of this interlocking structure. (Many years ago I tried to present the proof, but I was unable to find much of a simplification, and my exposition is probably not that much clearer than the original text.) Even the use of nonstandard analysis, which is often helpful in cleaning up armies of epsilons, turns out to be a bit tricky to apply here. (In typical applications of nonstandard analysis, one can get by with a single nonstandard universe, constructed as an ultrapower of the standard universe; but to correctly model all the epsilons occuring in Szemerédi’s argument, one needs to repeatedly perform the ultrapower construction to obtain a (finite) sequence of increasingly nonstandard (and increasingly saturated) universes, each one containing unbounded quantities that are far larger than any quantity that appears in the preceding universe, as discussed at the end of this previous blog post. This sequence of universes does end up concealing all the epsilons, but it is not so clear that this is a net gain in clarity for the proof; I may return to the nonstandard presentation of Szemeredi’s argument at some future juncture.)
Instead of trying to describe the entire argument here, I thought I would instead show some key components of it, with only the slightest hint as to how to assemble the components together to form the whole proof. In particular, I would like to show how two particular ingredients in the proof – namely van der Waerden’s theorem and the Szemerédi regularity lemma – become useful. For reasons that will hopefully become clearer later, it is convenient not only to work with ordinary progressions , but also progressions of progressions
, progressions of progressions of progressions, and so forth. (In additive combinatorics, these objects are known as generalised arithmetic progressions of rank one, two, three, etc., and play a central role in the subject, although the way they are used in Szemerédi’s proof is somewhat different from the way that they are normally used in additive combinatorics.) Very roughly speaking, Szemerédi’s proof begins by building an enormous generalised arithmetic progression of high rank containing many elements of the set
(arranged in a “near-maximal-density” configuration), and then steadily prunes this progression to improve the combinatorial properties of the configuration, until one ends up with a single rank one progression of length
that consists entirely of elements of
.
To illustrate some of the basic ideas, let us first consider a situation in which we have located a progression of progressions of length
, with each progression
,
being quite long, and containing a near-maximal amount of elements of
, thus
where is the “maximal density” of
along arithmetic progressions. (There are a lot of subtleties in the argument about exactly how good the error terms are in various approximations, but we will ignore these issues for the sake of this discussion and just use the imprecise symbols such as
instead.) By hypothesis,
is positive. The objective is then to locate a progression
in
, with each
in
for
. It may help to view the progression of progressions
as a tall thin rectangle
.
If we write for
, then the problem is equivalent to finding a (possibly degenerate) arithmetic progression
, with each
in
.
By hypothesis, we know already that each set has density about
in
:
, which roughly speaking asserts that
of
of density
of a certain form to be specified shortly. This is a plausible type of assumption if one believes
to behave like a random set, and if the sets
are constructed “independently” of the
in some sense. Of course, we do not expect such an assumption to be valid all of the time, but we will postpone consideration of this point until later. Let us now see how this sort of weakly mixing hypothesis could help one count progressions
of the desired form.
We will inductively consider the following (nonrigorously defined) sequence of claims for each
:
-
: For most choices of
, there are
arithmetic progressions
in
with the specified choice of
, such that
for all
.
(Actually, to avoid boundary issues one should restrict to lie in the middle third of
, rather than near the edges, but let us ignore this minor technical detail.) The quantity
is natural here, given that there are
arithmetic progressions
in
that pass through
in the
position, and that each one ought to have a probability of
or so that the events
simultaneously hold.) If one has the claim
, then by selecting a typical
in
, we obtain a progression
with
for all
, as required. (In fact, we obtain about
such progressions by this method.)
We can heuristically justify the claims by induction on
. For
, the claims
are clear just from direct counting of progressions (as long as we keep
away from the edges of
). Now suppose that
, and the claims
have already been proven. For any
and for most
, we have from hypothesis that there are
progressions
in
through
with
. Let
be the set of all the values of
attained by these progressions, then
. Invoking the weak mixing hypothesis, we (heuristically, at least) conclude that for most choices of
, we have
which then gives the desired claim .
The observant reader will note that we only needed the claim in the case
for the above argument, but for technical reasons, the full proof requires one to work with more general values of
(also the claim
needs to be replaced by a more complicated version of itself, but let’s ignore this for sake of discussion).
We now return to the question of how to justify the weak mixing hypothesis (2). For a single block of
, one can easily concoct a scenario in which this hypothesis fails, by choosing
to overlap with
too strongly, or to be too disjoint from
. However, one can do better if one can select
from a long progression of blocks. The starting point is the following simple double counting observation that gives the right upper bound:
Proposition 2 (Single upper bound) Let
be a progression of progressions
for some large
. Suppose that for each
, the set
has density
in
(i.e. (1) holds). Let
be a subset of
of density
. Then (if
is large enough) one can find an
such that
Proof: The key is the double counting identity
Because has maximal density
and
is large, we have
for each , and thus
The claim then follows from the pigeonhole principle.
Now suppose we want to obtain weak mixing not just for a single set , but for a small number
of such sets, i.e. we wish to find an
for which
, where
is the density of
in
. The above proposition gives, for each
, a choice of
for which (3) holds, but it could be a different
for each
, and so it is not immediately obvious how to use Proposition 2 to find an
for which (3) holds simultaneously for all
. However, it turns out that the van der Waerden theorem is the perfect tool for this amplification:
Proposition 3 (Multiple upper bound) Let
be a progression of progressions
for some large
. Suppose that for each
, the set
has density
in
(i.e. (1) holds). For each
, let
be a subset of
of density
. Then (if
is large enough depending on
) one can find an
such that
simultaneously for all
.
Proof: Suppose that the claim failed (for some suitably large ). Then, for each
, there exists
such that
This can be viewed as a colouring of the interval by
colours. If we take
large compared to
, van der Waerden’s theorem allows us to then find a long subprogression of
which is monochromatic, so that
is constant on this progression. But then this will furnish a counterexample to Proposition 2.
One nice thing about this proposition is that the upper bounds can be automatically upgraded to an asymptotic:
Proposition 4 (Multiple mixing) Let
be a progression of progressions
for some large
. Suppose that for each
, the set
has density
in
(i.e. (1) holds). For each
, let
be a subset of
of density
. Then (if
is large enough depending on
) one can find an
such that
simultaneously for all
.
Proof: By applying the previous proposition to the collection of sets and their complements
(thus replacing
with
, one can find an
for which
and
which gives the claim.
However, this improvement of Proposition 2 turns out to not be strong enough for applications. The reason is that the number of sets
for which mixing is established is too small compared with the length
of the progression one has to use in order to obtain that mixing. However, thanks to the magic of the Szemerédi regularity lemma, one can amplify the above proposition even further, to allow for a huge number of
to be mixed (at the cost of excluding a small fraction of exceptions):
Proposition 5 (Really multiple mixing) Let
be a progression of progressions
for some large
. Suppose that for each
, the set
has density
in
(i.e. (1) holds). For each
in some (large) finite set
, let
be a subset of
of density
. Then (if
is large enough, but not dependent on the size of
) one can find an
such that
simultaneously for almost all
.
Proof: We build a bipartite graph connecting the progression
to the finite set
by placing an edge
between an element
and an element
whenever
. The number
can then be interpreted as the degree of
in this graph, while the number
is the number of neighbours of
that land in
.
We now apply the regularity lemma to this graph . Roughly speaking, what this lemma does is to partition
and
into almost equally sized cells
and
such that for most pairs
of cells, the graph
resembles a random bipartite graph of some density
between these two cells. The key point is that the number
of cells here is bounded uniformly in the size of
and
. As a consequence of this lemma, one can show that for most vertices
in a typical cell
, the number
is approximately equal to
and the number is approximately equal to
The point here is that the different statistics
are now controlled by a mere
statistics
(this is not unlike the use of principal component analysis in statistics, incidentally, but that is another story). Now, we invoke Proposition 4 to find an
for which
simultaneously for all , and the claim follows.
This proposition now suggests a way forward to establish the type of mixing properties (2) needed for the preceding attempt at proving Szemerédi’s theorem to actually work. Whereas in that attempt, we were working with a single progression of progressions of progressions containing a near-maximal density of elements of
, we will now have to work with a family
of such progression of progressions, where
ranges over some suitably large parameter set. Furthermore, in order to invoke Proposition 5, this family must be “well-arranged” in some arithmetic sense; in particular, for a given
, it should be possible to find many reasonably large subfamilies of this family for which the
terms
of the progression of progressions in this subfamily are themselves in arithmetic progression. (Also, for technical reasons having to do with the fact that the sets
in Proposition 5 are not allowed to depend on
, one also needs the progressions
for any given
to be “similar” in the sense that they intersect
in the same fashion (thus the sets
as
varies need to be translates of each other).) If one has this sort of family, then Proposition 5 allows us to “spend” some of the degrees of freedom of the parameter set
in order to gain good mixing properties for at least one of the sets
in the progression of progressions.
Of course, we still have to figure out how to get such large families of well-arranged progressions of progressions. Szemerédi’s solution was to begin by working with generalised progressions of a much larger rank than the rank
progressions considered here; roughly speaking, to prove Szemerédi’s theorem for length
progressions, one has to consider generalised progressions of rank as high as
. It is possible by a reasonably straightforward (though somewhat delicate) “density increment argument” to locate a huge generalised progression of this rank which is “saturated” by
in a certain rather technical sense (related to the concept of “near maximal density” used previously). Then, by another reasonably elementary argument, it is possible to locate inside a suitable large generalised progression of some rank
, a family of large generalised progressions of rank
which inherit many of the good properties of the original generalised progression, and which have the arithmetic structure needed for Proposition 5 to be applicable, at least for one value of
. (But getting this sort of property for all values of
simultaneously is tricky, and requires many careful iterations of the above scheme; there is also the problem that by obtaining good behaviour for one index
, one may lose good behaviour at previous indices, leading to a sort of “Tower of Hanoi” situation which may help explain the exponential factor in the rank
that is ultimately needed. It is an extremely delicate argument; all the parameters and definitions have to be set very precisely in order for the argument to work at all, and it is really quite remarkable that Endre was able to see it through to the end.)
This is an addendum to last quarter’s course notes on Hilbert’s fifth problem, which I am in the process of reviewing in order to transcribe them into a book (as was done similarly for several other sets of lecture notes on this blog). When reviewing the zeroth set of notes in particular, I found that I had made a claim (Proposition 11 from those notes) which asserted, roughly speaking, that any sufficiently large nilprogression was an approximate group, and promised to prove it later in the course when we had developed the ability to calculate efficiently in nilpotent groups. As it turned out, I managed finish the course without the need to develop these calculations, and so the proposition remained unproven. In order to rectify this, I will use this post to lay out some of the basic algebra of nilpotent groups, and use it to prove the above proposition, which turns out to be a bit tricky. (In my paper with Breuillard and Green, we avoid the need for this proposition by restricting attention to a special type of nilprogression, which we call a nilprogression in -normal form, for which the computations are simpler.)
There are several ways to think about nilpotent groups; for instance one can use the model example of the Heisenberg group
over an arbitrary ring (which need not be commutative), or more generally any matrix group consisting of unipotent upper triangular matrices, and view a general nilpotent group as being an abstract generalisation of such concrete groups. (In the case of nilpotent Lie groups, at least, this is quite an accurate intuition, thanks to Engel’s theorem.) Or, one can adopt a Lie-theoretic viewpoint and try to think of nilpotent groups as somehow arising from nilpotent Lie algebras; this intuition is rigorous when working with nilpotent Lie groups (at least when the characteristic is large, in order to avoid issues coming from the denominators in the Baker-Campbell-Hausdorff formula), but also retains some conceptual value in the non-Lie setting. In particular, nilpotent groups (particularly finitely generated ones) can be viewed in some sense as “nilpotent Lie groups over
“, even though Lie theory does not quite work perfectly when the underlying scalars merely form an integral domain instead of a field.
Another point of view, which arises naturally both in analysis and in algebraic geometry, is to view nilpotent groups as modeling “infinitesimal” perturbations of the identity, where the infinitesimals have a certain finite order. For instance, given a (not necessarily commutative) ring without identity (representing all the “small” elements of some larger ring or algebra), we can form the powers
for
, defined as the ring generated by
-fold products
of elements
in
; this is an ideal of
which represents the elements which are “
order” in some sense. If one then formally adjoins an identity
onto the ring
, then for any
, the multiplicative group
is a nilpotent group of step at most
. For instance, if
is the ring of strictly upper
matrices (over some base ring), then
vanishes and
becomes the group of unipotent upper triangular matrices over the same ring, thus recovering the previous matrix-based example. In analysis applications,
might be a ring of operators which are somehow of “order”
or
for some small parameter
or
, and one wishes to perform Taylor expansions up to order
or
, thus discarding (i.e. quotienting out) all errors in
.
From a dynamical or group-theoretic perspective, one can also view nilpotent groups as towers of central extensions of a trivial group. Finitely generated nilpotent groups can also be profitably viewed as a special type of polycylic group; this is the perspective taken in this previous blog post. Last, but not least, one can view nilpotent groups from a combinatorial group theory perspective, as being words from some set of generators of various “degrees” subject to some commutation relations, with commutators of two low-degree generators being expressed in terms of higher degree objects, and all commutators of a sufficiently high degree vanishing. In particular, generators of a given degree can be moved freely around a word, as long as one is willing to generate commutator errors of higher degree.
With this last perspective, in particular, one can start computing in nilpotent groups by adopting the philosophy that the lowest order terms should be attended to first, without much initial concern for the higher order errors generated in the process of organising the lower order terms. Only after the lower order terms are in place should attention then turn to higher order terms, working successively up the hierarchy of degrees until all terms are dealt with. This turns out to be a relatively straightforward philosophy to implement in many cases (particularly if one is not interested in explicit expressions and constants, being content instead with qualitative expansions of controlled complexity), but the arguments are necessarily recursive in nature and as such can become a bit messy, and require a fair amount of notation to express precisely. So, unfortunately, the arguments here will be somewhat cumbersome and notation-heavy, even if the underlying methods of proof are relatively simple.
In the previous set of notes, we saw that one could derive expansion of Cayley graphs from three ingredients: non-concentration, product theorems, and quasirandomness. Quasirandomness was discussed in Notes 3. In the current set of notes, we discuss product theorems. Roughly speaking, these theorems assert that in certain circumstances, a finite subset of a group
either exhibits expansion (in the sense that
, say, is significantly larger than
), or is somehow “close to” or “trapped” by a genuine group.
Theorem 1 (Product theorem in
) Let
, let
be a finite field, and let
be a finite subset of
. Let
be sufficiently small depending on
. Then at least one of the following statements holds:
- (Expansion) One has
.
- (Close to
) One has
.
- (Trapping)
is contained in a proper subgroup of
.
We will prove this theorem (which was proven first in the cases for fields
of prime order by Helfgott, and then for
and general
by Dinai, and finally to general
and
independently by Pyber-Szabo and by Breuillard-Green-Tao) later in this notes. A more qualitative version of this proposition was also previously obtained by Hrushovski. There are also generalisations of the product theorem of importance to number theory, in which the field
is replaced by a cyclic ring
(with
not necessarily prime); this was achieved first for
and
square-free by Bourgain, Gamburd, and Sarnak, by Varju for general
and
square-free, and finally by this paper of Bourgain and Varju for arbitrary
and
.
Exercise 1 (Girth bound) Assuming Theorem 1, show that whenever
is a symmetric set of generators of
for some finite field
and some
, then any element of
can be expressed as the product of
elements from
. (Equivalently, if we add the identity element to
, then
for some
.) This is a special case of a conjecture of Babai and Seress, who conjectured that the bound should hold uniformly for all finite simple groups (in particular, the implied constants here should not actually depend on
. The methods used to handle the
case can handle other finite groups of Lie type of bounded rank, but at present we do not have bounds that are independent of the rank. On the other hand, a recent paper of Helfgott and Seress has almost resolved the conjecture for the permutation groups
.
A key tool to establish product theorems is an argument which is sometimes referred to as the pivot argument. To illustrate this argument, let us first discuss a much simpler (and older) theorem, essentially due to Freiman, which has a much weaker conclusion but is valid in any group :
Theorem 2 (Baby product theorem) Let
be a group, and let
be a finite non-empty subset of
. Then one of the following statements hold:
- (Expansion) One has
.
- (Close to a subgroup)
is contained in a left-coset of a group
with
.
To prove this theorem, we suppose that the first conclusion does not hold, thus . Our task is then to place
inside the left-coset of a fairly small group
.
To do this, we take a group element , and consider the intersection
. A priori, the size of this set could range from anywhere from
to
. However, we can use the hypothesis
to obtain an important dichotomy, reminiscent of the classical fact that two cosets
of a subgroup
of
are either identical or disjoint:
Proposition 3 (Dichotomy) If
, then exactly one of the following occurs:
- (Non-involved case)
is empty.
- (Involved case)
.
Proof: Suppose we are not in the pivot case, so that is non-empty. Let
be an element of
, then
and
both lie in
. The sets
and
then both lie in
. As these sets have cardinality
and lie in
, which has cardinality less than
, we conclude from the inclusion-exclusion formula that
But the left-hand side is equal to , and the claim follows.
The above proposition provides a clear separation between two types of elements : the “non-involved” elements, which have nothing to do with
(in the sense that
, and the “involved” elements, which have a lot to do with
(in the sense that
. The key point is that there is a significant “gap” between the non-involved and involved elements; there are no elements that are only “slightly involved”, in that
and
intersect a little but not a lot. It is this gap that will allow us to upgrade approximate structure to exact structure. Namely,
Proposition 4 The set
of involved elements is a finite group, and is equal to
.
Proof: It is clear that the identity element is involved, and that if
is involved then so is
(since
. Now suppose that
are both involved. Then
and
have cardinality greater than
and are both subsets of
, and so have non-empty intersection. In particular,
is non-empty, and so
is non-empty. By Proposition 3, this makes
involved. It is then clear that
is a group.
If , then
is non-empty, and so from Proposition 3
is involved. Conversely, if
is involved, then
. Thus we have
as claimed. In particular,
is finite.
Now we can quickly wrap up the proof of Theorem 2. By construction, for all
,which by double counting shows that
. As
, we see that
is contained in a right coset
of
; setting
, we conclude that
is contained in a left coset
of
.
is a conjugate of
, and so
. If
, then
and
both lie in
and have cardinality
, so must overlap; and so
. Thus
, and so
, and Theorem 2 follows.
Exercise 2 Show that the constant
in Theorem 2 cannot be replaced by any larger constant.
Exercise 3 Let
be a finite non-empty set such that
. Show that
. (Hint: If
, show that
for some
.)
Exercise 4 Let
be a finite non-empty set such that
. Show that there is a finite group
with
and a group element
such that
and
.
Below the fold, we give further examples of the pivot argument in other group-like situations, including Theorem 2 and also the “sum-product theorem” of Bourgain-Katz-Tao and Bourgain-Glibichuk-Konyagin.
Let be an element of the unit circle, let
, and let
. We define the (rank one) Bohr set
to be the set
where is the distance to the origin in the unit circle (or equivalently, the distance to the nearest integer, after lifting up to
). These sets play an important role in additive combinatorics and in additive number theory. For instance, they arise naturally when applying the circle method, because Bohr sets describe the oscillation of exponential phases such as
.
Observe that Bohr sets enjoy the doubling property
thus doubling the Bohr set doubles both the length parameter and the radius parameter
. As such, these Bohr sets resemble two-dimensional balls (or boxes). Indeed, one can view
as the preimage of the two-dimensional box
under the homomorphism
.
Another class of finite set with two-dimensional behaviour is the class of (rank two) generalised arithmetic progressions
with and
Indeed, we have
and so we see, as with the Bohr set, that doubling the generalised arithmetic progressions doubles the two defining parameters of that progression.
More generally, there is an analogy between rank Bohr sets
and the rank generalised arithmetic progressions
One of the aims of additive combinatorics is to formalise analogies such as the one given above. By using some arguments from the geometry of numbers, for instance, one can show that for any rank Bohr set
, there is a rank
generalised arithmetic progression
for which one has the containments
for some explicit depending only on
(in fact one can take
); this is (a slight modification of) Lemma 4.22 of my book with Van Vu.
In the special case when , one can make a significantly more detailed description of the link between rank one Bohr sets and rank two generalised arithmetic progressions, by using the classical theory of continued fractions, which among other things gives a fairly precise formula for the generators
and lengths
of the generalised arithmetic progression associated to a rank one Bohr set
. While this connection is already implicit in the continued fraction literature (for instance, in the classic text of Hardy and Wright), I thought it would be a good exercise to work it out explicitly and write it up, which I will do below the fold.
It is unfortunate that the theory of continued fractions is restricted to the rank one setting (it relies very heavily on the total ordering of one-dimensional sets such as or
). A higher rank version of the theory could potentially help with questions such as the Littlewood conjecture, which remains open despite a substantial amount of effort and partial progress on the problem. At the end of this post I discuss how one can use the rank one theory to rephrase the Littlewood conjecture as a conjecture about a doubly indexed family of rank four progressions, which can be used to heuristically justify why this conjecture should be true, but does not otherwise seem to shed much light on the problem.
In 1964, Kemperman established the following result:
Theorem 1 Let
be a compact connected group, with a Haar probability measure
. Let
be compact subsets of
. Then
Remark 1 The estimate is sharp, as can be seen by considering the case when
is a unit circle, and
are arcs; similarly if
is any compact connected group that projects onto the circle. The connectedness hypothesis is essential, as can be seen by considering what happens if
and
are a non-trivial open subgroup of
. For locally compact connected groups which are unimodular but not compact, there is an analogous statement, but with
now a Haar measure instead of a Haar probability measure, and the right-hand side
replaced simply by
. The case when
is a torus is due to Macbeath, and the case when
is a circle is due to Raikov. The theorem is closely related to the Cauchy-Davenport inequality; indeed, it is not difficult to use that inequality to establish the circle case, and the circle case can be used to deduce the torus case by considering increasingly dense circle subgroups of the torus (alternatively, one can also use Kneser’s theorem).
By inner regularity, the hypothesis that
are compact can be replaced with Borel measurability, so long as one then adds the additional hypothesis that
is also Borel measurable.
A short proof of Kemperman’s theorem was given by Ruzsa. In this post I wanted to record how this argument can be used to establish the following more “robust” version of Kemperman’s theorem, which not only lower bounds , but gives many elements of
some multiplicity:
Theorem 2 Let
be a compact connected group, with a Haar probability measure
. Let
be compact subsets of
. Then for any
, one has
Indeed, Theorem 1 can be deduced from Theorem 2 by dividing (1) by and then taking limits as
. The bound in (1) is sharp, as can again be seen by considering the case when
are arcs in a circle. The analogous claim for cyclic groups for prime order was established by Pollard, and for general abelian groups by Green and Ruzsa.
Let us now prove Theorem 2. It uses a submodularity argument related to one discussed in this previous post. We fix and
with
, and define the quantity
for any compact set . Our task is to establish that
whenever
.
We first verify the extreme cases. If , then
, and so
in this case. At the other extreme, if
, then from the inclusion-exclusion principle we see that
, and so again
in this case (since
).
To handle the intermediate regime when lies between
and
, we rely on the submodularity inequality
. This inequality comes from the obvious pointwise identity
whence
and thus (noting that the quantities on the left are closer to each other than the quantities on the right)
at which point (2) follows by integrating over and then using the inclusion-exclusion principle.
Now introduce the function
for . From the preceding discussion
vanishes at the endpoints
; our task is to show that
is non-negative in the interior region
. Suppose for contradiction that this was not the case. It is easy to see that
is continuous (indeed, it is even Lipschitz continuous), so there must be
at which
is a local minimum and not locally constant. In particular,
. But for any
with
, we have the translation-invariance
, and hence by (2)
Note that depends continuously on
, equals
when
is the identity, and has an average value of
. As
is connected, we thus see from the intermediate value theorem that for any
, we can find
such that
, and thus by inclusion-exclusion
. By definition of
, we thus have
Taking infima in (and noting that the hypotheses on
are independent of
) we conclude that
for all . As
is a local minimum and
is arbitrarily small, this implies that
is locally constant, a contradiction. This establishes Theorem 2.
We observe the following corollary:
Corollary 3 Let
be a compact connected group, with a Haar probability measure
. Let
be compact subsets of
, and let
. Then one has the pointwise estimate
if
, and
if
.
Once again, the bounds are completely sharp, as can be seen by computing when
are arcs of a circle. For quasirandom
, one can do much better than these bounds, as discussed in this recent blog post; thus, the abelian case is morally the worst case here, although it seems difficult to convert this intuition into a rigorous reduction.
Proof: By cyclic permutation we may take . For any
we can bound
where we used Theorem 2 to obtain the third line. Optimising in , we obtain the claim.
Emmanuel Breuillard, Ben Green and I have just uploaded to the arXiv the short paper “A nilpotent Freiman dimension lemma“, submitted to the special volume of the European Journal of Combinatorics in honour of Yahya Ould Hamidoune. This paper is a nonabelian (or more precisely, nilpotent) variant of the following additive combinatorics lemma of Freiman:
Freiman’s lemma. Let A be a finite subset of a Euclidean space with
. Then A is contained in an affine subspace of dimension at most
.
This can be viewed as a “cheap” version of the more well known theorem of Freiman that places sets of small doubling in a torsion-free abelian group inside a generalised arithmetic progression. The advantage here is that the bound on the dimension is extremely explicit.
Our main result is
Theorem. Let A be a finite subset of a simply-connected nilpotent Lie group G which is a K-approximate group (i.e. A is symmetric, contains the identity, and
can be covered by up to K left translates of A. Then A can be covered by at most
left-translates of a closed connected Lie subgroup of dimension at most
.
We remark that our previous paper established a similar result, in which the dimension bound was improved to , but at the cost of worsening the covering number to
, and with a much more complicated proof (91 pages instead of 8). Furthermore, the bound on
is ineffective, due to the use of ultraproducts in the argument (though it is likely that some extremely lousy explicit bound could eventually be squeezed out of the argument by finitising everything). Note that the step of the ambient nilpotent group G does not influence the final bounds in the theorem, although we do of course need this step to be finite. A simple quotienting argument allows one to deduce a corollary of the above theorem in which the ambient group is assumed to be residually torsion-free nilpotent instead of being a simply connected nilpotent Lie group, but we omit the statement of this corollary here.
To motivate the proof of this theorem, let us first show a simple case of an argument of Gleason, which is very much in the spirit of Freiman’s lemma:
Gleason Lemma (special case). Let
be a finite symmetric subset of a Euclidean space, and let
be a sequence of subspaces in this space, such that the sets
are strictly increasing in i for
. Then
, where
.
Proof. By hypothesis, for each , the projection
of
to
is non-trivial, finite, and symmetric. In particular, since the vector space
is torsion-free,
is strictly larger than
. Equivalently, one can find
in
that does not lie in
; in particular,
and
is disjoint from
. As a consequence, the
are disjoint and lie in 5A, whence the claim.
Note that by combining the contrapositive of this lemma with a greedy algorithm, one can show that any K-approximate group in a Euclidean space is contained in a subspace of dimension at most , which is a weak version of Freiman’s lemma.
To extend the argument to the nilpotent setting we use the following idea. Observe that any non-trivial genuine subgroup H of a nilpotent group G will contain at least one non-trivial central element; indeed, by intersecting H with the lower central series of G, and considering the last intersection
which is non-trivial, one obtains the claim. It turns out that one can adapt this argument to approximate groups, so that any sufficiently large K-approximate subgroup A of G will contain a non-trivial element that centralises a large fraction of A. Passing to this large fraction and quotienting out the central element, we obtain a new approximate group. If, after a bounded number of steps, this procedure gives an approximate group of bounded size, we are basically done. If, however, the process continues, then by using some Lie group theory, one can find a long sequence
of connected Lie subgroups of G, such that the sets
are strictly increasing in i. Using some Lie group theory and the hypotheses on G, one can deduce that the group
generated by
is much larger than
, in the sense that the latter group has infinite index in the former. It then turns out that the Gleason argument mentioned above can be adapted to this setting.

Recent Comments