You are currently browsing the tag archive for the ‘additive combinatorics’ tag.
Emmanuel Breuillard, Ben Green, and I have just uploaded to the arXiv our survey “Small doubling in groups“, for the proceedings of the upcoming Erdos Centennial. This is a short survey of the known results on classifying finite subsets of an (abelian) additive group
or a (not necessarily abelian) multiplicative group
that have small doubling in the sense that the sum set
or product set
is small. Such sets behave approximately like finite subgroups of
(and there is a closely related notion of an approximate group in which the analogy is even tighter) , and so this subject can be viewed as a sort of approximate version of finite group theory. (Unfortunately, thus far the theory does not have much new to say about the classification of actual finite groups; progress has been largely made instead on classifying the (highly restricted) number of ways in which approximate groups can differ from a genuine group.)
In the classical case when is the integers
, these sets were classified (in a qualitative sense, at least) by a celebrated theorem of Freiman, which roughly speaking says that such sets
are necessarily “commensurate” in some sense with a (generalised) arithmetic progression
of bounded rank. There are a number of essentially equivalent ways to define what “commensurate” means here; for instance, in the original formulation of the theorem, one asks that
be a dense subset of
, but in modern formulations it is often more convenient to require instead that
be of comparable size to
and be covered by a bounded number of translates of
, or that
and
have an intersection that is of comparable size to both
and
(cf. the notion of commensurability in group theory).
Freiman’s original theorem was extended to more general abelian groups in a sequence of papers culminating in the paper of Green and Ruzsa that handled arbitrary abelian groups. As such groups now contain non-trivial finite subgroups, the conclusion of the theorem must be modified by allowing for “coset progressions” , which can be viewed as “extensions” of generalized arithmetic progressions
by genuine finite groups
.
The proof methods in these abelian results were Fourier-analytic in nature (except in the cases of sets of very small doubling, in which more combinatorial approaches can be applied, and there were also some geometric or combinatorial methods that gave some weaker structural results). As such, it was a challenge to extend these results to nonabelian groups, although for various important special types of ambient group (such as an linear group over a finite or infinite field) it turns out that one can use tools exploiting the special structure of those groups (e.g. for linear groups one would use tools from Lie theory and algebraic geometry) to obtain quite satisfactory results; see e.g. this survey of Pyber and Szabo for the linear case. When the ambient group
is completely arbitrary, it turns out the problem is closely related to the classical Hilbert’s fifth problem of determining the minimal requirements of a topological group in order for such groups to have Lie structure; this connection was first observed and exploited by Hrushovski, and then used by Breuillard, Green, and myself to obtain the analogue of Freiman’s theorem for an arbitrary nonabelian group.
This survey is too short to discuss in much detail the proof techniques used in these results (although the abelian case is discussed in this book of mine with Vu, and the nonabelian case discussed in this more recent book of mine), but instead focuses on the statements of the various known results, as well as some remaining open questions in the subject (in particular, there is substantial work left to be done in making the estimates more quantitative, particularly in the nonabelian setting).
Let be an element of the unit circle, let
, and let
. We define the (rank one) Bohr set
to be the set
where is the distance to the origin in the unit circle (or equivalently, the distance to the nearest integer, after lifting up to
). These sets play an important role in additive combinatorics and in additive number theory. For instance, they arise naturally when applying the circle method, because Bohr sets describe the oscillation of exponential phases such as
.
Observe that Bohr sets enjoy the doubling property
thus doubling the Bohr set doubles both the length parameter and the radius parameter
. As such, these Bohr sets resemble two-dimensional balls (or boxes). Indeed, one can view
as the preimage of the two-dimensional box
under the homomorphism
.
Another class of finite set with two-dimensional behaviour is the class of (rank two) generalised arithmetic progressions
with and
Indeed, we have
and so we see, as with the Bohr set, that doubling the generalised arithmetic progressions doubles the two defining parameters of that progression.
More generally, there is an analogy between rank Bohr sets
and the rank generalised arithmetic progressions
One of the aims of additive combinatorics is to formalise analogies such as the one given above. By using some arguments from the geometry of numbers, for instance, one can show that for any rank Bohr set
, there is a rank
generalised arithmetic progression
for which one has the containments
for some explicit depending only on
(in fact one can take
); this is (a slight modification of) Lemma 4.22 of my book with Van Vu.
In the special case when , one can make a significantly more detailed description of the link between rank one Bohr sets and rank two generalised arithmetic progressions, by using the classical theory of continued fractions, which among other things gives a fairly precise formula for the generators
and lengths
of the generalised arithmetic progression associated to a rank one Bohr set
. While this connection is already implicit in the continued fraction literature (for instance, in the classic text of Hardy and Wright), I thought it would be a good exercise to work it out explicitly and write it up, which I will do below the fold.
It is unfortunate that the theory of continued fractions is restricted to the rank one setting (it relies very heavily on the total ordering of one-dimensional sets such as or
). A higher rank version of the theory could potentially help with questions such as the Littlewood conjecture, which remains open despite a substantial amount of effort and partial progress on the problem. At the end of this post I discuss how one can use the rank one theory to rephrase the Littlewood conjecture as a conjecture about a doubly indexed family of rank four progressions, which can be used to heuristically justify why this conjecture should be true, but does not otherwise seem to shed much light on the problem.
In 1964, Kemperman established the following result:
Theorem 1 Let
be a compact connected group, with a Haar probability measure
. Let
be compact subsets of
. Then
Remark 1 The estimate is sharp, as can be seen by considering the case when
is a unit circle, and
are arcs; similarly if
is any compact connected group that projects onto the circle. The connectedness hypothesis is essential, as can be seen by considering what happens if
and
are a non-trivial open subgroup of
. For locally compact connected groups which are unimodular but not compact, there is an analogous statement, but with
now a Haar measure instead of a Haar probability measure, and the right-hand side
replaced simply by
. The case when
is a torus is due to Macbeath, and the case when
is a circle is due to Raikov. The theorem is closely related to the Cauchy-Davenport inequality; indeed, it is not difficult to use that inequality to establish the circle case, and the circle case can be used to deduce the torus case by considering increasingly dense circle subgroups of the torus (alternatively, one can also use Kneser’s theorem).
By inner regularity, the hypothesis that
are compact can be replaced with Borel measurability, so long as one then adds the additional hypothesis that
is also Borel measurable.
A short proof of Kemperman’s theorem was given by Ruzsa. In this post I wanted to record how this argument can be used to establish the following more “robust” version of Kemperman’s theorem, which not only lower bounds , but gives many elements of
some multiplicity:
Theorem 2 Let
be a compact connected group, with a Haar probability measure
. Let
be compact subsets of
. Then for any
, one has
Indeed, Theorem 1 can be deduced from Theorem 2 by dividing (1) by and then taking limits as
. The bound in (1) is sharp, as can again be seen by considering the case when
are arcs in a circle. The analogous claim for cyclic groups for prime order was established by Pollard, and for general abelian groups by Green and Ruzsa.
Let us now prove Theorem 2. It uses a submodularity argument related to one discussed in this previous post. We fix and
with
, and define the quantity
for any compact set . Our task is to establish that
whenever
.
We first verify the extreme cases. If , then
, and so
in this case. At the other extreme, if
, then from the inclusion-exclusion principle we see that
, and so again
in this case (since
).
To handle the intermediate regime when lies between
and
, we rely on the submodularity inequality
. This inequality comes from the obvious pointwise identity
whence
and thus (noting that the quantities on the left are closer to each other than the quantities on the right)
at which point (2) follows by integrating over and then using the inclusion-exclusion principle.
Now introduce the function
for . From the preceding discussion
vanishes at the endpoints
; our task is to show that
is non-negative in the interior region
. Suppose for contradiction that this was not the case. It is easy to see that
is continuous (indeed, it is even Lipschitz continuous), so there must be
at which
is a local minimum and not locally constant. In particular,
. But for any
with
, we have the translation-invariance
, and hence by (2)
Note that depends continuously on
, equals
when
is the identity, and has an average value of
. As
is connected, we thus see from the intermediate value theorem that for any
, we can find
such that
, and thus by inclusion-exclusion
. By definition of
, we thus have
Taking infima in (and noting that the hypotheses on
are independent of
) we conclude that
for all . As
is a local minimum and
is arbitrarily small, this implies that
is locally constant, a contradiction. This establishes Theorem 2.
We observe the following corollary:
Corollary 3 Let
be a compact connected group, with a Haar probability measure
. Let
be compact subsets of
, and let
. Then one has the pointwise estimate
if
, and
if
.
Once again, the bounds are completely sharp, as can be seen by computing when
are arcs of a circle. For quasirandom
, one can do much better than these bounds, as discussed in this recent blog post; thus, the abelian case is morally the worst case here, although it seems difficult to convert this intuition into a rigorous reduction.
Proof: By cyclic permutation we may take . For any
we can bound
where we used Theorem 2 to obtain the third line. Optimising in , we obtain the claim.
A few days ago, I received the sad news that Yahya Ould Hamidoune had recently died. Hamidoune worked in additive combinatorics, and had recently solved a question on noncommutative Freiman-Kneser theorems posed by myself on this blog last year. Namely, Hamidoune showed
Theorem 1 (Noncommutative Freiman-Kneser theorem for small doubling) Let
, and let
be a finite non-empty subset of a multiplicative group
such that
for some finite set
of cardinality
at least
, where
is the product set of
and
. Then there exists a finite subgroup
of
with cardinality
, such that
is covered by at most
right-cosets
of
, where
depend only on
.
One can of course specialise here to the case , and view this theorem as a classification of those sets
of doubling constant at most
.
In fact Hamidoune’s argument, which is completely elementary, gives the very nice explicit constants and
, which are essentially optimal except for factors of
(as can be seen by considering an arithmetic progression in an additive group). This result was also independently established (in the
case) by Tom Sanders (unpublished) by a more Fourier-analytic method, in particular drawing on Sanders’ deep results on the Wiener algebra
on arbitrary non-commutative groups
.
This type of result had previously been known when was less than the golden ratio
, as first observed by Freiman; see my previous blog post for more discussion.
Theorem 1 is not, strictly speaking, contained in Hamidoune’s paper, but can be extracted from his arguments, which share some similarity with the recent simple proof of the Ruzsa-Plünnecke inequality by Petridis (as discussed by Tim Gowers here), and this is what I would like to do below the fold. I also include (with permission) Sanders’ unpublished argument, which proceeds instead by Fourier-analytic methods. Read the rest of this entry »
In Notes 3, we saw that the number of additive patterns in a given set was (in principle, at least) controlled by the Gowers uniformity norms of functions associated to that set.
Such norms can be defined on any finite additive group (and also on some other types of domains, though we will not discuss this point here). In particular, they can be defined on the finite-dimensional vector spaces over a finite field
.
In this case, the Gowers norms are closely tied to the space
of polynomials of degree at most
. Indeed, as noted in Exercise 20 of Notes 4, a function
of
norm
has
norm equal to
if and only if
for some
; thus polynomials solve the “
inverse problem” for the trivial inequality
. They are also a crucial component of the solution to the “
inverse problem” and “
inverse problem”. For the former, we will soon show:
Proposition 1 (
inverse theorem for
) Let
be such that
and
for some
. Then there exists
such that
, where
is a constant depending only on
.
Thus, for the Gowers norm to be almost completely saturated, one must be very close to a polynomial. The converse assertion is easily established:
Exercise 1 (Converse to
inverse theorem for
) If
and
for some
, then
, where
is a constant depending only on
.
In the world, one no longer expects to be close to a polynomial. Instead, one expects to correlate with a polynomial. Indeed, one has
Lemma 2 (Converse to the
inverse theorem for
) If
and
are such that
, where
, then
.
Proof: From the definition of the norm (equation (18) from Notes 3), the monotonicity of the Gowers norms (Exercise 19 of Notes 3), and the polynomial phase modulation invariance of the Gowers norms (Exercise 21 of Notes 3), one has
and the claim follows.
In the high characteristic case at least, this can be reversed:
Theorem 3 (
inverse theorem for
) Suppose that
. If
is such that
and
, then there exists
such that
.
This result is sometimes referred to as the inverse conjecture for the Gowers norm (in high, but bounded, characteristic). For small , the claim is easy:
Exercise 2 Verify the cases
of this theorem. (Hint: to verify the
case, use the Fourier-analytic identities
and
, where
is the space of all homomorphisms
from
to
, and
are the Fourier coefficients of
.)
This conjecture for larger values of are more difficult to establish. The
case of the theorem was established by Ben Green and myself in the high characteristic case
; the low characteristic case
was independently and simultaneously established by Samorodnitsky. The cases
in the high characteristic case was established in two stages, firstly using a modification of the Furstenberg correspondence principle, due to Ziegler and myself. to convert the problem to an ergodic theory counterpart, and then using a modification of the methods of Host-Kra and Ziegler to solve that counterpart, as done in this paper of Bergelson, Ziegler, and myself.
The situation with the low characteristic case in general is still unclear. In the high characteristic case, we saw from Notes 4 that one could replace the space of non-classical polynomials in the above conjecture with the essentially equivalent space of classical polynomials
. However, as we shall see below, this turns out not to be the case in certain low characteristic cases (a fact first observed by Lovett, Meshulam, and Samorodnitsky, and independently by Ben Green and myself), for instance if
and
; this is ultimately due to the existence in those cases of non-classical polynomials which exhibit no significant correlation with classical polynomials of equal or lesser degree. This distinction between classical and non-classical polynomials appears to be a rather non-trivial obstruction to understanding the low characteristic setting; it may be necessary to obtain a more complete theory of non-classical polynomials in order to fully settle this issue.
The inverse conjecture has a number of consequences. For instance, it can be used to establish the analogue of Szemerédi’s theorem in this setting:
Theorem 4 (Szemerédi’s theorem for finite fields) Let
be a finite field, let
, and let
be such that
. If
is sufficiently large depending on
, then
contains an (affine) line
for some
with
.
Exercise 3 Use Theorem 4 to establish the following generalisation: with the notation as above, if
and
is sufficiently large depending on
, then
contains an affine
-dimensional subspace.
We will prove this theorem in two different ways, one using a density increment method, and the other using an energy increment method. We discuss some other applications below the fold.
A handy inequality in additive combinatorics is the Plünnecke-Ruzsa inequality:
Theorem 1 (Plünnecke-Ruzsa inequality) Let
be finite non-empty subsets of an additive group
, such that
for all
and some scalars
. Then there exists a subset
of
such that
.
The proof uses graph-theoretic techniques. Setting , we obtain a useful corollary: if
has small doubling in the sense that
, then we have
for all
, where
is the sum of
copies of
.
In a recent paper, I adapted a number of sum set estimates to the entropy setting, in which finite sets such as in
are replaced with discrete random variables
taking values in
, and (the logarithm of) cardinality
of a set
is replaced by Shannon entropy
of a random variable
. (Throughout this note I assume all entropies to be finite.) However, at the time, I was unable to find an entropy analogue of the Plünnecke-Ruzsa inequality, because I did not know how to adapt the graph theory argument to the entropy setting.
I recently discovered, however, that buried in a classic paper of Kaimonovich and Vershik (implicitly in Proposition 1.3, to be precise) there was the following analogue of Theorem 1:
Theorem 2 (Entropy Plünnecke-Ruzsa inequality) Let
be independent random variables of finite entropy taking values in an additive group
, such that
for all
and some scalars
. Then
.
In fact Theorem 2 is a bit “better” than Theorem 1 in the sense that Theorem 1 needed to refine the original set to a subset
, but no such refinement is needed in Theorem 2. One corollary of Theorem 2 is that if
, then
for all
, where
are independent copies of
; this improves slightly over the analogous combinatorial inequality. Indeed, the function
is concave (this can be seen by using the
version of Theorem 2 (or (2) below) to show that the quantity
is decreasing in
).
Theorem 2 is actually a quick consequence of the submodularity inequality
are discrete random variables such that
and
each determine
(i.e.
is a function of
, and also a function of
), and
and
jointly determine
(i.e
is a function of
and
). To apply this, let
be independent discrete random variables taking values in
. Observe that the pairs
and
each determine
, and jointly determine
. Applying (1) we conclude that
which after using the independence of simplifies to the sumset submodularity inequality
case of Theorem 2). As a corollary of this inequality, we see that if
, then
and Theorem 2 follows by telescoping series.
The proof of Theorem 2 seems to be genuinely different from the graph-theoretic proof of Theorem 1. It would be interesting to see if the above argument can be somehow adapted to give a stronger version of Theorem 1. Note also that both Theorem 1 and Theorem 2 have extensions to more general combinations of than
; see this paper and this paper respectively.
Below the fold is a version of my talk “Recent progress on the Kakeya conjecture” that I gave at the Fefferman conference.
Additive combinatorics is largely focused on the additive properties of finite subsets A of an additive group . This group can be finite or infinite, but there is a very convenient trick, the Ruzsa projection trick, which allows one to reduce the latter case to the former. For instance, consider the set
inside the integers
. The integers of course form an infinite group, but if we are only interested in sums of at most two elements of A at a time, we can embed A ininside the finite cyclic group
without losing any combinatorial information. More precisely, there is a Freiman isomorphism of order 2 between the set
in
and the set
in
. One can view the latter version of
as a model for the former version of
. More generally, it turns out that any finite set A in an additive group can be modeled in the above set by an equivalent set in a finite group, and in fact one can ensure that this ambient modeling group is not much larger than A itself if A has some additive structure; see this paper of Ruzsa (or Lemma 5.26 of my book with Van Vu) for a precise statement. This projection trick has a number of important uses in additive combinatorics, most notably in Ruzsa’s simplified proof of Freiman’s theorem.
Given the interest in non-commutative analogues of Freiman’s theorem, it is natural to ask whether one can similarly model finite sets A in multiplicative (and non-commutative) groups using finite models. Unfortunately (as I learned recently from Akshay Venkatesh, via Ben Green), this turns out to be impossible in general, due to an old example of Higman. More precisely, Higman shows:
Theorem 1. There exists an infinite group G generated by four distinct elements a,b,c,d that obey the relations
; (1)
in fact, a and c generate the free group in G. On the other hand, if G’ is a finite group containing four elements a,b,c,d obeying (1), then a,b,c,d are all trivial.
As a consequence, the finite set in G has no model (in the sense of Freiman isomorphisms) in a finite group.
Theorem 1 is proven by a small amount of elementary group theory and number theory, and it was neat enough that I thought I would reproduce it here.

Recent Comments