You are currently browsing the tag archive for the ‘Freiman’s theorem’ tag.
This fall (starting Monday, September 26), I will be teaching a graduate topics course which I have entitled “Hilbert’s fifth problem and related topics.” The course is going to focus on three related topics:
- Hilbert’s fifth problem on the topological description of Lie groups, as well as the closely related (local) classification of locally compact groups (the Gleason-Yamabe theorem).
- Approximate groups in nonabelian groups, and their classification via the Gleason-Yamabe theorem (this is very recent work of Emmanuel Breuillard, Ben Green, Tom Sanders, and myself, building upon earlier work of Hrushovski);
- Gromov’s theorem on groups of polynomial growth, as proven via the classification of approximate groups (as well as some consequences to fundamental groups of Riemannian manifolds).
I have already blogged about these topics repeatedly in the past (particularly with regard to Hilbert’s fifth problem), and I intend to recycle some of that material in the lecture notes for this course.
The above three families of results exemplify two broad principles (part of what I like to call “the dichotomy between structure and randomness“):
- (Rigidity) If a group-like object exhibits a weak amount of regularity, then it (or a large portion thereof) often automatically exhibits a strong amount of regularity as well;
- (Structure) This strong regularity manifests itself either as Lie type structure (in continuous settings) or nilpotent type structure (in discrete settings). (In some cases, “nilpotent” should be replaced by sister properties such as “abelian“, “solvable“, or “polycyclic“.)
Let me illustrate what I mean by these two principles with two simple examples, one in the continuous setting and one in the discrete setting. We begin with a continuous example. Given an complex matrix
, define the matrix exponential
of
by the formula
which can easily be verified to be an absolutely convergent series.
Exercise 1 Show that the map
is a real analytic (and even complex analytic) map from
to
, and obeys the restricted homomorphism property
for all
and
.
Proposition 1 (Rigidity and structure of matrix homomorphisms) Let
be a natural number. Let
be the group of invertible
complex matrices. Let
be a map obeying two properties:
- (Group-like object)
is a homomorphism, thus
for all
.
- (Weak regularity) The map
is continuous.
Then:
- (Strong regularity) The map
is smooth (i.e. infinitely differentiable). In fact it is even real analytic.
- (Lie-type structure) There exists a (unique) complex
matrix
such that
for all
.
Proof: Let be as above. Let
be a small number (depending only on
). By the homomorphism property,
(where we use
here to denote the identity element of
), and so by continuity we may find a small
such that
for all
(we use some arbitrary norm here on the space of
matrices, and allow implied constants in the
notation to depend on
).
The map is real analytic and (by the inverse function theorem) is a diffeomorphism near
. Thus, by the inverse function theorem, we can (if
is small enough) find a matrix
of size
such that
. By the homomorphism property and (1), we thus have
On the other hand, by another application of the inverse function theorem we see that the squaring map is a diffeomorphism near
in
, and thus (if
is small enough)
We may iterate this argument (for a fixed, but small, value of ) and conclude that
for all . By the homomorphism property and (1) we thus have
whenever is a dyadic rational, i.e. a rational of the form
for some integer
and natural number
. By continuity we thus have
for all real . Setting
we conclude that
for all real , which gives existence of the representation and also real analyticity and smoothness. Finally, uniqueness of the representation
follows from the identity
Exercise 2 Generalise Proposition 1 by replacing the hypothesis that
is continuous with the hypothesis that
is Lebesgue measurable (Hint: use the Steinhaus theorem.). Show that the proposition fails (assuming the axiom of choice) if this hypothesis is omitted entirely.
Note how one needs both the group-like structure and the weak regularity in combination in order to ensure the strong regularity; neither is sufficient on its own. We will see variants of the above basic argument throughout the course. Here, the task of obtaining smooth (or real analytic structure) was relatively easy, because we could borrow the smooth (or real analytic) structure of the domain and range
; but, somewhat remarkably, we shall see that one can still build such smooth or analytic structures even when none of the original objects have any such structure to begin with.
Now we turn to a second illustration of the above principles, namely Jordan’s theorem, which uses a discreteness hypothesis to upgrade Lie type structure to nilpotent (and in this case, abelian) structure. We shall formulate Jordan’s theorem in a slightly stilted fashion in order to emphasise the adherence to the above-mentioned principles.
Theorem 2 (Jordan’s theorem) Let
be an object with the following properties:
- (Group-like object)
is a group.
- (Discreteness)
is finite.
- (Lie-type structure)
is contained in
(the group of unitary
matrices) for some
.
Then there is a subgroup
of
such that
- (
is close to
) The index
of
in
is
(i.e. bounded by
for some quantity
depending only on
).
- (Nilpotent-type structure)
is abelian.
A key observation in the proof of Jordan’s theorem is that if two unitary elements are close to the identity, then their commutator
is even closer to the identity (in, say, the operator norm
). Indeed, since multiplication on the left or right by unitary elements does not affect the operator norm, we have
and so by the triangle inequality
Now we can prove Jordan’s theorem.
Proof: We induct on , the case
being trivial. Suppose first that
contains a central element
which is not a multiple of the identity. Then, by definition,
is contained in the centraliser
of
, which by the spectral theorem is isomorphic to a product
of smaller unitary groups. Projecting
to each of these factor groups and applying the induction hypothesis, we obtain the claim.
Thus we may assume that contains no central elements other than multiples of the identity. Now pick a small
(one could take
in fact) and consider the subgroup
of
generated by those elements of
that are within
of the identity (in the operator norm). By considering a maximal
-net of
we see that
has index at most
in
. By arguing as before, we may assume that
has no central elements other than multiples of the identity.
If consists only of multiples of the identity, then we are done. If not, take an element
of
that is not a multiple of the identity, and which is as close as possible to the identity (here is where we crucially use that
is finite). By (2), we see that if
is sufficiently small depending on
, and if
is one of the generators of
, then
lies in
and is closer to the identity than
, and is thus a multiple of the identity. On the other hand,
has determinant
. Given that it is so close to the identity, it must therefore be the identity (if
is small enough). In other words,
is central in
, and is thus a multiple of the identity. But this contradicts the hypothesis that there are no central elements other than multiples of the identity, and we are done.
Commutator estimates such as (2) will play a fundamental role in many of the arguments we will see in this course; as we saw above, such estimates combine very well with a discreteness hypothesis, but will also be very useful in the continuous setting.
Exercise 3 Generalise Jordan’s theorem to the case when
is a finite subgroup of
rather than of
. (Hint: The elements of
are not necessarily unitary, and thus do not necessarily preserve the standard Hilbert inner product of
. However, if one averages that inner product by the finite group
, one obtains a new inner product on
that is preserved by
, which allows one to conjugate
to a subgroup of
. This averaging trick is (a small) part of Weyl’s unitary trick in representation theory.)
Exercise 4 (Inability to discretise nonabelian Lie groups) Show that if
, then the orthogonal group
cannot contain arbitrarily dense finite subgroups, in the sense that there exists an
depending only on
such that for every finite subgroup
of
, there exists a ball of radius
in
(with, say, the operator norm metric) that is disjoint from
. What happens in the
case?
Remark 1 More precise classifications of the finite subgroups of
are known, particularly in low dimensions. For instance, one can show that the only finite subgroups of
(which
is a double cover of) are isomorphic to either a cyclic group, a dihedral group, or the symmetry group of one of the Platonic solids.
A few days ago, I received the sad news that Yahya Ould Hamidoune had recently died. Hamidoune worked in additive combinatorics, and had recently solved a question on noncommutative Freiman-Kneser theorems posed by myself on this blog last year. Namely, Hamidoune showed
Theorem 1 (Noncommutative Freiman-Kneser theorem for small doubling) Let
, and let
be a finite non-empty subset of a multiplicative group
such that
for some finite set
of cardinality
at least
, where
is the product set of
and
. Then there exists a finite subgroup
of
with cardinality
, such that
is covered by at most
right-cosets
of
, where
depend only on
.
One can of course specialise here to the case , and view this theorem as a classification of those sets
of doubling constant at most
.
In fact Hamidoune’s argument, which is completely elementary, gives the very nice explicit constants and
, which are essentially optimal except for factors of
(as can be seen by considering an arithmetic progression in an additive group). This result was also independently established (in the
case) by Tom Sanders (unpublished) by a more Fourier-analytic method, in particular drawing on Sanders’ deep results on the Wiener algebra
on arbitrary non-commutative groups
.
This type of result had previously been known when was less than the golden ratio
, as first observed by Freiman; see my previous blog post for more discussion.
Theorem 1 is not, strictly speaking, contained in Hamidoune’s paper, but can be extracted from his arguments, which share some similarity with the recent simple proof of the Ruzsa-Plünnecke inequality by Petridis (as discussed by Tim Gowers here), and this is what I would like to do below the fold. I also include (with permission) Sanders’ unpublished argument, which proceeds instead by Fourier-analytic methods. Read the rest of this entry »
Let be a finite subset of a non-commutative group
. As mentioned previously on this blog (as well as in the current logic reading seminar), there is some interest in classifying those
which obey small doubling conditions such as
or
. A full classification here has still not been established. However, I wanted to record here an elementary argument (based on Exercise 2.6.5 of my book with Van Vu, which in turn is based on this paper of Izabella Laba) that handles the case when
is very close to
:
Proposition 1 If
, then
and
are both finite groups, which are conjugate to each other. In particular,
is contained in the right-coset (or left-coset) of a group of order less than
.
Remark 1 The constant
is completely sharp; consider the case when
where
is the identity and
is an element of order larger than
. This is a small example, but one can make it as large as one pleases by taking the direct product of
and
with any finite group. In the converse direction, we see that whenever
is contained in the right-coset
(resp. left-coset
) of a group of order less than
, then
(resp.
) is necessarily equal to all of
, by the inclusion-exclusion principle (see the proof below for a related argument).
Proof: We begin by showing that is a group. As
is symmetric and contains the identity, it suffices to show that this set is closed under addition.
Let . Then we can write
and
for
. If
were equal to
, then
and we would be done. Of course, there is no reason why
should equal
; but we can use the hypothesis
to boost this as follows. Observe that
and
both have cardinality
and lie inside
, which has cardinality strictly less than
. By the inclusion-exclusion principle, this forces
to have cardinality greater than
. In other words, there exist more than
pairs
such that
, which implies that
. Thus there are more than
elements
such that
for some
(since
is uniquely determined by
); similarly, there exists more than
elements
such that
for some
. Again by inclusion-exclusion, we can thus find
in
for which one has simultaneous representations
and
, and so
, and the claim follows.
In the course of the above argument we showed that every element of the group has more than
representations of the form
for
. But there are only
pairs
available, and thus
.
Now let be any element of
. Since
, we have
, and so
. Conversely, every element of
has exactly
representations of the form
where
. Since
occupies more than half of
, we thus see from the inclusion-exclusion principle, there is thus at least one representation
for which
both lie in
. In other words,
, and the claim follows.
To relate this to the classical doubling constants , we first make an easy observation:
Again, this is sharp; consider equal to
where
generate a free group.
Proof: Suppose that is an element of
for some
. Then the sets
and
have cardinality
and lie in
, so by the inclusion-exclusion principle, the two sets intersect. Thus there exist
such that
, thus
. This shows that
is contained in
. The converse inclusion is proven similarly.
Proposition 3 If
, then
is a finite group of order
, and
for some
in the normaliser of
.
The factor is sharp, by the same example used to show sharpness of Proposition 1. However, there seems to be some room for further improvement if one weakens the conclusion a bit; see below the fold.
Proof: Let (the two sets being equal by Lemma 2). By the argument used to prove Lemma 2, every element of
has more than
representations of the form
for
. By the argument used to prove Proposition 1, this shows that
is a group; also, since there are only
pairs
, we also see that
.
Pick any ; then
, and so
. Because every element of
has
representations of the form
with
,
, and
occupies more than half of
and of
, we conclude that each element of
lies in
, and so
and
.
The intersection of the groups and
contains
, which is more than half the size of
, and so we must have
, i.e.
normalises
, and the proposition follows.
Because the arguments here are so elementary, they extend easily to the infinitary setting in which is now an infinite set, but has finite measure with respect to some translation-invariant Kiesler measure
. We omit the details. (I am hoping that this observation may help simplify some of the theory in that setting.)
One of my favorite open problems, which I have blogged about in the past, is that of establishing (or even correctly formulating) a non-commutative analogue of Freiman’s theorem. Roughly speaking, the question is this: given a finite set in a non-commutative group
which is of small doubling in the sense that the product set
is not much larger than
(e.g.
for some
), what does this say about the structure of
? (For various technical reasons one may wish to replace small doubling by, say, small tripling (i.e.
), and one may also wish to assume that
contains the identity and is symmetric,
, but these are relatively minor details.)
Sets of small doubling (or tripling), etc. can be thought of as “approximate groups”, since groups themselves have a doubling constant equal to one. Another obvious example of an approximate group is that of an arithmetic progression in an additive group, and more generally of a ball (in the word metric) in a nilpotent group of bounded rank and step. It is tentatively conjectured that in fact all examples can somehow be “generated” out of these basic examples, although it is not fully clear at present what “generated” should mean.
A weaker conjecture along the same lines is that if is a set of small doubling, then there should be some sort of “pseudo-metric”
on
which is left-invariant, and for which
is controlled (in some suitable sense) by the unit ball in this metric. (For instance, if
was a subgroup of
, one would take the metric which identified all the left cosets of
to a point, but was otherwise a discrete metric; if
were a ball in a nilpotent group, one would use some rescaled version of the word metric, and so forth.) Actually for technical reasons one would like to work with a slightly weaker notion than a pseudo-metric, namely a Bourgain system, but let us again ignore this technicality here.
Recently, using some powerful tools from model theory combined with the theory of topological groups, Ehud Hrushovski has apparently achieved some breakthroughs on this problem, obtaining new structural control on sets of small doubling in arbitrary groups that was not previously accessible to the known combinatorial methods. The precise results are technical to state, but here are informal versions of two typical theorems. The first applies to sets of small tripling in an arbitrary group:
Theorem 1 (Rough version of Hrushovski Theorem 1.1) Let
be a set of small tripling, then one can find a long sequence of nested symmetric sets
, all of size comparable to
and contained in
, which are somewhat closed under multiplication in the sense that
for all
, and which are fairly well closed under commutation in the sense that
. (There are also some additional statements to the effect that the
efficiently cover each other, and also cover
, but I will omit those here.)
This nested sequence is somewhat analogous to a Bourgain system, though it is not quite the same notion.
If one assumes that is “perfect” in a certain sense, which roughly means that there is no non-trivial abelian quotient, then one can do significantly better:
Theorem 2 (Rough version of Hrushovski Corollary 1.2) Let
be a set of small tripling, let
, and suppose that for almost all
-tuples
(where
), the conjugacy classes
generate most of
in the sense that
. Then a large part of
is contained in a subgroup of size comparable to
.
Note that if one quotiented out by the commutator , then all of the conjugacy classes
would collapse to points. So the hypothesis here is basically a strong quantitative assertion to the effect that the commutator
is extremely large, and rapidly fills out most of
itself.
Here at UCLA, a group of logicians and I (consisting of Matthias Aschenbrenner, Isaac Goldbring, Greg Hjorth, Henry Towsner, Anush Tserunyan, and possibly others) have just started a weekly reading seminar to come to grips with the various combinatorial, logical, and group-theoretic notions in Hrushovski’s paper, of which we only have a partial understanding at present. The seminar is a physical one, rather than an online one, but I am going to try to put some notes on the seminar on this blog as it progresses, as I know that there are a couple of other mathematicians who are interested in these developments.
So far there have been two meetings of the seminar. In the first, I surveyed the state of knowledge of the noncommutative Freiman theorem, covering broadly the material in my previous blog post. In the second meeting, Isaac reviewed some key notions of model theory used in Hrushovski’s paper, in particular the notions of definability and type, which I will review below. It is not yet clear how these are going to be connected with the combinatorial side of things, but this is something which we will hopefully develop in future seminars. The near-term objective is to understand the statement of the main theorem on the model-theoretic side (Theorem 3.4 of Hrushovski), and then understand some of its easier combinatorial consequences, before going back and trying to understand the proof of that theorem.
[Update, Oct 19: Given the level of interest in this paper, readers are encouraged to discuss any aspect of that paper in the comments below, even if they are not currently being covered by the UCLA seminar.]
It turns out to be a favourable week or two for me to finally finish a number of papers that had been at a nearly completed stage for a while. I have just uploaded to the arXiv my article “Sumset and inverse sumset theorems for Shannon entropy“, submitted to Combinatorics, Probability, and Computing. This paper evolved from a “deleted scene” in my book with Van Vu entitled “Entropy sumset estimates“. In those notes, we developed analogues of the standard Plünnecke-Ruzsa sumset estimates (which relate quantities such as the cardinalities of the sum and difference sets of two finite sets
in an additive group
to each other), to the entropy setting, in which the finite sets
are replaced instead with discrete random variables
taking values in that group G, and the (logarithm of the) cardinality |A| is replaced with the Shannon entropy
This quantity measures the information content of X; for instance, if , then it will take k bits on the average to store the value of X (thus a string of n independent copies of X will require about nk bits of storage in the asymptotic limit
). The relationship between entropy and cardinality is that if X is the uniform distribution on a finite non-empty set A, then
. If instead X is non-uniformly distributed on A, one has
, thanks to Jensen’s inequality.
It turns out that many estimates on sumsets have entropy analogues, which resemble the “logarithm” of the sumset estimates. For instance, the trivial bounds
have the entropy analogue
whenever X, Y are independent discrete random variables in an additive group; this is not difficult to deduce from standard entropy inequalities. Slightly more non-trivially, the sum set estimate
established by Ruzsa, has an entropy analogue
,
and similarly for a number of other standard sumset inequalities in the literature (e.g. the Rusza triangle inequality, the Plünnecke-Rusza inequality, and the Balog-Szemeredi-Gowers theorem, though the entropy analogue of the latter requires a little bit of care to state). These inequalities can actually be deduced fairly easily from elementary arithmetic identities, together with standard entropy inequalities, most notably the submodularity inequality
whenever X,Y,Z,W are discrete random variables such that X and Y each determine W separately (thus for some deterministic functions f, g) and X and Y determine Z jointly (thus
for some deterministic function f). For instance, if X,Y,Z are independent discrete random variables in an additive group G, then
and
each determine
separately, and determine
jointly, leading to the inequality
which soon leads to the entropy Rusza triangle inequality
which is an analogue of the combinatorial Ruzsa triangle inequality
All of this was already in the unpublished notes with Van, though I include it in this paper in order to place it in the literature. The main novelty of the paper, though, is to consider the entropy analogue of Freiman’s theorem, which classifies those sets A for which . Here, the analogous problem is to classify the random variables
such that
, where
are independent copies of X. Let us say that X has small doubling if this is the case.
For instance, the uniform distribution U on a finite subgroup H of G has small doubling (in fact in this case). In a similar spirit, the uniform distribution on a (generalised) arithmetic progression P also has small doubling, as does the uniform distribution on a coset progression H+P. Also, if X has small doubling, and Y has bounded entropy, then X+Y also has small doubling, even if Y and X are not independent. The main theorem is that these are the only cases:
Theorem 1. (Informal statement) X has small doubling if and only if
for some uniform distribution U on a coset progression (of bounded rank), and Y has bounded entropy.
For instance, suppose that X was the uniform distribution on a dense subset A of a finite group G. Then Theorem 1 asserts that X is close in a “transport metric” sense to the uniform distribution U on G, in the sense that it is possible to rearrange or transport the probability distribution of X to the probability distribution of U (or vice versa) by shifting each component of the mass of X by an amount Y which has bounded entropy (which basically means that it primarily ranges inside a set of bounded cardinality). The way one shows this is by randomly translating the mass of X around by a few random shifts to approximately uniformise the distribution, and then deal with the residual fluctuation in the distribution by hand. Theorem 1 as a whole is established by using the Freiman theorem in the combinatorial setting combined with various elementary convexity and entropy inequality arguments to reduce matters to the above model case when X is supported inside a finite group G and has near-maximal entropy.
I also show a variant of the above statement: if X, Y are independent and , then we have
(i.e. X has the same distribution as Y+Z for some Z of bounded entropy (not necessarily independent of X or Y). Thus if two random variables are additively related to each other, then they can be additively transported to each other by using a bounded amount of entropy.
In the last part of the paper I relate these discrete entropies to their continuous counterparts
where X is now a continuous random variable on the real line with density function . There are a number of sum set inequalities known in this setting, for instance
,
for independent copies of a finite entropy random variable X, with equality if and only if X is a Gaussian. Using this inequality and Theorem 1, I show a discrete version, namely that
,
whenever and
are independent copies of a random variable in
(or any other torsion-free abelian group) whose entropy is sufficiently large depending on
. This is somewhat analogous to the classical sumset inequality
though notice that we have a gain of just rather than
here, the point being that there is a Gaussian counterexample in the entropy setting which does not have a combinatorial analogue (except perhaps in the high-dimensional limit). The main idea is to use Theorem 1 to trap most of X inside a coset progression, at which point one can use Fourier-analytic additive combinatorial tools to show that the distribution
is “smooth” in some non-trivial direction r, which can then be used to approximate the discrete distribution by a continuous one.
I also conjecture more generally that the entropy monotonicity inequalities established by Artstein, Barthe, Ball, and Naor in the continuous case also hold in the above sense in the discrete case, though my method of proof breaks down because I no longer can assume small doubling.
I’m continuing the stream of uploaded papers this week with my paper “Freiman’s theorem for solvable groups“, submitted to Contrib. Disc. Math.. This paper concerns the problem (discussed in this earlier blog post) of determining the correct analogue of Freiman’s theorem in a general non-abelian group . Specifically, if
is a finite set that obeys the doubling condition
for some bounded K, what does this tell us about A? Heuristically, we expect A to behave like a finite subgroup of G (or perhaps a coset of such a subgroup).
When G is the integers (with the additive group operation), Freiman’s theorem then tells us that A is controlled by a generalised arithmetic progression P, where I say that one set A is controlled by another P if they have comparable size, and the former can be covered by a finite number of translates of the latter. (One can view generalised arithmetic progressions as an approximate version of a subgroup, in which one only uses the generators of the progression for a finite amount of time before stopping, as opposed to groups which allow words of unbounded length in the generators.) For more general abelian groups, the Freiman theorem of Green and Ruzsa tells us that a set of bounded doubling is controlled by a generalised coset progression , i.e. the sum of a generalised arithmetic progression P and a finite subgroup H of G. (Of course, if G is torsion-free, the finite subgroup H must be trivial.)
In this paper we address the case when G is a solvable group of bounded derived length. The main result is that if a subset of G has small doubing, then it is controlled by an object which I call a “coset nilprogression”, which is a certain technical generalisation of a coset progression, in which the generators do not quite commute, but have commutator expressible in terms of “higher order” generators. This is essentially a sharp characterisation of such sets, except for the fact that one would like a more explicit description of these coset nilprogressions. In the torsion-free case, a more explicit description (analogous to the Mal’cev basis description of nilpotent groups) has appeared in a very recent paper of Breulliard and Green; in the case of monomial groups (a class of groups that overlaps to a large extent with solvable groups), and assuming a polynomial growth condition rather than a doubling condition, a related result controlling A by balls in a suitable type of metric has appeared in very recent work of Sanders. In the nilpotent case there is also a nice recent argument of Fisher, Peng, and Katz which shows that sets of small doubling remain of small doubling with respect to the Lie algebra operations of addition and Lie bracket, and thus are amenable to the abelian Freiman theorems.
The conclusion of my paper is easiest to state (and easiest to prove) in the model case of the lamplighter group , where
is the additive group of doubly infinite sequences in the finite field
with only finitely many non-zero entries, and
acts on this space by translations. This is a solvable group of derived length two. The main result here is
Theorem 1. (Freiman’s theorem for the lamplighter group) If
has bounded doubling, then A is controlled either by a finite subspace of the “vertical” group
, or else by a set of the form
, where
is a generalised arithmetic progression, and
obeys the Freiman isomorphism property
whenever
and
.
This result, incidentally, recovers an earlier result of Lindenstrauss that the lamplighter group does not contain a Følner sequence of sets of uniformly bounded doubling. It is a good exercise to establish the “exact” version of this theorem, in which one classifies subgroups of the lamplighter group rather than sets of small doubling; indeed, the proof of this the above theorem follows fairly closely the natural proof of the exact version.
One application of the solvable Freiman theorem is the following quantitative version of a classical result of Milnor and of Wolf, which asserts that any solvable group of polynomial growth is virtually nilpotent:
Theorem 2. (Quantitative Milnor-Wolf theorem) Let G be a solvable group of derived length O(1), let S be a set of generators for G, and suppose one has the polynomial growth condition
for some d = O(1), where
is the set of all words generated by S of length at most R. If R is sufficiently large, then this implies that G is virtually nilpotent; more precisely, G contains a nilpotent subgroup of step O(1) and index
.
The key points here are that one only needs polynomial growth at a single scale R, rather than on many scales, and that the index of the nilpotent subgroup has polynomial size.
The proofs are based on an induction on the derived length. After some standard manipulations (basically, splitting A by an approximate version of a short exact sequence), the problem boils down to that of understanding the action of some finite set A on a set E in an additive group. If one assumes that E has small doubling and that the action of A leaves E approximately invariant, then one can show that E is a coset progression, and the action of A can be described efficiently using the generators of that progression (after refining the set A a bit).
In the course of the proof we need two new additive combinatorial results which may be of independent interest. The first is a variant of a well-known theorem of Sárközy, which asserts that if A is a large subset of an arithmetic progression P, then an iterated sumset kA of A for some itself contains a long arithmetic progression. Here, we need the related fact that if A is a large subset of a coset progression, then an iterated subset kA for
contains a large coset progression Q, and furthermore this inclusion is “robust” in the sense that all elements the elements of Q have a large number of representations as sums of elements of A. We also need a new (non-commutative) variant of the Balog-Szemerédi(-Gowers) lemma, which asserts that if A has small doubling, then A (or more precisely
) contains a large “core” subset D such that almost all of a large iterated subset kD of D still lies inside
). (This may not look like the usual Balog-Szemerédi lemma, but the proof of the lemma is almost identical to the original proof of Balog and Szemerédi, in particular relying on the Szemerédi regularity lemma.
For a number of reasons, including the start of the summer break for me and my coauthors, a number of papers that we have been working on are being released this week. For instance, Ben Green and I have just uploaded to the arXiv our paper “An equivalence between inverse sumset theorems and inverse conjectures for the norm“, submitted to Math. Proc. Camb. Phil. Soc.. The main result of this short paper (which was briefly announced in this earlier post) is a connection between two types of inverse theorems in additive combinatorics, namely the inverse sumset theorems of Freiman type, and inverse theorems for the Gowers uniformity norm, and more specifically, for the
norm
on finite additive group G, where is a complex-valued function.
As usual, the connection is easiest to state in a finite field model such as . In this case, we have the following inverse sumset theorem of Ruzsa:
Theorem 1. If
is such that
, then A can be covered by a translate of a subspace of
of cardinality at most
.
The constant has been improved for large
in a sequence of papers, from
by Ruzsa,
by Green-Ruzsa,
by Sanders,
by Green and myself, and finally
by Konyagin (private communication) which is sharp except for the precise value of the O() implied constant (as can be seen by considering the example when A consists of about 2K independent elements). However, it is conjectured that the polynomial loss can be removed entirely if one modifies the conclusion slightly:
Conjecture 1. (Polynomial Freiman-Ruzsa conjecture for
.) If
is such that
, then A can be covered by
translates of subspaces of
of cardinality at most |A|.
This conjecture was verified for downsets by Green and myself, but is open in general. This conjecture has a number of equivalent formulations; see this paper of Green for more discussion. In this previous post we show that a stronger version of this conjecture fails.
Meanwhile, for the Gowers norm, we have the following inverse theorem, due to Samorodnitsky:
Theorem 2. Let
be a function whose
norm is at least 1/K. Then there exists a quadratic polynomial
such that
.
Note that the quadratic phases are the only functions taking values in [-1,1] whose
norm attains its maximal value of 1.
It is conjectured that the exponentially weak correlation here can be strengthened to a polynomial one:
Conjecture 2. (Polynomial inverse conjecture for the
norm). Let
be a function whose
norm is at least 1/K. Then there exists a quadratic polynomial
such that
.
The first main result of this paper is
Theorem 3. Conjecture 1 and Conjecture 2 are equivalent.
This result was also independently observed by Shachar Lovett (private communication). We also establish an analogous result for the cyclic group , in which the notion of polynomial is replaced by that of a subexponential
, and in which the notion of a quadratic polynomial is replaced by a 2-step nilsequence; the precise statement is a bit technical and will not be given here. We also observe a partial partial analogue of the correpsondence between inverse sumset theorems and Gowers norms in the higher order case, in particular observing that
inverse theorems imply a certain rigidity result for “Freiman-quadratic polynomials” (a quadratic version of Conjecture 3 below).
Below the fold, we sketch the proof of Theorem 3.
[This post is authored by Ben Green, who has kindly “guest blogged” this week’s “open problem of the week”. – T.]
In an earlier blog post Terry discussed Freiman’s theorem. The name of Freiman is attached to a growing body of theorems which take some rather “combinatorial” hypothesis, such that the sumset |A+A| of some set A is small, and deduce from it rather “algebraic” information (such that A is contained in a subspace or a grid).
The easiest place to talk about Freiman’s theorem is in the finite field model (see my survey article on this subject for a full discussion). Here it was shown by Ruzsa that if |A+A| is at most
then A is contained in a subspace of size no more than about
. The exponent has been improved a few times since Ruzsa’s paper, the best result currently in print being due to Sanders, who obtains an upper bound of
. Terry and I are in the process of writing a paper which obtains
, which is best possible in view of the example
where
; this set has doubling roughly K but is not contained in a subspace of dimension smaller than 2K.
This result has an air of finality (except for the true nature of the o(K) term, which represents an interesting open problem). This is something of an illusion, however. Even using this theorem, one loses an exponential every time one tries to transition between “combinatorial” structure and “algebraic” structure and back again. Indeed if one knows that A is contained in a subspace of size then the strongest assertion one can make about the doubling of A is that it is at most
.
The Polynomial Freiman-Ruzsa conjecture (PFR), in , hypothesises a more precise structure theorem for sets with small doubling. Using this
conjecture, one may flit back and forth between combinatorial and algebraic structure with only polynomial losses. Ruzsa attributes the conjecture to
Marton: it states that if A has doubling at most K then A is contained in the union of translates of some subspace H of size at most |A|.
This is another one of my favourite open problems, falling under the heading of inverse theorems in arithmetic combinatorics. “Direct” theorems in arithmetic combinatorics take a finite set A in a group or ring and study things like the size of its sum set or product set
. For example, a typical result in this area is the sum-product theorem, which asserts that whenever
is a subset of a finite field of prime order with
, then
for some . (This particular theorem was first proven here, with an earlier partial result here; more recent and elementary proofs with civilised bounds can be found here, here or here. It has a number of applications.)
In contrast, inverse theorems in this subject start with a hypothesis that, say, the sum set A+A of an unknown set A is small, and try to deduce structural information about A. A typical goal is to completely classify all sets A for which A+A has comparable size with A. In the case of finite subsets of integers, this is Freiman’s theorem, which roughly speaking asserts that if , if and only if A is a dense subset of a generalised arithmetic progression P of rank O(1), where we say that A is a dense subset of B if
and
. (The “if and only if” has to be interpreted properly; in either the “if” or the “only if” direction, the implicit constants in the conclusion depends on the implicit constants in the hypothesis, but these dependencies are not inverses of each other.) In the case of finite subsets A of an arbitrary abelian group, we have the Freiman-Green-Ruzsa theorem, which asserts that
if and only if A is a dense subset of a sum P+H of a finite subgroup H and a generalised arithmetic progression P of rank O(1).
Recent Comments