Emmanuel Breuillard, Ben Green, and I have just uploaded to the arXiv our announcement “Linear approximate groups“, submitted to Electronic Research Announcements.
The main result is a step towards the classification of -approximate groups, in the specific setting of simple and semisimple Lie groups (with some partial results for more general Lie groups). For , define a -approximate group to be a finite subset of a group which is a symmetric neighbourhood of the origin (thus and is equal to ), and such that the product set is covered by left-translates (or equivalently, right-translates) of . For , this is the same concept as a finite subgroup of , but for larger values of , one also gets some interesting objects which are close to, but not exactly groups, such as geometric progressions for some and .
The expectation is that -approximate groups are -controlled by “structured” objects, such as actual groups and progressions, though the precise formulation of this has not yet been finalised. (We say that one finite set -controls another if is at most times larger than in cardinality, and can be covered by at most left translates or right translates of .) The task of stating and proving this statement is the noncommutative Freiman theorem problem, discussed in these earlier blog posts.
While this problem remains unsolved for general groups, significant progress has been made in special groups, notably abelian, nilpotent, and solvable groups. Furthermore, the work of Chang (over ) and Helfgott (over ) has established the important special cases of the special linear groups and :
Theorem 1 (Helfgott’s theorem) Let and let be either or for some prime . Let be a -approximate subgroup of .
- If generates the entire group (which is only possible in the finite case ), then is either controlled by the trivial group or the whole group.
- If , then is -controlled by a solvable -approximate subgroup of , or by itself. If , the latter possibility cannot occur, and must be abelian.
Our main result is an extension of Helfgott’s theorem to for general . In fact, we obtain an analogous result for any simple (or almost simple) Chevalley group over an arbitrary finite field (not necessarily of prime order), or over . (Standard embedding arguments then allow us to in fact handle arbitrary fields.) The results from simple groups can also be extended to (almost) semisimple Lie groups by an approximate version of Goursat’s lemma. Given that general Lie groups are known to split as extensions of (almost) semisimple Lie groups by solvable Lie groups, and Freiman-type theorems are known for solvable groups also, this in principle gives a Freiman-type theorem for arbitrary Lie groups; we have already established this in the characteristic zero case , but there are some technical issues in the finite characteristic case that we are currently in the process of resolving.
We remark that a qualitative version of this result (with the polynomial bounds replaced by an ineffective bound ) was also recently obtained by Hrushovski.
Our arguments are based in part on Helfgott’s arguments, in particular maximal tori play a major role in our arguments for much the same reason they do in Helfgott’s arguments. Our main new ingredient is a surprisingly simple argument, which we call the pivot argument, which is an analogue of a corresponding argument of Konyagin and Bourgain-Glibichuk-Konyagin that was used to prove a sum-product estimate. Indeed, it seems that Helfgott-type results in these groups can be viewed as a manifestation of a product-conjugation phenomenon analogous to the sum-product phenomenon. Namely, the sum-product phenomenon asserts that it is difficult for a subset of a field to be simultaneously approximately closed under sums and products, without being close to an actual field; similarly, the product-conjugation phenomenon asserts that it is difficult for a union of (subsets of) tori to be simultaneously approximately closed under products and conjugations, unless it is coming from a genuine group. In both cases, the key is to exploit a sizeable gap between the behaviour of two types of “pivots” (which are scaling parameters in the sum-product case, and tori in the product-conjugation case): ones which interact strongly with the underlying set , and ones which do not interact at all. The point is that there is no middle ground of pivots which only interact weakly with the set. This separation between interacting (or “involved”) and non-interacting (or “non-involved”) pivots can then be exploited to bootstrap approximate algebraic structure into exact algebraic structure. (Curiously, a similar argument is used all the time in PDE, where it goes under the name of the “bootstrap argument”.)
Below the fold we give more details of this crucial pivot argument.
One piece of trivia about the writing of this paper: this was the first time any of us had used modern version control software to collaboratively write a paper; specifically, we used Subversion, with the repository being hosted online by xp-dev. (See this post at the Secret Blogging Seminar for how to get started with this software.) There were a certain number of technical glitches in getting everything to install and run smoothly, but once it was set up, it was significantly easier to use than our traditional system of emailing draft versions of the paper back and forth, as one could simply download and upload the most recent versions whenever one wished, with all changes merged successfully. I had a positive impression of this software and am likely to try it again in future collaborations, particularly those involving at least three people. (It would also work well for polymath projects, modulo the technical barrier of every participant having to install some software.)
— 1. The pivot argument —
For simplicity let us work in , which is slightly simpler because all semisimple (which, in this linear context, simply means diagonalisable) elements other than are regular, which in the case of linear groups just means that the eigenvalues are distinct. Every regular element of the three-dimensional then generates a one-dimensional maximal torus , which is also the centraliser of (the set of all matrices in that commute with ).
Let be a -approximate group that generates , where we think of as being small, say to simplify the discussion (of course, in the full argument we will need to track the dependence on and keep it polynomial in nature). We may assume that is not too small (more precisely, for some large ). As lives in the three-dimensional group , it is reasonable to expect that the intersection of with a one-dimensional subset, such as a maximal torus, would be of size about . And indeed this is true, as was observed by Helfgott:
Lemma 2 (Intersection with torus) If a maximal torus intersects (say) outside of , then (say) has cardinality .
For more general Lie groups, one can establish a similar upper bound (for more general algebraic varieties than just a torus) by using the Larsen-Pink inequality, which I discussed in this previous blog post. The lower bound is more important to us; it comes from noting that the conjugacy class lies in and also in a two-dimensional subset of , and so should have cardinality .
This lemma gives an important gap property: as soon as a maximal torus encounters just one regular element of , it in fact has to absorb quite a lot of elements of the slightly larger set . It is this gap which we exploit as follows. Let us say that a maximal torus is involved if it intersects outside of .
Lemma 3 (Pivot rotation lemma) If is an involved torus, then is also an involved torus for any .
Proof: (Sketch) We conjugate by a further element , and then multiply it on the left to get a new torus , where . On the one hand, we can think of this torus as of the form , where and . From Lemma 2 we see that there are only values of and , and so there are tori here. On the other hand, there are choices of and . Hence there must be lots of collisions of the form
Taking quotients, we see that
and thus lies in the normaliser of . But this is only twice as large as (the quotient of by is the Weyl group of , which in this two-dimensional case has cardinality .) But because there are so many collisions, it is not hard to use a pigeonhole argument to find a non-trivial pair where lies in the torus itself, and is not equal to . This makes an involved torus as required.
Now suppose that generates all of , then the set of involved tori is then invariant under conjugation by arbitrary elements of . But all maximal tori in are conjugate to each other, and it is not hard to show that any large must intersect at least one maximal torus non-trivially (using something called an escape from subvarieties argument), and so every maximal torus is an involved torus. But then there are such tori; this is only consistent with Lemma 2 if , at which point one is done. A slightly more sophisticated version of this argument also works when does not generate all of .
It is instructive to compare the above argument to the analogous sum-product argument. Let be an approximate subring of a field (let us not define this concept precisely here). We say that a non-zero field element is involved with if the set of sums are not distinct, i.e. . The analogue of Lemma 2 here is that if is involved, then in fact must lie in the quotient set of , just by using a collision to solving for . The analogue of Lemma 3 is then the observation that if and lie in , then and are still sufficiently involved with that one can bound or to be strictly smaller than , so that the sum and product of any two involved elements is again involved; this can be deduced from the so-called Katz-Tao lemma, which roughly asserts that if a set is approximately closed under sums and products, then it (or a large portion thereof) is closed under more complicated polynomial and rational operations as well.
16 comments
Comments feed for this article
27 January, 2010 at 9:34 am
Ben Green
Terry,
I don’t think you’ve quoted Helfgott’s theorem quite correctly in the case d = 3. For example, A could be controlled by a copy of SL_2(F_p). Even over C, there are 2-step nilpotent, nonabelian, examples in SL_3.
Best
Ben
27 January, 2010 at 9:36 am
Ben Green
Terry,
Another comment: I think we *do* get the full Freiman theorem for Lie groups (over R, or over C) but not at this stage for groups of Lie type (such as SL_d(F_p)).
Ben
27 January, 2010 at 11:10 am
Terence Tao
Oops, right. Thanks for the corrections!
27 January, 2010 at 11:20 am
Scott Morrison
Regarding subversion: congratulations on making the jump. I really think this is a good piece of technology for mathematicians.
These days, however, subversion isn’t really considered “modern version control”. The newer generation (mercurial, git, bzr) allowed “distributed version control”, and in particular let you commit changes even while offline. This at first sounds silly — after all your collaborators can’t see those changes until you get online. The nice thing is that it encourages “granular” commits, and in particular I find I write better “commit messages” (clues to others as to what changes you’re making) when I commit early and often. It also ends up being easier to set up a distributed system, as you never have to lock in to a particular hosting setup. If mathematicians want help setting up mercurial for a paper or collaboration, I’m happy to help — email me or follow the link to the secret blogging seminar post above.
27 January, 2010 at 12:07 pm
Laci Pyber
Dear Terence Tao,
looking at arxiv:1001.4556 you will find another research announcement
“Growth in finite simple groups of Lie type” by Endre Szabó and myself.
As the title indicates the two research announcements have a very substantial overlap. It is a rather curious coincidence that these announcements appeared side by side on the arxiv.
Best
Laci Pyber
27 January, 2010 at 1:58 pm
Terence Tao
Yes, this is quite a remarkable coincidence. I’ve seen simultaneous releases of closely related results before (and been involved in some of them personally), but this is the first time I know of that the announcements were released on the same day. (We’ve been announcing these results in talks and in discussion with colleagues over the past few weeks, though.)
It seems that both of our papers rely crucially on the same underlying pivot argument and on the Larsen-Pink type inequality (although the arguments are arranged a bit differently in the fine details). I suppose that in retrospect this is the “natural” way to proceed, especially given how easily these arguments establish the sum-product phenomenon (which used to be quite difficult from a psychological point of view, before the easy proofs were found).
27 January, 2010 at 3:42 pm
Terence Tao
Yes, I’ve already been informed privately by those more technologically up-to-date than I that a late 1990s era piece of software doesn’t count as “modern” nowadays, so I’m retracting that particular term :-)
For the current scale of projects (with 2-3 authors) it seems that the Subversion system works “well enough”, which is often a sufficient standard for these sorts of things. If one were to do a much larger-scale project, such as a polymath writeup, I could imagine that a distributed VCS offer further real advantages. In any case, either system is a definite advance over email-based methods.
27 January, 2010 at 3:52 pm
Ben Green
Terry, a further point: I think you’re using the word “pivot” in a somewhat opposite sense to the way it used in, for example, Helfgott’s paper on SL_3. The world “involved” as used in our announcement seems more descriptive.
27 January, 2010 at 4:03 pm
Terence Tao
Ah, good point. (I was wondering where that term came from, actually.) I’ve reworded the argument accordingly.
28 January, 2010 at 2:54 am
Link Starbureiy
Terry-
An even better alternative might be to use Google Code Project Hosting (code.google.com/p). It also utilizes subversion/mercurial. Depending on how large the commit group is, it can be pretty generous (and versatile) for storage. I’ve been using it for my own projects for about a year now.
28 January, 2010 at 6:07 am
Harald
A few words on “pivots” and the like.
It seems to me that the proof you have sketched for Lemma 3 is a little more complicated than it needs to be. In particular, the use of double-sided cosets aTb is unnecessary, as is the use of the Weyl group. Emmanuel was talking to me about this; if I understood him correctly, he proceeded guided by his reading of Proposition 3.1 in my SL_3 paper (where I used the term “pivot”). The idea there was to get an element xi (the “pivot”) of a group G (acted on by another group \Upsilon) such that the map (g,y)\mapsto g\cdot y(xi) would be injective when restricted to a set A\times Y \subset G\times \Upsilon. One could obtain this pivot as follows: if there is no pivot, then A and Y are very large and the argument is easy; if there is a pivot, there has to be in some sense a last “non-pivot”, which is one step away from being a pivot under the action of A or Y; this non-pivot is shown to be in some sense constructible (or “involved”, to use your term) and thus one can construct a pivot using one more step. (As I wrote back then and as you state above, this argument can be seen as an abstraction of an argument of Konyagin and Glibichuk in the sum-product setting; what I had in SL_3 was simply the first such statement in the setting of groups.) It is clear then why Emmanuel wanted a map (g,g’)\mapsto gTg’.
In fact, this is unnecessary in this setting; one can work entirely with left cosets a T. Here’s a simpler complete proof of your Lemma 3 (which works for every simple group, just like the proof in your paper will work for every simple group).
Proof.- Suppose T’=g T g^(-1) is not involved. Look at the map phi:G\mapsto G/T’ given by g\mapsto g T’. If g and g’ map to the same element, then g^(-1) g’ lies in T’; since T’ is not involved, it must be a non-regular element of T’. Since there are at most <>|A|^{1-(dim(T)-1)/dim(G))}. Pick a set R\subset A of representatives of every preimage phi^{-1}(x), x\in phi(A).
Evidently, |R|=|phi(A)|>>|A|^{1-(dim(T)-1)/dim(G))}.
Now, because T is involved, there is a regular element x in A^(-1) A \cap T(K). Hence g x g^(-1) in A A^{-1} A A^{-1} is regular, and so |A_8\cap T(K)|>>|A|^(dim(T)/dim(G)). Now the map (g,t)->gt restricted to R\times
(A_8\times T(K)) will (a) be injective (and thus have image of size
>> |A|^{1+1/dim(G)}) and (b) have image contained in A_9. Hence |A_9|>>|A|^(1+1/dim(G)).
End of proof.
28 January, 2010 at 6:18 am
Harald
One other thing. My post-doc (Nick Gill) and I wrote a draft some time ago proving that a set A generating SL_n(F_p) grows, provided that |A| << p^(n+1-epsilon). I had talked on the subject already (e.g. at Luminy last November).We were planning to put a full paper out on the arxiv very soon; we will still do so. Hopefully the techniques used therein will prove useful elsewhere.
Congratulations for a beautiful result.
4 February, 2010 at 12:48 pm
Harald
Right before Lemma 2, “would be of size about {|A|^3}.” should be
“would be of size about |A|^{1/3}”. Also, my congratulations were intended for both teams, obviously.
12 February, 2010 at 10:46 pm
An arithmetic regularity lemma, an associated counting lemma, and applications « What’s new
[…] last “production note”. Like our previous paper with Emmanuel Breuillard, we used Subversion to write this paper, which turned out to be a significant efficiency boost as […]
6 May, 2010 at 8:33 am
Suzuki groups as expanders « What’s new
[…] by Pyber-Szábo and can also be obtained by the methods of proof of the results announced by ourselves. (These arguments are in turn based on an earlier result of Helfgott establishing the analogous […]
12 May, 2010 at 1:30 pm
Approximate subgroups of linear subgroups « What’s new
[…] of linear groups“, submitted to GAFA. This paper contains (the first part) of the results announced previously by us; the second part of these results, concerning expander groups, will appear subsequently. The […]