You are currently browsing the monthly archive for January 2011.
Last week I gave a talk at the Trinity Mathematical Society at Trinity College, Cambridge UK. As the audience was primarily undergraduate, I gave a fairly non-technical talk on the universality phenomenon, based on this blog article of mine on the same topic. It was a quite light and informal affair, and this is reflected in the talk slides (which, in particular, play up quite strongly the role of former students and Fellows of Trinity College in this story). There was some interest in making these slides available publicly, so I have placed them on this site here. (Note: copyright for the images in these slides has not been secured.)
As I have done in the last three years, I am spending some time at the beginning of this year converting some of my posts on this blog into book format. This time round, the situation is a bit different because the majority of mathematical posts last year came from three courses I have taught: random matrices, higher-order Fourier analysis, and measure theory. These topics are sufficiently unrelated to each other, and to the other mathematical posts from 2010, that I am thinking of having as many as four distinct books this time around, though my plans are not yet definite in this regard.
In any event, I have started the process by converting the measure theory notes to book form, a draft copy of which is now available here. I have also started up a stub of a book page for this text, though it has little content at present beyond that link. I will be continuing to work on it in parallel with the rest of the conversion process. As always, any comments and corrections are very welcome.
Let be a compact group. (Throughout this post, all topological groups are assumed to be Hausdorff.) Then
has a number of unitary representations, i.e. continuous homomorphisms
to the group
of unitary operators on a Hilbert space
, equipped with the strong operator topology. In particular, one has the left-regular representation
, where we equip
with its normalised Haar measure
(and the Borel
-algebra) to form the Hilbert space
, and
is the translation operation
We call two unitary representations and
isomorphic if one has
for some unitary transformation
, in which case we write
.
Given two unitary representations and
, one can form their direct sum
in the obvious manner:
. Conversely, if a unitary representation
has a closed invariant subspace
of
(thus
for all
), then the orthogonal complement
is also invariant, leading to a decomposition
of
into the subrepresentations
,
. Accordingly, we will call a unitary representation
irreducible if
is nontrivial (i.e.
) and there are no nontrivial invariant subspaces (i.e. no invariant subspaces other than
and
); the irreducible representations play a role in the subject analogous to those of prime numbers in multiplicative number theory. By the principle of infinite descent, every finite-dimensional unitary representation is then expressible (perhaps non-uniquely) as the direct sum of irreducible representations.
The Peter-Weyl theorem asserts, among other things, that the same claim is true for the regular representation:
Theorem 1 (Peter-Weyl theorem) Let
be a compact group. Then the regular representation
is isomorphic to the direct sum of irreducible representations. In fact, one has
, where
is an enumeration of the irreducible finite-dimensional unitary representations
of
(up to isomorphism). (It is not difficult to see that such an enumeration exists.)
In the case when is abelian, the Peter-Weyl theorem is a consequence of the Plancherel theorem; in that case, the irreducible representations are all one dimensional, and are thus indexed by the space
of characters
(i.e. continuous homomorphisms into the unit circle
), known as the Pontryagin dual of
. (See for instance my lecture notes on the Fourier transform.) Conversely, the Peter-Weyl theorem can be used to deduce the Plancherel theorem for compact groups, as well as other basic results in Fourier analysis on these groups, such as the Fourier inversion formula.
Because the regular representation is faithful (i.e. injective), a corollary of the Peter-Weyl theorem (and a classical theorem of Cartan) is that every compact group can be expressed as the inverse limit of Lie groups, leading to a solution to Hilbert’s fifth problem in the compact case. Furthermore, the compact case is then an important building block in the more general theory surrounding Hilbert’s fifth problem, and in particular a result of Yamabe that any locally compact group contains an open subgroup that is the inverse limit of Lie groups.
I’ve recently become interested in the theory around Hilbert’s fifth problem, due to the existence of a correspondence principle between locally compact groups and approximate groups, which play a fundamental role in arithmetic combinatorics. I hope to elaborate upon this correspondence in a subsequent post, but I will mention that versions of this principle play a crucial role in Gromov’s proof of his theorem on groups of polynomial growth (discussed previously on this blog), and in a more recent paper of Hrushovski on approximate groups (also discussed previously). It is also analogous in many ways to the more well-known Furstenberg correspondence principle between ergodic theory and combinatorics (also discussed previously).
Because of the above motivation, I have decided to write some notes on how the Peter-Weyl theorem is proven. This is utterly standard stuff in abstract harmonic analysis; these notes are primarily for my own benefit, but perhaps they may be of interest to some readers also.
Emmanuel Breuillard, Ben Green, and I have just uploaded to the arXiv our paper “A note on approximate subgroups of and uniformly nonamenable groups“. In this short note, we obtain a new proof of a “noncommutative Freiman” type theorem in linear groups
. As discussed in earlier blog posts, a general question in additive (or multiplicative) combinatorics is to understand the structure of approximate groups – subsets
of genuine groups
which are a symmetric neighbourhood the identity (thus
and
whenever
), and such that the product set
is covered by
left (or right) translates of
for some bounded
. (The case
corresponds to the case of a genuine group.) Most of the focus in multiplicative combinatorics has been on the “discrete” case when
is a finite set, though continuous cases are also of interest (for instance, small balls around the identity in a Lie group are approximate groups).
In the discrete case, examples of approximate groups include:
- Finite groups;
- Balls in a discrete abelian group, or more generally a discrete nilpotent group, with boundedly many generators;
- Extensions of the latter type of balls by finite groups;
- Approximate groups
that are controlled by one of the previous examples
, in the sense that
has comparable cardinality to
, and can be covered by boundedly many translates of
.
It was conjectured independently by Helfgott and Lindenstrauss (private communication) that these are in fact the only examples of finite approximate groups. This conjecture is not yet settled in general (although we, with Tom Sanders, are making progress on this problem that we hope to be able to report on soon). However, many partial results are known. In particular, as part of the recent paper of Hrushovski in which model-theoretic techniques were introduced to study approximate groups, the following result was established:
Theorem 1 If
, then every approximate subgroup of
is controlled by a nilpotent approximate subgroup.
This result can be compared with Jordan’s theorem (discussed earlier on this blog) that every finite subgroup of is virtually abelian (with a uniform bound on the index of the abelian subgroup), or the special case of Gromov’s theorem for linear groups (which follows easily from the Tits alternative and the work of Milnor and of Wolf) that every finitely generated subgroup in
of polynomial growth is virtually nilpotent.
Hrushovski’s proof of the above argument was quite sophisticated; one first transplants the problem using model-theoretic techniques to an infinitary setting, in which the approximate group induces a locally compact topological group structure, which can be played off against the Lie group structure of using the machinery of a paper of Larsen and Pink, as discussed in this previous blog article.
Two further proofs of this theorem were obtained by ourselves, as well as in the most recent version of a similar preprint by Pyber and Szabo. The arguments used here are variants of those used in earlier papers of Helfgott, and are based on establishing expansion of sets that generated Zariski-dense subgroups of various Lie groups (such as ). Again, the machinery of Larsen and Pink (which controls how such approximate subgroups intersect with algebraic subgroups) plays a central role.
In this note we give a new proof of this theorem, based primarily on a different tool, namely the uniform Tits alternative of Breuillard. Recall that the Tits alternative asserts that a finitely generated subgroup of is either virtually solvable, or contains a copy of a free group on two generators. In other words, if
is a finite symmetric neighbourhood of the identity of
, then either
generates a virtually solvable subgroup, or else some power
of
contains two elements
that generate a free group. As stated,
may depend on
. However, the uniform Tits alternative makes the stronger assertion that one can take
to be uniform in
, and depend only on the dimension parameter
.
To use this alternative, we have the following simple observation, that asserts that multiplication by two elements that generate a free group forces a small amount of expansion:
Lemma 2 Let
be finite sets, such that
is symmetric and contains two elements
that generate a free group
. Then
.
We remark that this lemma immediately establishes the classical fact that any group that contains a copy of is not amenable, an observation initially made by von Neumann.
Proof: By foliating into cosets of
and translating, we may assume without loss of generality that
. Observe that for every element
in
, at least three of the four elements
has a longer word length than
, while lying in
. Furthermore, all such elements generated in this fashion are distinct (as one can recover the initial word
from the longer word by truncation). The claim follows.
This can be combined with a lemma of Sanders (also independently established by Croot and Sisask), that asserts that for any approximate group , and any
, one can find a smaller version
of
– also a symmetric neighbourhood of the identity – with the property that
, while
remains of comparable size to
. (One should think of
as being like a ball of some radius
, in which case
is analogous to a ball of radius
). In particular,
still has size comparable to
. Inspecting the size of the sets
, we conclude (if
is large enough) from the above lemma that
cannot contain two elements that generate a free group. Indeed, a slight modification of this argument shows that for any
, if we take
sufficiently large depending on
, that
does not contain two elements that generate a free group. Applying the uniform Tits alternative, this shows that
generates a virtually solvable subgroup of
. From the known product theory for such groups (due to Breuillard and Green),
(and hence
) is therefore controlled by a virtually nilpotent group, as desired.
Tamar Ziegler and I have just uploaded to the arXiv our paper “The inverse conjecture for the Gowers norm over finite fields in low characteristic“, submitted to Annals of Combinatorics. This paper completes another case of the inverse conjecture for the Gowers norm, this time for vector spaces over a fixed finite field
of prime order; with Vitaly Bergelson, we had previously established this claim when the characteristic of the field was large, so the main new result here is the extension to the low characteristic case. (The case of a cyclic group
or interval
was established by Ben Green and ourselves in another recent paper. For an arbitrary abelian (or nilpotent) group, a general but less explicit description of the obstructions to Gowers uniformity was recently obtained by Szegedy; the latter result recovers the high-characteristic case of our result (as was done in a subsequent paper of Szegedy), as well as our results with Green, but it is not immediately evident whether Szegedy’s description of the obstructions matches up with the one predicted by the inverse conjecture in low characteristic.)
The statement of the main theorem is as follows. Given a finite-dimensional vector space and a function
, and an integer
, one can define the Gowers uniformity norm
by the formula
where . If
is bounded in magnitude by
, it is easy to see that
is bounded by
also, with equality if and only if
for some non-classical polynomial
of degree at most
, where
, and a non-classical polynomial of degree at most
is a function whose
“derivatives” vanish in the sense that
for all , where
. Our result generalises this to the case when the uniformity norm is not equal to
, but is still bounded away from zero:
Theorem 1 (Inverse conjecture) Let
be bounded by
with
for some
. Then there exists a non-classical polynomial
of degree at most
such that
, where
is a positive quantity depending only on the indicated parameters.
This theorem is trivial for , and follows easily from Fourier analysis for
. The case
was done in odd characteristic by Ben Green and myself, and in even characteristic by Samorodnitsky. In two papers, one with Vitaly Bergelson, we established this theorem in the “high characteristic” case when the characteristic
of
was greater than
(in which case there is essentially no distinction between non-classical polynomials and their classical counterparts, as discussed previously on this blog). The need to deal with genuinely non-classical polynomials is the main new difficulty in this paper that was not dealt with in previous literature.
In our previous paper with Bergelson, a “weak” version of the above theorem was proven, in which the polynomial in the conclusion had bounded degree
, rather than being of degree at most
. In the current paper, we use this weak inverse theorem to reduce the inverse conjecture to a statement purely about polynomials:
Theorem 2 (Inverse conjecture for polynomials) Let
, and let
be a non-classical polynomial of degree at most
such that
. Then
has bounded rank in the sense that
is a function of
polynomials of degree at most
.
This type of inverse theorem was first introduced by Bogdanov and Viola. The deduction of Theorem 1 from Theorem 2 and the weak inverse Gowers conjecture is fairly standard, so the main difficulty is to show Theorem 2.
The quantity of a polynomial
of degree at most
was denoted the analytic rank of
by Gowers and Wolf. They observed that the analytic rank of
was closely related to the rank of
, defined as the least number of degree
polynomials needed to express
. For instance, in the quadratic case
the two ranks are identical (in odd characteristic, at least). For general
, it was easy to see that bounded rank implied bounded analytic rank; Theorem 2 is the converse statement.
We tried a number of ways to show that bounded analytic rank implied bounded rank, in particular spending a lot of time on ergodic-theoretic approaches, but eventually we settled on a “brute force” approach that relies on classifying those polynomials of bounded analytic rank as precisely as possible. The argument splits up into establishing three separate facts:
- (Classical case) If a classical polynomial has bounded analytic rank, then it has bounded rank.
- (Multiplication by
) If a non-classical polynomial
(of degree at most
) has bounded analytic rank, then
(which can be shown to have degree at most
) also has bounded analytic rank.
- (Division by
) If
is a non-clsasical polynomial of degree
of bounded rank, then there is a non-classical polynomial
of degree at most
of bounded rank such that
.
The multiplication by and division by
facts allow one to easily extend the classical case of the theorem to the non-classical case of the theorem, basically because classical polynomials are the kernel of the multiplication-by-
homomorphism. Indeed, if
is a non-classical polynomial of bounded analytic rank of the right degree, then the multiplication by
claim tells us that
also has bounded analytic rank, which by an induction hypothesis implies that
has bounded rank. Applying the division by
claim, we find a bounded rank polynomial
such that
, thus
differs from
by a classical polynomial, which necessarily has bounded analytic rank, hence bounded rank by the classical claim, and the claim follows.
Of the three claims, the multiplication-by- claim is the easiest to prove using known results; after a bit of Fourier analysis, it turns out to follow more or less immediately from the multidimensional Szemerédi theorem over finite fields of Bergelson, Leibman, and McCutcheon (one can also use the density Hales-Jewett theorem here if one desires).
The next easiest claim is the classical case. Here, the idea is to analyse a degree classical polynomial
via its derivative
, defined by the formula
for any (the RHS is independent of
as
has degree
). This is a multilinear form, and if
has bounded analytic rank, this form is biased (in the sense that the mean of
is large). Applying a general equidistribution theorem of Kaufman and Lovett (based on this earlier paper of Green and myself) this implies that
is a function of a bounded number of multilinear forms of lower degree. Using some “regularity lemma” theory to clean up these forms so that they have good equidistribution properties, it is possible to understand exactly how the original multilinear form
depends on these lower degree forms; indeed, the description one eventually obtains is so explicit that one can write down by inspection another bounded rank polynomial
such that
is equal to
. Thus
differs from the bounded rank polynomial
by a lower degree error, which is automatically of bounded rank also, and the claim follows.
The trickiest thing to establish is the division by claim. The polynomial
is some function
of lower degree polynomials
. Ideally, one would like to find a function
of the same polynomials with
, such that
has the correct degree; however, we have counterexamples that show that this is not always possible. (These counterexamples are the main obstruction to making the ergodic theory approach work: in ergodic theory, one is only allowed to work with “measurable” functions, which are roughly analogous in this context to functions of the indicated polynomials
and their shifts.) To get around this we have to first apply a regularity lemma to place
in a suitably equidistributed form (although the fact that
may be non-classical leads to a rather messy and technical description of this equidistribution), and then we have to extend each
to a higher degree polynomial
with
. There is a crucial “exact roots” property of polynomials that allows one to do this, with
having degree exactly
higher than
. It turns out that it is possible to find a function
of these extended polynomials that have the right degree and which solves the required equation
; this is established by classifying completely all functions of the equidistributed polynomials
or
that are of a given degree.
Recent Comments