You are currently browsing the monthly archive for June 2011.
Following the results from the recent poll on this blog, the mini-polymath3 project (which will focus on one of the problems from the 2011 IMO) will start at July 19 8pm UTC, and be run concurrently on this blog, on the polymath wiki, and on the polymath blog.
Over the past few months or so, I have been brushing up on my Lie group theory, as part of my project to fully understand the theory surrounding Hilbert’s fifth problem. Every so often, I encounter a basic fact in Lie theory which requires a slightly non-trivial “trick” to prove; I am recording two of them here, so that I can find these tricks again when I need to.
The first fact concerns the exponential map from a Lie algebra
of a Lie group
to that group. (For this discussion we will only consider finite-dimensional Lie groups and Lie algebras over the reals
.) A basic fact in the subject is that the exponential map is locally a homeomorphism: there is a neighbourhood of the origin in
that is mapped homeomorphically by the exponential map to a neighbourhood of the identity in
. This local homeomorphism property is the foundation of an important dictionary between Lie groups and Lie algebras.
It is natural to ask whether the exponential map is globally a homeomorphism, and not just locally: in particular, whether the exponential map remains both injective and surjective. For instance, this is the case for connected, simply connected, nilpotent Lie groups (as can be seen from the Baker-Campbell-Hausdorff formula.)
The circle group , which has
as its Lie algebra, already shows that global injectivity fails for any group that contains a circle subgroup, which is a huge class of examples (including, for instance, the positive dimensional compact Lie groups, or non-simply-connected Lie groups). Surjectivity also obviously fails for disconnected groups, since the Lie algebra is necessarily connected, and so the image under the exponential map must be connected also. However, even for connected Lie groups, surjectivity can fail. To see this, first observe that if the exponential map was surjective, then every group element
has a square root (i.e. an element
with
), since
has
as a square root for any
. However, there exist elements in connected Lie groups without square roots. A simple example is provided by the matrix
in the connected Lie group . This matrix has eigenvalues
,
. Thus, if
is a square root of
, we see (from the Jordan normal form) that it must have at least one eigenvalue in
, and at least one eigenvalue in
. On the other hand, as
has real coefficients, the complex eigenvalues must come in conjugate pairs
. Since
can only have at most
eigenvalues, we obtain a contradiction.
However, there is an important case where surjectivity is recovered:
Proposition 1 If
is a compact connected Lie group, then the exponential map is surjective.
Proof: The idea here is to relate the exponential map in Lie theory to the exponential map in Riemannian geometry. We first observe that every compact Lie group can be given the structure of a Riemannian manifold with a bi-invariant metric. This can be seen in one of two ways. Firstly, one can put an arbitrary positive definite inner product on
and average it against the adjoint action of
using Haar probability measure (which is available since
is compact); this gives an ad-invariant positive-definite inner product on
that one can then translate by either left or right translation to give a bi-invariant Riemannian structure on
. Alternatively, one can use the Peter-Weyl theorem to embed
in a unitary group
, at which point one can induce a bi-invariant metric on
from the one on the space
of
complex matrices.
As is connected and compact and thus complete, we can apply the Hopf-Rinow theorem and conclude that any two points are connected by at least one geodesic, so that the Riemannian exponential map from
to
formed by following geodesics from the origin is surjective. But one can check that the Lie exponential map and Riemannian exponential map agree; for instance, this can be seen by noting that the group structure naturally defines a connection on the tangent bundle which is both torsion-free and preserves the bi-invariant metric, and must therefore agree with the Levi-Civita metric. (Alternatively, one can embed into a unitary group
and observe that
is totally geodesic inside
, because the geodesics in
can be described explicitly in terms of one-parameter subgroups.) The claim follows.
Remark 1 While it is quite nice to see Riemannian geometry come in to prove this proposition, I am curious to know if there is any other proof of surjectivity for compact connected Lie groups that does not require explicit introduction of Riemannian geometry concepts.
The other basic fact I learned recently concerns the algebraic nature of Lie groups and Lie algebras. An important family of examples of Lie groups are the algebraic groups – algebraic varieties with a group law given by algebraic maps. Given that one can always automatically upgrade the smooth structure on a Lie group to analytic structure (by using the Baker-Campbell-Hausdorff formula), it is natural to ask whether one can upgrade the structure further to an algebraic structure. Unfortunately, this is not always the case. A prototypical example of this is given by the one-parameter subgroup
of . This is a Lie group for any exponent
, but if
is irrational, then the curve that
traces out is not an algebraic subset of
(as one can see by playing around with Puiseux series).
This is not a true counterexample to the claim that every Lie group can be given the structure of an algebraic group, because one can give a different algebraic structure than one inherited from the ambient group
. Indeed,
is clearly isomorphic to the additive group
, which is of course an algebraic group. However, a modification of the above construction works:
Proposition 2 There exists a Lie group
that cannot be given the structure of an algebraic group.
Proof: We use an example from the text of Tauvel and Yu (that I found via this MathOverflow posting). We consider the subgroup
of , with
an irrational number. This is a three-dimensional (metabelian) Lie group, whose Lie algebra
is spanned by the elements
with the Lie bracket given by
As such, we see that if we use the basis to identify
to
, then adjoint representation of
is the identity map.
If is an algebraic group, it is easy to see that the adjoint representation
is also algebraic, and so
is algebraic in
. Specialising to our specific example, in which adjoint representation is the identity, we conclude that if
has any algebraic structure, then it must also be an algebraic subgroup of
; but
projects to the group (1) which is not algebraic, a contradiction.
A slight modification of the same argument also shows that not every Lie algebra is algebraic, in the sense that it is isomorphic to a Lie algebra of an algebraic group. (However, there are important classes of Lie algebras that are automatically algebraic, such as nilpotent or semisimple Lie algebras.)
Let be a Lie group with Lie algebra
. As is well known, the exponential map
is a local homeomorphism near the identity. As such, the group law on
can be locally pulled back to an operation
defined on a neighbourhood
of the identity in
, defined as
where is the local inverse of the exponential map. One can view
as the group law expressed in local exponential coordinates around the origin.
An asymptotic expansion for is provided by the Baker-Campbell-Hausdorff (BCH) formula
for all sufficiently small , where
is the Lie bracket. More explicitly, one has the Baker-Campbell-Hausdorff-Dynkin formula
for all sufficiently small , where
,
is the adjoint representation
, and
is the function
which is real analytic near and can thus be applied to linear operators sufficiently close to the identity. One corollary of this is that the multiplication operation
is real analytic in local coordinates, and so every smooth Lie group is in fact a real analytic Lie group.
It turns out that one does not need the full force of the smoothness hypothesis to obtain these conclusions. It is, for instance, a classical result that regularity of the group operations is already enough to obtain the Baker-Campbell-Hausdorff formula. Actually, it turns out that we can weaken this a bit, and show that even
regularity (i.e. that the group operations are continuously differentiable, and the derivatives are locally Lipschitz) is enough to make the classical derivation of the Baker-Campbell-Hausdorff formula work. More precisely, we have
Theorem 1 (
Baker-Campbell-Hausdorff formula) Let
be a finite-dimensional vector space, and suppose one has a continuous operation
defined on a neighbourhood
around the origin, which obeys the following three axioms:
- (Approximate additivity) For
sufficiently close to the origin, one has
(In particular,
for
sufficiently close to the origin.)
- (Associativity) For
sufficiently close to the origin,
.
- (Radial homogeneity) For
sufficiently close to the origin, one has
for all
. (In particular,
for all
sufficiently close to the origin.)
Then
is real analytic (and in particular, smooth) near the origin. (In particular,
gives a neighbourhood of the origin the structure of a local Lie group.)
Indeed, we will recover the Baker-Campbell-Hausdorff-Dynkin formula (after defining appropriately) in this setting; see below the fold.
The reason that we call this a Baker-Campbell-Hausdorff formula is that if the group operation
has
regularity, and has
as an identity element, then Taylor expansion already gives (2), and in exponential coordinates (which, as it turns out, can be defined without much difficulty in the
category) one automatically has (3).
We will record the proof of Theorem 1 below the fold; it largely follows the classical derivation of the BCH formula, but due to the low regularity one will rely on tools such as telescoping series and Riemann sums rather than on the fundamental theorem of calculus. As an application of this theorem, we can give an alternate derivation of one of the components of the solution to Hilbert’s fifth problem, namely the construction of a Lie group structure from a Gleason metric, which was covered in the previous post; we discuss this at the end of this article. With this approach, one can avoid any appeal to von Neumann’s theorem and Cartan’s theorem (discussed in this post), or the Kuranishi-Gleason extension theorem (discussed in this post).
Hilbert’s fifth problem asks to clarify the extent that the assumption on a differentiable or smooth structure is actually needed in the theory of Lie groups and their actions. While this question is not precisely formulated and is thus open to some interpretation, the following result of Gleason and Montgomery-Zippin answers at least one aspect of this question:
Theorem 1 (Hilbert’s fifth problem) Let
be a topological group which is locally Euclidean (i.e. it is a topological manifold). Then
is isomorphic to a Lie group.
Theorem 1 can be viewed as an application of the more general structural theory of locally compact groups. In particular, Theorem 1 can be deduced from the following structural theorem of Gleason and Yamabe:
Theorem 2 (Gleason-Yamabe theorem) Let
be a locally compact group, and let
be an open neighbourhood of the identity in
. Then there exists an open subgroup
of
, and a compact subgroup
of
contained in
, such that
is isomorphic to a Lie group.
The deduction of Theorem 1 from Theorem 2 proceeds using the Brouwer invariance of domain theorem and is discussed in this previous post. In this post, I would like to discuss the proof of Theorem 2. We can split this proof into three parts, by introducing two additional concepts. The first is the property of having no small subgroups:
Definition 3 (NSS) A topological group
is said to have no small subgroups, or is NSS for short, if there is an open neighbourhood
of the identity in
that contains no subgroups of
other than the trivial subgroup
.
An equivalent definition of an NSS group is one which has an open neighbourhood of the identity that every non-identity element
escapes in finite time, in the sense that
for some positive integer
. It is easy to see that all Lie groups are NSS; we shall shortly see that the converse statement (in the locally compact case) is also true, though significantly harder to prove.
Another useful property is that of having what I will call a Gleason metric:
Definition 4 Let
be a topological group. A Gleason metric on
is a left-invariant metric
which generates the topology on
and obeys the following properties for some constant
, writing
for
:
- (Escape property) If
and
is such that
, then
.
- (Commutator estimate) If
are such that
, then
where
is the commutator of
and
.
For instance, the unitary group with the operator norm metric
can easily verified to be a Gleason metric, with the commutator estimate (1) coming from the inequality
Similarly, any left-invariant Riemannian metric on a (connected) Lie group can be verified to be a Gleason metric. From the escape property one easily sees that all groups with Gleason metrics are NSS; again, we shall see that there is a partial converse.
Remark 1 The escape and commutator properties are meant to capture “Euclidean-like” structure of the group. Other metrics, such as Carnot-Carathéodory metrics on Carnot Lie groups such as the Heisenberg group, usually fail one or both of these properties.
The proof of Theorem 2 can then be split into three subtheorems:
Theorem 5 (Reduction to the NSS case) Let
be a locally compact group, and let
be an open neighbourhood of the identity in
. Then there exists an open subgroup
of
, and a compact subgroup
of
contained in
, such that
is NSS, locally compact, and metrisable.
Theorem 6 (Gleason’s lemma) Let
be a locally compact metrisable NSS group. Then
has a Gleason metric.
Theorem 7 (Building a Lie structure) Let
be a locally compact group with a Gleason metric. Then
is isomorphic to a Lie group.
Clearly, by combining Theorem 5, Theorem 6, and Theorem 7 one obtains Theorem 2 (and hence Theorem 1).
Theorem 5 and Theorem 6 proceed by some elementary combinatorial analysis, together with the use of Haar measure (to build convolutions, and thence to build “smooth” bump functions with which to create a metric, in a variant of the analysis used to prove the Birkhoff-Kakutani theorem); Theorem 5 also requires Peter-Weyl theorem (to dispose of certain compact subgroups that arise en route to the reduction to the NSS case), which was discussed previously on this blog.
In this post I would like to detail the final component to the proof of Theorem 2, namely Theorem 7. (I plan to discuss the other two steps, Theorem 5 and Theorem 6, in a separate post.) The strategy is similar to that used to prove von Neumann’s theorem, as discussed in this previous post (and von Neumann’s theorem is also used in the proof), but with the Gleason metric serving as a substitute for the faithful linear representation. Namely, one first gives the space of one-parameter subgroups of
enough of a structure that it can serve as a proxy for the “Lie algebra” of
; specifically, it needs to be a vector space, and the “exponential map” needs to cover an open neighbourhood of the identity. This is enough to set up an “adjoint” representation of
, whose image is a Lie group by von Neumann’s theorem; the kernel is essentially the centre of
, which is abelian and can also be shown to be a Lie group by a similar analysis. To finish the job one needs to use arguments of Kuranishi and of Gleason, as discussed in this previous post.
The arguments here can be phrased either in the standard analysis setting (using sequences, and passing to subsequences often) or in the nonstandard analysis setting (selecting an ultrafilter, and then working with infinitesimals). In my view, the two approaches have roughly the same level of complexity in this case, and I have elected for the standard analysis approach.
Remark 2 From Theorem 7 we see that a Gleason metric structure is a good enough substitute for smooth structure that it can actually be used to reconstruct the entire smooth structure; roughly speaking, the commutator estimate (1) allows for enough “Taylor expansion” of expressions such as
that one can simulate the fundamentals of Lie theory (in particular, construction of the Lie algebra and the exponential map, and its basic properties. The advantage of working with a Gleason metric rather than a smoother structure, though, is that it is relatively undemanding with regards to regularity; in particular, the commutator estimate (1) is roughly comparable to the imposition
structure on the group
, as this is the minimal regularity to get the type of Taylor approximation (with quadratic errors) that would be needed to obtain a bound of the form (1). We will return to this point in a later post.
We recall Brouwer’s famous fixed point theorem:
Theorem 1 (Brouwer fixed point theorem) Let
be a continuous function on the unit ball
in a Euclidean space
. Then
has at least one fixed point, thus there exists
with
.
This theorem has many proofs, most of which revolve (either explicitly or implicitly) around the notion of the degree of a continuous map of the unit sphere
to itself, and more precisely around the stability of degree with respect to homotopy. (Indeed, one can view the Brouwer fixed point theorem as an assertion that some non-trivial degree-like invariant must exist, or more abstractly that the homotopy group
is non-trivial.)
One of the many applications of this result is to prove Brouwer’s invariance of domain theorem:
Theorem 2 (Brouwer invariance of domain theorem) Let
be an open subset of
, and let
be a continuous injective map. Then
is also open.
This theorem in turn has an important corollary:
Corollary 3 (Topological invariance of dimension) If
, and
is a non-empty open subset of
, then there is no continuous injective mapping from
to
. In particular,
and
are not homeomorphic.
This corollary is intuitively obvious, but note that topological intuition is not always rigorous. For instance, it is intuitively plausible that there should be no continuous surjection from to
for
, but such surjections always exist, thanks to variants of the Peano curve construction.
Theorem 2 or Corollary 3 can be proven by simple ad hoc means for small values of or
(for instance, by noting that removing a point from
will disconnect
when
, but not for
), but I do not know of any proof of these results in general dimension that does not require algebraic topology machinery that is at least as sophisticated as the Brouwer fixed point theorem. (Lebesgue, for instance, famously failed to establish the above corollary rigorously, although he did end up discovering the important concept of Lebesgue covering dimension as a result of his efforts.)
Nowadays, the invariance of domain theorem is usually proven using the machinery of singular homology. In this post I would like to record a short proof of Theorem 2 using Theorem 1 that I discovered in a paper of Kulpa, which avoids any use of algebraic topology tools beyond the fixed point theorem, though it is more ad hoc in its approach than the systematic singular homology approach.
Remark 1 A heuristic explanation as to why the Brouwer fixed point theorem is more or less a necessary ingredient in the proof of the invariance of domain theorem is that a counterexample to the former result could conceivably be used to create a counterexample to the latter one. Indeed, if the Brouwer fixed point theorem failed, then (as is well known) one would be able to find a continuous function
that was the identity on
(indeed, one could take
to be the first point in which the ray from
through
hits
). If one then considered the function
defined by
, then this would be a continuous function which avoids the interior of
, but which maps the origin
to a point on the sphere
(and maps
to the dilate
). This could conceivably be a counterexample to Theorem 2, except that
is not necessarily injective. I do not know if there is a more rigorous way to formulate this connection.
The reason I was looking for a proof of the invariance of domain theorem was that it comes up in the very last stage of the solution to Hilbert’s fifth problem, namely to establish the following fact:
Theorem 4 (Hilbert’s fifth problem) Every locally Euclidean group is isomorphic to a Lie group.
Recall that a locally Euclidean group is a topological group which is locally homeomorphic to an open subset of a Euclidean space , i.e. it is a continuous manifold. Note in contrast that a Lie group is a topological group which is locally diffeomorphic to an open subset of
, it is a smooth manifold. Thus, Hilbert’s fifth problem is a manifestation of the “rigidity” of algebraic structure (in this case, group structure), which turns weak regularity (continuity) into strong regularity (smoothness).
It is plausible that something like Corollary 3 would need to be invoked in order to solve Hilbert’s fifth problem. After all, if Euclidean spaces ,
of different dimension were homeomorphic to each other, then the property of being locally Euclidean loses a lot of meaning, and would thus not be a particularly powerful hypothesis. Note also that it is clear that two Lie groups can only be isomorphic if they have the same dimension, so in view of Theorem 4, it becomes plausible that two Euclidean spaces can only be homeomorphic if they have the same dimension, although I do not know of a way to rigorously deduce this claim from Theorem 4.
Interestingly, Corollary 3 is the only place where algebraic topology enters into the solution of Hilbert’s fifth problem (although its cousin, point-set topology, is used all over the place). There are results closely related to Theorem 4, such as the Gleason-Yamabe theorem mentioned in a recent post, which do not use the notion of being locally Euclidean, and do not require algebraic topological methods in their proof. Indeed, one can deduce Theorem 4 from the Gleason-Yamabe theorem and invariance of domain; we sketch a proof of this (following Montgomery and Zippin) below the fold.
In the last two years, I ran a “mini-polymath” project to solve one of the problems of that year’s International Mathematical Olympiad (IMO). This year, the IMO is being held in the Netherlands, with the problems being released on July 18 and 19, and I am planning to once again select a question (most likely the last question Q6, but I’ll exercise my discretion on which problem to select once I see all of them).
The format of the last year’s mini-polymath project seemed to work well, so I am inclined to simply repeat that format without much modification this time around, in order to collect a consistent set of data about these projects. Thus, unles the plan changes, the project will start at a pre-arranged time and date, with plenty of advance notice, and be run simultaneously on three different sites: a “research thread” over at the polymath blog for the problem solving process, a “discussion thread” over at this blog for any meta-discussion about the project, and a wiki page at the polymath wiki to record the progress already made at the research thread. (Incidentally, there is a current discussion at the wiki about the logo for that site; please feel free to chip in your opinion on the various proposed icons.) The project will follow the usual polymath rules (as summarised for instance in the 2010 mini-polymath thread).
There are some kinks with our format that still need to be worked out, unfortunately; the two main ones that keep recurring in previous feedback are (a) there is no way to edit or preview comments without the intervention of one of the blog maintainers, and (b) even with comment threading, it is difficult to keep track of all the multiple discussions going on at once. It is conceivable that we could use a different forum than the WordPress-based blogs we have been using for previous projects for this mini-polymath to experiment with other software that may help ameliorate (a) and (b) (though any alternative site should definitely have the ability to support some sort of TeX, and should be easily accessible by polymath participants, without the need for a cumbersome registration process); if there are any suggestions for such alternatives, I would be happy to hear about them in the comments to this post. (Of course, any other comments germane to the polymath or mini-polymath projects would also be appropriate for the comment thread.)
The other thing to do at this early stage is set up a poll for the start time for the project (and also to gauge interest in participation). For ease of comparison I am going to use the same four-hour time slots as for the 2010 poll. All times are in Coordinated Universal Time (UTC), which is essentially the same as GMT; conversions between UTC and local time zones can for instance be found on this web site. For instance, the Netherlands are at UTC+2, and so July 19 4m UTC (say) would be July 19 6pm in Netherlands local time. (I myself will be at UTC-7.)
In the last few months, I have been working my way through the theory behind the solution to Hilbert’s fifth problem, as I (together with Emmanuel Breuillard, Ben Green, and Tom Sanders) have found this theory to be useful in obtaining noncommutative inverse sumset theorems in arbitrary groups; I hope to be able to report on this connection at some later point on this blog. Among other things, this theory achieves the remarkable feat of creating a smooth Lie group structure out of what is ostensibly a much weaker structure, namely the structure of a locally compact group. The ability of algebraic structure (in this case, group structure) to upgrade weak regularity (in this case, continuous structure) to strong regularity (in this case, smooth and even analytic structure) seems to be a recurring theme in mathematics, and an important part of what I like to call the “dichotomy between structure and randomness”.
The theory of Hilbert’s fifth problem sprawls across many subfields of mathematics: Lie theory, representation theory, group theory, nonabelian Fourier analysis, point-set topology, and even a little bit of group cohomology. The latter aspect of this theory is what I want to focus on today. The general question that comes into play here is the extension problem: given two (topological or Lie) groups and
, what is the structure of the possible groups
that are formed by extending
by
. In other words, given a short exact sequence
to what extent is the structure of determined by that of
and
?
As an example of why understanding the extension problem would help in structural theory, let us consider the task of classifying the structure of a Lie group . Firstly, we factor out the connected component
of the identity as
as Lie groups are locally connected, is discrete. Thus, to understand general Lie groups, it suffices to understand the extensions of discrete groups by connected Lie groups.
Next, to study a connected Lie group , we can consider the conjugation action
on the Lie algebra
, which gives the adjoint representation
. The kernel of this representation consists of all the group elements
that commute with all elements of the Lie algebra, and thus (by connectedness) is the center
of
. The adjoint representation is then faithful on the quotient
. The short exact sequence
then describes as a central extension (by the abelian Lie group
) of
, which is a connected Lie group with a faithful finite-dimensional linear representation.
This suggests a route to Hilbert’s fifth problem, at least in the case of connected groups . Let
be a connected locally compact group that we hope to demonstrate is isomorphic to a Lie group. As discussed in a previous post, we first form the space
of one-parameter subgroups of
(which should, eventually, become the Lie algebra of
). Hopefully,
has the structure of a vector space. The group
acts on
by conjugation; this action should be both continuous and linear, giving an “adjoint representation”
. The kernel of this representation should then be the center
of
. The quotient
is locally compact and has a faithful linear representation, and is thus a Lie group by von Neumann’s version of Cartan’s theorem (discussed in this previous post). The group
is locally compact abelian, and so it should be a relatively easy task to establish that it is also a Lie group. To finish the job, one needs the following result:
Theorem 1 (Central extensions of Lie are Lie) Let
be a locally compact group which is a central extension of a Lie group
by an abelian Lie group
. Then
is also isomorphic to a Lie group.
This result can be obtained by combining a result of Kuranishi with a result of Gleason; I am recording this argument below the fold. The point here is that while is initially only a topological group, the smooth structures of
and
can be combined (after a little bit of cohomology) to create the smooth structure on
required to upgrade
from a topological group to a Lie group. One of the main ideas here is to improve the behaviour of a cocycle by averaging it; this basic trick is helpful elsewhere in the theory, resolving a number of cohomological issues in topological group theory. The result can be generalised to show in fact that arbitrary (topological) extensions of Lie groups by Lie groups remain Lie; this was shown by Gleason. However, the above special case of this result is already sufficient (in conjunction with the rest of the theory, of course) to resolve Hilbert’s fifth problem.
Remark 1 We have shown in the above discussion that every connected Lie group is a central extension (by an abelian Lie group) of a Lie group with a faithful continuous linear representation. It is natural to ask whether this central extension is necessary. Unfortunately, not every connected Lie group admits a faithful continuous linear representation. An example (due to Birkhoff) is the Heisenberg-Weyl group
Indeed, if we consider the group elements
and
for some prime
, then one easily verifies that
has order
and is central, and that
is conjugate to
. If we have a faithful linear representation
of
, then
must have at least one eigenvalue
that is a primitive
root of unity. If
is the eigenspace associated to
, then
must preserve
, and be conjugate to
on this space. This forces
to have at least
distinct eigenvalues on
, and hence
(and thus
) must have dimension at least
. Letting
we obtain a contradiction. (On the other hand,
is certainly isomorphic to the extension of the linear group
by the abelian group
.)
In 1977, Furstenberg established his multiple recurrence theorem:
Theorem 1 (Furstenberg multiple recurrence) Let
be a measure-preserving system, thus
is a probability space and
is a measure-preserving bijection such that
and
are both measurable. Let
be a measurable subset of
of positive measure
. Then for any
, there exists
such that
Equivalently, there exists
and
such that
As is well known, the Furstenberg multiple recurrence theorem is equivalent to Szemerédi’s theorem, thanks to the Furstenberg correspondence principle; see for instance these lecture notes of mine.
The multiple recurrence theorem is proven, roughly speaking, by an induction on the “complexity” of the system . Indeed, for very simple systems, such as periodic systems (in which
is the identity for some
, which is for instance the case for the circle shift
,
with a rational shift
), the theorem is trivial; at a slightly more advanced level, almost periodic (or compact) systems (in which
is a precompact subset of
for every
, which is for instance the case for irrational circle shifts), is also quite easy. One then shows that the multiple recurrence property is preserved under various extension operations (specifically, compact extensions, weakly mixing extensions, and limits of chains of extensions), which then gives the multiple recurrence theorem as a consequence of the Furstenberg-Zimmer structure theorem for measure-preserving systems. See these lecture notes for further discussion.
From a high-level perspective, this is still one of the most conceptual proofs known of Szemerédi’s theorem. However, the individual components of the proof are still somewhat intricate. Perhaps the most difficult step is the demonstration that the multiple recurrence property is preserved under compact extensions; see for instance these lecture notes, which is devoted entirely to this step. This step requires quite a bit of measure-theoretic and/or functional analytic machinery, such as the theory of disintegrations, relatively almost periodic functions, or Hilbert modules.
However, I recently realised that there is a special case of the compact extension step – namely that of finite extensions – which avoids almost all of these technical issues while still capturing the essence of the argument (and in particular, the key idea of using van der Waerden’s theorem). As such, this may serve as a pedagogical device for motivating this step of the proof of the multiple recurrence theorem.
Let us first explain what a finite extension is. Given a measure-preserving system , a finite set
, and a measurable map
from
to the permutation group of
, one can form the finite extension
which as a probability space is the product of with the finite probability space
(with the discrete
-algebra and uniform probability measure), and with shift map
One easily verifies that this is indeed a measure-preserving system. We refer to as the cocycle of the system.
An example of finite extensions comes from group theory. Suppose we have a short exact sequence
of finite groups. Let be a group element of
, and let
be its projection in
. Then the shift map
on
(with the discrete
-algebra and uniform probability measure) can be viewed as a finite extension of the shift map
on
(again with the discrete
-algebra and uniform probability measure), by arbitrarily selecting a section
that inverts the projection map, identifying
with
by identifying
with
for
, and using the cocycle
Thus, for instance, the unit shift on
can be thought of as a finite extension of the unit shift
on
whenever
is a multiple of
.
Another example comes from Riemannian geometry. If is a Riemannian manifold that is a finite cover of another Riemannian manifold
(with the metric on
being the pullback of that on
), then (unit time) geodesic flow on the cosphere bundle of
is a finite extension of the corresponding flow on
.
Here, then, is the finite extension special case of the compact extension step in the proof of the multiple recurrence theorem:
Proposition 2 (Finite extensions) Let
be a finite extension of a measure-preserving system
. If
obeys the conclusion of the Furstenberg multiple recurrence theorem, then so does
.
Before we prove this proposition, let us first give the combinatorial analogue.
Lemma 3 Let
be a subset of the integers that contains arbitrarily long arithmetic progressions, and let
be a colouring of
by
colours (or equivalently, a partition of
into
colour classes
). Then at least one of the
contains arbitrarily long arithmetic progressions.
Proof: By the infinite pigeonhole principle, it suffices to show that for each , one of the colour classes
contains an arithmetic progression of length
.
Let be a large integer (depending on
and
) to be chosen later. Then
contains an arithmetic progression of length
, which may be identified with
. The colouring of
then induces a colouring on
into
colour classes. Applying (the finitary form of) van der Waerden’s theorem, we conclude that if
is sufficiently large depending on
and
, then one of these colouring classes contains an arithmetic progression of length
; undoing the identification, we conclude that one of the
contains an arithmetic progression of length
, as desired.
Of course, by specialising to the case , we see that the above Lemma is in fact equivalent to van der Waerden’s theorem.
Now we prove Proposition 2.
Proof: Fix . Let
be a positive measure subset of
. By Fubini’s theorem, we have
where and
is the fibre of
at
. Since
is positive, we conclude that the set
is a positive measure subset of . Note for each
, we can find an element
such that
. While not strictly necessary for this argument, one can ensure if one wishes that the function
is measurable by totally ordering
, and then letting
the minimal element of
for which
.
Let be a large integer (which will depend on
and the cardinality
of
) to be chosen later. Because
obeys the multiple recurrence theorem, we can find a positive integer
and
such that
Now consider the sequence of points
for . From (1), we see that
for some sequence . This can be viewed as a colouring of
by
colours, where
is the cardinality of
. Applying van der Waerden’s theorem, we conclude (if
is sufficiently large depending on
and
) that there is an arithmetic progression
in
with
such that
for some . If we then let
, we see from (2) that
for all , and the claim follows.
Remark 1 The precise connection between Lemma 3 and Proposition 2 arises from the following observation: with
as in the proof of Proposition 2, and
, the set
can be partitioned into the classes
where
is the graph of
. The multiple recurrence property for
ensures that
contains arbitrarily long arithmetic progressions, and so therefore one of the
must also, which gives the multiple recurrence property for
.
Remark 2 Compact extensions can be viewed as a generalisation of finite extensions, in which the fibres are no longer finite sets, but are themselves measure spaces obeying an additional property, which roughly speaking asserts that for many functions
on the extension, the shifts
of
behave in an almost periodic fashion on most fibres, so that the orbits
become totally bounded on each fibre. This total boundedness allows one to obtain an analogue of the above colouring map
to which van der Waerden’s theorem can be applied.
Recent Comments