You are currently browsing the monthly archive for June 2011.

Following the results from the recent poll on this blog, the mini-polymath3 project (which will focus on one of the problems from the 2011 IMO) will start at July 19 8pm UTC, and be run concurrently on this blog, on the polymath wiki, and on the polymath blog.

Over the past few months or so, I have been brushing up on my Lie group theory, as part of my project to fully understand the theory surrounding Hilbert’s fifth problem. Every so often, I encounter a basic fact in Lie theory which requires a slightly non-trivial “trick” to prove; I am recording two of them here, so that I can find these tricks again when I need to.

The first fact concerns the exponential map from a Lie algebra of a Lie group to that group. (For this discuss we will only consider finite-dimensional Lie groups and Lie algebras over the reals .) A basic fact in the subject is that the exponential map is *locally* a homeomorphism: there is a neighbourhood of the origin in that is mapped homeomorphically by the exponential map to a neighbourhood of the identity in . This local homeomorphism property is the foundation of an important dictionary between Lie groups and Lie algebras.

It is natural to ask whether the exponential map is globally a homeomorphism, and not just locally: in particular, whether the exponential map remains both injective and surjective. For instance, this is the case for connected, simply connected, nilpotent Lie groups (as can be seen from the Baker-Campbell-Hausdorff formula.)

The circle group , which has as its Lie algebra, already shows that global injectivity fails for any group that contains a circle subgroup, which is a huge class of examples (including, for instance, the positive dimensional compact Lie groups, or non-simply-connected Lie groups). Surjectivity also obviously fails for disconnected groups, since the Lie algebra is necessarily connected, and so the image under the exponential map must be connected also. However, even for connected Lie groups, surjectivity can fail. To see this, first observe that if the exponential map was surjective, then every group element has a square root (i.e. an element with ), since has as a square root for any . However, there exist elements in connected Lie groups without square roots. A simple example is provided by the matrix

in the connected Lie group . This matrix has eigenvalues , . Thus, if is a square root of , we see (from the Jordan normal form) that it must have at least one eigenvalue in , and at least one eigenvalue in . On the other hand, as has real coefficients, the complex eigenvalues must come in conjugate pairs . Since can only have at most eigenvalues, we obtain a contradiction.

However, there is an important case where surjectivity is recovered:

Proposition 1If is a compact connected Lie group, then the exponential map is surjective.

*Proof:* The idea here is to relate the exponential map in Lie theory to the exponential map in Riemannian geometry. We first observe that every compact Lie group can be given the structure of a Riemannian manifold with a bi-invariant metric. This can be seen in one of two ways. Firstly, one can put an arbitrary positive definite inner product on and average it against the adjoint action of using Haar probability measure (which is available since is compact); this gives an ad-invariant positive-definite inner product on that one can then translate by either left or right translation to give a bi-invariant Riemannian structure on . Alternatively, one can use the Peter-Weyl theorem to embed in a unitary group , at which point one can induce a bi-invariant metric on from the one on the space of complex matrices.

As is connected and compact and thus complete, we can apply the Hopf-Rinow theorem and conclude that any two points are connected by at least one geodesic, so that the *Riemannian* exponential map from to formed by following geodesics from the origin is surjective. But one can check that the Lie exponential map and Riemannian exponential map agree; for instance, this can be seen by noting that the group structure naturally defines a connection on the tangent bundle which is both torsion-free and preserves the bi-invariant metric, and must therefore agree with the Levi-Civita metric. (Alternatively, one can embed into a unitary group and observe that is totally geodesic inside , because the geodesics in can be described explicitly in terms of one-parameter subgroups.) The claim follows.

Remark 1While it is quite nice to see Riemannian geometry come in to prove this proposition, I am curious to know if there is any other proof of surjectivity for compact connected Lie groups that does not require explicit introduction of Riemannian geometry concepts.

The other basic fact I learned recently concerns the algebraic nature of Lie groups and Lie algebras. An important family of examples of Lie groups are the algebraic groups – algebraic varieties with a group law given by algebraic maps. Given that one can always automatically upgrade the smooth structure on a Lie group to analytic structure (by using the Baker-Campbell-Hausdorff formula), it is natural to ask whether one can upgrade the structure further to an algebraic structure. Unfortunately, this is not always the case. A prototypical example of this is given by the one-parameter subgroup

of . This is a Lie group for any exponent , but if is irrational, then the curve that traces out is not an algebraic subset of (as one can see by playing around with Puiseux series).

This is not a true counterexample to the claim that every Lie group can be given the structure of an algebraic group, because one can give a different algebraic structure than one inherited from the ambient group . Indeed, is clearly isomorphic to the additive group , which is of course an algebraic group. However, a modification of the above construction works:

Proposition 2There exists a Lie group that cannot be given the structure of an algebraic group.

*Proof:* We use an example from the text of Tauvel and Yu (that I found via this MathOverflow posting). We consider the subgroup

of , with an irrational number. This is a three-dimensional (metabelian) Lie group, whose Lie algebra is spanned by the elements

with the Lie bracket given by

As such, we see that if we use the basis to identify to , then adjoint representation of is the identity map.

If is an algebraic group, it is easy to see that the adjoint representation is also algebraic, and so is algebraic in . Specialising to our specific example, in which adjoint representation is the identity, we conclude that if has *any* algebraic structure, then it must also be an algebraic subgroup of ; but projects to the group (1) which is not algebraic, a contradiction.

A slight modification of the same argument also shows that not every Lie algebra is *algebraic*, in the sense that it is isomorphic to a Lie algebra of an algebraic group. (However, there are important classes of Lie algebras that are automatically algebraic, such as nilpotent or semisimple Lie algebras.)

Let be a Lie group with Lie algebra . As is well known, the exponential map is a local homeomorphism near the identity. As such, the group law on can be locally pulled back to an operation defined on a neighbourhood of the identity in , defined as

where is the local inverse of the exponential map. One can view as the group law expressed in local exponential coordinates around the origin.

An asymptotic expansion for is provided by the Baker-Campbell-Hausdorff (BCH) formula

for all sufficiently small , where is the Lie bracket. More explicitly, one has the *Baker-Campbell-Hausdorff-Dynkin formula*

for all sufficiently small , where , is the adjoint representation , and is the function

which is real analytic near and can thus be applied to linear operators sufficiently close to the identity. One corollary of this is that the multiplication operation is real analytic in local coordinates, and so every smooth Lie group is in fact a real analytic Lie group.

It turns out that one does not need the full force of the smoothness hypothesis to obtain these conclusions. It is, for instance, a classical result that regularity of the group operations is already enough to obtain the Baker-Campbell-Hausdorff formula. Actually, it turns out that we can weaken this a bit, and show that even regularity (i.e. that the group operations are continuously differentiable, and the derivatives are locally Lipschitz) is enough to make the classical derivation of the Baker-Campbell-Hausdorff formula work. More precisely, we have

Theorem 1 ( Baker-Campbell-Hausdorff formula)Let be a finite-dimensional vector space, and suppose one has a continuous operation defined on a neighbourhood around the origin, which obeys the following three axioms:

- (Approximate additivity) For sufficiently close to the origin, one has
- (Associativity) For sufficiently close to the origin, .
- (Radial homogeneity) For sufficiently close to the origin, one has
for all . (In particular, for all sufficiently close to the origin.)

Then is real analytic (and in particular, smooth) near the origin. (In particular, gives a neighbourhood of the origin the structure of a local Lie group.)

Indeed, we will recover the Baker-Campbell-Hausdorff-Dynkin formula (after defining appropriately) in this setting; see below the fold.

The reason that we call this a Baker-Campbell-Hausdorff formula is that if the group operation has regularity, and has as an identity element, then Taylor expansion already gives (2), and in exponential coordinates (which, as it turns out, can be defined without much difficulty in the category) one automatically has (3).

We will record the proof of Theorem 1 below the fold; it largely follows the classical derivation of the BCH formula, but due to the low regularity one will rely on tools such as telescoping series and Riemann sums rather than on the fundamental theorem of calculus. As an application of this theorem, we can give an alternate derivation of one of the components of the solution to Hilbert’s fifth problem, namely the construction of a Lie group structure from a Gleason metric, which was covered in the previous post; we discuss this at the end of this article. With this approach, one can avoid any appeal to von Neumann’s theorem and Cartan’s theorem (discussed in this post), or the Kuranishi-Gleason extension theorem (discussed in this post).

Hilbert’s fifth problem asks to clarify the extent that the assumption on a differentiable or smooth structure is actually needed in the theory of Lie groups and their actions. While this question is not precisely formulated and is thus open to some interpretation, the following result of Gleason and Montgomery-Zippin answers at least one aspect of this question:

Theorem 1 (Hilbert’s fifth problem)Let be a topological group which is locally Euclidean (i.e. it is a topological manifold). Then is isomorphic to a Lie group.

Theorem 1 can be viewed as an application of the more general structural theory of locally compact groups. In particular, Theorem 1 can be deduced from the following structural theorem of Gleason and Yamabe:

Theorem 2 (Gleason-Yamabe theorem)Let be a locally compact group, and let be an open neighbourhood of the identity in . Then there exists an open subgroup of , and a compact subgroup of contained in , such that is isomorphic to a Lie group.

The deduction of Theorem 1 from Theorem 2 proceeds using the Brouwer invariance of domain theorem and is discussed in this previous post. In this post, I would like to discuss the proof of Theorem 2. We can split this proof into three parts, by introducing two additional concepts. The first is the property of having no small subgroups:

Definition 3 (NSS)A topological group is said to haveno small subgroups, or isNSSfor short, if there is an open neighbourhood of the identity in that contains no subgroups of other than the trivial subgroup .

An equivalent definition of an NSS group is one which has an open neighbourhood of the identity that every non-identity element *escapes* in finite time, in the sense that for some positive integer . It is easy to see that all Lie groups are NSS; we shall shortly see that the converse statement (in the locally compact case) is also true, though significantly harder to prove.

Another useful property is that of having what I will call a *Gleason metric*:

Definition 4Let be a topological group. AGleason metricon is a left-invariant metric which generates the topology on and obeys the following properties for some constant , writing for :

- (Escape property) If and is such that , then .
- (Commutator estimate) If are such that , then
where is the commutator of and .

For instance, the unitary group with the operator norm metric can easily verified to be a Gleason metric, with the commutator estimate (1) coming from the inequality

Similarly, any left-invariant Riemannian metric on a (connected) Lie group can be verified to be a Gleason metric. From the escape property one easily sees that all groups with Gleason metrics are NSS; again, we shall see that there is a partial converse.

Remark 1The escape and commutator properties are meant to capture “Euclidean-like” structure of the group. Other metrics, such as Carnot-Carathéodory metrics on Carnot Lie groups such as the Heisenberg group, usually fail one or both of these properties.

The proof of Theorem 2 can then be split into three subtheorems:

Theorem 5 (Reduction to the NSS case)Let be a locally compact group, and let be an open neighbourhood of the identity in . Then there exists an open subgroup of , and a compact subgroup of contained in , such that is NSS, locally compact, and metrisable.

Theorem 6 (Gleason’s lemma)Let be a locally compact metrisable NSS group. Then has a Gleason metric.

Theorem 7 (Building a Lie structure)Let be a locally compact group with a Gleason metric. Then is isomorphic to a Lie group.

Clearly, by combining Theorem 5, Theorem 6, and Theorem 7 one obtains Theorem 2 (and hence Theorem 1).

Theorem 5 and Theorem 6 proceed by some elementary combinatorial analysis, together with the use of Haar measure (to build convolutions, and thence to build “smooth” bump functions with which to create a metric, in a variant of the analysis used to prove the Birkhoff-Kakutani theorem); Theorem 5 also requires Peter-Weyl theorem (to dispose of certain compact subgroups that arise en route to the reduction to the NSS case), which was discussed previously on this blog.

In this post I would like to detail the final component to the proof of Theorem 2, namely Theorem 7. (I plan to discuss the other two steps, Theorem 5 and Theorem 6, in a separate post.) The strategy is similar to that used to prove von Neumann’s theorem, as discussed in this previous post (and von Neumann’s theorem is also used in the proof), but with the Gleason metric serving as a substitute for the faithful linear representation. Namely, one first gives the space of one-parameter subgroups of enough of a structure that it can serve as a proxy for the “Lie algebra” of ; specifically, it needs to be a vector space, and the “exponential map” needs to cover an open neighbourhood of the identity. This is enough to set up an “adjoint” representation of , whose image is a Lie group by von Neumann’s theorem; the kernel is essentially the centre of , which is abelian and can also be shown to be a Lie group by a similar analysis. To finish the job one needs to use arguments of Kuranishi and of Gleason, as discussed in this previous post.

The arguments here can be phrased either in the standard analysis setting (using sequences, and passing to subsequences often) or in the nonstandard analysis setting (selecting an ultrafilter, and then working with infinitesimals). In my view, the two approaches have roughly the same level of complexity in this case, and I have elected for the standard analysis approach.

Remark 2From Theorem 7 we see that a Gleason metric structure is a good enough substitute for smooth structure that it can actually be used to reconstruct the entire smooth structure; roughly speaking, the commutator estimate (1) allows for enough “Taylor expansion” of expressions such as that one can simulate the fundamentals of Lie theory (in particular, construction of the Lie algebra and the exponential map, and its basic properties. The advantage of working with a Gleason metric rather than a smoother structure, though, is that it is relatively undemanding with regards to regularity; in particular, the commutator estimate (1) is roughly comparable to the imposition structure on the group , as this is the minimal regularity to get the type of Taylor approximation (with quadratic errors) that would be needed to obtain a bound of the form (1). We will return to this point in a later post.

We recall Brouwer’s famous fixed point theorem:

Theorem 1 (Brouwer fixed point theorem)Let be a continuous function on the unit ball in a Euclidean space . Then has at least one fixed point, thus there exists with .

This theorem has many proofs, most of which revolve (either explicitly or implicitly) around the notion of the degree of a continuous map of the unit sphere to itself, and more precisely around the *stability* of degree with respect to homotopy. (Indeed, one can view the Brouwer fixed point theorem as an assertion that some non-trivial degree-like invariant must exist, or more abstractly that the homotopy group is non-trivial.)

One of the many applications of this result is to prove Brouwer’s invariance of domain theorem:

Theorem 2 (Brouwer invariance of domain theorem)Let be an open subset of , and let be a continuous injective map. Then is also open.

This theorem in turn has an important corollary:

Corollary 3 (Topological invariance of dimension)If , and is a non-empty open subset of , then there is no continuous injective mapping from to . In particular, and are not homeomorphic.

This corollary is intuitively obvious, but note that topological intuition is not always rigorous. For instance, it is intuitively plausible that there should be no continuous surjection from to for , but such surjections always exist, thanks to variants of the Peano curve construction.

Theorem 2 or Corollary 3 can be proven by simple *ad hoc* means for small values of or (for instance, by noting that removing a point from will disconnect when , but not for ), but I do not know of any proof of these results in general dimension that does not require algebraic topology machinery that is at least as sophisticated as the Brouwer fixed point theorem. (Lebesgue, for instance, famously failed to establish the above corollary rigorously, although he did end up discovering the important concept of Lebesgue covering dimension as a result of his efforts.)

Nowadays, the invariance of domain theorem is usually proven using the machinery of singular homology. In this post I would like to record a short proof of Theorem 2 using Theorem 1 that I discovered in a paper of Kulpa, which avoids any use of algebraic topology tools beyond the fixed point theorem, though it is more *ad hoc* in its approach than the systematic singular homology approach.

Remark 1A heuristic explanation as to why the Brouwer fixed point theorem is more or less a necessary ingredient in the proof of the invariance of domain theorem is that a counterexample to the former result could conceivably be used to create a counterexample to the latter one. Indeed, if the Brouwer fixed point theorem failed, then (as is well known) one would be able to find a continuous function that was the identity on (indeed, one could take to be the first point in which the ray from through hits ). If one then considered the function defined by , then this would be a continuous function which avoids the interior of , but which maps the origin to a point on the sphere (and maps to the dilate ). This could conceivably be a counterexample to Theorem 2, except that is not necessarily injective. I do not know if there is a more rigorous way to formulate this connection.

The reason I was looking for a proof of the invariance of domain theorem was that it comes up in the very last stage of the solution to Hilbert’s fifth problem, namely to establish the following fact:

Theorem 4 (Hilbert’s fifth problem)Every locally Euclidean group is isomorphic to a Lie group.

Recall that a *locally Euclidean* group is a topological group which is locally homeomorphic to an open subset of a Euclidean space , i.e. it is a continuous manifold. Note in contrast that a Lie group is a topological group which is locally *diffeomorphic* to an open subset of , it is a *smooth* manifold. Thus, Hilbert’s fifth problem is a manifestation of the “rigidity” of algebraic structure (in this case, group structure), which turns weak regularity (continuity) into strong regularity (smoothness).

It is plausible that something like Corollary 3 would need to be invoked in order to solve Hilbert’s fifth problem. After all, if Euclidean spaces , of different dimension were homeomorphic to each other, then the property of being locally Euclidean loses a lot of meaning, and would thus not be a particularly powerful hypothesis. Note also that it is clear that two Lie groups can only be isomorphic if they have the same dimension, so in view of Theorem 4, it becomes plausible that two Euclidean spaces can only be homeomorphic if they have the same dimension, although I do not know of a way to rigorously deduce this claim from Theorem 4.

Interestingly, Corollary 3 is the only place where algebraic topology enters into the solution of Hilbert’s fifth problem (although its cousin, *point-set topology*, is used all over the place). There are results closely related to Theorem 4, such as the Gleason-Yamabe theorem mentioned in a recent post, which do not use the notion of being locally Euclidean, and do not require algebraic topological methods in their proof. Indeed, one can deduce Theorem 4 from the Gleason-Yamabe theorem and invariance of domain; we sketch a proof of this (following Montgomery and Zippin) below the fold.

In the last two years, I ran a “mini-polymath” project to solve one of the problems of that year’s International Mathematical Olympiad (IMO). This year, the IMO is being held in the Netherlands, with the problems being released on July 18 and 19, and I am planning to once again select a question (most likely the last question Q6, but I’ll exercise my discretion on which problem to select once I see all of them).

The format of the last year’s mini-polymath project seemed to work well, so I am inclined to simply repeat that format without much modification this time around, in order to collect a consistent set of data about these projects. Thus, unles the plan changes, the project will start at a pre-arranged time and date, with plenty of advance notice, and be run simultaneously on three different sites: a “research thread” over at the polymath blog for the problem solving process, a “discussion thread” over at this blog for any meta-discussion about the project, and a wiki page at the polymath wiki to record the progress already made at the research thread. (Incidentally, there is a current discussion at the wiki about the logo for that site; please feel free to chip in your opinion on the various proposed icons.) The project will follow the usual polymath rules (as summarised for instance in the 2010 mini-polymath thread).

There are some kinks with our format that still need to be worked out, unfortunately; the two main ones that keep recurring in previous feedback are (a) there is no way to edit or preview comments without the intervention of one of the blog maintainers, and (b) even with comment threading, it is difficult to keep track of all the multiple discussions going on at once. It is conceivable that we could use a different forum than the WordPress-based blogs we have been using for previous projects for this mini-polymath to experiment with other software that may help ameliorate (a) and (b) (though any alternative site should definitely have the ability to support some sort of TeX, and should be easily accessible by polymath participants, without the need for a cumbersome registration process); if there are any suggestions for such alternatives, I would be happy to hear about them in the comments to this post. (Of course, any other comments germane to the polymath or mini-polymath projects would also be appropriate for the comment thread.)

The other thing to do at this early stage is set up a poll for the start time for the project (and also to gauge interest in participation). For ease of comparison I am going to use the same four-hour time slots as for the 2010 poll. All times are in Coordinated Universal Time (UTC), which is essentially the same as GMT; conversions between UTC and local time zones can for instance be found on this web site. For instance, the Netherlands are at UTC+2, and so July 19 4m UTC (say) would be July 19 6pm in Netherlands local time. (I myself will be at UTC-7.)

In the last few months, I have been working my way through the theory behind the solution to Hilbert’s fifth problem, as I (together with Emmanuel Breuillard, Ben Green, and Tom Sanders) have found this theory to be useful in obtaining noncommutative inverse sumset theorems in arbitrary groups; I hope to be able to report on this connection at some later point on this blog. Among other things, this theory achieves the remarkable feat of creating a smooth Lie group structure out of what is ostensibly a much weaker structure, namely the structure of a locally compact group. The ability of algebraic structure (in this case, group structure) to upgrade weak regularity (in this case, continuous structure) to strong regularity (in this case, smooth and even analytic structure) seems to be a recurring theme in mathematics, and an important part of what I like to call the “dichotomy between structure and randomness”.

The theory of Hilbert’s fifth problem sprawls across many subfields of mathematics: Lie theory, representation theory, group theory, nonabelian Fourier analysis, point-set topology, and even a little bit of group cohomology. The latter aspect of this theory is what I want to focus on today. The general question that comes into play here is the *extension problem*: given two (topological or Lie) groups and , what is the structure of the possible groups that are formed by extending by . In other words, given a short exact sequence

to what extent is the structure of determined by that of and ?

As an example of why understanding the extension problem would help in structural theory, let us consider the task of classifying the structure of a Lie group . Firstly, we factor out the connected component of the identity as

as Lie groups are locally connected, is discrete. Thus, to understand general Lie groups, it suffices to understand the extensions of discrete groups by connected Lie groups.

Next, to study a connected Lie group , we can consider the conjugation action on the Lie algebra , which gives the adjoint representation . The kernel of this representation consists of all the group elements that commute with all elements of the Lie algebra, and thus (by connectedness) is the center of . The adjoint representation is then faithful on the quotient . The short exact sequence

then describes as a central extension (by the abelian Lie group ) of , which is a connected Lie group with a faithful finite-dimensional linear representation.

This suggests a route to Hilbert’s fifth problem, at least in the case of connected groups . Let be a connected locally compact group that we hope to demonstrate is isomorphic to a Lie group. As discussed in a previous post, we first form the space of one-parameter subgroups of (which should, eventually, become the Lie algebra of ). Hopefully, has the structure of a vector space. The group acts on by conjugation; this action should be both continuous and linear, giving an “adjoint representation” . The kernel of this representation should then be the center of . The quotient is locally compact and has a faithful linear representation, and is thus a Lie group by von Neumann’s version of Cartan’s theorem (discussed in this previous post). The group is locally compact abelian, and so it should be a relatively easy task to establish that it is also a Lie group. To finish the job, one needs the following result:

Theorem 1 (Central extensions of Lie are Lie)Let be a locally compact group which is a central extension of a Lie group by an abelian Lie group . Then is also isomorphic to a Lie group.

This result can be obtained by combining a result of Kuranishi with a result of Gleason; I am recording this argument below the fold. The point here is that while is initially only a topological group, the smooth structures of and can be combined (after a little bit of cohomology) to create the smooth structure on required to upgrade from a topological group to a Lie group. One of the main ideas here is to improve the behaviour of a cocycle by averaging it; this basic trick is helpful elsewhere in the theory, resolving a number of cohomological issues in topological group theory. The result can be generalised to show in fact that arbitrary (topological) extensions of Lie groups by Lie groups remain Lie; this was shown by Gleason. However, the above special case of this result is already sufficient (in conjunction with the rest of the theory, of course) to resolve Hilbert’s fifth problem.

Remark 1We have shown in the above discussion that every connected Lie group is a central extension (by an abelian Lie group) of a Lie group with a faithful continuous linear representation. It is natural to ask whether this central extension is necessary. Unfortunately, not every connected Lie group admits a faithful continuous linear representation. An example (due to Birkhoff) is the Heisenberg-Weyl groupIndeed, if we consider the group elements

and

for some prime , then one easily verifies that has order and is central, and that is conjugate to . If we have a faithful linear representation of , then must have at least one eigenvalue that is a primitive root of unity. If is the eigenspace associated to , then must preserve , and be conjugate to on this space. This forces to have at least distinct eigenvalues on , and hence (and thus ) must have dimension at least . Letting we obtain a contradiction. (On the other hand, is certainly isomorphic to the extension of the linear group by the abelian group .)

In 1977, Furstenberg established his multiple recurrence theorem:

Theorem 1 (Furstenberg multiple recurrence)Let be a measure-preserving system, thus is a probability space and is a measure-preserving bijection such that and are both measurable. Let be a measurable subset of of positive measure . Then for any , there exists such thatEquivalently, there exists and such that

As is well known, the Furstenberg multiple recurrence theorem is equivalent to Szemerédi’s theorem, thanks to the Furstenberg correspondence principle; see for instance these lecture notes of mine.

The multiple recurrence theorem is proven, roughly speaking, by an induction on the “complexity” of the system . Indeed, for very simple systems, such as periodic systems (in which is the identity for some , which is for instance the case for the circle shift , with a rational shift ), the theorem is trivial; at a slightly more advanced level, *almost periodic* (or *compact*) systems (in which is a precompact subset of for every , which is for instance the case for irrational circle shifts), is also quite easy. One then shows that the multiple recurrence property is preserved under various *extension* operations (specifically, compact extensions, weakly mixing extensions, and limits of chains of extensions), which then gives the multiple recurrence theorem as a consequence of the *Furstenberg-Zimmer structure theorem* for measure-preserving systems. See these lecture notes for further discussion.

From a high-level perspective, this is still one of the most conceptual proofs known of Szemerédi’s theorem. However, the individual components of the proof are still somewhat intricate. Perhaps the most difficult step is the demonstration that the multiple recurrence property is preserved under *compact extensions*; see for instance these lecture notes, which is devoted entirely to this step. This step requires quite a bit of measure-theoretic and/or functional analytic machinery, such as the theory of disintegrations, relatively almost periodic functions, or Hilbert modules.

However, I recently realised that there is a special case of the compact extension step – namely that of *finite* extensions – which avoids almost all of these technical issues while still capturing the essence of the argument (and in particular, the key idea of using van der Waerden’s theorem). As such, this may serve as a pedagogical device for motivating this step of the proof of the multiple recurrence theorem.

Let us first explain what a finite extension is. Given a measure-preserving system , a finite set , and a measurable map from to the permutation group of , one can form the *finite extension*

which as a probability space is the product of with the finite probability space (with the discrete -algebra and uniform probability measure), and with shift map

One easily verifies that this is indeed a measure-preserving system. We refer to as the *cocycle* of the system.

An example of finite extensions comes from group theory. Suppose we have a short exact sequence

of finite groups. Let be a group element of , and let be its projection in . Then the shift map on (with the discrete -algebra and uniform probability measure) can be viewed as a finite extension of the shift map on (again with the discrete -algebra and uniform probability measure), by arbitrarily selecting a section that inverts the projection map, identifying with by identifying with for , and using the cocycle

Thus, for instance, the unit shift on can be thought of as a finite extension of the unit shift on whenever is a multiple of .

Another example comes from Riemannian geometry. If is a Riemannian manifold that is a finite cover of another Riemannian manifold (with the metric on being the pullback of that on ), then (unit time) geodesic flow on the cosphere bundle of is a finite extension of the corresponding flow on .

Here, then, is the finite extension special case of the compact extension step in the proof of the multiple recurrence theorem:

Proposition 2 (Finite extensions)Let be a finite extension of a measure-preserving system . If obeys the conclusion of the Furstenberg multiple recurrence theorem, then so does .

Before we prove this proposition, let us first give the combinatorial analogue.

Lemma 3Let be a subset of the integers that contains arbitrarily long arithmetic progressions, and let be a colouring of by colours (or equivalently, a partition of into colour classes ). Then at least one of the contains arbitrarily long arithmetic progressions.

*Proof:* By the infinite pigeonhole principle, it suffices to show that for each , one of the colour classes contains an arithmetic progression of length .

Let be a large integer (depending on and ) to be chosen later. Then contains an arithmetic progression of length , which may be identified with . The colouring of then induces a colouring on into colour classes. Applying (the finitary form of) van der Waerden’s theorem, we conclude that if is sufficiently large depending on and , then one of these colouring classes contains an arithmetic progression of length ; undoing the identification, we conclude that one of the contains an arithmetic progression of length , as desired.

Of course, by specialising to the case , we see that the above Lemma is in fact equivalent to van der Waerden’s theorem.

Now we prove Proposition 2.

*Proof:* Fix . Let be a positive measure subset of . By Fubini’s theorem, we have

where and is the fibre of at . Since is positive, we conclude that the set

is a positive measure subset of . Note for each , we can find an element such that . While not strictly necessary for this argument, one can ensure if one wishes that the function is measurable by totally ordering , and then letting the minimal element of for which .

Let be a large integer (which will depend on and the cardinality of ) to be chosen later. Because obeys the multiple recurrence theorem, we can find a positive integer and such that

Now consider the sequence of points

for . From (1), we see that

for some sequence . This can be viewed as a colouring of by colours, where is the cardinality of . Applying van der Waerden’s theorem, we conclude (if is sufficiently large depending on and ) that there is an arithmetic progression in with such that

for some . If we then let , we see from (2) that

for all , and the claim follows.

Remark 1The precise connection between Lemma 3 and Proposition 2 arises from the following observation: with as in the proof of Proposition 2, and , the setcan be partitioned into the classes

where is the graph of . The multiple recurrence property for ensures that contains arbitrarily long arithmetic progressions, and so therefore one of the must also, which gives the multiple recurrence property for .

Remark 2Compact extensions can be viewed as a generalisation of finite extensions, in which the fibres are no longer finite sets, but are themselves measure spaces obeying an additional property, which roughly speaking asserts that for many functions on the extension, the shifts of behave in an almost periodic fashion on most fibres, so that the orbits become totally bounded on each fibre. This total boundedness allows one to obtain an analogue of the above colouring map to which van der Waerden’s theorem can be applied.

## Recent Comments