You are currently browsing the tag archive for the ‘Hilbert’s fifth problem’ tag.
In the last few months, I have been working my way through the theory behind the solution to Hilbert’s fifth problem, as I (together with Emmanuel Breuillard, Ben Green, and Tom Sanders) have found this theory to be useful in obtaining noncommutative inverse sumset theorems in arbitrary groups; I hope to be able to report on this connection at some later point on this blog. Among other things, this theory achieves the remarkable feat of creating a smooth Lie group structure out of what is ostensibly a much weaker structure, namely the structure of a locally compact group. The ability of algebraic structure (in this case, group structure) to upgrade weak regularity (in this case, continuous structure) to strong regularity (in this case, smooth and even analytic structure) seems to be a recurring theme in mathematics, and an important part of what I like to call the “dichotomy between structure and randomness”.
The theory of Hilbert’s fifth problem sprawls across many subfields of mathematics: Lie theory, representation theory, group theory, nonabelian Fourier analysis, point-set topology, and even a little bit of group cohomology. The latter aspect of this theory is what I want to focus on today. The general question that comes into play here is the extension problem: given two (topological or Lie) groups and , what is the structure of the possible groups that are formed by extending by . In other words, given a short exact sequence
to what extent is the structure of determined by that of and ?
As an example of why understanding the extension problem would help in structural theory, let us consider the task of classifying the structure of a Lie group . Firstly, we factor out the connected component of the identity as
as Lie groups are locally connected, is discrete. Thus, to understand general Lie groups, it suffices to understand the extensions of discrete groups by connected Lie groups.
Next, to study a connected Lie group , we can consider the conjugation action on the Lie algebra , which gives the adjoint representation . The kernel of this representation consists of all the group elements that commute with all elements of the Lie algebra, and thus (by connectedness) is the center of . The adjoint representation is then faithful on the quotient . The short exact sequence
then describes as a central extension (by the abelian Lie group ) of , which is a connected Lie group with a faithful finite-dimensional linear representation.
This suggests a route to Hilbert’s fifth problem, at least in the case of connected groups . Let be a connected locally compact group that we hope to demonstrate is isomorphic to a Lie group. As discussed in a previous post, we first form the space of one-parameter subgroups of (which should, eventually, become the Lie algebra of ). Hopefully, has the structure of a vector space. The group acts on by conjugation; this action should be both continuous and linear, giving an “adjoint representation” . The kernel of this representation should then be the center of . The quotient is locally compact and has a faithful linear representation, and is thus a Lie group by von Neumann’s version of Cartan’s theorem (discussed in this previous post). The group is locally compact abelian, and so it should be a relatively easy task to establish that it is also a Lie group. To finish the job, one needs the following result:
This result can be obtained by combining a result of Kuranishi with a result of Gleason; I am recording this argument below the fold. The point here is that while is initially only a topological group, the smooth structures of and can be combined (after a little bit of cohomology) to create the smooth structure on required to upgrade from a topological group to a Lie group. One of the main ideas here is to improve the behaviour of a cocycle by averaging it; this basic trick is helpful elsewhere in the theory, resolving a number of cohomological issues in topological group theory. The result can be generalised to show in fact that arbitrary (topological) extensions of Lie groups by Lie groups remain Lie; this was shown by Gleason. However, the above special case of this result is already sufficient (in conjunction with the rest of the theory, of course) to resolve Hilbert’s fifth problem.
Remark 1 We have shown in the above discussion that every connected Lie group is a central extension (by an abelian Lie group) of a Lie group with a faithful continuous linear representation. It is natural to ask whether this central extension is necessary. Unfortunately, not every connected Lie group admits a faithful continuous linear representation. An example (due to Birkhoff) is the Heisenberg-Weyl group
Indeed, if we consider the group elements
for some prime , then one easily verifies that has order and is central, and that is conjugate to . If we have a faithful linear representation of , then must have at least one eigenvalue that is a primitive root of unity. If is the eigenspace associated to , then must preserve , and be conjugate to on this space. This forces to have at least distinct eigenvalues on , and hence (and thus ) must have dimension at least . Letting we obtain a contradiction. (On the other hand, is certainly isomorphic to the extension of the linear group by the abelian group .)
In order to understand the structure of a topological group , a basic strategy is to try to split into two smaller factor groups by exhibiting a short exact sequence
If one has such a sequence, then is an extension of by (which includes direct products and semidirect products as examples, but can be more general than these situations, as discussed in this previous blog post). In principle, the problem of understanding the structure of then splits into three simpler problems:
- (Horizontal structure) Understanding the structure of the “horizontal” group .
- (Vertical structure) Understanding the structure of the “vertical” group .
- (Cohomology) Understanding the ways in which one can extend by .
The “cohomological” aspect to this program can be nontrivial. However, in principle at least, this strategy reduces the study of the large group to the study of the smaller groups . (This type of splitting strategy is not restricted to topological groups, but can also be adapted to many other categories, particularly those of groups or group-like objects.) Typically, splitting alone does not fully kill off a structural classification problem, but it can reduce matters to studying those objects which are somehow “simple” or “irreducible”. For instance, this strategy can often be used to reduce questions about arbitrary finite groups to finite simple groups.
A simple example of splitting is as follows. Given any topological group , one can form the connected component of the identity – the maximal connected set containing the identity. It is not difficult to show that is a closed (and thus also locally compact) normal subgroup of , whose quotient is another locally compact group. Furthermore, due to the maximal connected nature of , is totally disconnected – the only connected sets are the singletons. In particular, is Hausdorff (the identity element is closed). Thus we have obtained a splitting
of an arbitrary locally compact group into a connected locally compact group , and a totally disconnected locally compact group . In principle at least, the study of locally compact groups thus splits into the study of connected locally compact groups, and the study of totally disconnected locally compact groups (though the cohomological issues are not always trivial).
In the structural theory of totally disconnected locally compact groups, the first basic theorem in the subject is van Dantzig’s theorem (which we prove below the fold):
Example 1 Let be a prime. Then the -adic field (with the usual -adic valuation) is totally disconnected locally compact, and the -adic integers are a compact open subgroup.
Of course, this situation is the polar opposite of what occurs in the connected case, in which the only open subgroup is the whole group.
In view of van Dantzig’s theorem, we see that the “local” behaviour of totally disconnected locally compact groups can be modeled by the compact totally disconnected groups, which are better understood (for instance, one can start analysing them using the Peter-Weyl theorem, as discussed in this previous post). The global behaviour however remains more complicated, in part because the compact open subgroup given by van Dantzig’s theorem need not be normal, and so does not necessarily induce a splitting of into compact and discrete factors.
Example 2 Let be a prime, and let be the semi-direct product , where the integers act on by the map , and we give the product of the discrete topology of and the -adic topology on . One easily verifies that is a totally disconnected locally compact group. It certainly has compact open subgroups, such as . However, it is easy to show that has no non-trivial compact normal subgroups (the problem is that the conjugation action of on has all non-trivial orbits unbounded).
Returning to more general locally compact groups, we obtain an immediate corollary:
Indeed, one applies van Dantzig’s theorem to the totally disconnected group , and then pulls back the resulting compact open subgroup.
Now we mention another application of van Dantzig’s theorem, of more direct relevance to Hilbert’s fifth problem. Define a generalised Lie group to be a topological group with the property that given any open neighbourhood of the identity, there exists an open subgroup of and a compact normal subgroup of in such that is isomorphic to a Lie group. It is easy to see that such groups are locally compact. The deep Gleason-Yamabe theorem, which among other things establishes a satisfactory solution to Hilbert’s fifth problem (and which we will not prove here), asserts the converse:
Theorem 3 (Gleason-Yamabe theorem) Every locally compact group is a generalised Lie group.
Example 3 We consider the locally compact group from Example 2. This is of course not a Lie group. However, any open neighbourhood of the identity in will contain the compact subgroup for some integer . The open subgroup then has isomorphic to the discrete finite group , which is certainly a Lie group. Thus is a generalised Lie group.
One important example of generalised Lie groups are those locally compact groups which are an inverse limit (or projective limit) of Lie groups. Indeed, suppose we have a family of Lie groups indexed by partially ordered set which is directed in the sense that every finite subset of has an upper bound, together with continuous homomorphisms for all which form a category in the sense that for all . Then we can form the inverse limit
which is the subgroup of consisting of all tuples which are compatible with the in the sense that for all . If we endow with the product topology, then is a closed subgroup of , and thus has the structure of a topological group, with continuous homomorphisms which are compatible with the in the sense that for all . Such an inverse limit need not be locally compact; for instance, the inverse limit
of Euclidean spaces with the usual coordinate projection maps is isomorphic to the infinite product space with the product topology, which is not locally compact. However, if an inverse limit
of Lie groups is locally compact, it can be easily seen to be a generalised Lie group. Indeed, by local compactness, any open neighbourhood of the identity will contain an open precompact neighbourhood of the identity; by construction of the product topology (and the directed nature of ), this smaller neighbourhood will in turn will contain the kernel of one of the , which will be compact since the preceding neighbourhood was precompact. Quotienting out by this we obtain a locally compact subgroup of the Lie group , which is necessarily again a Lie group by Cartan’s theorem, and the claim follows.
We show Theorem 4 below the fold. Combining this with the (substantially more difficult) Gleason-Yamabe theorem, we obtain quite a satisfactory description of the local structure of locally compact groups. (The situation is particularly simple for connected groups, which have no non-trivial open subgroups; we then conclude that every connected locally compact Hausdorff group is the inverse limit of Lie groups.)
Example 4 The locally compact group is not an inverse limit of Lie groups because (as noted earlier) it has no non-trivial compact normal subgroups, which would contradict the preceding analysis that showed that all locally compact inverse limits of Lie groups were generalised Lie groups. On the other hand, contains the open subgroup , which is the inverse limit of the discrete (and thus Lie) groups for (where we give the usual ordering, and use the obvious projection maps).
This is another post in a series on various components to the solution of Hilbert’s fifth problem. One interpretation of this problem is to ask for a purely topological classification of the topological groups which are isomorphic to Lie groups. (Here we require Lie groups to be finite-dimensional, but allow them to be disconnected.)
There are some obvious necessary conditions on a topological group in order for it to be isomorphic to a Lie group; for instance, it must be Hausdorff and locally compact. These two conditions, by themselves, are not quite enough to force a Lie group structure; consider for instance a -adic field for some prime , which is a locally compact Hausdorff topological group which is not a Lie group (the topology is locally that of a Cantor set). Nevertheless, it turns out that by adding some key additional assumptions on the topological group, one can recover Lie structure. One such result, which is a key component of the full solution to Hilbert’s fifth problem, is the following result of von Neumann:
Theorem 1 Let be a locally compact Hausdorff topological group that has a faithful finite-dimensional linear representation, i.e. an injective continuous homomorphism into some linear group. Then can be given the structure of a Lie group. Furthermore, after giving this Lie structure, becomes smooth (and even analytic) and non-degenerate (the Jacobian always has full rank).
This result is closely related to a theorem of Cartan:
Indeed, Theorem 1 immediately implies Theorem 2 in the important special case when the ambient Lie group is a linear group, and in any event it is not difficult to modify the proof of Theorem 1 to give a proof of Theorem 2. However, Theorem 1 is more general than Theorem 2 in some ways. For instance, let be the real line , which we faithfully represent in the -torus using an irrational embedding for some fixed irrational . The -torus can in turn be embedded in a linear group (e.g. by identifying it with , or ), thus giving a faithful linear representation of . However, the image is not closed (it is a dense subgroup of a -torus), and so Cartan’s theorem does not directly apply ( fails to be a Lie group). Nevertheless, Theorem 1 still applies and guarantees that the original group is a Lie group.
The key to building the Lie group structure on a topological group is to first build the associated Lie algebra structure, by means of one-parameter subgroups.
Definition 3 A one-parameter subgroup of a topological group is a continuous homomorphism from the real line (with the additive group structure) to .
Remark 1 Technically, is a parameterisation of a subgroup , rather than a subgroup itself, but we will abuse notation and refer to as the subgroup.
In a Lie group , the one-parameter subgroups are in one-to-one correspondence with the Lie algebra , with each element giving rise to a one-parameter subgroup , and conversely each one-parameter subgroup giving rise to an element of the Lie algebra; we will establish these basic facts in the special case of linear groups below the fold. On the other hand, the notion of a one-parameter subgroup can be defined in an arbitrary topological group. So this suggests the following strategy if one is to try to represent a topological group as a Lie group:
- First, form the space of one-parameter subgroups of .
- Show that has the structure of a (finite-dimensional) Lie algebra.
- Show that “behaves like” the tangent space of at the identity (in particular, the one-parameter subgroups in should cover a neighbourhood of the identity in ).
- Conclude that has the structure of a Lie group.
It turns out that this strategy indeed works to give Theorem 1 (and variants of this strategy are ubiquitious in the rest of the theory surrounding Hilbert’s fifth problem).
Below the fold, I record the proof of Theorem 1 (based on the exposition of Montgomery and Zippin). I plan to organise these disparate posts surrounding Hilbert’s fifth problem (and its application to related topics, such as Gromov’s theorem or to the classification of approximate groups) at a later date.
Recall that a (real) topological vector space is a real vector space equipped with a topology that makes the vector space operations and continuous. One often restricts attention to Hausdorff topological vector spaces; in practice, this is not a severe restriction because it turns out that any topological vector space can be made Hausdorff by quotienting out the closure of the origin . One can also discuss complex topological vector spaces, and the theory is not significantly different; but for sake of exposition we shall restrict attention here to the real case.
An obvious example of a topological vector space is a finite-dimensional vector space such as with the usual topology. Of course, there are plenty of infinite-dimensional topological vector spaces also, such as infinite-dimensional normed vector spaces (with the strong, weak, or weak-* topologies) or Frechet spaces.
One way to distinguish the finite and infinite dimensional topological vector spaces is via local compactness. Recall that a topological space is locally compact if every point in that space has a compact neighbourhood. From the Heine-Borel theorem, all finite-dimensional vector spaces (with the usual topology) are locally compact. In infinite dimensions, one can trivially make a vector space locally compact by giving it a trivial topology, but once one restricts to the Hausdorff case, it seems impossible to make a space locally compact. For instance, in an infinite-dimensional normed vector space with the strong topology, an iteration of the Riesz lemma shows that the closed unit ball in that space contains an infinite sequence with no convergent subsequence, which (by the Heine-Borel theorem) implies that cannot be locally compact. If one gives the weak-* topology instead, then is now compact by the Banach-Alaoglu theorem, but is no longer a neighbourhood of the identity in this topology. In fact, we have the following result:
The first proof of this theorem that I am aware of is by André Weil. There is also a related result:
As a corollary, every locally compact Hausdorff topological vector space is in fact isomorphic to with the usual topology for some . This can be viewed as a very special case of the theorem of Gleason, which is a key component of the solution to Hilbert’s fifth problem, that a locally compact group with no small subgroups (in the sense that there is a neighbourhood of the identity that contains no non-trivial subgroups) is necessarily isomorphic to a Lie group. Indeed, Theorem 1 is in fact used in the proof of Gleason’s theorem (the rough idea being to first locate a “tangent space” to at the origin, with the tangent vectors described by “one-parameter subgroups” of , and show that this space is a locally compact Hausdorff topological space, and hence finite dimensional by Theorem 1).
Theorem 2 may seem devoid of content, but it does contain some subtleties, as it hinges crucially on the joint continuity of the vector space operations and , and not just on the separate continuity in each coordinate. Consider for instance the one-dimensional vector space with the co-compact topology (a non-empty set is open iff its complement is compact in the usual topology). In this topology, the space is (though not Hausdorff), the scalar multiplication map is jointly continuous as long as the scalar is not zero, and the addition map is continuous in each coordinate (i.e. translations are continuous), but not jointly continuous; for instance, the set does not contain a non-trivial Cartesian product of two sets that are open in the co-compact topology. So this is not a counterexample to Theorem 2. Similarly for the cocountable or cofinite topologies on (the latter topology, incidentally, is the same as the Zariski topology on ).
Another near-counterexample comes from the topology of inherited by pulling back the usual topology on the unit circle . Admittedly, this pullback topology is not quite Hausdorff, but the addition map is jointly continuous. On the other hand, the scalar multiplication map is not continuous at all. A slight variant of this topology comes from pulling back the usual topology on the torus under the map for some irrational ; this restores the Hausdorff property, and addition is still jointly continuous, but multiplication remains discontinuous.
As some final examples, consider with the discrete topology; here, the topology is Hausdorff, addition is jointly continuous, and every dilation is continuous, but multiplication is not jointly continuous. If one instead gives the half-open topology, then again the topology is Hausdorff and addition is jointly continuous, but scalar multiplication is only jointly continuous once one restricts the scalar to be non-negative.
Below the fold, I record the textbook proof of Theorem 2 and Theorem 1. There is nothing particularly original in this presentation, but I wanted to record it here for my own future reference, and perhaps these results will also be of interest to some other readers.
A topological space is said to be metrisable if one can find a metric on it whose open balls generate the topology.
There are some obvious necessary conditions on the space in order for it to be metrisable. For instance, it must be Hausdorff, since all metric spaces are Hausdorff. It must also be first countable, because every point in a metric space has a countable neighbourhood base of balls , .
In the converse direction, being Hausdorff and first countable is not always enough to guarantee metrisability, for a variety of reasons. For instance the long line is not metrisable despite being both Hausdorff and first countable, due to a failure of paracompactness, which prevents one from gluing together the local metric structures on this line into a global one. Even after adding in paracompactness, this is still not enough; the real line with the lower limit topology (also known as the Sorgenfrey line) is Hausdorff, first countable, and paracompact, but still not metrisable (because of a failure of second countability despite being separable).
However, there is one important setting in which the Hausdorff and first countability axioms do suffice to give metrisability, and that is the setting of topological groups:
Theorem 1 (Birkhoff-Kakutani theorem) Let be a topological group (i.e. a topological space that is also a group, such that the group operations and are continuous). Then is metrisable if and only if it is both Hausdorff and first countable.
Remark 1 It is not hard to show that a topological group is Hausdorff if and only if the singleton set is closed. More generally, in an arbitrary topological group, it is a good exercise to show that the closure of is always a closed normal subgroup of , whose quotient is then a Hausdorff topological group. Because of this, the study of topological groups can usually be reduced immediately to the study of Hausdorff topological groups. (Indeed, in many texts, topological groups are automatically understood to be an abbreviation for “Hausdorff topological group”.)
The standard proof of the Birkhoff-Kakutani theorem (which we have taken from this book of Montgomery and Zippin) relies on the following Urysohn-type lemma:
- (Unique maximum) , and for all .
- (Neighbourhood base) The sets for form a neighbourhood base at the identity.
- (Uniform continuity) For every , there exists an open neighbourhood of the identity such that for all and .
Note that if had a left-invariant metric, then the function would suffice for this lemma, which already gives some indication as to why this lemma is relevant to the Birkhoff-Kakutani theorem.
Let us assume Lemma 2 for now and finish the proof of the Birkhoff-Kakutani theorem. We only prove the difficult direction, namely that a Hausdorff first countable topological group is metrisable. We let be the function from Lemma 2, and define the function by the formula
where is the space of bounded continuous functions on (with the supremum norm) and is the left-translation operator .
Clearly obeys the the identity and symmetry axioms, and the triangle inequality is also immediate. This already makes a pseudometric. In order for to be a genuine metric, what is needed is that have no non-trivial translation invariances, i.e. one has for all . But this follows since attains its maximum at exactly one point, namely the group identity .
To put it another way: because has no non-trivial translation invariances, the left translation action gives an embedding , and then inherits a metric from the metric structure on .
Now we have to check whether the metric actually generates the topology. This amounts to verifying two things. Firstly, that every ball in this metric is open; and secondly, that every open neighbourhood of a point contains a ball .
To verify the former claim, it suffices to show that the map from to is continuous, follows from the uniform continuity hypothesis. The second claim follows easily from the neighbourhood base hypothesis, since if then .
Remark 2 The above argument in fact shows that if a group is metrisable, then it admits a left-invariant metric. The idea of using a suitable continuous function to generate a useful metric structure on a topological group is a powerful one, for instance underlying the Gleason lemmas which are fundamental to the solution of Hilbert’s fifth problem. I hope to return to this topic in a future post.
Now we prove Lemma 2. By first countability, we can find a countable neighbourhood base
of the identity. As is Hausdorff, we must have
For every dyadic rational in , we can now define the open sets by setting
for all and .
We now set
with the understanding that if the supremum is over the empty set. One easily verifies using (4) that is continuous, and furthermore obeys the uniform continuity property. The neighbourhood base property follows since the are a neighbourhood base of the identity, and the unique maximum property follows from (3). This proves Lemma 2.
Remark 3 A very similar argument to the one above also establishes that every topological group is completely regular.
Notice that the function constructed in the above argument was localised to the set . As such, it is not difficult to localise the Birkhoff-Kakutani theorem to local groups. A local group is a topological space equipped with an identity , a partially defined inversion operation , and a partially defined product operation , where , are open subsets of and , obeying the following restricted versions of the group axioms:
- (Continuity) and are continuous on their domains of definition.
- (Identity) For any , and are well-defined and equal to .
- (Inverse) For any , and are well-defined and equal to . is well-defined and equal to .
- (Local associativity) If are such that , , , and are all well-defined, then .
Informally, one can view a local group as a topological group in which the closure axiom has been almost completely dropped, but with all the other axioms retained. A basic way to generate a local group is to start with an ordinary topological group and restrict it to an open neighbourhood of the identity, with and . However, this is not quite the only way to generate local groups (ultimately because the local associativity axiom does not necessarily imply a (stronger) global associativity axiom in which one considers two different ways to multiply more than three group elements together).
Remark 4 Another important example of a local group is that of a group chunk, in which the sets and are somehow “generic”; for instance, could be an algebraic variety, Zariski-open, and the group operations birational on their domains of definition. This is somewhat analogous to the notion of a “ group” in additive combinatorics. There are a number of group chunk theorems, starting with a theorem of Weil in the algebraic setting, which roughly speaking assert that a generic portion of a group chunk can be identified with the generic portion of a genuine group.
We then have
Theorem 3 (Birkhoff-Kakutani theorem for local groups) Let be a local group which is Hausdorff and first countable. Then there exists an open neighbourhood of the identity which is metrisable.
Proof: (Sketch) It is not difficult to see that in a local group , one can find a symmetric neighbourhood of the identity such that the product of any (say) elements of (multiplied together in any order) are well-defined, which effectively allows us to treat elements of as if they belonged to a group for the purposes of simple algebraic manipulation, such as applying the cancellation laws for . Inside this , one can then repeat the previous arguments and eventually end up with a continuous function supported in obeying the conclusions of Lemma 2 (but in the uniform continuity conclusion, one has to restrict to, say, , to avoid issues of ill-definedness). The definition (1) then gives a metric on with the required properties, where we make the convention that vanishes for (say) and .
My motivation for studying local groups is that it turns out that there is a correspondence (first observed by Hrushovski) between the concept of an approximate group in additive combinatorics, and a locally compact local group in topological group theory; I hope to discuss this correspondence further in a subsequent post.
(One can of course define Lie algebras over other fields than the complex numbers , but in order to avoid some technical issues we shall work solely with the complex case in this post.)
An important special case of the abstract Lie algebras are the concrete Lie algebras, in which is a vector space of linear transformations on a vector space (which again can be either finite or infinite dimensional), and the bilinear form is given by the usual Lie bracket
It is easy to verify that every concrete Lie algebra is an abstract Lie algebra. In the converse direction, we have
To prove this theorem, we introduce the useful algebraic tool of the universal enveloping algebra of the abstract Lie algebra . This is the free (associative, complex) algebra generated by (viewed as a complex vector space), subject to the constraints
This algebra is described by the Poincaré-Birkhoff-Witt theorem, which asserts that given an ordered basis of as a vector space, that a basis of is given by “monomials” of the form
where is a natural number, the are an increasing sequence of indices in , and the are positive integers. Indeed, given two such monomials, one can express their product as a finite linear combination of further monomials of the form (3) after repeatedly applying (2) (which we rewrite as ) to reorder the terms in this product modulo lower order terms until one all monomials have their indices in the required increasing order. It is then a routine exercise in basic abstract algebra (using all the axioms of an abstract Lie algebra) to verify that this is multiplication rule on monomials does indeed define a complex associative algebra which has the universal properties required of the universal enveloping algebra.
The abstract Lie algebra acts on its universal enveloping algebra by left-multiplication: , thus giving a map from to . It is easy to verify that this map is a Lie algebra homomorphism (so this is indeed an action (or representation) of the Lie algebra), and this action is clearly faithful (i.e. the map from to is injective), since each element of maps the identity element of to a different element of , namely . Thus is isomorphic to its image in , proving Theorem 1.
In the converse direction, every representation of a Lie algebra “factors through” the universal enveloping algebra, in that it extends to an algebra homomorphism from to , which by abuse of notation we shall also call .
One drawback of Theorem 1 is that the space that the concrete Lie algebra acts on will almost always be infinite-dimensional, even when the original Lie algebra is finite-dimensional. However, there is a useful theorem of Ado that rectifies this:
Theorem 2 (Ado’s theorem) Every finite-dimensional abstract Lie algebra is isomorphic to a concrete Lie algebra over a finite-dimensional vector space .
Among other things, this theorem can be used (in conjunction with the Baker-Campbell-Hausdorff formula) to show that every abstract (finite-dimensional) Lie group (or abstract local Lie group) is locally isomorphic to a linear group. (It is well-known, though, that abstract Lie groups are not necessarily globally isomorphic to a linear group, but we will not discuss these global obstructions here.)
Ado’s theorem is surprisingly tricky to prove in general, but some special cases are easy. For instance, one can try using the adjoint representation of on itself, defined by the action ; the Jacobi identity (1) ensures that this indeed a representation of . The kernel of this representation is the centre . This already gives Ado’s theorem in the case when is semisimple, in which case the center is trivial.
The adjoint representation does not suffice, by itself, to prove Ado’s theorem in the non-semisimple case. However, it does provide an important reduction in the proof, namely it reduces matters to showing that every finite-dimensional Lie algebra has a finite-dimensional representation which is faithful on the centre . Indeed, if one has such a representation, one can then take the direct sum of that representation with the adjoint representation to obtain a new finite-dimensional representation which is now faithful on all of , which then gives Ado’s theorem for .
It remains to find a finite-dimensional representation of which is faithful on the centre . In the case when is abelian, so that the centre is all of , this is again easy, because then acts faithfully on by the infinitesimal shear maps . In matrix form, this representation identifies each in this abelian Lie algebra with an “upper-triangular” matrix:
This construction gives a faithful finite-dimensional representation of the centre of any finite-dimensional Lie algebra. The standard proof of Ado’s theorem (which I believe dates back to work of Harish-Chandra) then proceeds by gradually “extending” this representation of the centre to larger and larger sub-algebras of , while preserving the finite-dimensionality of the representation and the faithfulness on , until one obtains a representation on the entire Lie algebra with the required properties. (For technical inductive reasons, one also needs to carry along an additional property of the representation, namely that it maps the nilradical to nilpotent elements, but we will discuss this technicality later.)
This procedure is a little tricky to execute in general, but becomes simpler in the nilpotent case, in which the lower central series becomes trivial for sufficiently large :
Theorem 3 (Ado’s theorem for nilpotent Lie algebras) Let be a finite-dimensional nilpotent Lie algebra. Then there exists a finite-dimensional faithful representation of . Furthermore, there exists a natural number such that , i.e. one has for all .
The second conclusion of Ado’s theorem here is useful for induction purposes. (By Engel’s theorem, this conclusion is also equivalent to the assertion that every element of is nilpotent, but we can prove Theorem 3 without explicitly invoking Engel’s theorem.)
Below the fold, I give a proof of Theorem 3, and then extend the argument to cover the full strength of Ado’s theorem. This is not a new argument – indeed, I am basing this particular presentation from the one in Fulton and Harris – but it was an instructive exercise for me to try to extract the proof of Ado’s theorem from the more general structural theory of Lie algebras (e.g. Engel’s theorem, Lie’s theorem, Levi decomposition, etc.) in which the result is usually placed. (However, the proof I know of still needs Engel’s theorem to establish the solvable case, and the Levi decomposition to then establish the general case.)
Let be a compact group. (Throughout this post, all topological groups are assumed to be Hausdorff.) Then has a number of unitary representations, i.e. continuous homomorphisms to the group of unitary operators on a Hilbert space , equipped with the strong operator topology. In particular, one has the left-regular representation , where we equip with its normalised Haar measure (and the Borel -algebra) to form the Hilbert space , and is the translation operation
We call two unitary representations and isomorphic if one has for some unitary transformation , in which case we write .
Given two unitary representations and , one can form their direct sum in the obvious manner: . Conversely, if a unitary representation has a closed invariant subspace of (thus for all ), then the orthogonal complement is also invariant, leading to a decomposition of into the subrepresentations , . Accordingly, we will call a unitary representation irreducible if is nontrivial (i.e. ) and there are no nontrivial invariant subspaces (i.e. no invariant subspaces other than and ); the irreducible representations play a role in the subject analogous to those of prime numbers in multiplicative number theory. By the principle of infinite descent, every finite-dimensional unitary representation is then expressible (perhaps non-uniquely) as the direct sum of irreducible representations.
The Peter-Weyl theorem asserts, among other things, that the same claim is true for the regular representation:
Theorem 1 (Peter-Weyl theorem) Let be a compact group. Then the regular representation is isomorphic to the direct sum of irreducible representations. In fact, one has , where is an enumeration of the irreducible finite-dimensional unitary representations of (up to isomorphism). (It is not difficult to see that such an enumeration exists.)
In the case when is abelian, the Peter-Weyl theorem is a consequence of the Plancherel theorem; in that case, the irreducible representations are all one dimensional, and are thus indexed by the space of characters (i.e. continuous homomorphisms into the unit circle ), known as the Pontryagin dual of . (See for instance my lecture notes on the Fourier transform.) Conversely, the Peter-Weyl theorem can be used to deduce the Plancherel theorem for compact groups, as well as other basic results in Fourier analysis on these groups, such as the Fourier inversion formula.
Because the regular representation is faithful (i.e. injective), a corollary of the Peter-Weyl theorem (and a classical theorem of Cartan) is that every compact group can be expressed as the inverse limit of Lie groups, leading to a solution to Hilbert’s fifth problem in the compact case. Furthermore, the compact case is then an important building block in the more general theory surrounding Hilbert’s fifth problem, and in particular a result of Yamabe that any locally compact group contains an open subgroup that is the inverse limit of Lie groups.
I’ve recently become interested in the theory around Hilbert’s fifth problem, due to the existence of a correspondence principle between locally compact groups and approximate groups, which play a fundamental role in arithmetic combinatorics. I hope to elaborate upon this correspondence in a subsequent post, but I will mention that versions of this principle play a crucial role in Gromov’s proof of his theorem on groups of polynomial growth (discussed previously on this blog), and in a more recent paper of Hrushovski on approximate groups (also discussed previously). It is also analogous in many ways to the more well-known Furstenberg correspondence principle between ergodic theory and combinatorics (also discussed previously).
Because of the above motivation, I have decided to write some notes on how the Peter-Weyl theorem is proven. This is utterly standard stuff in abstract harmonic analysis; these notes are primarily for my own benefit, but perhaps they may be of interest to some readers also.