You are currently browsing the monthly archive for April 2013.
An abstract finite-dimensional complex Lie algebra, or Lie algebra for short, is a finite-dimensional complex vector space together with an anti-symmetric bilinear form
that obeys the Jacobi identity
for all ; by anti-symmetry one can also rewrite the Jacobi identity as
We will usually omit the subscript from the Lie bracket when this will not cause ambiguity. A homomorphism
between two Lie algebras
is a linear map that respects the Lie bracket, thus
for all
. As with many other classes of mathematical objects, the class of Lie algebras together with their homomorphisms then form a category. One can of course also consider Lie algebras in infinite dimension or over other fields, but we will restrict attention throughout these notes to the finite-dimensional complex case. The trivial, zero-dimensional Lie algebra is denoted
; Lie algebras of positive dimension will be called non-trivial.
Lie algebras come up in many contexts in mathematics, in particular arising as the tangent space of complex Lie groups. It is thus very profitable to think of Lie algebras as being the infinitesimal component of a Lie group, and in particular almost all of the notation and concepts that are applicable to Lie groups (e.g. nilpotence, solvability, extensions, etc.) have infinitesimal counterparts in the category of Lie algebras (often with exactly the same terminology). See this previous blog post for more discussion about the connection between Lie algebras and Lie groups (that post was focused over the reals instead of the complexes, but much of the discussion carries over to the complex case).
A particular example of a Lie algebra is the general linear Lie algebra of linear transformations
on a finite-dimensional complex vector space (or vector space for short)
, with the commutator Lie bracket
; one easily verifies that this is indeed an abstract Lie algebra. We will define a concrete Lie algebra to be a Lie algebra that is a subalgebra of
for some vector space
, and similarly define a representation of a Lie algebra
to be a homomorphism
into a concrete Lie algebra
. It is a deep theorem of Ado (discussed in this previous post) that every abstract Lie algebra is in fact isomorphic to a concrete one (or equivalently, that every abstract Lie algebra has a faithful representation), but we will not need or prove this fact here.
Even without Ado’s theorem, though, the structure of abstract Lie algebras is very well understood. As with objects in many other algebraic categories, a basic way to understand a Lie algebra is to factor it into two simpler algebras
via a short exact sequence
thus one has an injective homomorphism from to
and a surjective homomorphism from
to
such that the image of the former homomorphism is the kernel of the latter. (To be pedantic, a short exact sequence in a general category requires these homomorphisms to be monomorphisms and epimorphisms respectively, but in the category of Lie algebras these turn out to reduce to the more familiar concepts of injectivity and surjectivity respectively.) Given such a sequence, one can (non-uniquely) identify
with the vector space
equipped with a Lie bracket of the form
for some bilinear maps and
that obey some Jacobi-type identities which we will not record here. Understanding exactly what maps
are possible here (up to coordinate change) can be a difficult task (and is one of the key objectives of Lie algebra cohomology), but in principle at least, the problem of understanding
can be reduced to that of understanding that of its factors
. To emphasise this, I will (perhaps idiosyncratically) express the existence of a short exact sequence (3) by the ATLAS-type notation
although one should caution that for given and
, there can be multiple non-isomorphic
that can form a short exact sequence with
, so that
is not a uniquely defined combination of
and
; one could emphasise this by writing
instead of
, though we will not do so here. We will refer to
as an extension of
by
, and read the notation (5) as “
is
-by-
“; confusingly, these two notations reverse the subject and object of “by”, but unfortunately both notations are well entrenched in the literature. We caution that the operation
is not commutative, and it is only partly associative: every Lie algebra of the form
is also of the form
, but the converse is not true (see this previous blog post for some related discussion). As we are working in the infinitesimal world of Lie algebras (which have an additive group operation) rather than Lie groups (in which the group operation is usually written multiplicatively), it may help to think of
as a (twisted) “sum” of
and
rather than a “product”; for instance, we have
and
, and also
.
Special examples of extensions of
by
include the direct sum (or direct product)
(also denoted
), which is given by the construction (4) with
and
both vanishing, and the split extension (or semidirect product)
(also denoted
), which is given by the construction (4) with
vanishing and the bilinear map
taking the form
for some representation of
in the concrete Lie algebra of derivations
of
, that is to say the algebra of linear maps
that obey the Leibniz rule
for all . (The derivation algebra
of a Lie algebra
is analogous to the automorphism group
of a Lie group
, with the two concepts being intertwined by the tangent space functor
from Lie groups to Lie algebras (i.e. the derivation algebra is the infinitesimal version of the automorphism group). Of course, this functor also intertwines the Lie algebra and Lie group versions of most of the other concepts discussed here, such as extensions, semidirect products, etc.)
There are two general ways to factor a Lie algebra as an extension
of a smaller Lie algebra
by another smaller Lie algebra
. One is to locate a Lie algebra ideal (or ideal for short)
in
, thus
, where
denotes the Lie algebra generated by
, and then take
to be the quotient space
in the usual manner; one can check that
,
are also Lie algebras and that we do indeed have a short exact sequence
Conversely, whenever one has a factorisation , one can identify
with an ideal in
, and
with the quotient of
by
.
The other general way to obtain such a factorisation is is to start with a homomorphism of
into another Lie algebra
, take
to be the image
of
, and
to be the kernel
. Again, it is easy to see that this does indeed create a short exact sequence:
Conversely, whenever one has a factorisation , one can identify
with the image of
under some homomorphism, and
with the kernel of that homomorphism. Note that if a representation
is faithful (i.e. injective), then the kernel is trivial and
is isomorphic to
.
Now we consider some examples of factoring some class of Lie algebras into simpler Lie algebras. The easiest examples of Lie algebras to understand are the abelian Lie algebras , in which the Lie bracket identically vanishes. Every one-dimensional Lie algebra is automatically abelian, and thus isomorphic to the scalar algebra
. Conversely, by using an arbitrary linear basis of
, we see that an abelian Lie algebra is isomorphic to the direct sum of one-dimensional algebras. Thus, a Lie algebra is abelian if and only if it is isomorphic to the direct sum of finitely many copies of
.
Now consider a Lie algebra that is not necessarily abelian. We then form the derived algebra
; this algebra is trivial if and only if
is abelian. It is easy to see that
is an ideal whenever
are ideals, so in particular the derived algebra
is an ideal and we thus have the short exact sequence
The algebra is the maximal abelian quotient of
, and is known as the abelianisation of
. If it is trivial, we call the Lie algebra perfect. If instead it is non-trivial, then the derived algebra has strictly smaller dimension than
. From this, it is natural to associate two series to any Lie algebra
, the lower central series
and the derived series
By induction we see that these are both decreasing series of ideals of , with the derived series being slightly smaller (
for all
). We say that a Lie algebra is nilpotent if its lower central series is eventually trivial, and solvable if its derived series eventually becomes trivial. Thus, abelian Lie algebras are nilpotent, and nilpotent Lie algebras are solvable, but the converses are not necessarily true. For instance, in the general linear group
, which can be identified with the Lie algebra of
complex matrices, the subalgebra
of strictly upper triangular matrices is nilpotent (but not abelian for
), while the subalgebra
of upper triangular matrices is solvable (but not nilpotent for
). It is also clear that any subalgebra of a nilpotent algebra is nilpotent, and similarly for solvable or abelian algebras.
From the above discussion we see that a Lie algebra is solvable if and only if it can be represented by a tower of abelian extensions, thus
for some abelian . Similarly, a Lie algebra
is nilpotent if it is expressible as a tower of central extensions (so that in all the extensions
in the above factorisation,
is central in
, where we say that
is central in
if
). We also see that an extension
is solvable if and only of both factors
are solvable. Splitting abelian algebras into cyclic (i.e. one-dimensional) ones, we thus see that a finite-dimensional Lie algebra is solvable if and only if it is polycylic, i.e. it can be represented by a tower of cyclic extensions.
For our next fundamental example of using short exact sequences to split a general Lie algebra into simpler objects, we observe that every abstract Lie algebra has an adjoint representation
, where for each
,
is the linear map
; one easily verifies that this is indeed a representation (indeed, (2) is equivalent to the assertion that
for all
). The kernel of this representation is the center
, which the maximal central subalgebra of
. We thus have the short exact sequence
which, among other things, shows that every abstract Lie algebra is a central extension of a concrete Lie algebra (which can serve as a cheap substitute for Ado’s theorem mentioned earlier).
For our next fundamental decomposition of Lie algebras, we need some more definitions. A Lie algebra is simple if it is non-abelian and has no ideals other than
and
; thus simple Lie algebras cannot be factored
into strictly smaller algebras
. In particular, simple Lie algebras are automatically perfect and centerless. We have the following fundamental theorem:
Theorem 1 (Equivalent definitions of semisimplicity) Let
be a Lie algebra. Then the following are equivalent:
- (i)
does not contain any non-trivial solvable ideal.
- (ii)
does not contain any non-trivial abelian ideal.
- (iii) The Killing form
, defined as the bilinear form
, is non-degenerate on
.
- (iv)
is isomorphic to the direct sum of finitely many non-abelian simple Lie algebras.
We review the proof of this theorem later in these notes. A Lie algebra obeying any (and hence all) of the properties (i)-(iv) is known as a semisimple Lie algebra. The statement (iv) is usually taken as the definition of semisimplicity; the equivalence of (iv) and (i) is a special case of Weyl’s complete reducibility theorem (see Theorem 44), and the equivalence of (iv) and (iii) is known as the Cartan semisimplicity criterion. (The equivalence of (i) and (ii) is easy.)
If and
are solvable ideals of a Lie algebra
, then it is not difficult to see that the vector sum
is also a solvable ideal (because on quotienting by
we see that the derived series of
must eventually fall inside
, and thence must eventually become trivial by the solvability of
). As our Lie algebras are finite dimensional, we conclude that
has a unique maximal solvable ideal, known as the radical
of
. The quotient
is then a Lie algebra with trivial radical, and is thus semisimple by the above theorem, giving the Levi decomposition
expressing an arbitrary Lie algebra as an extension of a semisimple Lie algebra by a solvable algebra
(and it is not hard to see that this is the only possible such extension up to isomorphism). Indeed, a deep theorem of Levi allows one to upgrade this decomposition to a split extension
although we will not need or prove this result here.
In view of the above decompositions, we see that we can factor any Lie algebra (using a suitable combination of direct sums and extensions) into a finite number of simple Lie algebras and the scalar algebra . In principle, this means that one can understand an arbitrary Lie algebra once one understands all the simple Lie algebras (which, being defined over
, are somewhat confusingly referred to as simple complex Lie algebras in the literature). Amazingly, this latter class of algebras are completely classified:
Theorem 2 (Classification of simple Lie algebras) Up to isomorphism, every simple Lie algebra is of one of the following forms:
for some
.
for some
.
for some
.
for some
.
, or
.
.
.
(The precise definition of the classical Lie algebras
and the exceptional Lie algebras
will be recalled later.)
(One can extend the families of classical Lie algebras a little bit to smaller values of
, but the resulting algebras are either isomorphic to other algebras on this list, or cease to be simple; see this previous post for further discussion.)
This classification is a basic starting point for the classification of many other related objects, including Lie algebras and Lie groups over more general fields (e.g. the reals ), as well as finite simple groups. Being so fundamental to the subject, this classification is covered in almost every basic textbook in Lie algebras, and I myself learned it many years ago in an honours undergraduate course back in Australia. The proof is rather lengthy, though, and I have always had difficulty keeping it straight in my head. So I have decided to write some notes on the classification in this blog post, aiming to be self-contained (though moving rapidly). There is no new material in this post, though; it is all drawn from standard reference texts (I relied particularly on Fulton and Harris’s text, which I highly recommend). In fact it seems remarkably hard to deviate from the standard routes given in the literature to the classification; I would be interested in knowing about other ways to reach the classification (or substeps in that classification) that are genuinely different from the orthodox route.
The classification of finite simple groups (CFSG), first announced in 1983 but only fully completed in 2004, is one of the monumental achievements of twentieth century mathematics. Spanning hundreds of papers and tens of thousands of pages, it has been called the “enormous theorem”. A “second generation” proof of the theorem is nearly completed which is a little shorter (estimated at about five thousand pages in length), but currently there is no reasonably sized proof of the classification.
An important precursor of the CFSG is the Feit-Thompson theorem from 1962-1963, which asserts that every finite group of odd order is solvable, or equivalently that every non-abelian finite simple group has even order. This is an immediate consequence of CFSG, and conversely the Feit-Thompson theorem is an essential starting point in the proof of the classification, since it allows one to reduce matters to groups of even order for which key additional tools (such as the Brauer-Fowler theorem) become available. The original proof of the Feit-Thompson theorem is 255 pages long, which is significantly shorter than the proof of the CFSG, but still far from short. While parts of the proof of the Feit-Thompson theorem have been simplified (and it has recently been converted, after six years of effort, into an argument that has been verified by the proof assistant Coq), the available proofs of this theorem are still extremely lengthy by any reasonable standard.
However, there is a significantly simpler special case of the Feit-Thompson theorem that was established previously by Suzuki in 1957, which was influential in the proof of the more general Feit-Thompson theorem (and thus indirectly to the proof of CFSG). Define a CA-group to be a group with the property that the centraliser
of any non-identity element
is abelian; equivalently, the commuting relation
(defined as the relation that holds when
commutes with
, thus
) is an equivalence relation on the non-identity elements
of
. Trivially, every abelian group is CA. A non-abelian example of a CA-group is the
group of invertible affine transformations
on a field
. A little less obviously, the special linear group
over a finite field
is a CA-group when
is a power of two. The finite simple groups of Lie type are not, in general, CA-groups, but when the rank is bounded they tend to behave as if they were “almost CA”; the centraliser of a generic element in
, for instance, when
is bounded and
is large), is typically a maximal torus (because most elements in
are regular semisimple) which is certainly abelian. In view of the CFSG, we thus see that CA or nearly CA groups form an important subclass of the simple groups, and it is thus of interest to study them separately. To this end, we have
Theorem 1 (Suzuki’s theorem on CA-groups) Every finite CA-group of odd order is solvable.
Of course, this theorem is superceded by the more general Feit-Thompson theorem, but Suzuki’s proof is substantially shorter (the original proof is nine pages) and will be given in this post. (See this survey of Solomon for some discussion of the link between Suzuki’s argument and the Feit-Thompson argument.) Suzuki’s analysis can be pushed further to give an essentially complete classification of all the finite CA-groups (of either odd or even order), but we will not pursue these matters here.
Moving even further down the ladder of simple precursors of CSFG is the following theorem of Frobenius from 1901. Define a Frobenius group to be a finite group which has a subgroup
(called the Frobenius complement) with the property that all the non-trivial conjugates
of
for
, intersect
only at the origin. For instance the
group is also a Frobenius group (take
to be the affine transformations that fix a specified point
, e.g. the origin). This example suggests that there is some overlap between the notions of a Frobenius group and a CA group. Indeed, note that if
is a CA-group and
is a maximal abelian subgroup of
, then any conjugate
of
that is not identical to
will intersect
only at the origin (because
and each of its conjugates consist of equivalence classes under the commuting relation
, together with the identity). So if a maximal abelian subgroup
of a CA-group is its own normaliser (thus
is equal to
), then the group is a Frobenius group.
Frobenius’ theorem places an unexpectedly strong amount of structure on a Frobenius group:
Theorem 2 (Frobenius’ theorem) Let
be a Frobenius group with Frobenius complement
. Then there exists a normal subgroup
of
(called the Frobenius kernel of
) such that
is the semi-direct product
of
and
.
Roughly speaking, this theorem indicates that all Frobenius groups “behave” like the example (which is a quintessential example of a semi-direct product).
Note that if every CA-group of odd order was either Frobenius or abelian, then Theorem 2 would imply Theorem 1 by an induction on the order of , since any subgroup of a CA-group is clearly again a CA-group. Indeed, the proof of Suzuki’s theorem does basically proceed by this route (Suzuki’s arguments do indeed imply that CA-groups of odd order are Frobenius or abelian, although we will not quite establish that fact here).
Frobenius’ theorem can be reformulated in the following concrete combinatorial form:
Theorem 3 (Frobenius’ theorem, equivalent version) Let
be a group of permutations acting transitively on a finite set
, with the property that any non-identity permutation in
fixes at most one point in
. Then the set of permutations in
that fix no points in
, together with the identity, is closed under composition.
Again, a good example to keep in mind for this theorem is when is the group of affine permutations on a field
(i.e. the
group for that field), and
is the set of points on that field. In that case, the set of permutations in
that do not fix any points are the non-trivial translations.
To deduce Theorem 3 from Theorem 2, one applies Theorem 2 to the stabiliser of a single point in . Conversely, to deduce Theorem 2 from Theorem 3, set
to be the space of left-cosets of
, with the obvious left
-action; one easily verifies that this action is faithful, transitive, and each non-identity element
of
fixes at most one left-coset of
(basically because it lies in at most one conjugate of
). If we let
be the elements of
that do not fix any point in
, plus the identity, then by Theorem 3
is closed under composition; it is also clearly closed under inverse and conjugation, and is hence a normal subgroup of
. From construction
is the identity plus the complement of all the
conjugates of
, which are all disjoint except at the identity, so by counting elements we see that
As normalises
and is disjoint from
, we thus see that
is all of
, giving Theorem 2.
Despite the appealingly concrete and elementary form of Theorem 3, the only known proofs of that theorem (or equivalently, Theorem 2) in its full generality proceed via the machinery of group characters (which one can think of as a version of Fourier analysis for nonabelian groups). On the other hand, once one establishes the basic theory of these characters (reviewed below the fold), the proof of Frobenius’ theorem is very short, which gives quite a striking example of the power of character theory. The proof of Suzuki’s theorem also proceeds via character theory, and is basically a more involved version of the Frobenius argument; again, no character-free proof of Suzuki’s theorem is currently known. (The proofs of Feit-Thompson and CFSG also involve characters, but those proofs also contain many other arguments of much greater complexity than the character-based portions of the proof.)
It seems to me that the above four theorems (Frobenius, Suzuki, Feit-Thompson, and CFSG) provide a ladder of sorts (with exponentially increasing complexity at each step) to the full classification, and that any new approach to the classification might first begin by revisiting the earlier theorems on this ladder and finding new proofs of these results first (in particular, if one had a “robust” proof of Suzuki’s theorem that also gave non-trivial control on “almost CA-groups” – whatever that means – then this might lead to a new route to classifying the finite simple groups of Lie type and bounded rank). But even for the simplest two results on this ladder – Frobenius and Suzuki – it seems remarkably difficult to find any proof that is not essentially the character-based proof. (Even trying to replace character theory by its close cousin, representation theory, doesn’t seem to work unless one gives in to the temptation to take traces everywhere and put the characters back in; it seems that rather than abandon characters altogether, one needs to find some sort of “robust” generalisation of existing character-based methods.) In any case, I am recording here the standard character-based proofs of the theorems of Frobenius and Suzuki below the fold. There is nothing particularly novel here, but I wanted to collect all the relevant material in one place, largely for my own benefit.
Recent Comments