You are currently browsing the tag archive for the ‘nilpotent matrices’ tag.
Recall that a (complex) abstract Lie algebra is a complex vector space (either finite or infinite dimensional) equipped with a bilinear antisymmetric form
that obeys the Jacobi identity
(One can of course define Lie algebras over other fields than the complex numbers , but in order to avoid some technical issues we shall work solely with the complex case in this post.)
An important special case of the abstract Lie algebras are the concrete Lie algebras, in which is a vector space of linear transformations
on a vector space
(which again can be either finite or infinite dimensional), and the bilinear form is given by the usual Lie bracket
It is easy to verify that every concrete Lie algebra is an abstract Lie algebra. In the converse direction, we have
Theorem 1 Every abstract Lie algebra is isomorphic to a concrete Lie algebra.
To prove this theorem, we introduce the useful algebraic tool of the universal enveloping algebra of the abstract Lie algebra
. This is the free (associative, complex) algebra generated by
(viewed as a complex vector space), subject to the constraints
of
as a vector space, that a basis of
is given by “monomials” of the form
is a natural number, the
are an increasing sequence of indices in
, and the
are positive integers. Indeed, given two such monomials, one can express their product as a finite linear combination of further monomials of the form (3) after repeatedly applying (2) (which we rewrite as
) to reorder the terms in this product modulo lower order terms until one all monomials have their indices in the required increasing order. It is then a routine exercise in basic abstract algebra (using all the axioms of an abstract Lie algebra) to verify that this is multiplication rule on monomials does indeed define a complex associative algebra which has the universal properties required of the universal enveloping algebra.
The abstract Lie algebra acts on its universal enveloping algebra
by left-multiplication:
, thus giving a map from
to
. It is easy to verify that this map is a Lie algebra homomorphism (so this is indeed an action (or representation) of the Lie algebra), and this action is clearly faithful (i.e. the map from
to
is injective), since each element
of
maps the identity element
of
to a different element of
, namely
. Thus
is isomorphic to its image in
, proving Theorem 1.
In the converse direction, every representation of a Lie algebra “factors through” the universal enveloping algebra, in that it extends to an algebra homomorphism from
to
, which by abuse of notation we shall also call
.
One drawback of Theorem 1 is that the space that the concrete Lie algebra acts on will almost always be infinite-dimensional, even when the original Lie algebra
is finite-dimensional. However, there is a useful theorem of Ado that rectifies this:
Theorem 2 (Ado’s theorem) Every finite-dimensional abstract Lie algebra is isomorphic to a concrete Lie algebra over a finite-dimensional vector space
.
Among other things, this theorem can be used (in conjunction with the Baker-Campbell-Hausdorff formula) to show that every abstract (finite-dimensional) Lie group (or abstract local Lie group) is locally isomorphic to a linear group. (It is well-known, though, that abstract Lie groups are not necessarily globally isomorphic to a linear group, but we will not discuss these global obstructions here.)
Ado’s theorem is surprisingly tricky to prove in general, but some special cases are easy. For instance, one can try using the adjoint representation of
on itself, defined by the action
; the Jacobi identity (1) ensures that this indeed a representation of
. The kernel of this representation is the centre
. This already gives Ado’s theorem in the case when
is semisimple, in which case the center is trivial.
The adjoint representation does not suffice, by itself, to prove Ado’s theorem in the non-semisimple case. However, it does provide an important reduction in the proof, namely it reduces matters to showing that every finite-dimensional Lie algebra has a finite-dimensional representation
which is faithful on the centre
. Indeed, if one has such a representation, one can then take the direct sum of that representation with the adjoint representation to obtain a new finite-dimensional representation which is now faithful on all of
, which then gives Ado’s theorem for
.
It remins to find a finite-dimensional representation of which is faithful on the centre
. In the case when
is abelian, so that the centre
is all of
, this is again easy, because
then acts faithfully on
by the infinitesimal shear maps
. In matrix form, this representation identifies each
in this abelian Lie algebra with an “upper-triangular” matrix:
This construction gives a faithful finite-dimensional representation of the centre of any finite-dimensional Lie algebra. The standard proof of Ado’s theorem (which I believe dates back to work of Harish-Chandra) then proceeds by gradually “extending” this representation of the centre
to larger and larger sub-algebras of
, while preserving the finite-dimensionality of the representation and the faithfulness on
, until one obtains a representation on the entire Lie algebra
with the required properties. (For technical inductive reasons, one also needs to carry along an additional property of the representation, namely that it maps the nilradical to nilpotent elements, but we will discuss this technicality later.)
This procedure is a little tricky to execute in general, but becomes simpler in the nilpotent case, in which the lower central series becomes trivial for sufficiently large
:
Theorem 3 (Ado’s theorem for nilpotent Lie algebras) Let
be a finite-dimensional nilpotent Lie algebra. Then there exists a finite-dimensional faithful representation
of
. Furthermore, there exists a natural number
such that
, i.e. one has
for all
.
The second conclusion of Ado’s theorem here is useful for induction purposes. (By Engel’s theorem, this conclusion is also equivalent to the assertion that every element of is nilpotent, but we can prove Theorem 3 without explicitly invoking Engel’s theorem.)
Below the fold, I give a proof of Theorem 3, and then extend the argument to cover the full strength of Ado’s theorem. This is not a new argument – indeed, I am basing this particular presentation from the one in Fulton and Harris – but it was an instructive exercise for me to try to extract the proof of Ado’s theorem from the more general structural theory of Lie algebras (e.g. Engel’s theorem, Lie’s theorem, Levi decomposition, etc.) in which the result is usually placed. (However, the proof I know of still needs Engel’s theorem to establish the solvable case, and the Levi decomposition to then establish the general case.)
In one of my recent posts, I used the Jordan normal form for a matrix in order to justify a couple of arguments. As a student, I learned the derivation of this form twice: firstly (as an undergraduate) by using the minimal polynomial, and secondly (as a graduate) by using the structure theorem for finitely generated modules over a principal ideal domain. I found though that the former proof was too concrete and the latter proof too abstract, and so I never really got a good intuition on how the theorem really worked. So I went back and tried to synthesise a proof that I was happy with, by taking the best bits of both arguments that I knew. I ended up with something which wasn’t too different from the standard proofs (relying primarily on the (extended) Euclidean algorithm and the fundamental theorem of algebra), but seems to get at the heart of the matter fairly quickly, so I thought I’d put it up on this blog anyway.
Before we begin, though, let us recall what the Jordan normal form theorem is. For this post, I’ll take the perspective of abstract linear transformations rather than of concrete matrices. Let be a linear transformation on a finite dimensional complex vector space V, with no preferred coordinate system. We are interested in asking what possible “kinds” of linear transformations V can support (more technically, we want to classify the conjugacy classes of
, the ring of linear endomorphisms of V to itself). Here are some simple examples of linear transformations.
- The right shift. Here,
is a standard vector space, and the right shift
is defined as
, thus all elements are shifted right by one position. (For instance, the 1-dimensional right shift is just the zero operator.)
- The right shift plus a constant. Here we consider an operator
, where
is a right shift, I is the identity on V, and
is a complex number.
- Direct sums. Given two linear transformations
and
, we can form their direct sum
by the formula
.
Our objective is then to prove the
Jordan normal form theorem. Every linear transformation
on a finite dimensional complex vector space V is similar to a direct sum of transformations, each of which is a right shift plus a constant.
(Of course, the same theorem also holds with left shifts instead of right shifts.)

Recent Comments