Ado’s theorem

10 May, 2011 in expository, math.RA, math.RT | Tags: Ado's theorem, Engel's theorem, Hilbert's fifth problem, Levi decomposition, Lie algebras, nilpotent matrices | by Terence Tao

Recall that a (complex) abstract Lie algebra is a complex vector space ${{\mathfrak g}}$ (either finite or infinite dimensional) equipped with a bilinear antisymmetric form ${[]: {\mathfrak g} \times {\mathfrak g} \rightarrow {\mathfrak g}}$ that obeys the Jacobi identity

$\displaystyle [[X,Y],Z] + [[Y,Z],X] + [[Z,X],Y] = 0. \ \ \ \ \ (1)$

(One can of course define Lie algebras over other fields than the complex numbers ${{\bf C}}$ , but in order to avoid some technical issues we shall work solely with the complex case in this post.)

An important special case of the abstract Lie algebras are the concrete Lie algebras, in which ${{\mathfrak g} \subset \hbox{End}(V)}$ is a vector space of linear transformations ${X: V \rightarrow V}$ on a vector space ${V}$ (which again can be either finite or infinite dimensional), and the bilinear form is given by the usual Lie bracket

$\displaystyle [X,Y] := XY-YX.$

It is easy to verify that every concrete Lie algebra is an abstract Lie algebra. In the converse direction, we have

Theorem 1 Every abstract Lie algebra is isomorphic to a concrete Lie algebra.

To prove this theorem, we introduce the useful algebraic tool of the universal enveloping algebra ${U({\mathfrak g})}$ of the abstract Lie algebra ${{\mathfrak g}}$ . This is the free (associative, complex) algebra generated by ${{\mathfrak g}}$ (viewed as a complex vector space), subject to the constraints

$\displaystyle [X,Y] = XY - YX. \ \ \ \ \ (2)$

This algebra is described by the Poincaré-Birkhoff-Witt theorem, which asserts that given an ordered basis ${(X_i)_{i \in I}}$ of ${{\mathfrak g}}$ as a vector space, that a basis of ${U({\mathfrak g})}$ is given by “monomials” of the form

$\displaystyle X_{i_1}^{a_1} \ldots X_{i_m}^{a_m} \ \ \ \ \ (3)$

where ${m}$ is a natural number, the ${i_1 < \ldots < i_m}$ are an increasing sequence of indices in ${I}$ , and the ${a_1,\ldots,a_m}$ are positive integers. Indeed, given two such monomials, one can express their product as a finite linear combination of further monomials of the form (3) after repeatedly applying (2) (which we rewrite as ${XY = YX + [X,Y]}$ ) to reorder the terms in this product modulo lower order terms until one all monomials have their indices in the required increasing order. It is then a routine exercise in basic abstract algebra (using all the axioms of an abstract Lie algebra) to verify that this is multiplication rule on monomials does indeed define a complex associative algebra which has the universal properties required of the universal enveloping algebra.

The abstract Lie algebra ${{\mathfrak g}}$ acts on its universal enveloping algebra ${U({\mathfrak g})}$ by left-multiplication: ${X: M \mapsto XM}$ , thus giving a map from ${{\mathfrak g}}$ to ${\hbox{End}(U({\mathfrak g}))}$ . It is easy to verify that this map is a Lie algebra homomorphism (so this is indeed an action (or representation) of the Lie algebra), and this action is clearly faithful (i.e. the map from ${{\mathfrak g}}$ to ${\hbox{End}(U{\mathfrak g})}$ is injective), since each element ${X}$ of ${{\mathfrak g}}$ maps the identity element ${1}$ of ${U({\mathfrak g})}$ to a different element of ${U({\mathfrak g})}$ , namely ${X}$ . Thus ${{\mathfrak g}}$ is isomorphic to its image in ${\hbox{End}(U({\mathfrak g}))}$ , proving Theorem 1.

In the converse direction, every representation ${\rho: {\mathfrak g} \rightarrow \hbox{End}(V)}$ of a Lie algebra “factors through” the universal enveloping algebra, in that it extends to an algebra homomorphism from ${U({\mathfrak g})}$ to ${\hbox{End}(V)}$ , which by abuse of notation we shall also call ${\rho}$ .

One drawback of Theorem 1 is that the space ${U({\mathfrak g})}$ that the concrete Lie algebra acts on will almost always be infinite-dimensional, even when the original Lie algebra ${{\mathfrak g}}$ is finite-dimensional. However, there is a useful theorem of Ado that rectifies this:

Theorem 2 (Ado’s theorem) Every finite-dimensional abstract Lie algebra is isomorphic to a concrete Lie algebra over a finite-dimensional vector space ${V}$ .

Among other things, this theorem can be used (in conjunction with the Baker-Campbell-Hausdorff formula) to show that every abstract (finite-dimensional) Lie group (or abstract local Lie group) is locally isomorphic to a linear group. (It is well-known, though, that abstract Lie groups are not necessarily globally isomorphic to a linear group, but we will not discuss these global obstructions here.)

Ado’s theorem is surprisingly tricky to prove in general, but some special cases are easy. For instance, one can try using the adjoint representation ${\hbox{ad}: {\mathfrak g} \rightarrow \hbox{End}({\mathfrak g})}$ of ${{\mathfrak g}}$ on itself, defined by the action ${X: Y \mapsto [X,Y]}$ ; the Jacobi identity (1) ensures that this indeed a representation of ${{\mathfrak g}}$ . The kernel of this representation is the centre ${Z({\mathfrak g}) := \{ X \in {\mathfrak g}: [X,Y]=0 \hbox{ for all } Y \in {\mathfrak g}\}}$ . This already gives Ado’s theorem in the case when ${{\mathfrak g}}$ is semisimple, in which case the center is trivial.

The adjoint representation does not suffice, by itself, to prove Ado’s theorem in the non-semisimple case. However, it does provide an important reduction in the proof, namely it reduces matters to showing that every finite-dimensional Lie algebra ${{\mathfrak g}}$ has a finite-dimensional representation ${\rho: {\mathfrak g} \rightarrow \hbox{End}(V)}$ which is faithful on the centre ${Z({\mathfrak g})}$ . Indeed, if one has such a representation, one can then take the direct sum of that representation with the adjoint representation to obtain a new finite-dimensional representation which is now faithful on all of ${{\mathfrak g}}$ , which then gives Ado’s theorem for ${{\mathfrak g}}$ .

It remains to find a finite-dimensional representation of ${{\mathfrak g}}$ which is faithful on the centre ${Z({\mathfrak g})}$ . In the case when ${{\mathfrak g}}$ is abelian, so that the centre ${Z({\mathfrak g})}$ is all of ${{\mathfrak g}}$ , this is again easy, because ${{\mathfrak g}}$ then acts faithfully on ${{\mathfrak g} \times {\bf C}}$ by the infinitesimal shear maps ${X: (Y,t) \mapsto (tX, 0)}$ . In matrix form, this representation identifies each ${X}$ in this abelian Lie algebra with an “upper-triangular” matrix:

$\displaystyle X \equiv \begin{pmatrix} 0 & X \\ 0 & 0 \end{pmatrix}.$

This construction gives a faithful finite-dimensional representation of the centre ${Z({\mathfrak g})}$ of any finite-dimensional Lie algebra. The standard proof of Ado’s theorem (which I believe dates back to work of Harish-Chandra) then proceeds by gradually “extending” this representation of the centre ${Z({\mathfrak g})}$ to larger and larger sub-algebras of ${{\mathfrak g}}$ , while preserving the finite-dimensionality of the representation and the faithfulness on ${Z({\mathfrak g})}$ , until one obtains a representation on the entire Lie algebra ${{\mathfrak g}}$ with the required properties. (For technical inductive reasons, one also needs to carry along an additional property of the representation, namely that it maps the nilradical to nilpotent elements, but we will discuss this technicality later.)

This procedure is a little tricky to execute in general, but becomes simpler in the nilpotent case, in which the lower central series ${{\mathfrak g}_1 := {\mathfrak g}; {\mathfrak g}_{n+1} := [{\mathfrak g}, {\mathfrak g}_n]}$ becomes trivial for sufficiently large ${n}$ :

Theorem 3 (Ado’s theorem for nilpotent Lie algebras) Let ${{\mathfrak n}}$ be a finite-dimensional nilpotent Lie algebra. Then there exists a finite-dimensional faithful representation ${\rho: {\mathfrak n} \rightarrow \hbox{End}(V)}$ of ${{\mathfrak n}}$ . Furthermore, there exists a natural number ${k}$ such that ${\rho({\mathfrak n})^k = \{0\}}$ , i.e. one has ${\rho(X_1) \ldots \rho(X_k)=0}$ for all ${X_1,\ldots,X_k \in {\mathfrak n}}$ .

The second conclusion of Ado’s theorem here is useful for induction purposes. (By Engel’s theorem, this conclusion is also equivalent to the assertion that every element of ${\rho({\mathfrak n})}$ is nilpotent, but we can prove Theorem 3 without explicitly invoking Engel’s theorem.)

Below the fold, I give a proof of Theorem 3, and then extend the argument to cover the full strength of Ado’s theorem. This is not a new argument – indeed, I am basing this particular presentation from the one in Fulton and Harris – but it was an instructive exercise for me to try to extract the proof of Ado’s theorem from the more general structural theory of Lie algebras (e.g. Engel’s theorem, Lie’s theorem, Levi decomposition, etc.) in which the result is usually placed. (However, the proof I know of still needs Engel’s theorem to establish the solvable case, and the Levi decomposition to then establish the general case.)

— 1. The nilpotent case —

We now prove Theorem 3. We achieve this by an induction on the dimension of ${{\mathfrak n}}$ . The claim is trivial for dimension zero, so we assume inductively that ${{\mathfrak n}}$ has positive dimension, and that Theorem 3 has already been proven for all lower-dimensional nilpotent Lie algebras.

As noted earlier, the adjoint representation already verifies the claim except for the fact that it is not faithful on the centre ${Z({\mathfrak n})}$ . (The nilpotency of ${{\mathfrak n}}$ ensures the existence of some ${k}$ for which ${\hbox{ad}({\mathfrak n})^k = 0}$ .) Thus, it will suffice to find a finite-dimensional representation ${\rho: {\mathfrak n} \rightarrow \hbox{End}(V)}$ that is faithful on the center ${Z({\mathfrak n})}$ , and for which ${\rho({\mathfrak n})^k = \{0\}}$ for some ${k}$ .

We have already verified the claim when ${{\mathfrak n}}$ is abelian (and can take ${k=1}$ in this case), so suppose that ${{\mathfrak n}}$ is not abelian; in particular, the centre ${Z({\mathfrak n})}$ has strictly smaller dimension. We then observe that ${{\mathfrak n}}$ must contain a codimension one ideal ${{\mathfrak a}}$ containing ${Z({\mathfrak n})}$ , which will of course again be a nilpotent Lie algebra. This can be seen by passing to the quotient ${{\mathfrak n}' := {\mathfrak n}/Z({\mathfrak n})}$ (which has positive dimension) and then to the abelianisation ${{\mathfrak n'}/[{\mathfrak n'},{\mathfrak n'}]}$ (which still has positive dimension, by nilpotency), arbitrarily selecting a codimension one subspace of that abelianisation, and then passing back to ${{\mathfrak n}}$ .

If we let ${{\mathfrak h}}$ be a one-dimensional complementary subspace of ${{\mathfrak a}}$ , then ${{\mathfrak h}}$ is automatically an abelian Lie algebra, and we have the decomposition

$\displaystyle {\mathfrak n} = {\mathfrak a} \oplus {\mathfrak h}$

as vector spaces. This is not necessarily a direct sum of Lie algebras, though, because ${{\mathfrak a}}$ and ${{\mathfrak h}}$ need not commute. However, as ${{\mathfrak a}}$ is an ideal, we do know that ${[{\mathfrak h},{\mathfrak a}] \subset {\mathfrak a}}$ , thus there is an adjoint action ${\hbox{ad}: {\mathfrak h} \rightarrow \hbox{End}({\mathfrak a})}$ of ${{\mathfrak h}}$ on ${{\mathfrak a}}$ .

By induction hypothesis, we know that there is a faithful representation ${\rho_0: {\mathfrak a} \rightarrow \hbox{End}(V_0)}$ on some finite-dimensional space ${V_0}$ with ${\rho_0({\mathfrak a})^{k_0} = \{0\}}$ for some ${k_0}$ . We would like to somehow “extend” this representation to a finite-dimensional representation ${\rho: {\mathfrak n} \rightarrow \hbox{End}(V)}$ which is still faithful on ${{\mathfrak a}}$ (and hence on ${Z({\mathfrak n})}$ ), and with ${\rho({\mathfrak n})^k = \{0\}}$ for some ${k}$ .

To motivate the construction, let us begin not with ${\rho_0}$ , but with the universal enveloping algebra representation ${A: M \mapsto AM}$ of ${{\mathfrak a}}$ on ${U({\mathfrak a})}$ . The idea is to somehow combine this action ${{\mathfrak a}}$ with an action of ${{\mathfrak h}}$ on the same space ${U({\mathfrak a})}$ to obtain an action of the full Lie algebra ${{\mathfrak n}}$ on ${U({\mathfrak a})}$ . To do this, recall that we have the adjoint action ${\hbox{ad}(H): A \mapsto [H,A]}$ of ${{\mathfrak h}}$ on ${{\mathfrak a}}$ . This extends to an adjoint action of ${{\mathfrak h}}$ on ${U({\mathfrak a})}$ , defined by the Leibniz rule

$\displaystyle [H, A_1 \ldots A_m] := \sum_{i=1}^m A_1 \ldots A_{i-1} [H,A_i] A_{i+1} \ldots A_m; \ \ \ \ \ (4)$

one can easily verify that this is indeed an action (and furthermore that the map ${\hbox{ad}(H): M \mapsto [H,M]}$ is a derivation on ${U({\mathfrak a})}$ ). One can then combine the multiplicative action ${A: M \mapsto AM}$ of ${{\mathfrak a}}$ and the adjoint action ${H: M \mapsto [H,M]}$ of ${{\mathfrak h}}$ into a joint action

$\displaystyle (A+H): M \mapsto AM + [H,M] \ \ \ \ \ (5)$

of ${{\mathfrak a}+{\mathfrak h} = {\mathfrak n}}$ on ${U({\mathfrak a})}$ . A minor algebraic miracle occurs here, namely that this combined action is still a genuine action of the Lie algebra ${{\mathfrak a}+{\mathfrak h}}$ . In particular one has

$\displaystyle [A+H,A'+H'] M = (A+H) (A'+H') M - (A'+H') (A+H) M$

for all ${A+H, A'+H' \in {\mathfrak a}+{\mathfrak h}}$ , as can be verified by a brief computation. Also, because the action of ${{\mathfrak a}}$ was already faithful on ${U({\mathfrak a})}$ , this enlarged action of ${{\mathfrak n}}$ will still be faithful on ${{\mathfrak a}}$ .

Remark 1 One can explain this “miracle” by working first in the larger universal algebra ${U({\mathfrak n})}$ . This space has a left multiplicative action ${X: M \mapsto XM}$ of ${{\mathfrak n}}$ , but also has a right multiplicative action ${X: M \mapsto -MX}$ which commutes with the left multiplicative action. Furthermore, the splitting ${{\mathfrak n} = {\mathfrak a} + {\mathfrak h}}$ induces a projection map ${\pi: {\mathfrak n} \rightarrow {\mathfrak h}}$ which will be a Lie algebra homomorphism because ${{\mathfrak a}}$ is an ideal. This gives a projected right multiplicative action ${X: M \mapsto -M\pi(X)}$ which still commutes with the left multiplicative action. In particular, the combined action ${X: M \mapsto XM - M \pi(X)}$ is still an action of ${{\mathfrak n}}$ . One then observes that this action preserves ${U({\mathfrak a})}$ , and when restricted to that space, becomes precisely (5).

Now we need to project the above construction down to a finite-dimensional representation. It turns out to be convenient not to use the original representation ${\rho_0}$ directly, but only indirectly via the property ${\rho_0({\mathfrak a})^{k_0} = \{0\}}$ . Specifically, we consider the (two-sided) ideal ${I := \langle ({\mathfrak a})^{k_0} \rangle}$ of ${U({\mathfrak a})}$ generated by ${k_0}$ -fold products of elements in ${{\mathfrak a}}$ , and then consider the quotient algebra ${U({\mathfrak a})/I}$ . Because ${\rho_0({\mathfrak a})^{k_0} = \{0\}}$ , this algebra still maps to ${\hbox{End}(V_0)}$ ; since ${{\mathfrak a}}$ was faithful in ${\rho_0}$ , it must also be faithful in ${U({\mathfrak a})/I}$ .

The point of doing this quotienting is that whereas ${U({\mathfrak a})}$ is likely to be infinite-dimensional, the space ${U({\mathfrak a})/I}$ is only finite-dimensional. Indeed, it is generated by those monomials (3) whose total degree ${a_1 + \ldots + a_m}$ is less than ${k_0}$ ; since ${{\mathfrak a}}$ is finite-dimensional, there are only finitely many such monomials.

From the Leibniz rule (4) we see that the adjoint action of ${{\mathfrak h}}$ on ${U({\mathfrak a})}$ preserves ${I}$ , and thus descends to an action on ${U({\mathfrak a})/I}$ . We can combine this action with the multiplicative action of ${{\mathfrak a}}$ as before using (5) to create an action of ${{\mathfrak n}}$ on ${U({\mathfrak a})/I}$ , which is now a finite-dimensional representation which (as observed previously) is faithful on ${{\mathfrak a}}$ .

The only remaining thing to show is that for some sufficiently large ${k}$ , that ${{\mathfrak n}^k}$ annihilates ${U({\mathfrak a})/I}$ . By linearity, it suffices to show that ${X_1 \ldots X_k A_1 \ldots A_m = 0 \hbox{ mod } I}$ whenever ${X_1,\ldots,X_k}$ lie in either ${{\mathfrak a}}$ or ${{\mathfrak h}}$ , and ${A_1 \ldots A_m}$ is a monomial in ${U({\mathfrak a})}$ . But observe that if one multiplies a monomial ${A_1 \ldots A_m}$ by an element ${A}$ of ${{\mathfrak a}}$ , then one gets a monomial ${A A_1 \ldots A_m}$ of one higher degree; and if instead one multiplies a monomial ${A_1 \ldots A_m}$ by an element ${H}$ of ${{\mathfrak h}}$ , then from (4) one gets a sum of monomials in which one of the terms ${A_i}$ has been replaced with the “higher order” term ${[H,A_i]}$ . Using the nilpotency of ${{\mathfrak n}}$ (which implies that all sufficiently long iterated commutators must vanish), we thus see that if ${k}$ is large enough, then ${X_1 \ldots X_k A_1 \ldots A_m}$ will consist only of terms of degree at least ${k_0}$ , which automatically lie in ${I}$ , and the claim follows.

Remark 2 The above argument gives an effective (though not particularly efficient) bound on the dimension of ${V}$ and on the order ${k}$ of ${\rho({\mathfrak n})}$ in terms of the dimension of ${{\mathfrak n}}$ . Similarly for the arguments we give below to prove more general versions of Ado’s theorem.

— 2. The solvable case —

To go beyond the nilpotent case one needs to use more of the structural theory of Lie algebras. In particular, we will use Engel’s theorem:

Theorem 4 (Engel’s theorem) Let ${{\mathfrak g} \subset \hbox{End}(V)}$ be a concrete Lie algebra on a finite-dimensional space ${V}$ which is concretely nilpotent in the sense that every element of ${{\mathfrak g}}$ is a nilpotent operator on ${V}$ . Then there exists a basis of ${V}$ such that ${{\mathfrak g}}$ consists entirely of upper-triangular matrices. In particular, if ${d}$ is the dimension of ${V}$ , then ${{\mathfrak g}^d = \{0\}}$ (i.e. ${X_1 \ldots X_d = 0}$ for all ${X_1,\ldots,X_d \in {\mathfrak g}}$ ).

The proof of Engel’s theorem is not too difficult, but we omit it here as we have nothing to add to the textbook proofs of this theorem. Note that this theorem, combined with Theorem 3, gives a correspondence between concretely nilpotent Lie algebras and abstractly nilpotent Lie algebras.

Engel’s theorem has the following consequence:

Corollary 5 (Nilpotent ideals are null) Let ${{\mathfrak g} \subset \hbox{End}(V)}$ be a concrete Lie algebra on a finite-dimensional space ${V}$ , and let ${{\mathfrak a}}$ be a concretely nilpotent Lie algebra ideal of ${{\mathfrak g}}$ . Then for any ${X_1,\ldots,X_m \in {\mathfrak g}}$ with at least one of the ${X_i}$ in ${{\mathfrak a}}$ , one has ${\hbox{tr}(X_1 \ldots X_m) = 0}$ .

Proof: Set ${V_i := {\mathfrak a}^i V}$ to be the vector space spanned by vectors of the form ${A_1 \ldots A_i v}$ for ${A_1,\ldots,A_i \in {\mathfrak a}}$ and ${v \in V}$ , then ${V = V_0 \supset \ldots \supset V_d = \{0\}}$ is a flag, with ${{\mathfrak a}}$ mapping ${V_i}$ to ${V_{i+1}}$ for each ${i}$ . As ${{\mathfrak a}}$ is an ideal of ${{\mathfrak g}}$ , we see that each of the spaces ${V_i}$ is invariant with respect to ${{\mathfrak g}}$ . Thus ${X_1 \ldots X_m}$ also maps ${V_i}$ to ${V_{i+1}}$ for each ${i}$ , and so must have zero trace. $\Box$

This leads to the following curious algebraic fact:

Corollary 6 Let ${{\mathfrak g} \subset \hbox{End}(V)}$ be a concrete Lie algebra on a finite-dimensional space ${V}$ , and let ${{\mathfrak a}}$ , ${{\mathfrak b}}$ be ideals of ${{\mathfrak g}}$ such that ${{\mathfrak b} \in [{\mathfrak a},{\mathfrak g}]}$ . If ${[{\mathfrak a}, {\mathfrak b}]}$ is concretely nilpotent, then ${{\mathfrak b}}$ is also concretely nilpotent.

Proof: Let ${B \in {\mathfrak b}}$ . We need to show that ${B}$ is nilpotent. By the Cayley-Hamilton theorem, it suffices to show that ${\hbox{tr}(B^k) = 0}$ for each ${k \geq 1}$ . Since ${B \in {\mathfrak b} \subset [{\mathfrak a}, {\mathfrak g}]}$ , it suffices to show that ${\hbox{tr}([A,X] B^{k-1}) = 0}$ for each ${k \geq 1}$ , ${A \in {\mathfrak a}}$ , and ${X \in {\mathfrak g}}$ . But we can use the cyclic property of trace to rearrange

$\displaystyle \hbox{tr}([A,X] B^{k-1}) = - \hbox{tr}(X [A, B^{k-1}] ).$

We can then use the Leibniz rule to expand

$\displaystyle [A,B^{k-1}] = \sum_{i=1}^{k-1} B^{i-1} [A,B] B^{k-i-1}.$

But ${[A,B]}$ lies in ${[{\mathfrak a},{\mathfrak b}]}$ , and the claim now follows from Corollary 5. $\Box$

We can apply this corollary to the radical ${{\mathfrak r}}$ of an (abstract) finite-dimensional Lie algebra ${{\mathfrak g}}$ , defined as the maximal solvable ideal of the Lie algebra. (One can easily verify that the vector space sum of two solvable ideals is again a solvable ideal, which implies that the radical is well-defined.)

Theorem 7 Let ${{\mathfrak g}}$ be a finite-dimensional (abstract) Lie algebra, and let ${{\mathfrak r}}$ be a solvable ideal in ${{\mathfrak g}}$ . Then ${[{\mathfrak g},{\mathfrak g}] \cap {\mathfrak r}}$ is an (abstractly) nilpotent ideal of ${{\mathfrak g}}$ .

Proof: Without loss of generality we may take ${{\mathfrak r}}$ to be the radical of ${{\mathfrak g}}$ .

Let us first verify the theorem in the case when ${{\mathfrak g}}$ is a concrete Lie algebra over a finite-dimensional vector space ${V}$ . We let ${{\mathfrak r}^{(i+1)} := [{\mathfrak r}^{(i)},{\mathfrak r}^{(i)}]}$ be the derived series of ${{\mathfrak r}}$ . One easily verifies that each of the ${{\mathfrak r}^{(i)}}$ are ideals of ${{\mathfrak g}}$ . We will show by downward induction on ${i}$ that the ideals ${[{\mathfrak g},{\mathfrak g}] \cap {\mathfrak r}^{(i)}}$ are concretely nilpotent. As ${{\mathfrak r}}$ is solvable, this claim is trivial for ${i}$ large enough. Now suppose that ${i}$ is such that the ideal ${[{\mathfrak g},{\mathfrak g}] \cap {\mathfrak r}^{(i+1)}}$ is concretely nilpotent. This ideal contains ${[{\mathfrak r}^{(i)}, [{\mathfrak r}^{(i)}, {\mathfrak g}]]}$ , which is then also concretely nilpotent. By Corollary 6, this implies that the ideal ${[{\mathfrak r}^{(i)},{\mathfrak g}]}$ is concretely nilpotent, and hence the smaller ideal ${[{\mathfrak g}, [{\mathfrak g},{\mathfrak g}] \cap {\mathfrak r}^{(i)}]}$ is also concretely nilpotent. By a second application of Corollary 6, we conclude that ${[{\mathfrak g},{\mathfrak g}] \cap {\mathfrak r}^{(i)}}$ is also concretely nilpotent, closing the induction. This proves the theorem in the concrete case.

To establish the claim in the abstract case, we simply use the adjoint representation, which effectively quotients out the centre ${Z({\mathfrak g})}$ of ${{\mathfrak g}}$ (which will also be an ideal of ${{\mathfrak r}}$ ) to convert an finite-dimensional abstract Lie algebra into a concrete Lie algebra over a finite-dimensional space. We conclude that the quotient of ${[{\mathfrak g},{\mathfrak g}] \cap {\mathfrak r}}$ by the central ideal ${Z({\mathfrak g})}$ is nilpotent, which implies that ${[{\mathfrak g},{\mathfrak g}] \cap {\mathfrak r}}$ is nilpotent as required. $\Box$

This has the following corollary. Recall that a derivation ${D: {\mathfrak g} \rightarrow {\mathfrak g}}$ on an abstract Lie algebra ${{\mathfrak g}}$ is a linear map such that ${D[X,Y] = [DX,Y] + [X,DY]}$ for each ${X,Y \in {\mathfrak g}}$ . Examples of derivations include the inner derivations ${DX := [A,X]}$ for some fixed ${A \in {\mathfrak g}}$ , but

Corollary 8 Let ${{\mathfrak g}}$ be a finite-dimensional (abstract) Lie algebra, and let ${{\mathfrak r}}$ be a solvable ideal in ${{\mathfrak g}}$ . Then for any derivation ${D: {\mathfrak g} \rightarrow {\mathfrak g}}$ , ${D {\mathfrak r}}$ is nilpotent.

Proof: If ${D}$ were an inner derivation, the claim would follow easily from Theorem 7. In the non-inner case, the trick is simply to view ${D}$ as an inner derivation coming from an extension of ${{\mathfrak g}}$ . Namely, we work in the enlarged Lie algebra ${{\mathfrak g} \rtimes_D {\bf C}}$ , which is the Cartesian product ${{\mathfrak g} \times {\bf C}}$ with Lie bracket

$\displaystyle [ (X,s), (Y,t)] := ([X,Y] + s Dy - t Dx, 0 ).$

One easily verifies that this is an (abstract) Lie algebra which contains ${{\mathfrak g} \equiv {\mathfrak g} \times \{0\}}$ as a subalgebra. The derivation ${D}$ on ${{\mathfrak g}}$ then arises from the inner derivation on ${{\mathfrak g} \rtimes_D {\bf C}}$ coming from ${(0,1)}$ . The claim then easily follows by applying Theorem 7 to this enlarged algebra. $\Box$

Remark 3 I would be curious to know if there is a more direct proof of Corollary 8 (which in particular did not need the full strength of Engel’s theorem), as this would give a simpler proof of Ado’s theorem in the solvable case (indeed, it seems almost equivalent to that theorem, especially in view of Lie’s theorem).

Define the nilradical of an (abstract) finite-dimensional Lie algebra ${{\mathfrak g}}$ to be the maximal nilpotent ideal in ${{\mathfrak g}}$ . One can verify that the sum of two nilpotent ideals is again a nilpotent ideal, so the nilradical is well-defined. One can verify that the radical ${{\mathfrak r}}$ is characteristic (i.e. preserved by all derivations), and so ${D {\mathfrak r}}$ is also characteristic; in particular, it is an ideal. We conclude that ${D {\mathfrak r}}$ is in fact contained in the nilradical of ${{\mathfrak g}}$ .

We can now prove Ado’s theorem in the solvable case:

Theorem 9 (Ado’s theorem for solvable Lie algebras) Let ${{\mathfrak r}}$ be a finite-dimensional solvable Lie algebra. Then there exists a finite-dimensional faithful representation ${\rho: {\mathfrak r} \rightarrow \hbox{End}(V)}$ of ${{\mathfrak r}}$ . Furthermore, if ${{\mathfrak n}}$ is the nilradical of ${{\mathfrak r}}$ , one can ensure that ${\rho({\mathfrak n})}$ is nilpotent.

Proof: This is similar to the proof of Theorem 3. We induct on the relative dimension ${{\mathfrak r}/{\mathfrak n}}$ of ${{\mathfrak r}}$ relative to its nilradical. When this dimension is zero, the claim follows from Theorem 3, so we assume that this dimension is positive, and the claim has already been proven for lesser dimensions.

Using the adjoint representation as before, it suffices to establish a representation which is nilpotent on ${{\mathfrak n}}$ and faithful on the centre ${Z({\mathfrak r})}$ . As the centre is contained in the nilradical, it will suffice to establish nilpotency and faithfulness on ${{\mathfrak n}}$ .

As in the proof of Theorem 3, we can find a codimension one ideal ${{\mathfrak a}}$ of ${{\mathfrak r}}$ that contains ${{\mathfrak n}}$ , and so can split ${{\mathfrak r} = {\mathfrak a} \oplus {\mathfrak h}}$ as vector spaces, for some abelian one-dimensional Lie algebra ${{\mathfrak h}}$ . Note that the nilradical of ${{\mathfrak a}}$ is characteristic in ${{\mathfrak a}}$ , and hence a nilpotent ideal in ${{\mathfrak r}}$ ; as a consequence, it must be identical to the nilradical ${{\mathfrak n}}$ of ${{\mathfrak r}}$ .

By induction hypothesis, we have a finite-dimensional faithful representation ${\rho_0: {\mathfrak a} \rightarrow \hbox{End}(V_0)}$ which is nilpotent on ${{\mathfrak n}}$ ; we extend this representation to the universal enveloping algebra ${U({\mathfrak a})}$ . In particular (by Engel’s theorem), there exists a natural number ${k}$ such that ${\rho_0( {\mathfrak n}^k ) = 0}$ .

Let ${I}$ be the two-sided ideal in the universal enveloping algebra ${U({\mathfrak a})}$ generated by the kernel ${\hbox{ker}(\rho_0)}$ (in the universal enveloping algebra ${U({\mathfrak a})}$ , not the Lie algebra) and ${{\mathfrak n}}$ . Then the ideal ${I^k}$ is contained in the kernel of ${\rho_0}$ . Since ${\rho_0}$ is faithful on ${{\mathfrak a}}$ , we see that the quotient map from ${U({\mathfrak a})}$ to ${U({\mathfrak a})/I}$ is also faithful on ${{\mathfrak a}}$ .

By the Cayley-Hamilton theorem, for every element ${A}$ of ${{\mathfrak a}}$ there is a monic polynomial ${P(A)}$ of ${A}$ that is annihilated by ${\rho_0}$ and is thus in ${I}$ . The monic polynomial ${P(A)^k}$ is thus in ${I^k}$ . Using this, we see that we can express any monomial (3) with at least one sufficiently large exponent ${a_i}$ in terms of monomials of lower degree modulo ${I^k}$ . From this we see that ${U({\mathfrak a})/I^k}$ can be generated by just finitely many such monomials, and is in particular finite-dimensional.

The Lie algebra ${{\mathfrak h}}$ has an adjoint action on ${{\mathfrak a}}$ . These are derivations on ${{\mathfrak a}}$ , so by Corollary 8, they take values in ${{\mathfrak n}}$ and in particular in ${I}$ . If we extend these derivations to the universal enveloping algebra ${U({\mathfrak a})}$ by the Leibniz rule (cf. (4)), we thus see that these derivations preserve ${I}$ , and thus (by the Leibniz rule) also preserve ${I^k}$ . Thus this also gives an action of ${{\mathfrak h}}$ on ${U({\mathfrak a})/I^k}$ by derivations.

In analogy with (5), we may now combine the actions of ${{\mathfrak a}}$ and ${{\mathfrak h}}$ on ${U({\mathfrak a})/I^k}$ to an action of ${{\mathfrak r}}$ which will remain faithful and nilpotent on ${{\mathfrak n}}$ , and the claim follows. $\Box$

— 3. The general case —

Finally, we handle the general case. Actually we can use basically the same argument as in preceding cases, but we need one additional ingredient, namely Levi’s theorem:

Theorem 10 (Levi’s theorem) Let ${{\mathfrak g}}$ be a finite-dimensional (abstract) Lie algebra. Then there exists a splitting ${{\mathfrak g} = {\mathfrak r} \oplus {\mathfrak h}}$ as vector spaces, where ${{\mathfrak r}}$ is the radical of ${{\mathfrak g}}$ and ${{\mathfrak h}}$ is another Lie algebra.

We remark that ${{\mathfrak h}}$ necessarily has trivial radical and is thus semisimple. It is easy to see that the quotient ${{\mathfrak g}/{\mathfrak r}}$ is semisimple; the entire difficulty of Levi’s theorem is to ensure that one can lift this quotient back up into the original space ${{\mathfrak g}}$ . This requires some knowledge of the structural theory of semisimple Lie algebras. The proof will be omitted, as I have nothing to add to the textbook proofs of this result.

Using Levi’s theorem and Theorem 9, one obtains the full Ado theorem:

Theorem 11 (Ado’s theorem for general Lie algebras) Let ${{\mathfrak g}}$ be a finite-dimensional Lie algebra. Then there exists a finite-dimensional faithful representation ${\rho: {\mathfrak g} \rightarrow \hbox{End}(V)}$ of ${{\mathfrak g}}$ . Furthermore, if ${{\mathfrak n}}$ is the nilradical of ${{\mathfrak r}}$ , one can ensure that ${\rho({\mathfrak n})}$ is nilpotent.

Theorem 11 is deduced from Theorem 9 (and Theorem 10) using almost exactly the same argument used to deduce Theorem 9 from the induction hypothesis, with the key point again being Corollary 8 that places the adjoint action of ${{\mathfrak h}}$ on ${{\mathfrak r}}$ inside ${{\mathfrak n}}$ .

Remark 4 One corollary of Ado’s theorem is Lie’s third theorem, which asserts that every finite-dimensional Lie algebra ${{\mathfrak g}}$ is isomorphic to the Lie algebra of some Lie group. Indeed, without loss of generality we may take ${{\mathfrak g}}$ to be a concrete finite-dimensional Lie algebra in, say, ${\hbox{End}({\bf R}^n) = \mathfrak{gl}_n({\bf R})}$ . We can then consider the (infinite-dimensional) group ${\tilde G}$ of continuously differentiable paths ${\gamma: [0,1] \rightarrow GL_n({\bf R})}$ with ${\gamma'(t) \gamma(t)^{-1} \in {\mathfrak g}}$ for every ${t \in [0,1]}$ , quotiented out by reparameterisation and by right-translation, with the group law given by concatenation, and then the (finite-dimensional) quotient group ${G}$ formed by also quotienting out by homotopies that fix the endpoints. One can give ${G}$ a topology (and a smooth structure) by using neighbourhoods of the origin in ${{\mathfrak g}}$ , pushed forward by the exponential map, as a neighbourhood base for ${G}$ ; one can then verify that this is a Lie group with Lie algebra ${{\mathfrak g}}$ . We omit the details.

27 comments

Comments feed for this article

10 May, 2011 at 11:16 pm

David Roberts

Hi Terry,

there are some funny TeX errors in the paragraph starting “The only remaining thing to show is that for some sufficiently large…”, including some orphaned ‘langle’s and left-pointing turnstiles.

[Corrected, thanks – T.]

11 May, 2011 at 9:14 pm

Leif

So do you expect most people who read your posts to understand them..or only trained mathematicians who understand the jargon..and I’m assuming that is part of the reason you post links within your posts to Wikipedia articles and other links to help explain concepts or theories you refer to?

Thanks in advance for your response,

Leif

12 May, 2011 at 8:04 am

Terence Tao

About

12 May, 2011 at 7:37 am

Someone

On mathoverflow (http://mathoverflow.net/questions/8784/why-is-lies-third-theorem-difficult) there was a discussion about the question “Why is Ado’s theorem difficult?”.

16 May, 2011 at 1:23 am

Vanessa Pérez Arroyo

Hy Terry. On the Theorem 3. Let n be a finite dimensional k-step nilpotent Lie algebra, and set m := min { j : ad(x)^j=0 for all x in n } which is a counterexample to m=k?

16 May, 2011 at 9:13 am

Terence Tao

I believe it is possible to build a Lie algebra for which k > m. Some (lengthy) calculations involving the free 4-step nilpotent Lie algebra on four generators suggest to me that one can enforce the constraint that $\hbox{ad}(x)^3 = 0$ for each x, and still have a non-trivial fourth term in the lower central series.

17 May, 2011 at 12:17 pm

Ramdorai

Hello Vanessa. Such a Lie algebra as you are considering is called an m-Engel Lie algebra (this notion has been very widely studied by Efim Zelmanov in the case of infinite dimensions). I do not know a counterexample in the case of finite dimension.

In <> ( http://www.springerlink.com/content/j7562232386266w8/ ) your question appear as a conjecture.

11 June, 2011 at 4:15 am

Ravi

I think there is a typo in (the statement of) Corollary 5. It should read “… $\text{tr}(X_1\ldots X_m)=0$” (not “… $\text{tr}(AX_1\ldots X_m)=0$”).

Ravi Raghunathan

[Corrected, thanks – T.]

14 June, 2011 at 7:07 am

Wolfgang M.

Some (historical) remarks which may be of interest to some:

(1.) The above proof, in which a representation of a subspace is inductively extended to the whole Lie algebra, indeed seems to be the ‘standard’ proof for Ado’s theorem. A slightly different approach, by Y. Neretin (“A CONSTRUCTION OF FINITE-DIMENSIONAL FAITHFUL
REPRESENTATION OF LIE ALGEBRA,” http://www.ams.org/mathscinet-getitem?mr=1982443), can be found in the literature.

The objects and techniques used in his proof are almost exactly the same but with the following difference: Neretin first embeds the (complex, finite-dimensional) Lie algebra $g$ into a Lie algebra $g'$ that is the semidirect product $p \ltimes h$ of a reductive Lie algebra $p$ and a nilpotent ideal $h$ such that the $p$ acts completely reducibly on the $h$ . It then suffices to find a (finite-dimensional) faithful representation of $g'$ . This is done exactly as in the proof above: by letting $g'$ act on its universal enveloping algebra $U(g')$ through derivations (p) and left multiplications (h), and then considering the quotient $U(g')/I$ , where $I$ is an invariant ideal that is spanned by all elements that are long enough, in some sense.

The embedding theorem follows from what Neretin calls “elementary expansions,” based on the Jordan-Chevalley decomposition for derivations. I suppose this idea is also implicitly contained in the standard proof.

Neretin’s proof is short and elegant, but it also assumes theorem 7, corollary 8, theorem 10, and so on.

(2.) Concerning “Remark 2”: There have been attempts to determine (bounds for) the minimal degree $m(g)$ of a faithful representation of a given Lie algebra $g$ in terms of other natural invariants associated to $g$ . Some classes admit polynomial bounds, but the general bounds appear to be exponential in the dimension.

17 August, 2011 at 7:04 pm

Notes on local groups « What’s new

[…] this first for Parts 1 and 2. Let be a Lie algebra. Applying Ado’s theorem (discussed in this blog post), we can identify with a subalgebra of the Lie algebra for some finite . If is a small symmetric […]

1 September, 2011 at 6:15 pm

254A, Notes 1: Lie groups, Lie algebras, and the Baker-Campbell-Hausdorff formula « What’s new

[…] a global Lie group, requiring the non-trivial algebraic tool of Ado’s theorem (discussed in this previous blog post); see Exercise 20 […]

29 October, 2011 at 11:25 am

Associativity of the Baker-Campbell-Hausdorff formula « What’s new

[…] With the assistance of Ado’s theorem, which places inside the general linear Lie algebra for some , one can deduce both the well-definedness and associativity of (3) from the Baker-Campbell-Hausdorff formula for . However, Ado’s theorem is rather difficult to prove (see for instance this previous blog post for a proof), and it is natural to ask whether there is a way to establish these facts without Ado’s theorem. […]

5 April, 2012 at 4:02 am

Theo

Just above Remark 1; why is the enlarged action still faithful on \mathfrak{a}? Couldn’t an element of \mathfrak{h} sit in the centralizer of \mathfrak{a} in \mathfrak{n}?

thanks

5 April, 2012 at 7:18 am

Terence Tao

The restriction of the action of ${\mathfrak n} = {\mathfrak a} \oplus {\mathfrak h}$ on $U({\mathfrak a})$ to ${\mathfrak a}$ is simply the original action of ${\mathfrak a}$ on $U({\mathfrak a})$ , as can be seen by substituting $H=0$ into (5). The behaviour of ${\mathfrak h}$ becomes irrelevant at that point.

21 September, 2012 at 3:28 am

Ado’s theorem for groups with dilations? « chorasimilarity

[…] proof I am aware of, (see this post for one proof and relevant links), mixes the following […]

27 April, 2013 at 9:25 pm

Notes on the classification of complex Lie algebras | What's new

[…] to be a homomorphism into a concrete Lie algebra . It is a deep theorem of Ado (discussed in this previous post) that every abstract Lie algebra is in fact isomorphic to a concrete one (or equivalently, that […]

25 January, 2014 at 10:18 pm

The Lie Bracket | Wet Savanna Animals

[…] Terence Tao, “Ado’s Theorem”; a proof of Ado’s theorem sketched out on Terre… […]

21 March, 2014 at 5:44 am

The Lie Bracket and the Little Adjoint Representation | Wet Savanna Animals

[…] Terence Tao, “Ado’s Theorem”; a proof of Ado’s theorem sketched out on Terre… […]

15 June, 2014 at 6:48 pm

Chapter 13: Lie Theoretical Systems Theory | Wet Savanna Animals

[…] Terence Tao, “Ado’s Theorem”; a proof of Ado’s theorem sketched out on Terre… […]

17 July, 2014 at 9:48 pm

Lie’s Third Theorem and Differential Equations in a Less-Than-Full-Rank Lie Algebra | Wet Savanna Animals

[…] Terence Tao, “Ado’s Theorem”; a proof of Ado’s theorem sketched out on Terre… […]

17 January, 2015 at 2:54 pm

Geoffrey J.

Dear Terry Tao,

I was wondering why one need to take the ideal A to be of codimension 1? Already in the nilpotent case for example, i don’t see in which step we need A to satisfy that (also do we need H to be abelian?). I get the feeling that it is enough to be able to decompose the lie algebra into A \oplus H with A an ideal of the lie algebra.

Thanks for your answer!

Best regards

17 January, 2015 at 5:42 pm

Terence Tao

If ${\mathfrak a}$ is not of codimension 1, it is not obvious how to locate a complementary space ${\mathfrak h}$ which is itself a Lie algebra (i.e. closed under Lie bracket). In the codimension one case, this is not a difficulty as any one-dimensional subspace is automatically a Lie algebra. (It is automatically also abelian, but this is a bonus which, as you say, is not particularly essential to the rest of the argument.)

18 January, 2015 at 1:10 am

Avi Levy

Ctrl + f: “remins” -> “remains”

[Corrected, thanks – T.]

18 March, 2015 at 9:41 am

Alejandro Ginory

Hello Terence Tao,

This as a very enjoyable post. Thanks for posting it.

Regarding remark 3, I don’t know if this qualifies as a more “direct” proof but corollary 8 can be proved via Lie’s theorem and vice-versa. In one direction, corollary 8 can be proved using Lie’s theorem by first showing that any derivation $D$ maps the radical of $\mathfrak{g}$ into itself and second by using Lie’s theorem on the solvable ideal $R+\mathbb{F}D$ , where $R$ is the image of $rad(\mathfrak{g})$ in $\mathfrak{gl}(\mathfrak{g})$ under the ad map, to show that $[D,rad(\mathfrak{g})]$ is nilpotent.

Conversely, corollary 8 can be used to prove Lie’s theorem inductively by taking a solvable Lie algebra $\mathfrak{g}$ , a codimension 1 ideal $\mathfrak{h}$ , and then showing that the space $E$ generated by 1-dimensional ideals of $\mathfrak{h}$ is both a non-trivial subspace (by induction) and an ideal in $\mathfrak{g}.$ Showing that $E$ is an ideal requires cor. 8 and this produces a common eigenvector of the adjoint representation, showing that every solvable ideal has a complete flag consisting of ideals. From there, we can work out Lie’s theorem for any representation $V$ by considering the direct sum $\mathfrak{g}+V$ , where $V$ is considered as an abelian ideal.

As a bonus, theorem 7 also follows from cor. 8 and Lie’s theorem.

Best Regards,
Alejandro Ginory

9 May, 2016 at 8:14 pm

Math Major

Dear Prof. Tao,

Thanks for writing this illuminating article. Regarding the statement

“One can of course define Lie algebras over other fields than the complex numbers {{\bf C}}, but in order to avoid some technical issues we shall work solely with the complex case in this post.”

made at the beginning of the post, could you please point out which part of the argument depends on this restriction to complex Lie algebras? I’ve not been able to figure out why the argument won’t go through for real Lie algebras, although Fulton and Harris deduce the real case from the complex case.

Thanks ever so much!

9 May, 2016 at 9:39 pm

Terence Tao

It’s important to restrict to the characteristic zero case (Levi’s theorem fails for positive characteristic, and there may be other places in the above post where characteristic zero is implicitly used). But I think most of the discussion on the complex case transfers to the real case without difficulty (Lie’s theorem isn’t true over the reals, but that theorem isn’t really used in the above post except as motivation). And as you say, the complex case of Ado’s theorem implies the real case as a corollary.

9 September, 2016 at 1:55 pm

Vincenzo Zaccaro

How I can cite your work?

Best regards
Vincenzo

[This particular article was reprinted as Section 2.3 of this book: https://terrytao.wordpress.com/books/hilberts-fifth-problem-and-related-topics/ . In general, the web URL should be fairly stable. -T.]

	Anonymous on 275A, Notes 3: The weak and st…
	Terence Tao on 254A, Supplement 4: Probabilis…
	Anonymous on 254A, Supplement 4: Probabilis…
	Terence Tao on Analysis II
	Anonymous on Analysis II
	El problema de Erdős… on Two announcements: AI for Math…
	Anonymous on An airport-inspired puzzle
	oliverknill on Two announcements: AI for Math…
	Anonymous on An airport-inspired puzzle
	Prashant Patil on Two announcements: AI for Math…
	Anonymous on Two announcements: AI for Math…
	Anonymous on Two announcements: AI for Math…
	Anonymous on 275A, Notes 3: The weak and st…
	Anonymous on 275A, Notes 3: The weak and st…
	Anonymous on Two announcements: AI for Math…

Ado’s theorem

Recent Comments

Articles by others

Diversions

Mathematics

Selected articles

Software

The sciences

Top Posts

Archives

Categories

The Polymath Blog

27 comments

Leave a reply to Associativity of the Baker-Campbell-Hausdorff formula « What’s new Cancel reply

For commenters

Ado’s theorem

Share this:

Recent Comments

Articles by others

Diversions

Mathematics

Selected articles

Software

The sciences

Top Posts

Archives

Categories

The Polymath Blog

27 comments

Leave a reply to Associativity of the Baker-Campbell-Hausdorff formula « What’s new Cancel reply

For commenters