You are currently browsing the tag archive for the ‘normal subgroups’ tag.

I’m collecting in this blog post a number of simple group-theoretic lemmas, all of the following flavour: if is a subgroup of some product of groups, then one of three things has to happen:

- ( too small) is contained in some proper subgroup of , or the elements of are constrained to some sort of equation that the full group does not satisfy.
- ( too large) contains some non-trivial normal subgroup of , and as such actually arises by pullback from some subgroup of the quotient group .
- (Structure) There is some useful structural relationship between and the groups .

It is perhaps easiest to explain the flavour of these lemmas with some simple examples, starting with the case where we are just considering subgroups of a single group .

Lemma 1Let be a subgroup of a group . Then exactly one of the following hold:

- (i) ( too small) There exists a non-trivial group homomorphism into a group such that for all .
- (ii) ( normally generates ) is generated as a group by the conjugates of .

*Proof:* Let be the group normally generated by , that is to say the group generated by the conjugates of . This is a normal subgroup of containing (indeed it is the smallest such normal subgroup). If is all of we are in option (ii); otherwise we can take to be the quotient group and to be the quotient map. Finally, if (i) holds, then all of the conjugates of lie in the kernel of , and so (ii) cannot hold.

Here is a “dual” to the above lemma:

Lemma 2Let be a subgroup of a group . Then exactly one of the following hold:

- (i) ( too large) is the pullback of some subgroup of for some non-trivial normal subgroup of , where is the quotient map.
- (ii) ( is core-free) does not contain any non-trivial conjugacy class .

*Proof:* Let be the normal core of , that is to say the intersection of all the conjugates of . This is the largest normal subgroup of that is contained in . If is non-trivial, we can quotient it out and end up with option (i). If instead is trivial, then there is no non-trivial element that lies in the core, hence no non-trivial conjugacy class lies in and we are in option (ii). Finally, if (i) holds, then every conjugacy class of an element of is contained in and hence in , so (ii) cannot hold.

For subgroups of nilpotent groups, we have a nice dichotomy that detects properness of a subgroup through abelian representations:

Lemma 3Let be a subgroup of a nilpotent group . Then exactly one of the following hold:

- (i) ( too small) There exists non-trivial group homomorphism into an abelian group such that for all .
- (ii) .

Informally: if is a variable ranging in a subgroup of a nilpotent group , then either is unconstrained (in the sense that it really ranges in all of ), or it obeys some abelian constraint .

*Proof:* By definition of nilpotency, the lower central series

Since is a normal subgroup of , is also a subgroup of . Suppose first that is a proper subgroup of , then the quotient map is a non-trivial homomorphism to an abelian group that annihilates , and we are in option (i). Thus we may assume that , and thus

Note that modulo the normal group , commutes with , hence and thus We conclude that . One can continue this argument by induction to show that for every ; taking large enough we end up in option (ii). Finally, it is clear that (i) and (ii) cannot both hold.

Remark 4When the group is locally compact and is closed, one can take the homomorphism in Lemma 3 to be continuous, and by using Pontryagin duality one can also take the target group to be the unit circle . Thus is now a character of . Similar considerations hold for some of the later lemmas in this post. Discrete versions of this above lemma, in which the group is replaced by some orbit of a polynomial map on a nilmanifold, were obtained by Leibman and are important in the equidistribution theory of nilmanifolds; see this paper of Ben Green and myself for further discussion.

Here is an analogue of Lemma 3 for special linear groups, due to Serre (IV-23):

Lemma 5Let be a prime, and let be a closed subgroup of , where is the ring of -adic integers. Then exactly one of the following hold:

- (i) ( too small) There exists a proper subgroup of such that for all .
- (ii) .

*Proof:* It is a standard fact that the reduction of mod is , hence (i) and (ii) cannot both hold.

Suppose that (i) fails, then for every there exists such that , which we write as

We now claim inductively that for any and , there exists with ; taking limits as using the closed nature of will then place us in option (ii).The case is already handled, so now suppose . If , we see from the case that we can write where and . Thus to establish the claim it suffices to do so under the additional hypothesis that .

First suppose that for some with . By the case, we can find of the form for some . Raising to the power and using and , we note that , giving the claim in this case.

Any matrix of trace zero with coefficients in is a linear combination of , , and is thus a sum of matrices that square to zero. Hence, if is of the form , then for some matrix of trace zero, and thus one can write (up to errors) as the finite product of matrices of the form with . By the previous arguments, such such matrix lies in up to errors, and hence does also. This completes the proof of the case.

Now suppose and the claim has already been proven for . Arguing as before, it suffices to close the induction under the additional hypothesis that , thus we may write . By induction hypothesis, we may find with . But then , and we are done.

We note a generalisation of Lemma 3 that involves two groups rather than just one:

Lemma 6Let be a subgroup of a product of two nilpotent groups . Then exactly one of the following hold:

- (i) ( too small) There exists group homomorphisms , into an abelian group , with non-trivial, such that for all , where is the projection of to .
- (ii) for some subgroup of .

*Proof:* Consider the group . This is a subgroup of . If it is all of , then must be a Cartesian product and option (ii) holds. So suppose that this group is a proper subgroup of . Applying Lemma 3, we obtain a non-trivial group homomorphism into an abelian group such that whenever . For any in the projection of to , there is thus a unique quantity such that whenever . One easily checcks that is a homomorphism, so we are in option (i).

Finally, it is clear that (i) and (ii) cannot both hold, since (i) places a non-trivial constraint on the second component of an element of for any fixed choice of .

We also note a similar variant of Lemma 5, which is Lemme 10 of this paper of Serre:

Lemma 7Let be a prime, and let be a closed subgroup of . Then exactly one of the following hold:

- (i) ( too small) There exists a proper subgroup of such that for all .
- (ii) .

*Proof:* As in the proof of Lemma 5, (i) and (ii) cannot both hold. Suppose that (i) does not hold, then for any there exists such that . Similarly, there exists with . Taking commutators of and , we can find with . Continuing to take commutators with and extracting a limit (using compactness and the closed nature of ), we can find with . Thus, the closed subgroup of does not obey conclusion (i) of Lemma 5, and must therefore obey conclusion (ii); that is to say, contains . Similarly contains ; multiplying, we end up in conclusion (ii).

The most famous result of this type is of course the Goursat lemma, which we phrase here in a somewhat idiosyncratic manner to conform to the pattern of the other lemmas in this post:

Lemma 8 (Goursat lemma)Let be a subgroup of a product of two groups . Then one of the following hold:

- (i) ( too small) is contained in for some subgroups , of respectively, with either or (or both).
- (ii) ( too large) There exist normal subgroups of respectively, not both trivial, such that arises from a subgroup of , where is the quotient map.
- (iii) (Isomorphism) There is a group isomorphism such that is the graph of . In particular, and are isomorphic.

Here we almost have a trichotomy, because option (iii) is incompatible with both option (i) and option (ii). However, it is possible for options (i) and (ii) to simultaneously hold.

*Proof:* If either of the projections , from to the factor groups (thus and fail to be surjective, then we are in option (i). Thus we may assume that these maps are surjective.

Next, if either of the maps , fail to be injective, then at least one of the kernels , is non-trivial. We can then descend down to the quotient and end up in option (ii).

The only remaining case is when the group homomorphisms are both bijections, hence are group isomorphisms. If we set we end up in case (iii).

We can combine the Goursat lemma with Lemma 3 to obtain a variant:

Corollary 9 (Nilpotent Goursat lemma)Let be a subgroup of a product of two nilpotent groups . Then one of the following hold:

- (i) ( too small) There exists and a non-trivial group homomorphism such that for all .
- (ii) ( too large) There exist normal subgroups of respectively, not both trivial, such that arises from a subgroup of .
- (iii) (Isomorphism) There is a group isomorphism such that is the graph of . In particular, and are isomorphic.

*Proof:* If Lemma 8(i) holds, then by applying Lemma 3 we arrive at our current option (i). The other options are unchanged from Lemma 8, giving the claim.

Now we present a lemma involving three groups that is known in ergodic theory contexts as the “Furstenberg-Weiss argument”, as an argument of this type arose in this paper of Furstenberg and Weiss, though perhaps it also implicitly appears in other contexts also. It has the remarkable feature of being able to enforce the abelian nature of one of the groups once the other options of the lemma are excluded.

Lemma 10 (Furstenberg-Weiss lemma)Let be a subgroup of a product of three groups . Then one of the following hold:

- (i) ( too small) There is some proper subgroup of and some such that whenever and .
- (ii) ( too large) There exists a non-trivial normal subgroup of with abelian, such that arises from a subgroup of , where is the quotient map.
- (iii) is abelian.

*Proof:* If the group is a proper subgroup of , then we are in option (i) (with ), so we may assume that

As before, we can combine this with previous lemmas to obtain a variant in the nilpotent case:

Lemma 11 (Nilpotent Furstenberg-Weiss lemma)Let be a subgroup of a product of three nilpotent groups . Then one of the following hold:

- (i) ( too small) There exists and group homomorphisms , for some abelian group , with non-trivial, such that whenever , where is the projection of to .
- (ii) ( too large) There exists a non-trivial normal subgroup of , such that arises from a subgroup of .
- (iii) is abelian.

Informally, this lemma asserts that if is a variable ranging in some subgroup , then either (i) there is a non-trivial abelian equation that constrains in terms of either or ; (ii) is not fully determined by and ; or (iii) is abelian.

*Proof:* Applying Lemma 10, we are already done if conclusions (ii) or (iii) of that lemma hold, so suppose instead that conclusion (i) holds for say . Then the group is not of the form , since it only contains those with . Applying Lemma 6, we obtain group homomorphisms , into an abelian group , with non-trivial, such that whenever , placing us in option (i).

The Furstenberg-Weiss argument is often used (though not precisely in this form) to establish that certain key structure groups arising in ergodic theory are abelian; see for instance Proposition 6.3(1) of this paper of Host and Kra for an example.

One can get more structural control on in the Furstenberg-Weiss lemma in option (iii) if one also broadens options (i) and (ii):

Lemma 12 (Variant of Furstenberg-Weiss lemma)Let be a subgroup of a product of three groups . Then one of the following hold:

- (i) ( too small) There is some proper subgroup of for some such that whenever . (In other words, the projection of to is not surjective.)
- (ii) ( too large) There exists a normal of respectively, not all trivial, such that arises from a subgroup of , where is the quotient map.
- (iii) are abelian and isomorphic. Furthermore, there exist isomorphisms , , to an abelian group such that

The ability to encode an abelian additive relation in terms of group-theoretic properties is vaguely reminiscent of the group configuration theorem.

*Proof:* We apply Lemma 10. Option (i) of that lemma implies option (i) of the current lemma, and similarly for option (ii), so we may assume without loss of generality that is abelian. By permuting we may also assume that are abelian, and will use additive notation for these groups.

We may assume that the projections of to and are surjective, else we are in option (i). The group is then a normal subgroup of ; we may assume it is trivial, otherwise we can quotient it out and be in option (ii). Thus can be expressed as a graph for some map . As is a group, must be a homomorphism, and we can write it as for some homomorphisms , . Thus elements of obey the constraint .

If or fails to be injective, then we can quotient out by their kernels and end up in option (ii). If fails to be surjective, then the projection of to also fails to be surjective (since for , is now constrained to lie in the range of ) and we are in option (i). Simiarly if fails to be surjective. Thus we may assume that the homomorphisms are bijective and thus group isomorphisms. Setting to the identity, we arrive at option (iii).

Combining this lemma with Lemma \reF{nilp}, we obtain a nilpotent version:

Corollary 13 (Variant of nilpotent Furstenberg-Weiss lemma)Let be a subgroup of a product of three groups . Then one of the following hold:

- (i) ( too small) There are homomorphisms , to some abelian group for some , with not both trivial, such that whenever .
- (ii) ( too large) There exists a normal of respectively, not all trivial, such that arises from a subgroup of , where is the quotient map.
- (iii) are abelian and isomorphic. Furthermore, there exist isomorphisms , , to an abelian group such that

Here is another variant of the Furstenberg-Weiss lemma, attributed to Serre by Ribet (see Lemma 3.3):

Lemma 14 (Serre’s lemma)Let be a subgroup of a finite product of groups with . Then one of the following hold:

- (i) ( too small) There is some proper subgroup of for some such that whenever .
- (ii) ( too large) One has .
- (iii) One of the has a non-trivial abelian quotient .

*Proof:* The claim is trivial for (and we don’t need (iii) in this case), so suppose that . We can assume that each is a perfect group, , otherwise we can quotient out by the commutator and arrive in option (iii). Similarly, we may assume that all the projections of to , are surjective, otherwise we are in option (i).

We now claim that for any and any , one can find with for and . For this follows from the surjectivity of the projection of to . Now suppose inductively that and the claim has already been proven for . Since is perfect, it suffices to establish this claim for of the form for some . By induction hypothesis, we can find with for and . By surjectivity of the projection of to , one can find with and . Taking commutators of these two elements, we obtain the claim.

Setting , we conclude that contains . Similarly for permutations. Multiplying these together we see that contains all of , and we are in option (ii).

In most undergraduate courses, groups are first introduced as a primarily *algebraic* concept – a set equipped with a number of algebraic operations (group multiplication, multiplicative inverse, and multiplicative identity) and obeying a number of rules of algebra (most notably the associative law). It is only somewhat later that one learns that groups are not solely an algebraic object, but can also be equipped with the structure of a manifold (giving rise to *Lie groups*) or a topological space (giving rise to *topological groups*). (See also this post for a number of other ways to think about groups.)

Another important way to enrich the structure of a group is to give it some *geometry*. A fundamental way to provide such a geometric structure is to specify a list of generators of the group . Let us call such a pair a *generated group*; in many important cases the set of generators is finite, leading to a *finitely generated group*. A generated group gives rise to the *word metric* on , defined to be the maximal metric for which for all and (or more explicitly, is the least for which for some and ). This metric then generates the balls . In the finitely generated case, the are finite sets, and the rate at which the cardinality of these sets grow in is an important topic in the field of geometric group theory. The idea of studying a finitely generated group via the geometry of its metric goes back at least to the work of Dehn.

One way to visualise the geometry of a generated group is to look at the (labeled) Cayley colour graph of the generated group . This is a directed coloured graph, with edges coloured by the elements of , and vertices labeled by elements of , with a directed edge of colour from to for each and . The word metric then corresponds to the graph metric of the Cayley graph.

For instance, the Cayley graph of the cyclic group with a single generator (which we draw in green) is given as

while the Cayley graph of the same group but with the generators (which we draw in blue and red respectively) is given as

We can thus see that the same group can have somewhat different geometry if one changes the set of generators. For instance, in a large cyclic group , with a single generator the Cayley graph “looks one-dimensional”, and balls grow linearly in until they saturate the entire group, whereas with two generators chosen at random, the Cayley graph “looks two-dimensional”, and the balls typically grow quadratically until they saturate the entire group.

Cayley graphs have three distinguishing properties:

- (Regularity) For each colour , every vertex has a single -edge leading out of , and a single -edge leading into .
- (Connectedness) The graph is connected.
- (Homogeneity) For every pair of vertices , there is a unique coloured graph isomorphism that maps to .

It is easy to verify that a directed coloured graph is a Cayley graph (up to relabeling) if and only if it obeys the above three properties. Indeed, given a graph with the above properties, one sets to equal the (coloured) automorphism group of the graph ; arbitrarily designating one of the vertices of to be the identity element , we can then identify all the other vertices in with a group element. One then identifies each colour with the vertex that one reaches from by an -coloured edge. Conversely, every Cayley graph of a generated group is clearly regular, is connected because generates , and has isomorphisms given by right multiplication for all . (The regularity and connectedness properties already ensure the uniqueness component of the homogeneity property.)

From the above equivalence, we see that we do not really need the vertex labels on the Cayley graph in order to describe a generated group, and so we will now drop these labels and work solely with *unlabeled* Cayley graphs, in which the vertex set is not already identified with the group. As we saw above, one just needs to designate a marked vertex of the graph as the “identity” or “origin” in order to turn an unlabeled Cayley graph into a labeled Cayley graph; but from homogeneity we see that all vertices of an unlabeled Cayley graph “look the same” and there is no canonical preference for choosing one vertex as the identity over another. I prefer here to keep the graphs unlabeled to emphasise the homogeneous nature of the graph.

It is instructive to revisit the basic concepts of group theory using the language of (unlabeled) Cayley graphs, and to see how geometric many of these concepts are. In order to facilitate the drawing of pictures, I work here only with small finite groups (or Cayley graphs), but the discussion certainly is applicable to large or infinite groups (or Cayley graphs) also.

For instance, in this setting, the concept of *abelianness* is analogous to that of a *flat* (zero-curvature) geometry: given any two colours , a directed path with colours (adopting the obvious convention that the reversal of an -coloured directed edge is considered an -coloured directed edge) returns to where it started. (Note that a generated group is abelian if and only if the generators in pairwise commute with each other.) Thus, for instance, the two depictions of above are abelian, whereas the group , which is also the dihedral group of the triangle and thus admits the Cayley graph

is not abelian.

A subgroup of a generated group can be easily described in Cayley graph language if the generators of happen to be a subset of the generators of . In that case, if one begins with the Cayley graph of and erases all colours except for those colours in , then the graph *foliates* into connected components, each of which is isomorphic to the Cayley graph of . For instance, in the above Cayley graph depiction of , erasing the blue colour leads to three copies of the red Cayley graph (which has as its structure group), while erasing the red colour leads to two copies of the blue Cayley graph (which as as its structure group). If is not contained in , then one has to first “change basis” and add or remove some coloured edges to the original Cayley graph before one can obtain this formulation (thus for instance contains two more subgroups of order two that are not immediately apparent with this choice of generators). Nevertheless the geometric intuition that subgroups are analogous to foliations is still quite a good one.

We saw that a subgroup of a generated group with foliates the larger Cayley graph into -connected components, each of which is a copy of the smaller Cayley graph. The remaining colours in then join those -components to each other. In some cases, each colour will connect a -component to exactly one other -component; this is the case for instance when one splits into two blue components. In other cases, a colour can connect a -component to multiple -components; this is the case for instance when one splits into three red components. The former case occurs precisely when the subgroup is *normal*. (Note that a subgroup of a generated group is normal if and only if left-multiplication by a generator of maps right-cosets of to right-cosets of .) We can then *quotient out* the Cayley graph from , leading to a quotient Cayley graph whose vertices are the -connected components of , and the edges are projected from in the obvious manner. We can then view the original graph as a *bundle* of -graphs over a base -graph (or equivalently, an *extension* of the base graph by the fibre graph ); for instance can be viewed as a bundle of the blue graph over the red graph , but not conversely. We thus see that the geometric analogue of the concept of a normal subgroup is that of a bundle. The generators in can be viewed as describing a connection on that bundle.

Note, though, that the structure group of this connection is not simply , unless is a central subgroup; instead, it is the larger group , the semi-direct product of with its automorphism group. This is because a non-central subgroup can be “twisted around” by operations such as conjugation by a generator . So central subgroups are analogous to the geometric notion of a principal bundle. For instance, here is the Heisenberg group

over the field of two elements, which one can view as a central extension of (the blue and green edges, after quotienting) by (the red edges):

Note how close this group is to being abelian; more generally, one can think of nilpotent groups as being a slight perturbation of abelian groups.

In the case of (viewed as a bundle of the blue graph over the red graph ), the base graph is in fact embedded (three times) into the large graph . More generally, the base graph can be lifted back into the extension if and only if the short exact sequence splits, in which case becomes a semidirect product of and a lifted copy of . Not all bundles can be split in this fashion. For instance, consider the group , with the blue generator and the red generator :

This is a -bundle over that does not split; the blue Cayley graph of is not visible in the graph directly, but only after one quotients out the red fibre subgraph. The notion of a splitting in group theory is analogous to the geometric notion of a global gauge. The existence of such a splitting or gauge, and the relationship between two such splittings or gauges, are controlled by the group cohomology of the sequence .

Even when one has a splitting, the bundle need not be completely trivial, because the bundle is not principal, and the connection can still twist the fibres around. For instance, when viewed as a bundle over with fibres splits, but observe that if one uses the red generator of this splitting to move from one copy of the blue graph to the other, that the orientation of the graph changes. The bundle is trivialisable if and only if is a direct summand of , i.e. splits as a direct product of a lifted copy of . Thus we see that the geometric analogue of a direct summand is that of a trivialisable bundle (and that trivial bundles are then the analogue of direct products). Note that there can be more than one way to trivialise a bundle. For instance, with the Klein four-group ,

the red fibre is a direct summand, but one can use either the blue lift of or the green lift of as the complementary factor.

## Recent Comments