You are currently browsing the tag archive for the ‘group extensions’ tag.

Asgar Jamneshan and I have just uploaded to the arXiv our paper “An uncountable Mackey-Zimmer theorem“. This paper is part of our longer term project to develop “uncountable” versions of various theorems in ergodic theory; see this previous paper of Asgar and myself for the first paper in this series (and another paper will appear shortly).

In this case the theorem in question is the Mackey-Zimmer theorem, previously discussed in this blog post. This theorem gives an important classification of group and homogeneous extensions of measure-preserving systems. Let us first work in the (classical) setting of concrete measure-preserving systems. Let ${Y = (Y, \mu_Y, T_Y)}$ be a measure-preserving system for some group ${\Gamma}$, thus ${(Y,\mu_Y)}$ is a (concrete) probability space and ${T_Y : \gamma \rightarrow T_Y^\gamma}$ is a group homomorphism from ${\Gamma}$ to the automorphism group ${\mathrm{Aut}(Y,\mu_Y)}$ of the probability space. (Here we are abusing notation by using ${Y}$ to refer both to the measure-preserving system and to the underlying set. In the notation of the paper we would instead distinguish these two objects as ${Y_{\mathbf{ConcPrb}_\Gamma}}$ and ${Y_{\mathbf{Set}}}$ respectively, reflecting two of the (many) categories one might wish to view ${Y}$ as a member of, but for sake of this informal overview we will not maintain such precise distinctions.) If ${K}$ is a compact group, we define a (concrete) cocycle to be a collection of measurable functions ${\rho_\gamma : Y \rightarrow K}$ for ${\gamma \in \Gamma}$ that obey the cocycle equation

$\displaystyle \rho_{\gamma \gamma'}(y) = \rho_\gamma(T_Y^{\gamma'} y) \rho_{\gamma'}(y) \ \ \ \ \ (1)$

for each ${\gamma,\gamma' \in \Gamma}$ and all ${y \in Y}$. (One could weaken this requirement by only demanding the cocycle equation to hold for almost all ${y}$, rather than all ${y}$; we will effectively do so later in the post, when we move to opposite probability algebra systems.) Any such cocycle generates a group skew-product ${X = Y \rtimes_\rho K}$ of ${Y}$, which is another measure-preserving system ${(X, \mu_X, T_X)}$ where
• ${X = Y \times K}$ is the Cartesian product of ${Y}$ and ${K}$;
• ${\mu_X = \mu_Y \times \mathrm{Haar}_K}$ is the product measure of ${\mu_Y}$ and Haar probability measure on ${K}$; and
• The action ${T_X: \gamma \rightarrow }$ is given by the formula

$\displaystyle T_X^\gamma(y,k) := (T_Y^\gamma y, \rho_\gamma(y) k). \ \ \ \ \ (2)$

The cocycle equation (1) guarantees that ${T_X}$ is a homomorphism, and the (left) invariance of Haar measure and Fubini’s theorem guarantees that the ${T_X^\gamma}$ remain measure preserving. There is also the more general notion of a homogeneous skew-product ${X \times Y \times_\rho K/L}$ in which the group ${K}$ is replaced by the homogeneous space ${K/L}$ for some closed subgroup of ${L}$, noting that ${K/L}$ still comes with a left-action of ${K}$ and a Haar measure. Group skew-products are very “explicit” ways to extend a system ${Y}$, as everything is described by the cocycle ${\rho}$ which is a relatively tractable object to manipulate. (This is not to say that the cohomology of measure-preserving systems is trivial, but at least there are many tools one can use to study them, such as the Moore-Schmidt theorem discussed in this previous post.)

This group skew-product ${X}$ comes with a factor map ${\pi: X \rightarrow Y}$ and a coordinate map ${\theta: X \rightarrow K}$, which by (2) are related to the action via the identities

$\displaystyle \pi \circ T_X^\gamma = T_Y^\gamma \circ \pi \ \ \ \ \ (3)$

and

$\displaystyle \theta \circ T_X^\gamma = (\rho_\gamma \circ \pi) \theta \ \ \ \ \ (4)$

where in (4) we are implicitly working in the group of (concretely) measurable functions from ${Y}$ to ${K}$. Furthermore, the combined map ${(\pi,\theta): X \rightarrow Y \times K}$ is measure-preserving (using the product measure on ${Y \times K}$), indeed the way we have constructed things this map is just the identity map.

We can now generalize the notion of group skew-product by just working with the maps ${\pi, \theta}$, and weakening the requirement that ${(\pi,\theta)}$ be measure-preserving. Namely, define a group extension of ${Y}$ by ${K}$ to be a measure-preserving system ${(X,\mu_X, T_X)}$ equipped with a measure-preserving map ${\pi: X \rightarrow Y}$ obeying (3) and a measurable map ${\theta: X \rightarrow K}$ obeying (4) for some cocycle ${\rho}$, such that the ${\sigma}$-algebra of ${X}$ is generated by ${\pi,\theta}$. There is also a more general notion of a homogeneous extension in which ${\theta}$ takes values in ${K/L}$ rather than ${K}$. Then every group skew-product ${Y \rtimes_\rho K}$ is a group extension of ${Y}$ by ${K}$, but not conversely. Here are some key counterexamples:

• (i) If ${H}$ is a closed subgroup of ${K}$, and ${\rho}$ is a cocycle taking values in ${H}$, then ${Y \rtimes_\rho H}$ can be viewed as a group extension of ${Y}$ by ${K}$, taking ${\theta: Y \rtimes_\rho H \rightarrow K}$ to be the vertical coordinate ${\theta(y,h) = h}$ (viewing ${h}$ now as an element of ${K}$). This will not be a skew-product by ${K}$ because ${(\theta,\pi)}$ pushes forward to the wrong measure on ${Y \times K}$: it pushes forward to ${\mu_Y \times \mathrm{Haar}_H}$ rather than ${\mu_Y \times \mathrm{Haar}_K}$.
• (ii) If one takes the same example as (i), but twists the vertical coordinate ${\theta}$ to another vertical coordinate ${\tilde \theta(y,h) := \Phi(y) \theta(y,h)}$ for some measurable “gauge function” ${\Phi: Y \rightarrow K}$, then ${Y \rtimes_\rho H}$ is still a group extension by ${K}$, but now with the cocycle ${\rho}$ replaced by the cohomologous cocycle

$\displaystyle \tilde \rho_\gamma(y) := \Phi(T_Y^\gamma y) \rho_\gamma \Phi(y)^{-1}.$

Again, this will not be a skew product by ${K}$, because ${(\theta,\pi)}$ pushes forward to a twisted version of ${\mu_Y \times \mathrm{Haar}_H}$ that is supported (at least in the case where ${Y}$ is compact and the cocycle ${\rho}$ is continuous) on the ${H}$-bundle ${\bigcup_{y \in Y} \{y\} \times \Phi(y) H}$.
• (iii) With the situation as in (i), take ${X}$ to be the union ${X = Y \rtimes_\rho H \uplus Y \rtimes_\rho Hk \subset Y \times K}$ for some ${k \in K}$ outside of ${H}$, where we continue to use the action (2) and the standard vertical coordinate ${\theta: (y,k) \mapsto k}$ but now use the measure ${\mu_Y \times (\frac{1}{2} \mathrm{Haar}_H + \frac{1}{2} \mathrm{Haar}_{Hk})}$.

As it turns out, group extensions and homogeneous extensions arise naturally in the Furstenberg-Zimmer structural theory of measure-preserving systems; roughly speaking, every compact extension of ${Y}$ is an inverse limit of group extensions. It is then of interest to classify such extensions.

Examples such as (iii) are annoying, but they can be excluded by imposing the additional condition that the system ${(X,\mu_X,T_X)}$ is ergodic – all invariant (or essentially invariant) sets are of measure zero or measure one. (An essentially invariant set is a measurable subset ${E}$ of ${X}$ such that ${T^\gamma E}$ is equal modulo null sets to ${E}$ for all ${\gamma \in \Gamma}$.) For instance, the system in (iii) is non-ergodic because the set ${Y \times H}$ (or ${Y \times Hk}$) is invariant but has measure ${1/2}$. We then have the following fundamental result of Mackey and Zimmer:

Theorem 1 (Countable Mackey Zimmer theorem) Let ${\Gamma}$ be a group, ${Y}$ be a concrete measure-preserving system, and ${K}$ be a compact Hausdorff group. Assume that ${\Gamma}$ is at most countable, ${Y}$ is a standard Borel space, and ${K}$ is metrizable. Then every (concrete) ergodic group extension of ${Y}$ is abstractly isomorphic to a group skew-product (by some closed subgroup ${H}$ of ${K}$), and every (concrete) ergodic homogeneous extension of ${Y}$ is similarly abstractly isomorphic to a homogeneous skew-product.

We will not define precisely what “abstractly isomorphic” means here, but it roughly speaking means “isomorphic after quotienting out the null sets”. A proof of this theorem can be found for instance in .

The main result of this paper is to remove the “countability” hypotheses from the above theorem, at the cost of working with opposite probability algebra systems rather than concrete systems. (We will discuss opposite probability algebras in a subsequent blog post relating to another paper in this series.)

Theorem 2 (Uncountable Mackey Zimmer theorem) Let ${\Gamma}$ be a group, ${Y}$ be an opposite probability algebra measure-preserving system, and ${K}$ be a compact Hausdorff group. Then every (abstract) ergodic group extension of ${Y}$ is abstractly isomorphic to a group skew-product (by some closed subgroup ${H}$ of ${K}$), and every (abstract) ergodic homogeneous extension of ${Y}$ is similarly abstractly isomorphic to a homogeneous skew-product.

We plan to use this result in future work to obtain uncountable versions of the Furstenberg-Zimmer and Host-Kra structure theorems.

As one might expect, one locates a proof of Theorem 2 by finding a proof of Theorem 1 that does not rely too strongly on “countable” tools, such as disintegration or measurable selection, so that all of those tools can be replaced by “uncountable” counterparts. The proof we use is based on the one given in this previous post, and begins by comparing the system ${X}$ with the group extension ${Y \rtimes_\rho K}$. As the examples (i), (ii) show, these two systems need not be isomorphic even in the ergodic case, due to the different probability measures employed. However one can relate the two after performing an additional averaging in ${K}$. More precisely, there is a canonical factor map ${\Pi: X \rtimes_1 K \rightarrow Y \times_\rho K}$ given by the formula

$\displaystyle \Pi(x, k) := (\pi(x), \theta(x) k).$

This is a factor map not only of ${\Gamma}$-systems, but actually of ${\Gamma \times K^{op}}$-systems, where the opposite group ${K^{op}}$ to ${K}$ acts (on the left) by right-multiplication of the second coordinate (this reversal of order is why we need to use the opposite group here). The key point is that the ergodicity properties of the system ${Y \times_\rho K}$ are closely tied the group ${H}$ that is “secretly” controlling the group extension. Indeed, in example (i), the invariant functions on ${Y \times_\rho K}$ take the form ${(y,k) \mapsto f(Hk)}$ for some measurable ${f: H \backslash K \rightarrow {\bf C}}$, while in example (ii), the invariant functions on ${Y \times_{\tilde \rho} K}$ take the form ${(y,k) \mapsto f(H \Phi(y)^{-1} k)}$. In either case, the invariant factor is isomorphic to ${H \backslash K}$, and can be viewed as a factor of the invariant factor of ${X \rtimes_1 K}$, which is isomorphic to ${K}$. Pursuing this reasoning (using an abstract ergodic theorem of Alaoglu and Birkhoff, as discussed in the previous post) one obtains the Mackey range ${H}$, and also obtains the quotient ${\tilde \Phi: Y \rightarrow K/H}$ of ${\Phi: Y \rightarrow K}$ to ${K/H}$ in this process. The main remaining task is to lift the quotient ${\tilde \Phi}$ back up to a map ${\Phi: Y \rightarrow K}$ that stays measurable, in order to “untwist” a system that looks like (ii) to make it into one that looks like (i). In countable settings this is where a “measurable selection theorem” would ordinarily be invoked, but in the uncountable setting such theorems are not available for concrete maps. However it turns out that they still remain available for abstract maps: any abstractly measurable map ${\tilde \Phi}$ from ${Y}$ to ${K/H}$ has an abstractly measurable lift from ${Y}$ to ${K}$. To prove this we first use a canonical model for opposite probability algebras (which we will discuss in a companion post to this one, to appear shortly) to work with continuous maps (on a Stone space) rather than abstractly measurable maps. The measurable map ${\tilde \Phi}$ then induces a probability measure on ${Y \times K/H}$, formed by pushing forward ${\mu_Y}$ by the graphing map ${y \mapsto (y,\tilde \Phi(y))}$. This measure in turn has several lifts up to a probability measure on ${Y \times K}$; for instance, one can construct such a measure ${\overline{\mu}}$ via the Riesz representation theorem by demanding

$\displaystyle \int_{Y \times K} f(y,k) \overline{\mu}(y,k) := \int_Y (\int_{\tilde \Phi(y) H} f(y,k)\ d\mathrm{Haar}_{\tilde \Phi(y) H})\ d\mu_Y(y)$

for all continuous functions ${f}$. This measure does not come from a graph of any single lift ${\Phi: Y \rightarrow K}$, but is in some sense an “average” of the entire ensemble of these lifts. But it turns out one can invoke the Krein-Milman theorem to pass to an extremal lifting measure which does come from an (abstract) lift ${\Phi}$, and this can be used as a substitute for a measurable selection theorem. A variant of this Krein-Milman argument can also be used to express any homogeneous extension as a quotient of a group extension, giving the second part of the Mackey-Zimmer theorem.

In the last few months, I have been working my way through the theory behind the solution to Hilbert’s fifth problem, as I (together with Emmanuel Breuillard, Ben Green, and Tom Sanders) have found this theory to be useful in obtaining noncommutative inverse sumset theorems in arbitrary groups; I hope to be able to report on this connection at some later point on this blog. Among other things, this theory achieves the remarkable feat of creating a smooth Lie group structure out of what is ostensibly a much weaker structure, namely the structure of a locally compact group. The ability of algebraic structure (in this case, group structure) to upgrade weak regularity (in this case, continuous structure) to strong regularity (in this case, smooth and even analytic structure) seems to be a recurring theme in mathematics, and an important part of what I like to call the “dichotomy between structure and randomness”.

The theory of Hilbert’s fifth problem sprawls across many subfields of mathematics: Lie theory, representation theory, group theory, nonabelian Fourier analysis, point-set topology, and even a little bit of group cohomology. The latter aspect of this theory is what I want to focus on today. The general question that comes into play here is the extension problem: given two (topological or Lie) groups ${H}$ and ${K}$, what is the structure of the possible groups ${G}$ that are formed by extending ${H}$ by ${K}$. In other words, given a short exact sequence

$\displaystyle 0 \rightarrow K \rightarrow G \rightarrow H \rightarrow 0,$

to what extent is the structure of ${G}$ determined by that of ${H}$ and ${K}$?

As an example of why understanding the extension problem would help in structural theory, let us consider the task of classifying the structure of a Lie group ${G}$. Firstly, we factor out the connected component ${G^\circ}$ of the identity as

$\displaystyle 0 \rightarrow G^\circ \rightarrow G \rightarrow G/G^\circ \rightarrow 0;$

as Lie groups are locally connected, ${G/G^\circ}$ is discrete. Thus, to understand general Lie groups, it suffices to understand the extensions of discrete groups by connected Lie groups.

Next, to study a connected Lie group ${G}$, we can consider the conjugation action ${g: X \mapsto gXg^{-1}}$ on the Lie algebra ${{\mathfrak g}}$, which gives the adjoint representation ${\hbox{Ad}: G \rightarrow GL({\mathfrak g})}$. The kernel of this representation consists of all the group elements ${g}$ that commute with all elements of the Lie algebra, and thus (by connectedness) is the center ${Z(G)}$ of ${G}$. The adjoint representation is then faithful on the quotient ${G/Z(G)}$. The short exact sequence

$\displaystyle 0 \rightarrow Z(G) \rightarrow G \rightarrow G/Z(G) \rightarrow 0$

then describes ${G}$ as a central extension (by the abelian Lie group ${Z(G)}$) of ${G/Z(G)}$, which is a connected Lie group with a faithful finite-dimensional linear representation.

This suggests a route to Hilbert’s fifth problem, at least in the case of connected groups ${G}$. Let ${G}$ be a connected locally compact group that we hope to demonstrate is isomorphic to a Lie group. As discussed in a previous post, we first form the space ${L(G)}$ of one-parameter subgroups of ${G}$ (which should, eventually, become the Lie algebra of ${G}$). Hopefully, ${L(G)}$ has the structure of a vector space. The group ${G}$ acts on ${L(G)}$ by conjugation; this action should be both continuous and linear, giving an “adjoint representation” ${\hbox{Ad}: G \rightarrow GL(L(G))}$. The kernel of this representation should then be the center ${Z(G)}$ of ${G}$. The quotient ${G/Z(G)}$ is locally compact and has a faithful linear representation, and is thus a Lie group by von Neumann’s version of Cartan’s theorem (discussed in this previous post). The group ${Z(G)}$ is locally compact abelian, and so it should be a relatively easy task to establish that it is also a Lie group. To finish the job, one needs the following result:

Theorem 1 (Central extensions of Lie are Lie) Let ${G}$ be a locally compact group which is a central extension of a Lie group ${H}$ by an abelian Lie group ${K}$. Then ${G}$ is also isomorphic to a Lie group.

This result can be obtained by combining a result of Kuranishi with a result of Gleason; I am recording this argument below the fold. The point here is that while ${G}$ is initially only a topological group, the smooth structures of ${H}$ and ${K}$ can be combined (after a little bit of cohomology) to create the smooth structure on ${G}$ required to upgrade ${G}$ from a topological group to a Lie group. One of the main ideas here is to improve the behaviour of a cocycle by averaging it; this basic trick is helpful elsewhere in the theory, resolving a number of cohomological issues in topological group theory. The result can be generalised to show in fact that arbitrary (topological) extensions of Lie groups by Lie groups remain Lie; this was shown by Gleason. However, the above special case of this result is already sufficient (in conjunction with the rest of the theory, of course) to resolve Hilbert’s fifth problem.

Remark 1 We have shown in the above discussion that every connected Lie group is a central extension (by an abelian Lie group) of a Lie group with a faithful continuous linear representation. It is natural to ask whether this central extension is necessary. Unfortunately, not every connected Lie group admits a faithful continuous linear representation. An example (due to Birkhoff) is the Heisenberg-Weyl group

$\displaystyle G := \begin{pmatrix} 1 & {\bf R} & {{\bf R}/{\bf Z}} \\ 0 & 1 & {\bf R} \\ 0 & 0 & 1 \end{pmatrix} = \begin{pmatrix} 1 & {\bf R} & {\bf R} \\ 0 & 1 & {\bf R} \\ 0 & 0 & 1 \end{pmatrix} / \begin{pmatrix} 1 & 0 & {\bf Z} \\ 0 & 1 & 0 \\ 0 & 0 & 1 \end{pmatrix}.$

Indeed, if we consider the group elements

$\displaystyle A := \begin{pmatrix} 1 & 1 & 0 \\ 0 & 1 & 0 \\ 0 & 0 & 1 \end{pmatrix}$

and

$\displaystyle B := \begin{pmatrix} 1 & 0 & 1/p \\ 0 & 1 & 0 \\ 0 & 0 & 1 \end{pmatrix}$

for some prime ${p}$, then one easily verifies that ${B}$ has order ${p}$ and is central, and that ${AB}$ is conjugate to ${A}$. If we have a faithful linear representation ${\rho: G \rightarrow GL_n({\bf C})}$ of ${G}$, then ${\rho(B)}$ must have at least one eigenvalue ${\alpha}$ that is a primitive ${p^{th}}$ root of unity. If ${V}$ is the eigenspace associated to ${\alpha}$, then ${\rho(A)}$ must preserve ${V}$, and be conjugate to ${\alpha \rho(A)}$ on this space. This forces ${\rho(A)}$ to have at least ${p}$ distinct eigenvalues on ${V}$, and hence ${V}$ (and thus ${{\bf C}^n}$) must have dimension at least ${p}$. Letting ${p \rightarrow \infty}$ we obtain a contradiction. (On the other hand, ${G}$ is certainly isomorphic to the extension of the linear group ${{\bf R}^2}$ by the abelian group ${{\bf R}/{\bf Z}}$.)

In mathematics, one frequently starts with some space ${X}$ and wishes to extend it to a larger space ${Y}$. Generally speaking, there are two ways in which one can extend a space ${X}$:

• By embedding ${X}$ into a space ${Y}$ that has ${X}$ (or at least an isomorphic copy of ${X}$) as a subspace.
• By covering ${X}$ by a space ${Y}$ that has ${X}$ (or an isomorphic copy thereof) as a quotient.

For many important categories of interest (such as abelian categories), the former type of extension can be represented by the exact sequence,

$\displaystyle 0 \rightarrow X \rightarrow Y$

and the latter type of extension be represented by the exact sequence

$\displaystyle Y \rightarrow X \rightarrow 0.$

In some cases, ${X}$ can be both embedded in, and covered by, ${Y}$, in a consistent fashion; in such cases we sometimes say that the above exact sequences split.

An analogy would be to that of digital images. When a computer represents an image, it is limited both by the scope of the image (what it is picturing), and by the resolution of an image (how much physical space is represented by a given pixel). To make the image “larger”, one could either embed the image in an image of larger scope but equal resolution (e.g. embedding a picture of a ${200 \times 200}$ pixel image of person’s face into a ${800 \times 800}$ pixel image that covers a region of space that is four times larger in both dimensions, e.g. the person’s upper body) or cover the image with an image of higher resolution but of equal scope (e.g. enhancing a ${200 \times 200}$ pixel picture of a face to a ${800 \times 800}$ pixel of the same face). In the former case, the original image is a sub-image (or cropped image) of the extension, but in the latter case the original image is a quotient (or a pixelation) of the extension. In the former case, each pixel in the original image can be identified with a pixel in the extension, but not every pixel in the extension is covered. In the latter case, every pixel in the original image is covered by several pixels in the extension, but the pixel in the original image is not canonically identified with any particular pixel in the extension that covers it; it “loses its identity” by dispersing into higher resolution pixels.

(Note that “zooming in” the visual representation of an image by making each pixel occupy a larger region of the screen neither increases the scope or the resolution; in this language, a zoomed-in version of an image is merely an isomorphic copy of the original image; it carries the same amount of information as the original image, but has been represented in a new coordinate system which may make it easier to view, especially to the visually impaired.)

In the study of a given category of spaces (e.g. topological spaces, manifolds, groups, fields, etc.), embedding and coverings are both important; this is particularly true in the more topological areas of mathematics, such as manifold theory. But typically, the term extension is reserved for just one of these two operations. For instance, in the category of fields, coverings are quite trivial; if one covers a field ${k}$ by a field ${l}$, the kernel of the covering map ${\pi: l \rightarrow k}$ is necessarily trivial and so ${k, l}$ are in fact isomorphic. So in field theory, a field extension refers to an embedding of a field, rather than a covering of a field. Similarly, in the theory of metric spaces, there are no non-trivial isometric coverings of a metric space, and so the only useful notion of an extension of a metric space is the one given by embedding the original space in the extension.

On the other hand, in group theory (and in group-like theories, such as the theory of dynamical systems, which studies group actions), the term “extension” is reserved for coverings, rather than for embeddings. I think one of the main reasons for this is that coverings of groups automatically generate a special type of embedding (a normal embedding), whereas most embeddings don’t generate coverings. More precisely, given a group extension ${G}$ of a base group ${H}$,

$\displaystyle G \rightarrow H \rightarrow 0,$

one can form the kernel ${K = \hbox{ker}(\phi)}$ of the covering map ${\pi: G \rightarrow H}$, which is a normal subgroup of ${G}$, and we thus can extend the above sequence canonically to a short exact sequence

$\displaystyle 0 \rightarrow K \rightarrow G \rightarrow H \rightarrow 0.$

On the other hand, an embedding of ${K}$ into ${G}$,

$\displaystyle 0 \rightarrow K \rightarrow G$

does not similarly extend to a short exact sequence unless the the embedding is normal.

Another reason for the notion of extension varying between embeddings and coverings from subject to subject is that there are various natural duality operations (and more generally, contravariant functors) which turn embeddings into coverings and vice versa. For instance, an embedding of one vector space ${V}$ into another ${W}$ induces a covering of the dual space ${V^*}$ by the dual space ${W^*}$, and conversely; similarly, an embedding of a locally compact abelian group ${H}$ in another ${G}$ induces a covering of the Pontryagin dual ${\hat H}$ by the Pontryagin dual ${\hat G}$. In the language of images, embedding an image in an image of larger scope is largely equivalent to covering the Fourier transform of that image by a transform of higher resolution, and conversely; this is ultimately a manifestation of the basic fact that frequency is inversely proportional to wavelength.

Similarly, a common duality operation arises in many areas of mathematics by starting with a space ${X}$ and then considering a space ${C(X)}$ of functions on that space (e.g. continuous real-valued functions, if ${X}$ was a topological space, or in more algebraic settings one could consider homomorphisms from ${X}$ to some fixed space). Embedding ${X}$ into ${Y}$ then induces a covering of ${C(X)}$ by ${C(Y)}$, and conversely, a covering of ${X}$ by ${Y}$ induces an embedding of ${C(X)}$ into ${C(Y)}$. Returning again to the analogy with images, if one looks at the collection of all images of a fixed scope and resolution, rather than just a single image, then increasing the available resolution causes an embedding of the space of low-resolution images into the space of high-resolution images (since of course every low-resolution image is an example of a high-resolution image), whereas increasing the available scope causes a covering of the space of narrow-scope images by the space of wide-scope images (since every wide-scope image can be cropped into a narrow-scope image). Note in the case of images, that these extensions can be split: not only can a low-resolution image be viewed as a special case of a high-resolution image, but any high-resolution image can be pixelated into a low-resolution one. Similarly, not only can any wide-scope image be cropped into a narrow-scope one, a narrow-scope image can be extended to a wide-scope one simply by filling in all the new areas of scope with black (or by using more advanced image processing tools to create a more visually pleasing extension). (In the category of sets, the statement that every covering can be split is precisely the axiom of choice.)

I’ve recently found myself having to deal quite a bit with group extensions in my research, so I have decided to make some notes on the basic theory of such extensions here. This is utterly elementary material for a group theorist, but I found this task useful for organising my own thoughts on this topic, and also in pinning down some of the jargon in this field.