In this set of notes we will be able to finally prove the Gleason-Yamabe theorem from Notes 0, which we restate here:

Theorem 1 (Gleason-Yamabe theorem) Let ${G}$ be a locally compact group. Then, for any open neighbourhood ${U}$ of the identity, there exists an open subgroup ${G'}$ of ${G}$ and a compact normal subgroup ${K}$ of ${G'}$ in ${U}$ such that ${G'/K}$ is isomorphic to a Lie group.

In the next set of notes, we will combine the Gleason-Yamabe theorem with some topological analysis (and in particular, using the invariance of domain theorem) to establish some further control on locally compact groups, and in particular obtaining a solution to Hilbert’s fifth problem.

To prove the Gleason-Yamabe theorem, we will use three major tools developed in previous notes. The first (from Notes 2) is a criterion for Lie structure in terms of a special type of metric, which we will call a Gleason metric:

Definition 2 Let ${G}$ be a topological group. A Gleason metric on ${G}$ is a left-invariant metric ${d: G \times G \rightarrow {\bf R}^+}$ which generates the topology on ${G}$ and obeys the following properties for some constant ${C>0}$, writing ${\|g\|}$ for ${d(g,\hbox{id})}$:

• (Escape property) If ${g \in G}$ and ${n \geq 1}$ is such that ${n \|g\| \leq \frac{1}{C}}$, then ${\|g^n\| \geq \frac{1}{C} n \|g\|}$.
• (Commutator estimate) If ${g, h \in G}$ are such that ${\|g\|, \|h\| \leq \frac{1}{C}}$, then

$\displaystyle \|[g,h]\| \leq C \|g\| \|h\|, \ \ \ \ \ (1)$

where ${[g,h] := g^{-1}h^{-1}gh}$ is the commutator of ${g}$ and ${h}$.

Theorem 3 (Building Lie structure from Gleason metrics) Let ${G}$ be a locally compact group that has a Gleason metric. Then ${G}$ is isomorphic to a Lie group.

The second tool is the existence of a left-invariant Haar measure on any locally compact group; see Theorem 3 from Notes 3. Finally, we will also need the compact case of the Gleason-Yamabe theorem (Theorem 8 from Notes 3), which was proven via the Peter-Weyl theorem:

Theorem 4 (Gleason-Yamabe theorem for compact groups) Let ${G}$ be a compact Hausdorff group, and let ${U}$ be a neighbourhood of the identity. Then there exists a compact normal subgroup ${H}$ of ${G}$ contained in ${U}$ such that ${G/H}$ is isomorphic to a linear group (i.e. a closed subgroup of a general linear group ${GL_n({\bf C})}$).

To finish the proof of the Gleason-Yamabe theorem, we have to somehow use the available structures on locally compact groups (such as Haar measure) to build good metrics on those groups (or on suitable subgroups or quotient groups). The basic construction is as follows:

Definition 5 (Building metrics out of test functions) Let ${G}$ be a topological group, and let ${\psi: G \rightarrow {\bf R}^+}$ be a bounded non-negative function. Then we define the pseudometric ${d_\psi: G \times G \rightarrow {\bf R}^+}$ by the formula

$\displaystyle d_\psi(g,h) := \sup_{x \in G} |\tau(g) \psi(x) - \tau(h) \psi(x)|$

$\displaystyle = \sup_{x \in G} |\psi(g^{-1} x ) - \psi(h^{-1} x)|$

and the semi-norm ${\| \|_\psi: G \rightarrow {\bf R}^+}$ by the formula

$\displaystyle \|g\|_\psi := d_\psi(g, \hbox{id}).$

Note that one can also write

$\displaystyle \|g\|_\psi = \sup_{x \in G} |\partial_g \psi(x)|$

where ${\partial_g \psi(x) := \psi(x) - \psi(g^{-1} x)}$ is the “derivative” of ${\psi}$ in the direction ${g}$.

Exercise 6 Let the notation and assumptions be as in the above definition. For any ${g,h,k \in G}$, establish the metric-like properties

1. (Identity) ${d_\psi(g,h) \geq 0}$, with equality when ${g=h}$.
2. (Symmetry) ${d_\psi(g,h) = d_\psi(h,g)}$.
3. (Triangle inequality) ${d_\psi(g,k) \leq d_\psi(g,h) + d_\psi(h,k)}$.
4. (Continuity) If ${\psi \in C_c(G)}$, then the map ${d_\psi: G \times G \rightarrow {\bf R}^+}$ is continuous.
5. (Boundedness) One has ${d_\psi(g,h) \leq \sup_{x \in G} |\psi(x)|}$. If ${\psi \in C_c(G)}$ is supported in a set ${K}$, then equality occurs unless ${g^{-1} h \in K K^{-1}}$.
6. (Left-invariance) ${d_\psi(g,h) = d_\psi(kg,kh)}$. In particular, ${d_\psi(g,h) = \| h^{-1} g \|_\psi = \| g^{-1} h \|_\psi}$.

In particular, we have the norm-like properties

1. (Identity) ${\|g\|_\psi \geq 0}$, with equality when ${g=\hbox{id}}$.
2. (Symmetry) ${\|g\|_\psi = \|g^{-1}\|_\psi}$.
3. (Triangle inequality) ${\|gh\|_\psi \leq \|g\|_\psi + \|h\|_\psi}$.
4. (Continuity) If ${\psi \in C_c(G)}$, then the map ${\|\|_\psi: G \rightarrow {\bf R}^+}$ is continuous.
5. (Boundedness) One has ${\|g\|_\psi \leq \sup_{x \in G} |\psi(x)|}$. If ${\psi \in C_c(G)}$ is supported in a set ${K}$, then equality occurs unless ${g \in K K^{-1}}$.

We remark that the first three properties of ${d_\psi}$ in the above exercise ensure that ${d_\psi}$ is indeed a pseudometric.

To get good metrics (such as Gleason metrics) on groups ${G}$, it thus suffices to obtain test functions ${\psi}$ that obey suitably good “regularity” properties. We will achieve this primarily by means of two tricks. The first trick is to obtain high-regularity test functions by convolving together two low-regularity test functions, taking advantage of the existence of a left-invariant Haar measure ${\mu}$ on ${G}$. The second trick is to obtain low-regularity test functions by means of a metric-like object on ${G}$. This latter trick may seem circular, as our whole objective is to get a metric on ${G}$ in the first place, but the key point is that the metric one starts with does not need to have as many “good properties” as the metric one ends up with, thanks to the regularity-improving properties of convolution. As such, one can use a “bootstrap argument” (or induction argument) to create a good metric out of almost nothing. It is this bootstrap miracle which is at the heart of the proof of the Gleason-Yamabe theorem (and hence to the solution of Hilbert’s fifth problem).

The arguments here are based on the nonstandard analysis arguments used to establish Hilbert’s fifth problem by Hirschfeld and by Goldbring (and also some unpublished lecture notes of Goldbring and van den Dries). However, we will not explicitly use any nonstandard analysis in this post.

— 1. Warmup: the Birkhoff-Kakutani theorem —

To illustrate the basic idea of using test functions to build metrics, let us first establish a classical theorem on topological groups, which gives a necessary and sufficient condition for metrisability. Recall that a topological space is metrisable if there is a metric on that space that generates the topology.

Theorem 7 (Birkhoff-Kakutani theorem) A topology group is metrisable if and only if it is Hausdorff and first countable.

Remark 8 The group structure is crucial; for instance, the long line is Hausdorff and first countable, but not metrisable.

We now prove this theorem (following the arguments in this book of Montgomery and Zippin). The “only if” direction is easy, so it suffices to establish the “if” direction. The key lemma is

Lemma 9 (Urysohn-type lemma) Let ${G}$ be a Hausdorff first countable group. Then there exists a bounded continuous function ${\psi: G \rightarrow [0,1]}$ with the following properties:

• (Unique maximum) ${\psi(\hbox{id}) = 1}$, and ${\psi(x) < 1}$ for all ${x \neq \hbox{id}}$.
• (Neighbourhood base) The sets ${\{ x \in G: \psi(x) > 1-1/n \}}$ for ${n=1,2,\ldots}$ form a neighbourhood base at the identity.
• (Uniform continuity) For every ${\varepsilon > 0}$, there exists an open neighbourhood ${U}$ of the identity such that ${|\psi(gx)-\psi(x)| \leq \varepsilon}$ for all ${g \in U}$ and ${x \in G}$.

Note that if ${G}$ had a left-invariant metric, then the function ${\psi(x) := \max( 1 - d(x,\hbox{id}), 0)}$ would suffice for this lemma, which already gives some indication as to why this lemma is relevant to the Birkhoff-Kakutani theorem.

Exercise 10 Let ${G}$ be a Hausdorff first countable group, and let ${\psi}$ be as in Lemma 9. Show that ${d_\psi}$ is a metric on ${G}$ (so in particular, ${d_\psi(g,h)}$ only vanishes when ${g=h}$) and that ${d_\psi}$ generates the topology of ${G}$ (thus every set which is open with respect to ${d_\psi}$ is open in ${G}$, and vice versa).

In view of the above exercise, we see that to prove the Birkhoff-Kakutani theorem, it suffices to prove Lemma 9, which we now do. By first countability, we can find a countable neighbourhood base

$\displaystyle V_1 \supset V_2 \supset \ldots \supset \{\hbox{id}\}$

of the identity. As ${G}$ is Hausdorff, we must have

$\displaystyle \bigcap_{n=1}^\infty V_n = \{\hbox{id}\}.$

Using the continuity of the group operations, we can recursively find a sequence of nested open neighbourhoods of the identity

$\displaystyle U_1 \supset U_{1/2} \supset U_{1/4} \supset \ldots \supset \{\hbox{id}\} \ \ \ \ \ (2)$

such that each ${U_{1/2^n}}$ is symmetric (i.e. ${g \in U_{1/2^n}}$ if and only if ${g^{-1} \in U_{1/2^n}}$), is contained in ${V_n}$, and is such that ${U_{1/2^{n+1}} \cdot U_{1/2^{n+1}} \subset U_{1/2^n}}$ for each ${n \geq 0}$. In particular the ${U_{1/2^n}}$ are also a neighbourhood base of the identity with

$\displaystyle \bigcap_{n=1}^\infty U_{1/2^n} = \{\hbox{id}\}. \ \ \ \ \ (3)$

For every dyadic rational ${a/2^n}$ in ${(0,1)}$, we can now define the open sets ${U_{a/2^n}}$ by setting

$\displaystyle U_{a/2^n} := U_{1/2^{n_k}} \cdot \ldots \cdot U_{1/2^{n_1}}$

where ${a/2^n = 2^{-n_1} + \ldots + 2^{-n_k}}$ is the binary expansion of ${a/2^n}$ with ${1 \leq n_1 < \ldots < n_k}$. By repeated use of the hypothesis ${U_{1/2^{n+1}} \cdot U_{1/2^{n+1}} \subset U_{1/2^n}}$ we see that the ${U_{a/2^n}}$ are increasing in ${a/2^n}$; indeed, we have the inclusion

$\displaystyle U_{1/2^n} \cdot U_{a/2^n} \subset U_{(a+1)/2^n} \ \ \ \ \ (4)$

for all ${n \geq 1}$ and ${1 \leq a < 2^n}$.

We now set

$\displaystyle \psi(x) := \sup \{ 1 - \frac{a}{2^n}: n \geq 1; 1 \leq a < 2^n; x \in U_{a/2^n} \}$

with the understanding that ${\psi(x)=0}$ if the supremum is over the empty set. One easily verifies using (4) that ${\psi}$ is continuous, and furthermore obeys the uniform continuity property. The neighbourhood base property follows since the ${U_{1/2^n}}$ are a neighbourhood base of the identity, and the unique maximum property follows from (3). This proves Lemma 9, and the Birkhoff-Kakutani theorem follows.

Exercise 11 Let ${G}$ be a topological group. Show that ${G}$ is completely regular, that is to say for every closed subset ${F}$ in ${G}$ and every ${x \in G \backslash F}$, there exists a continuous function ${f: G \rightarrow {\bf R}}$ that equals ${1}$ on ${F}$ and vanishes on ${x}$.

Exercise 12 (Reduction to the metrisable case) Let ${G}$ be a locally compact group, let ${U}$ be an open neighbourhood of the identity, and let ${G'}$ be the group generated by ${U}$.

• (i) Construct a sequence of open neighbourhoods of the identity

$\displaystyle U \supset U_1 \supset U_2 \supset \ldots$

with the property that ${U_{n+1}^2 \subset U_n}$ and ${U_{n+1}^U \subset U_n}$ for all ${n \geq 1}$, where ${A^B := \{ a^b: a \in A, b \in B \}}$ and ${a^b := b^{-1} a b}$.

• (ii) If we set ${H := \bigcap_{n=1}^\infty U_n}$, show that ${H}$ is a closed normal subgroup ${G'}$ in ${U}$, and the quotient group ${G'/H}$ is Hausdorff and first countable (and thus metrisable, by the Birkhoff-Kakutani theorem).
• (iii) Conclude that to prove the Gleason-Yamabe theorem (Theorem 1), it suffices to do so under the assumption that ${G}$ is metrisable.

The above arguments are essentially in this paper of Gleason.

Exercise 13 (Birkhoff-Kakutani theorem for local groups) Let ${G}$ be a local group which is Hausdorff and first countable. Show that there exists an open neighbourhood ${V_0}$ of the identity which is metrisable.

— 2. Obtaining the commutator estimate via convolution —

We now return to the main task of constructing Gleason metrics. The first thing we will do is dispense with the commutator property (1). Thus, define a weak Gleason metric on a topological group ${G}$ to be a left-invariant metric ${d: G \times G \rightarrow {\bf R}^+}$ which generates the topology on ${G}$ and obeys the escape property for some constant ${C>0}$, thus one has

$\displaystyle \|g^n\| \geq \frac{1}{C} n \|g\| \hbox{ whenever } g \in G, n \geq 1, \hbox{ and } n \|g\| \leq \frac{1}{C}. \ \ \ \ \ (5)$

In this section we will show

Theorem 14 Every weak Gleason metric is a Gleason metric (possibly after adjusting the constant ${C}$).

We now prove this theorem. The key idea here is to involve a bump function ${\phi}$ formed by convolving together two Lipschitz functions. The escape property (5) will be crucial in obtaining quantitative control of the metric geometry at very small scales, as one can study the size of a group element ${g}$ very close to the origin through its powers ${g^n}$, which are further away from the origin.

Specifically, let ${\varepsilon > 0}$ be a small quantity to be chosen later, and let ${\psi \in C_c(G)}$ be a non-negative Lipschitz function supported on the ball ${B(0,\varepsilon)}$ which is not identically zero. For instance, one could use the explicit function

$\displaystyle \psi(x) := (1 - \frac{\|x\|}{\varepsilon})_+$

where ${y_+ := \max(y,0)}$, although the exact form of ${\psi}$ will not be important for our argument. Being Lipschitz, we see that

$\displaystyle \| \partial_g \psi \|_{C_c(G)} \ll \|g\| \ \ \ \ \ (6)$

for all ${g \in G}$ (where we allow implied constants to depend on ${G}$, ${\varepsilon}$, and ${\psi}$), where ${\|\|_{C_c(G)}}$ denotes the sup norm.

Let ${\mu}$ be a left-invariant Haar measure on ${G}$, the existence of which was established in Theorem 3 from Notes 3. We then form the convolution ${\phi := \psi * \psi}$, with convolution defined using the formula

$\displaystyle f*g(x) := \int_G f(y) g(y^{-1} x)\ d\mu(y). \ \ \ \ \ (7)$

This is a continuous function supported in ${B(0,2\varepsilon)}$, and gives a metric ${d_\phi}$ and a norm ${\| \|_\phi}$ as usual.

We now prove a variant of the commutator estimate (1), namely that

$\displaystyle \| \partial_g \partial_h \phi \|_{C_c(G)} \ll \|g\| \| h \| \ \ \ \ \ (8)$

whenever ${g, h \in B(0,\varepsilon)}$. To see this, we first use the left-invariance of Haar measure to write

$\displaystyle \partial_h \phi = (\partial_h \psi) * \psi, \ \ \ \ \ (9)$

thus

$\displaystyle \partial_h \phi(x) = \int_G (\partial_h \psi)(y) \psi(y^{-1} x)\ d\mu(y).$

We would like to similarly move the ${\partial_g}$ operator over to the second factor, but we run into a difficulty due to the non-abelian nature of ${G}$. Nevertheless, we can still do this provided that we twist that operator by a conjugation. More precisely, we have

$\displaystyle \partial_g \partial_h \phi(x) = \int_G (\partial_h \psi)(y) (\partial_{g^y} \psi)(y^{-1} x)\ d\mu(y) \ \ \ \ \ (10)$

where ${g^y := y^{-1} g y}$ is ${g}$ conjugated by ${y}$. If ${h \in B(0,\varepsilon)}$, the integrand is only non-zero when ${y \in B(0,2\varepsilon)}$. Applying (6), we obtain the bound

$\displaystyle \| \partial_g \partial_h \phi \|_{C_c(g)} \ll \|h\| \sup_{y \in B(0,2\varepsilon)} \|g^y\|.$

To finish the proof of (8), it suffices to show that

$\displaystyle \|g^y\| \ll \|g\|$

whenever ${g \in B(0,\varepsilon)}$ and ${y \in B(0,2\varepsilon)}$.

We can achieve this by the escape property (5). Let ${n}$ be a natural number such that ${n \|g\| \leq \varepsilon}$, then for all ${m \leq n}$ we have ${\|g^m\| \leq \varepsilon}$ and so ${g^m \in B(0,\varepsilon)}$. Conjugating by ${y}$, this implies that ${(g^y)^m \in B(0,5\varepsilon)}$ for all ${m \leq n}$, and so by (5), we have ${\|g^y\| \ll \frac{1}{n}}$ (if ${\varepsilon}$ is small enough), and the claim follows.

Next, we claim that the norm ${\| \|_\phi}$ is locally comparable to the original norm ${\| \|}$. More precisely, we claim:

1. If ${g \in G}$ with ${\| g \|_\phi}$ sufficiently small, then ${\| g \| \ll \| g\|_\phi}$.
2. If ${g \in G}$ with ${\| g \|}$ sufficiently small, then ${\|g\|_\phi \ll \|g\|}$.

Claim 2 follows easily from (9) and (6), so we turn to Claim 1. Let ${g \in G}$, and let ${n}$ be a natural number such that

$\displaystyle n \|g\|_\phi < \| \phi \|_{C_c(G)}.$

Then by the triangle inequality

$\displaystyle \|g^n \|_\phi < \|\phi \|_{C_c(G)}.$

This implies that ${\phi}$ and ${\tau_{g^n} \phi}$ have overlapping support, and hence ${g^n}$ lies in ${B(0,4\varepsilon)}$. By the escape property (5), this implies (if ${\varepsilon}$ is small enough) that ${\|g\| \ll \frac{1}{n}}$, and the claim follows.

Combining Claim 2 with (8) we see that

$\displaystyle \| \partial_g \partial_h \phi \|_{C_c(G)} \ll \|g\|_\phi \| h \|_\phi$

whenever ${\|g\|_\phi, \|h\|_\phi}$ are small enough. Now we use the identity

$\displaystyle \| [g,h]\|_\phi = \| \tau([g,h]) \phi - \phi \|_{C_c(G)}$

$\displaystyle = \| \tau(g) \tau(h) \phi - \tau(h) \tau(g) \phi \|_{C_c(G)}$

$\displaystyle = \| \partial_g \partial_h \phi - \partial_h \partial_g \phi \|_{C_c(G)}$

and the triangle inequality to conclude that

$\displaystyle \| [g,h] \|_\phi \ll \|g\|_\phi \|h\|_\phi$

whenever ${\|g\|_\phi, \|h\|_\phi}$ are small enough. Theorem 14 then follows from Claim 1 and Claim 2.

— 3. Building metrics on NSS groups —

We will now be able to build metrics on groups using a set of hypotheses that do not explicitly involve any metric at all. The key hypothesis will be the no small subgroups (NSS) property:

Definition 15 (No small subgroups) A topological group ${G}$ has the no small subgroups (or NSS) property if there exists an open neighbourhood ${U}$ of the identity which does not contain any subgroup of ${G}$ other than the trivial group.

Exercise 16 Show that any Lie group is NSS.

Exercise 17 Show that any group with a weak Gleason metric is NSS.

For an example of a group which is not NSS, consider the infinite-dimensional torus ${({\bf R}/{\bf Z})^{\bf N}}$. From the definition of the product topology, we see that any neighbourhood of the identity in this torus contains an infinite-dimensional subtorus, and so this group is not NSS.

Exercise 18 Show that for any prime ${p}$, the ${p}$-adic groups ${{\bf Z}_p}$ and ${{\bf Q}_p}$ are not NSS. What about the solenoid group ${{\bf R} \times {\bf Z}_p / {\bf Z}^\Delta}$?

Exercise 19 Show that an NSS group is automatically Hausdorff. (Hint: use Exercise 3 from Notes 3.)

Exercise 20 Show that an NSS locally compact group is automatically metrisable. (Hint: use Exercise 12.)

Exercise 21 (NSS implies escape property) Let ${G}$ be a locally compact NSS group. Show that if ${U}$ is a sufficiently small neighbourhood of the identity, then for every ${g \in G \backslash \{\hbox{id}\}}$, there exists a positive integer ${n}$ such that ${g^n \not \in U}$. Furthermore, for any other neighbourhood ${V}$ of the identity, there exists a positive integer ${N}$ such that if ${g,\ldots,g^N \in U}$, then ${g \in V}$.

We can now prove the following theorem (first proven in full generality by Yamabe), which is a key component in the proof of the Gleason-Yamabe theorem and in the wider theory of Hilbert’s fifth problem.

Theorem 22 Every NSS locally compact group admits a weak Gleason metric. In particular, by Theorem 14 and Theorem 3, every NSS locally compact group is isomorphic to a Lie group.

In view of this theorem and Exercise 16, we see that for locally compact groups, the property of being a Lie group is equivalent to the property of being an NSS group. This is a major advance towards both the Gleason-Yamabe theorem and Hilbert’s fifth problem, as it has reduced the property of being a Lie group into a condition that is almost purely algebraic in nature.

We now prove Theorem 22. An important concept will be that of an escape norm associated to an open neighbourhood ${U}$ of a group ${G}$, defined by the formula

$\displaystyle \|g\|_{e,U} := \inf \{ \frac{1}{n+1}: g, g^2, \ldots, g^n \in U \} \ \ \ \ \ (11)$

for any ${g \in G}$, where ${n}$ ranges over the natural numbers (thus, for instance ${\|g\|_{e,U} \leq 1}$, with equality iff ${g \not \in U}$). Thus, the longer it takes for the orbit ${g, g^2, \ldots}$ to escape ${U}$, the smaller the escape norm.

Strictly speaking, the escape norm is not necessarily a norm, as it need not obey the symmetry, non-degeneracy, or triangle inequalities; however, we shall see that in many situations, the escape norm behaves similarly to a norm, even if it does not exactly obey the norm axioms. Also, as the name suggests, the escape norm will be well suited for establishing the escape property (5).

It is possible for the escape norm ${\|g\|_{e,U}}$ of a non-identity element ${g \in G}$ to be zero, if ${U}$ contains the group ${\langle g \rangle}$ generated by ${U}$. But if the group ${G}$ has the NSS property, then we see that this cannot occur for all sufficiently small ${U}$ (where “sufficiently small” means “contained in a suitably chosen open neighbourhood ${U_0}$ of the identity”). In fact, more is true: if ${U, U'}$ are two sufficiently small open neighbourhoods of the identity in a locally compact NSS group ${G}$, then the two escape norms are comparable, thus we have

$\displaystyle \|g \|_{e,U} \ll \|g\|_{e,U'} \ll \|g\|_{e,U} \ \ \ \ \ (12)$

for all ${g \in G}$ (where the implied constants can depend on ${U, U'}$).

By symmetry, it suffices to prove the second inequality in (12). By (11), it suffices to find an integer ${m}$ such that whenever ${g \in G}$ is such that ${g, g^2, \ldots, g^m \in U}$, then ${g \in U'}$. But this follows from Exercise 21. This concludes the proof of (12).

Exercise 23 Let ${G}$ be a locally compact group. Show that if ${d}$ is a left-invariant metric on ${G}$ obeying the escape property (5) that generates the topology, then ${G}$ is NSS, and ${\| g\|}$ is comparable to ${\|g\|_{e,U}}$ for all sufficiently small ${U}$ and for all sufficiently small ${g}$. (In particular, any two left-invariant metrics obeying the escape property and generating the topology are locally comparable to each other.)

Henceforth ${G}$ is a locally compact NSS group. We now establish a metric-like property on the escape norm ${\|\|_{e,U_0}}$.

Proposition 24 (Approximate triangle inequality) Let ${U_0}$ be a sufficiently small open neighbourhood of the identity. Then for any ${n}$ and any ${g_1,\ldots,g_n \in G}$, one has

$\displaystyle \| g_1 \ldots g_n \|_{e,U_0} \ll \sum_{i=1}^n \|g_i\|_{e,U_0}$

(where the implied constant can depend on ${U_0}$).

Of course, in view of (12), the exact choice of ${U_0}$ is irrelevant, so long as it is small. It is slightly convenient to take ${U_0}$ to be symmetric (thus ${U_0 = U_0^{-1}}$), so that ${\|g\|_{e,U_0} = \|g^{-1}\|_{e,U_0}}$ for all ${g}$.

Proof: We will use a bootstrap argument. Assume to start with that we somehow already have a weaker form of the conclusion, namely

$\displaystyle \| g_1 \ldots g_n \|_{e,U_0} \leq M \sum_{i=1}^n \|g_i\|_{e,U_0} \ \ \ \ \ (13)$

for all ${n,g_1,\ldots,g_n}$ and some huge constant ${M}$; we will then deduce the same estimate with a smaller value of ${M}$. Afterwards we will show how to remove the hypothesis (13).

Now suppose we have (13) for some ${M}$. Motivated by the argument in the previous section, we now try to convolve together two “Lipschitz” functions. For this, we will need some metric-like functions. Define the modified escape norm ${\|g\|_{*,U_0}}$ by the formula

$\displaystyle \|g\|_{*,U_0} := \inf \{ \sum_{i=1}^n \|g_i\|_{e,U_0}: g = g_1 \ldots g_n \}$

where the infimum is over all possible ways to split ${g}$ as a finite product of group elements. From (13), we have

$\displaystyle \frac{1}{M}\|g\|_{e,U_0} \leq \|g\|_{*,U_0} \leq \|g\|_{e,U_0} \ \ \ \ \ (14)$

and we have the triangle inequality

$\displaystyle \|gh\|_{*,U_0} \leq \|g\|_{*,U_0} + \|h\|_{*,U_0}$

for any ${g,h \in G}$. We also have the symmetry property ${\|g\|_{*,U_0} = \|g^{-1} \|_{*,U_0}}$. Thus ${\| \|_{*,U_0}}$ gives a left-invariant semi-metric on ${G}$ by defining

$\displaystyle \hbox{dist}_{*,U_0}(g,h) := \|g^{-1} h \|_{*,U_0}.$

We can now define a “Lipschitz” function ${\psi: G \rightarrow {\bf R}}$ by setting

$\displaystyle \psi(x) := (1 - M \hbox{dist}_{*,U_0}(x, U_0))_+.$

On the one hand, we see from (14) that this function takes values in ${[0,1]}$ and obeys the Lipschitz bound

$\displaystyle |\partial_g \psi(x)| \leq M \|g\|_{e,U_0} \ \ \ \ \ (15)$

for any ${g, x \in G}$. On the other hand, it is supported in the region where ${\hbox{dist}_{*,U_0}(x,U_0) \leq 1/M}$, which by (14) (and (11)) is contained in ${U_0^2}$.

We could convolve ${\psi}$ with itself in analogy to the preceding section, but in doing so, we will eventually end up establishing a much worse estimate than (13) (in which the constant ${M}$ is replaced with something like ${M^2}$). Instead, we will need to convolve ${\psi}$ with another function ${\eta}$, that we define as follows. We will need a large natural number ${L}$ (independent of ${M}$) to be chosen later, then a small open neighbourhood ${U_1 \subset U_0}$ of the identity (depending on ${L, U_0}$) to be chosen later. We then let ${\eta: G \rightarrow {\bf R}}$ be the function

$\displaystyle \eta(x) := \sup \{ 1 - \frac{j}{L}: x \in U_1^j U_0; j = 0,\ldots,L \} \cup \{0\}.$

Similarly to ${\psi}$, we see that ${\eta}$ takes values in ${[0,1]}$ and obeys the Lipschitz-type bound

$\displaystyle |\partial_g \eta(x)| \leq \frac{1}{L} \ \ \ \ \ (16)$

for all ${g \in U_1}$ and ${x \in G}$. Also, ${\eta}$ is supported in ${U_1^L U_0}$, and hence (if ${U_1}$ is sufficiently small depending on ${L,U_0}$) is supported in ${U_0^2}$, just as ${\psi}$ is.

The functions ${\psi, \eta}$ need not be continuous, but they are compactly supported, bounded, and Borel measurable, and so one can still form their convolution ${\phi := \psi * \eta}$, which will then be continuous and compactly supported; indeed, ${\phi}$ is supported in ${U_0^4}$.

We have a lower bound on how big ${\phi}$ is, since

$\displaystyle \phi(0) \geq \mu(U_0) \gg 1$

(where we allow implied constants to depend on ${\mu, U_0}$, but remain independent of ${L}$, ${U_1}$, or ${M}$). This gives us a way to compare ${\| \|_{\phi}}$ with ${\| \|_{e,U_0}}$. Indeed, if ${n \|g\|_{\phi} < \phi(0)}$, then (as in the proof of Claim 1 in the previous section) we have ${g^n \in U_0^8}$; this implies that

$\displaystyle \| g \|_{e,U_0^8} \ll \| g \|_{\phi}$

for all ${g \in G}$, and hence by (12) we have

$\displaystyle \| g \|_{e,U_0} \ll \| g \|_{\phi} \ \ \ \ \ (17)$

also. In the converse direction, we have

$\displaystyle \|g\|_\phi = \| \partial_g (\psi * \eta) \|_{C_c(G)}$

$\displaystyle = \| (\partial_g \psi) * \eta \|_{C_c(G)}$

$\displaystyle \ll M \|g\|_{e,U_0} \ \ \ \ \ (18)$

thanks to (15). But we can do better than this, as follows. For any ${g, h \in G}$, we have the analogue of (10), namely

$\displaystyle \partial_g \partial_h \phi(x) = \int_G (\partial_h \psi)(y) (\partial_{g^y} \eta)(y^{-1} x)\ d\mu(y)$

If ${h \in U_0}$, then the integrand vanishes unless ${y \in U_0^3}$. By continuity, we can find a small open neighbourhood ${U_2 \subset U_1}$ of the identity such that ${g^y \in U_1}$ for all ${g \in U_2}$ and ${y \in U_0^3}$; we conclude from (15), (16) that

$\displaystyle |\partial_g \partial_h \phi(x)| \ll \frac{M}{L} \|h\|_{e,U_0}.$

whenever ${h \in U_0}$ and ${g \in U_2}$. To use this, we observe the telescoping identity

$\displaystyle \partial_{g^n} = n \partial_g + \sum_{i=0}^{n-1} \partial_g \partial_{g^i}$

for any ${g \in G}$ and natural number ${n}$, and thus by the triangle inequality

$\displaystyle \| g^n \|_\phi = n \| g \|_\phi + O( \sum_{i=0}^{n-1} \| \partial_g \partial_{g^i} \phi \|_{C_c(G)} ). \ \ \ \ \ (19)$

We conclude that

$\displaystyle \|g^n\|_\phi = n \|g\|_\phi + O( n \frac{M}{L} \|g\|_{e,U_0} )$

whenever ${n \geq 1}$ and ${g,\ldots,g^n \in U_2}$. Using the trivial bound ${\|g^n\|_\phi = O(1)}$, we then have

$\displaystyle \|g\|_\phi \ll \frac{1}{n} + \frac{M}{L} \|g\|_{e,U_0};$

optimising in ${n}$ we obtain

$\displaystyle \|g\|_\phi \ll \|g\|_{e,U_2} + \frac{M}{L} \|g\|_{e,U_0}$

and hence by (12)

$\displaystyle \|g\|_\phi \ll (\frac{M}{L} + O_{U_2}(1)) \|g\|_{e,U_0}$

where the implied constant in ${O_{U_2}(1)}$ can depend on ${U_0,U_1,U_2, L}$, but is crucially independent of ${M}$. Note the essential gain of ${\frac{1}{L}}$ here compared with (18). We also have the norm inequality

$\displaystyle \|g_1 \ldots g_n \|_\phi \leq \sum_{i=1}^n \|g_i\|_\phi.$

Combining these inequalities with (17) we see that

$\displaystyle \| g_1 \ldots g_n \|_{e,U_0} \ll (\frac{1}{L} M + O_{U_2}(1)) \sum_{i=1}^n \|g_i\|_{e,U_0}.$

Thus we have improved the constant ${M}$ in the hypothesis (13) to ${O( \frac{1}{L} M ) + O_{U_2}(1)}$. Choosing ${L}$ large enough and iterating, we conclude that we can bootstrap any finite constant ${M}$ in (13) to ${O(1)}$.

Of course, there is no reason why there has to be a finite ${M}$ for which (13) holds in the first place. However, one can rectify this by the usual trick of creating an epsilon of room. Namely, one replaces the escape norm ${\| g \|_{e,U_0}}$ by, say, ${\|g\|_{e,U_0}+\varepsilon}$ for some small ${\varepsilon > 0}$ in the definition of ${\| \|_{*,U_0}}$ and in the hypothesis (13). Then the bound (13) will be automatic with a finite ${M}$ (of size about ${O(1/\varepsilon)}$). One can then run the above argument with the requisite changes and conclude a bound of the form

$\displaystyle \| g_1 \ldots g_n \|_{e,U_0} \ll \sum_{i=1}^n (\|g_i\|_{e,U_0}+\varepsilon)$

uniformly in ${\varepsilon}$; we omit the details. Sending ${\varepsilon \rightarrow 0}$, we have thus shown Proposition 24. $\Box$

Now we can finish the proof of Theorem 22. Let ${G}$ be a locally compact NSS group, and let ${U_0}$ be a sufficiently small neighbourhood of the identity. From Proposition 24, we see that the escape norm ${\| \|_{e,U_0}}$ and the modified escape norm ${\| \|_{*,U_0}}$ are comparable. We have seen ${d_{*,U_0}}$ is a left-invariant pseudometric. As ${G}$ is NSS and ${U_0}$ is small, there are no non-identity elements with zero escape norm, and hence no non-identity elements with zero modified escape norm either; thus ${d_{*,U_0}}$ is a genuine metric.

We now claim that ${d_{*,U_0}}$ generates the topology of ${G}$. Given the left-invariance of ${d_{*,U_0}}$, it suffices to establish two things: firstly, that any open neighbourhood of the identity contains a ball around the identity in the ${d_{*,U_0}}$ metric; and conversely, any such ball contains an open neighbourhood around the identity.

To prove the first claim, let ${U}$ be an open neighbourhood around the identity, and let ${U' \subset U}$ be a smaller neighbourhood of the identity. From (12) we see (if ${U'}$ is small enough) that ${\| \|_{*,U_0}}$ is comparable to ${\| \|_{e,U'}}$, and ${U'}$ contains a small ball around the origin in the ${d_{*,U_0}}$ metric, giving the claim. To prove the second claim, consider a ball ${B(0,r)}$ in the ${d_{*,U_0}}$ metric. For any positive integer ${m}$, we can find an open neighbourhood ${U_m}$ of the identity such that ${U_m^m \subset U_0}$, and hence ${\|g\|_{e,U_0} \leq \frac{1}{m}}$ for all ${g \in U_m}$. For ${m}$ large enough, this implies that ${U_m \subset B(0,r)}$, and the claim follows.

To finish the proof of Theorem 22, we need to verify the escape property (5). Thus, we need to show that if ${g \in G}$, ${n \geq 1}$ are such that ${n \|g\|_{*,U_0}}$ is sufficiently small, then we have ${\|g^n\|_{*,U_0} \gg n \|g\|_{*,U_0}}$. We may of course assume that ${g}$ is not the identity, as the claim is trivial otherwise. As ${\|\|_{*,U_0}}$ is comparable to ${\| \|_{e,U_0}}$, we know that there exists a natural number ${m \ll 1 / \| g \|_{*,U_0}}$ such that ${g^m \not \in U_0}$. Let ${U_1}$ be a neighbourhood of the identity small enough that ${U_1^2 \subset U_0}$. We have ${\|g^i\|_{*,U_0} \leq n \|g\|_{*,U_0}}$ for all ${i=1,\ldots,n}$, so ${g^i \in U_1}$ and hence ${m > n}$. Let ${m+i}$ be the first multiple of ${n}$ larger than ${n}$, then ${i \leq n}$ and so ${g^i \in U_1}$. Since ${g^m \not \in U_0}$, this implies ${g^{m+i} \not \in U_1}$. Since ${m+i}$ is divisible by ${n}$, we conclude that ${\| g^n \|_{e,U_1} \geq \frac{n}{m+i} \gg n \| g \|_{*,U_0}}$, and the claim follows from (12).

— 4. NSS from subgroup trapping —

In view of Theorem 22, the only remaining task in the proof of the Gleason-Yamabe theorem is to locate “big” subquotients ${G'/H}$ of a locally compact group ${G}$ with the NSS property. We will need some further notation. Given a neighbourhood ${V}$ of the identity in a topological group ${G}$, let ${Q[V]}$ denote the union of all the subgroups of ${G}$ that are contained in ${V}$. Thus, a group is NSS if ${Q[V]}$ is trivial for all sufficiently small ${V}$.

We will need a property that is weaker than NSS:

Definition 25 (Subgroup trapping) A topological group has the subgroup trapping property if, for every open neighbourhood ${U}$ of the identity, there exists another open neighbourhood ${V}$ of the identity such that ${Q[V]}$ generates a subgroup ${\langle Q[V] \rangle}$ contained in ${U}$.

Clearly, every NSS group has the subgroup trapping property. Informally, groups with the latter property do have small subgroups, but one cannot get very far away from the origin just by combining together such subgroups.

Example 26 The infinite-dimensional torus ${({\bf R}/{\bf Z})^{\bf N}}$ does not have the NSS property, but it does have the subgroup trapping property.

It is difficult to produce an example of a group that does not have the subgroup trapping property; the reason for this will be made clear in the next section. For now, we establish the following key result.

Proposition 27 (From subgroup trapping to NSS) Let ${G}$ be a locally compact group with the subgroup trapping property, and let ${U}$ be an open neighbourhood of the identity in ${G}$. Then there exists an open subgroup ${G'}$ of ${G}$, and a compact subgroup ${N}$ of ${G'}$ contained in ${U}$, such that ${G'/N}$ is locally compact and NSS. In particular, by Theorem 22, ${G'/N}$ is isomorphic to a Lie group.

Intuitively, the idea is to use the subgroup trapping property to find a small compact normal subgroup ${N}$ that contains ${Q[V]}$ for some small ${V}$, and then quotient this group out to get an NSS group. Unfortunately, because ${N}$ is not necessarily contained in ${V}$, this quotienting operation may create some additional small subgroups. To fix this, we need to pass from the compact subgroup ${N}$ to a smaller one. In order to understand the subgroups of compact groups, the main tool will be Gleason-Yamabe theorem for compact groups (Theorem 4).

For us, the main reason why we need the compact case of the Gleason-Yamabe theorem is that Lie groups automatically have the NSS property, even though ${G}$ need not. Thus, one can view Theorem 4 as giving the compact case of Proposition 27.

We now prove Proposition 27, using an argument of Yamabe. Let ${G}$ be a locally compact group with the subgroup trapping property, and let ${U}$ be an open neighbourhood of the identity. We may find a smaller neighbourhood ${U_1}$ of the identity with ${U_1^2 \subset U}$, which in particular implies that ${\overline{U_1} \subset U}$; by shrinking ${U_1}$ if necessary, we may assume that ${\overline{U_1}}$ is compact. By the subgroup trapping property, one can find an open neighbourhood ${U_2}$ of the identity such that ${\langle Q(U_2) \rangle}$ is contained in ${U_1}$, and thus ${H := \overline{\langle Q(U_2) \rangle}}$ is a compact subgroup of ${G}$ contained in ${U}$. By shrinking ${U_2}$ if necessary we may assume ${U_2 \subset U_1}$.

Ideally, if ${H}$ were normal and contained in ${U_2}$, then the quotient group ${G/H}$ would have the NSS property. Unfortunately ${H}$ need not be normal, and need not be contained in ${U_2}$, but we can fix this as follows. Applying Theorem 4, we can find a compact normal subgroup ${N}$ of ${H}$ contained in ${U_2 \cap H}$ such that ${H/N}$ is isomorphic to a Lie group, and in particular is NSS. In particular, we can find an open symmetric neighbourhood ${U_3}$ of the identity in ${G}$ such that ${U_3 N U_3 \subset U_2}$ and that the quotient space ${\pi(U_3 N U_3 \cap H)}$ has no non-trivial subgroups in ${H/N}$, where ${\pi: H \rightarrow H/N}$ is the quotient map.

We now claim that ${N}$ is normalised by ${U_3}$. Indeed, if ${g \in U_3}$, then the conjugate ${N^g := g^{-1} N g}$ of ${N}$ is contained in ${U_3 N U_3}$ and hence in ${U_2}$. As ${N^g}$ is a group, it must thus be contained in ${Q(U_2)}$ and hence in ${H}$. But then ${\pi(N^g)}$ is a subgroup of ${H/N}$ that is contained in ${\pi(U_3 N U_3 \cap H)}$, and is hence trivial by construction. Thus ${N^g \subset N}$, and so ${N}$ is normalised by ${U_3}$. If we then let ${G'}$ be the subgroup of ${G}$ generated by ${N}$ and ${U_3}$, we see that ${G'}$ is an open subgroup of ${G}$, with ${N}$ a compact normal subgroup of ${G'}$.

To finish the job, we need to show that ${G'/N}$ has the NSS property. It suffices to show that ${U_3 N U_3 / N}$ has no nontrivial subgroups. But any subgroup in ${U_3 N U_3 / N}$ pulls back to a subgroup in ${U_3 N U_3}$, hence in ${U_2}$, hence in ${Q(U_2)}$, hence in ${H}$; since ${(U_3 N U_3 \cap H)/N}$ has no nontrivial subgroups, the claim follows. This concludes the proof of Proposition 27.

— 5. The subgroup trapping property —

In view of Theorem 22, Proposition 27, and Exercise 12, we see that the Gleason-Yamabe theorem (Theorem 1) now reduces to the following claim.

Proposition 28 Every locally compact metrisable group has the subgroup trapping property.

We now prove this proposition, which is the hardest step of the entire proof and uses almost all the tools already developed. In particular, it requires both Theorem 4 and Gleason’s convolution trick, as well as some of the basic theory of Hausdorff distance; as such, this is perhaps the most “infinitary” of all the steps in the argument.

The Gleason-type arguments can be encapsulated in the following proposition, which is a weak version of the subgroup trapping property:

Proposition 29 (Finite trapping) Let ${G}$ be a locally compact group, let ${U}$ be an open precompact neighbourhood of the identity, and let ${m \geq 1}$ be an integer. Then there exists an open neighbourhood ${V}$ of the identity with the following property: if ${Q \subset Q[V]}$ is a symmetric set containing the identity, and ${n \geq 1}$ is such that ${Q^n \subset U}$, then ${Q^{mn} \subset U^8}$.

Informally, Proposition 29 asserts that subsets of ${Q[V]}$ grow much more slowly than “large” sets such as ${U}$. We remark that if one could replace ${U^8}$ in the conclusion here by ${U}$, then a simple induction on ${n}$ (after first shrinking ${V}$ to lie in ${U}$) would give Proposition 28. It is the loss of ${8}$ in the exponent that necessitates some non-trivial additional arguments.

Proof: } Let ${V}$ be small enough to be chosen later, and let ${Q, n}$ be as in the proposition. Once again we will convolve together two “Lipschitz” functions ${\psi, \eta}$ to obtain a good bump function ${\phi = \psi*\eta}$ which generates a useful metric for analysing the situation. The first bump function ${\psi: G \rightarrow {\bf R}}$ will be defined by the formula

$\displaystyle \psi(x) := \sup \{ 1 - \frac{j}{n}: x \in Q^j U; j = 0,\ldots,n \} \cup \{0\}.$

Then ${\psi}$ takes values in ${[0,1]}$, equals ${1}$ on ${U}$, is supported in ${U^2}$, and obeys the Lipschitz type property

$\displaystyle |\partial_q \psi(x)| \leq \frac{1}{n} \ \ \ \ \ (20)$

for all ${q \in Q}$. The second bump function ${\eta: G \rightarrow {\bf R}}$ is similarly defined by the formula

$\displaystyle \eta(x) := \sup \{ 1 - \frac{j}{M}: x \in (V^{U^4})^j U; j = 0,\ldots,M \} \cup \{0\},$

where ${V^{U^4} := \{ g^{-1} x g: x \in V, g \in U^4 \}}$, where ${M}$ is a quantity depending on ${m}$ and ${U}$ to be chosen later. If ${V}$ is small enough depending on ${U}$ and ${m}$, then ${(V^{U^4})^M \subset U}$, and so ${\eta}$ also takes values in ${[0,1]}$, equals ${1}$ on ${U}$, is supported in ${U^2}$, and obeys the Lipschitz type property

$\displaystyle |\partial_g \psi(x)| \leq \frac{1}{M} \ \ \ \ \ (21)$

for all ${g \in V^{U^4}}$.

Now let ${\phi := \psi * \eta}$. Then ${\phi}$ is supported on ${U^4}$ and ${\| \phi \|_{C_c(G)} \gg 1}$ (where implied constants can depend on ${U}$, ${\mu}$). As before, we conclude that ${g \in U^8}$ whenever ${\|g\|_\phi}$ is sufficiently small.

Now suppose that ${q \in Q[V]}$; we will estimate ${\|q\|_\phi}$. From (19) one has

$\displaystyle \|q\|_\phi \ll \frac{1}{n} \| q^n \|_\phi + \sup_{0 \leq i \leq n} \| \partial_{q^i} \partial_{q} \phi \|_{C_c(G)}$

(note that ${\partial_{q^i}}$ and ${\partial_q}$ commute). For the first term, we can compute

$\displaystyle \| q^n \|_\phi = \sup_x |\partial_{q^n} (\psi * \eta)(x)|$

and

$\displaystyle \partial_{q^n} (\psi * \eta)(x) = \int_G \psi(y) \partial_{(q^n)^y}(y^{-1} x) d\mu(y).$

Since ${q \in Q[V]}$, ${q^n \in V}$, so by (21) we conclude that

$\displaystyle \| q^n \|_\phi \ll \frac{1}{M}.$

For the second term, we similarly expand

$\displaystyle \partial_{q^i} \partial_{q^i} \phi(x) = \int_G (\partial_q \psi)(y) \partial_{(q^n)^y}(y^{-1} x) d\mu(y).$

Using (21), (20) we conclude that

$\displaystyle |\partial_{q^i} \partial_{q^i} \phi(x)| \ll \frac{1}{Mn}.$

Putting this together we see that

$\displaystyle \|q\|_\phi \ll \frac{1}{Mn}$

for all ${q \in Q[V]}$, which in particular implies that

$\displaystyle \| g \|_\phi \ll \frac{m}{M}$

for all ${g \in Q^{mn}}$. For ${M}$ sufficiently large, this gives ${Q^{mn} \subset U^8}$ as required. $\Box$

We will also need the following compactness result in the Hausdorff distance

$\displaystyle d_H( E, F ) := \max( \sup_{x \in E} \hbox{dist}(x,F), \sup_{y \in F} \hbox{dist}(E, y) )$

between two non-empty closed subsets ${E, F}$ of a metric space ${(X,d)}$.

Example 30 In ${{\bf R}}$ with the usual metric, the finite sets ${\{ \frac{i}{n}: i=1,\ldots,n\}}$ converge in Hausdorff distance to the closed interval ${[0,1]}$.

Exercise 31 Show that the space ${K(X)}$ of non-empty closed subsets of a compact metric space ${X}$ is itself a compact metric space (with the Hausdorff distance as the metric). (Hint: use the Heine-Borel theorem.)

Now we can prove Proposition 28. Let ${G}$ be a locally compact group endowed with some metric ${d}$, and let ${U}$ be an open neighbourhood of the identity; by shrinking ${U}$ we may assume that ${U}$ is precompact. Let ${V_i}$ be a sequence of balls around the identity with radius going to zero, then ${Q[V_i]}$ is a symmetric set in ${V_i}$ that contains the identity. If, for some ${i}$, ${Q[V_i]^n \subset U}$ for every ${n}$, then ${\langle Q[V_i] \rangle \subset U}$ and we are done. Thus, we may assume for sake of contradiction that there exists ${n_i}$ such that ${Q[V_i]^{n_i} \subset U}$ and ${Q[V_i]^{n_i + 1} \not \subset U}$; since the ${V_i}$ go to zero, we have ${n_i \rightarrow \infty}$. By Proposition 29, we can also find ${m_i \rightarrow \infty}$ such that ${Q[V_i]^{m_i n_i} \subset U^8}$.

The sets ${\overline{Q[V_i]}^{n_i}}$ are closed subsets of ${\overline{U}}$; by Exercise 31, we may pass to a subsequence and assume that they converge to some closed subset ${E}$ of ${\overline{U}}$. Since the ${Q[V_i]}$ are symmetric and contain the identity, ${E}$ is also symmetric and contains the identity. For any fixed ${m}$, we have ${Q[V_i]^{m n_i} \subset U^8}$ for all sufficiently large ${i}$, which on taking Hausdorff limits implies that ${E^m \subset \overline{U^8}}$. In particular, the group ${H := \overline{\langle E \rangle}}$ is a compact subgroup of ${G}$ contained in ${\overline{U^8}}$.

Let ${U_1}$ be a small neighbourhood of the identity in ${G}$ to be chosen later. By Theorem 4, we can find a normal subgroup ${N}$ of ${H}$ contained in ${U_1 \cap H}$ such that ${H/N}$ is NSS. Let ${B}$ be a neigbourhood of the identity in ${H/N}$ so small that ${B^{10}}$ has no small subgroups. A compactness argument then shows that there exists a natural number ${k}$ such that for any ${g \in H/N}$ that is not in ${B}$, at least one of ${g, \ldots,g^k}$ must lie outside of ${B^{10}}$.

Now let ${\varepsilon > 0}$ be a small parameter. Since ${Q[V_i]^{n_i+1} \not \subset U}$, we see that ${Q[V_i]^{n_i+1}}$ does not lie in the ${\varepsilon}$-neighbourhood ${\pi^{-1}(B)_\varepsilon}$ of ${\pi^{-1}(B)}$ if ${\varepsilon, U_1, B}$ are small enough, where ${\pi: H \rightarrow H/N}$ is the projection map. Let ${n'_i}$ be the first integer for which ${Q[V_i]^{n'_i}}$ does not lie in ${\pi^{-1}(B)_\varepsilon}$, then ${n'_i \leq n_i+1}$ and ${n'_i \rightarrow \infty}$ as ${i \rightarrow \infty}$ (for fixed ${\varepsilon}$). On the other hand, as ${Q[V_i]^{n'_i-1} \subset \pi^{-1}(B)_\varepsilon}$, we see from another application of Proposition 29 that ${Q[V_i]^{kn'_i} \subset (\pi^{-1}(B)_\varepsilon)^8}$ if ${i}$ is sufficiently large depending on ${\varepsilon}$.

On the other hand, since ${Q[V_i]^{n_i}}$ converges to a subset of ${H}$ in the Hausdorff distance, we know that for ${i}$ large enough, ${Q[V_i]^{2n_i}}$ and hence ${Q[V_i]^{n'_i}}$ is contained in the ${\varepsilon}$-neighbourhood of ${H}$. Thus we can find an element ${g_i}$ of ${Q[V_i]^{n'_i}}$ that lies within ${\varepsilon}$ of a group element ${h_i}$ of ${H}$, but does not lie in ${\pi^{-1}(B)_\varepsilon}$; thus ${h_i}$ lies inside ${H \backslash \pi^{-1}(B)}$. By construction of ${B}$, we can find ${1 \leq j_i \leq k}$ such that ${h^{j_i}_i}$ lies in ${H \backslash \pi^{-1}(B^{10})}$. But ${h_i^{j_i}}$ also lies within ${o(1)}$ of ${g_i^{j_i}}$, which lies in ${Q[V_i]^{kn'_i}}$ and hence in ${(\pi^{-1}(B)_\varepsilon)^8}$, where ${o(1)}$ denotes a quantity depending on ${\varepsilon}$ that goes to zero as ${\varepsilon \rightarrow 0}$. We conclude that ${H \backslash \pi^{-1}(B^{10})}$ and ${\pi^{-1}(B^8)}$ are separated by ${o(1)}$, which leads to a contradiction if ${\varepsilon}$ is sufficiently small (note that ${\overline{\pi^{-1}(B^8)}}$ and ${H \backslash \pi^{-1}(B^{10})}$ are compact and disjoint, and hence separated by a positive distance), and the claim follows.

Exercise 32 Let ${X}$ be a compact metric space, ${K_c(X)}$ denote the space of non-empty closed and connected subsets of ${X}$. Show that ${K_c(X)}$ with the Hausdorff metric is also a compact metric space.

Exercise 33 Show that the metrisability condition in Proposition 28 can be dropped; in other words, show that every locally compact group has the subgroup trapping property.

— 6. The local group case —

In the thesis of Goldbring (and also the later paper of Goldbring and van den Dries), the above theory was extended to the setting of local groups. In fact, there is relatively little difficulty (other than some notational difficulties) in doing so, because the analysis in the previous sections can be made to take place on a small neighbourhood of the origin. This extension to local groups is not simply a generalisation for its own sake; it will turn out that it will be natural to work with local groups when we classify approximate groups in later notes.

One technical issue that comes up in the theory of local groups is that basic cancellation laws such as ${gh=gk \implies h=k}$, which are easily verified for groups, are not always true for local groups. However, this is a minor issue as one can always recover the cancellation laws by passing to a slightly smaller local group, as follows.

Definition 34 (Cancellative local group) A local group ${G}$ is said to be symmetric if the inverse operation is always well-defined. It is said to be cancellative if it is symmetric, and the following axioms hold:

• (i) Whenever ${g,h,k \in G}$ are such that ${gh}$ and ${gk}$ are well-defined and equal to each other, then ${h=k}$. (Note that this implies in particular that ${(g^{-1})^{-1} = g}$.)
• (ii) Whenever ${g,h,k \in G}$ are such that ${hg}$ and ${kg}$ are well-defined and equal to each other, then ${h=k}$.
• (iii) Whenever ${g,h \in G}$ are such that ${gh}$ and ${h^{-1}g^{-1}}$ are well-defined, then ${(gh)^{-1} = h^{-1}g^{-1}}$. (In particular, if ${U \subset G}$ is symmetric and ${U^m}$ is well-defined in ${G}$ for some ${m \geq 1}$, then ${U^m}$ is also symmetric.)

Clearly, all global groups are cancellative, and more generally the restriction of a global group to a symmetric neighbourhood of the identity s cancellative. While not all local groups are cancellative, we have the following substitute:

Exercise 35 Let ${G}$ be a local group. Show that there is a neighbourhood ${U}$ of the identity which is cancellative (thus, the restriction ${G\downharpoonright_U}$ of ${G}$ to ${U}$ is cancellative).

Note that any symmetric neighbourhood of the identity in a cancellative local group is again a cancellative local group. Because of this, it turns out in practice that we may restrict to the cancellative setting without much loss of generality.

Next, we need to localise the notion of a quotient ${G/H}$ of a global group ${G}$ by a normal subgroup ${H}$. Recall that in order for a subset ${H}$ og a global group ${G}$ to be a normal subgroup, it has to be symmetric, contain the identity, be closed under multiplication (thus ${h_1 h_2 \in H}$ whenever ${h_1,h_2 \in H}$, and closed under conjugation (thus ${h^g := g^{-1} hg \in H}$ whenever ${h \in H}$ and ${g \in G}$). We now localise this concept as follows:

Definition 36 (Normal sublocal group) Let ${G}$ be a cancellative local group. A subset ${H}$ of ${G}$ is said to be a normal sublocal group if there is an open neighbourhood ${V}$ of ${H}$ (called a normalising neighbourhood of ${H}$) obeying the following axioms:

1. (Identity and inverse) ${H}$ is symmetric and contains the identity.
2. (Local closure) If ${g, h \in H}$ and ${gh}$ is well-defined in ${V}$, then ${gh \in H}$.
3. (Normality) If ${g \in V, h \in H}$ are such that ${h^g = g^{-1} h g}$ is well-defined in ${V}$, then ${h^g \in H}$.

(Strictly speaking, one should refer to the pair ${(H,V)}$ as the normal sublocal group, rather than just ${H}$, but by abuse of notation we shall omit the normalising neighbourhood ${V}$ when referring to the normal sublocal group.)

It is easy to see that if ${H}$ is a normal sublocal group of ${G}$, then ${H}$ is itself a cancellative local group, using the topology and group structure formed by restriction from ${G}$. (Note how the open neighbourhood ${V}$ is needed to ensure that the domain of the multiplication map in ${H}$ remains open.)

Example 37 In the global group ${G = {\bf R}^2 = ({\bf R}^2,+)}$, the open interval ${H := (-1,1) \times \{0\}}$ is a normal sub-local subgroup if one takes (say) ${V := (-1,1) \times (-1,1)}$ as the normalising neighbourhood.

Example 38 Let ${T: ({\bf R}/{\bf Z})^{\bf Z} \rightarrow ({\bf R}/{\bf Z})^{\bf Z}}$ be the shift map ${T (a_n)_{n \in {\bf Z}} := (a_{n-1})_{n\in {\bf Z}}}$, and let ${{\bf Z} \ltimes_T ({\bf R}/{\bf Z})^{\bf Z}}$ be the semidirect product of ${{\bf Z}}$ and ${({\bf R}/{\bf Z})^{\bf Z}}$. Then if ${H}$ is any (global) subgroup of ${({\bf R}/{\bf Z})^{\bf Z}}$, the set ${\{0\} \times H}$ is a normal sub-local subgroup of ${{\bf Z} \ltimes_T ({\bf R}/{\bf Z})^{\bf Z}}$ (with normalising neighbourhood ${\{0\} \times ({\bf R}/{\bf Z})^{\bf Z}}$). This is despite the fact that ${H}$ will, in general, not be normal in ${{\bf Z} \ltimes_T ({\bf R}/{\bf Z})^{\bf Z}}$ in the classical (global) sense.

As observed by Goldbring, one can define the operation of quotienting a local group by a normal sub-local group, provided that one restricts to a sufficiently small neighbourhood of the origin:

Exercise 39 (Quotient spaces) Let ${G}$ be a cancellative local group, and let ${H}$ be a normal sub-local group with normalising neighbourhood ${V}$. Let ${W}$ be a symmetric open neighbourhood of the identity such that ${W^6}$ is well-defined and contained in ${V}$. Show that there exists a cancellative local group ${W/H}$ and a surjective continuous homomorphism ${\phi: W \rightarrow W/H}$ such that, for any ${g, h \in W}$, one has ${\phi(g)=\phi(h)}$ if and only if ${gh^{-1} \in H}$, and for any ${E \subset W/H}$, one has ${E}$ open if and only if ${\phi^{-1}(E)}$ is open.

It is not difficult to show that the quotient ${W/H}$ defined by the above exercise is unique up to local isomorphism, so we will abuse notation and talk about “the” quotient space ${W/H}$ given by the above construction.

We can now state the local version of the Gleason-Yamabe theorem, first proven by Goldbring in his thesis, and later reproven by Goldbring and van den Dries by a slightly different method:

Theorem 40 (Local Gleason-Yamabe theorem) Let ${G}$ be a locally compact local group. Then there exists an open symmetric neighbourhood ${G'}$ of the identity, and a compact global group ${H}$ in ${G'}$ that is normalised by ${G'}$, such that ${G'/H}$ is well-defined and isomorphic to a local Lie group.

The proofs of this theorem by Goldbring and Goldbring-van den Dries were phrased in the language of nonstandard analysis. However, it is possible to translate those arguments to standard analysis arguments, which closely follow the arguments given in previous sections and notes. (Actually, our arguments are not a verbatim translation of those in Goldbring and Goldbring-van den Dries, as we have made a few simplifications in which the role of Gleason metrics is much more strongly emphasised.) We briefly sketch the main points here.

As in the global case, the route to obtaining (local) Lie structure is via Gleason metrics. On a local group ${G}$, we define a local Gleason metric to be a metric ${d: U \times U \rightarrow {\bf R}^+}$ defined on some symmetric open neighbourhood ${U}$ of the identity with (say) ${U^{100}}$ well-defined (to avoid technical issues), which generates the topology of ${U}$, and which obeys the following version of the left-invariance, escape and commutator properties:

• (Left-invariance) If ${g,h, k \in U}$ are such that ${gh, gk \in U}$, then ${d(h,k) = d(gh,gk)}$.
• (Escape property) If ${g \in U}$ and ${n \|g\| \leq \frac{1}{C}}$, then ${g,\ldots,g^n}$ are well-defined in ${U}$ and ${\|g^n\| \geq \frac{1}{C} n \|g\|}$.
• (Commutator estimate) If ${g, h \in U}$ are such that ${\|g\|, \|h\| \leq \frac{1}{C}}$, then ${[g,h]}$ is well-defined in ${U}$ and (1) holds.

One can then verify (by localisation of the arguments in Notes 2) that any locally compact local Lie group with a local Gleason metric is locally Lie (i.e. some neighbourhood of the identity is isomorphic to a local Lie group); see Exercise 10 from Notes 2. Next, one can define the notion of a weak local Gleason metric by dropping the commutator estimate, and one can verify an analogue of Theorem 14, namely that any weak local Gleason metric is automatically a local Gleason metric, after possibly shrinking the neighbourhood ${U}$ and adjusting the constant ${C}$ as necessary. The proof of this statement is essentially the same as that in Theorem 14 (which is already localised to small neighbourhoods of the identity), but uses a local Haar measure instead of a global Haar measure, and requires some preliminary shrinking of the neighbourhood ${U}$ to ensure that all group-theoretic operations (and convolutions) are well-defined. We omit the (rather tedious) details.

Now we define the concept of an NSS local group as a local group which has an open neighbourhood of the identity that contains no non-trivial global subgroups. The proof of Theorem 22 is already localised to small neighbourhoods of the identity, and it is possible (after being sufficiently careful with the notation) to translate that argument to the local setting, and conclude that any NSS local group admits a weak Gleason metric on some open neighbourhood of the identity, and is hence locally Lie. (A typical example of being “sufficiently careful with the notation”: to define the escape norm (11), one adopts the convention that a statement such as ${g,\ldots,g^n \in U}$ is automatically false if ${g,\ldots,g^n}$ are not all well-defined. The induction hypothesis (13) will play a key role in ensuring that all expressions involved are well-defined and localised to a suitably small neighbourhood of the identity.) Again, we omit the details.

The next step is to obtain a local version of Proposition 27. Here we encounter a slight difficulty because in a general local group ${G}$, we do not have a good notion of the group ${\langle A \rangle}$ generated by a set ${A}$ of generators in ${G}$. As such, the subgroup trapping property does not automatically translate to the local group setting as defined in Definition 41. However, this difficulty can be easily avoided by rewording the definition:

Definition 41 (Subgroup trapping) A local group has the subgroup trapping property if, for every open neighbourhood ${U}$ of the identity, there exists another open neighbourhood ${V}$ of the identity such that ${Q[V]}$ is contained in a global subgroup ${H}$ that is in turn contained in ${U}$. (Here, ${Q[V]}$ is, as before, the union of all the global subgroups contained in ${V}$.)

Because ${Q[V]}$ is now contained in a global group ${H}$, the group ${\langle Q[V] \rangle}$ generated by ${H}$ is well-defined. As ${H}$ is in the open neighbourhood ${U}$, one can then also form the closure ${\overline{\langle Q[V] \rangle}}$; if we choose ${U}$ small enough to be precompact, then this is a compact global group (and thus describable by the Gleason-Yamabe theorem for such groups, Theorem 4). Because of this, it is possible to adapt Proposition 27 without much difficulty to the local setting to conclude that given any locally compact local group ${G}$ with the subgroup trapping property, there exists an open symmetric neighbourhood ${G'}$ of the identity, and a compact global group ${H}$ in ${G'}$ that is normalised by ${G'}$, such that ${G'/H}$ is well-defined and NSS (and thus locally isomorphic to a local Lie group).

Finally, to finish the proof of Theorem 40, one has to establish the analogue of Proposition 28, namely that one has to show that every locally compact metrisable local group has the subgroup trapping property. (It is not difficult to adapt Exercise 12 to the local group setting to reduce to the metrisable case.) The first step is to prove the local group analogue of Proposition 29 (again adopting the obvious convention that a statement such as ${Q^n \subset U}$ is only considered true if ${Q^n}$ is well-defined, and adding the additional hypothesis that ${U}$ is sufficiently small in order to ensure that all manipulations are justified). This can be done by a routine modification of the proof. But then one can modify the rest of the argument in Proposition 28 to hold in the local setting as well (note, as in the proof of Proposition 27, that the compact set ${H}$ generated in the course of this argument remains a global group rather than a local one, and so one can again use Theorem 4 without difficulty). Again, we omit the details.