Hilbert’s fifth problem asks to clarify the extent that the assumption on a differentiable or smooth structure is actually needed in the theory of Lie groups and their actions. While this question is not precisely formulated and is thus open to some interpretation, the following result of Gleason and Montgomery-Zippin answers at least one aspect of this question:

Theorem 1 (Hilbert’s fifth problem) Let {G} be a topological group which is locally Euclidean (i.e. it is a topological manifold). Then {G} is isomorphic to a Lie group.

Theorem 1 can be viewed as an application of the more general structural theory of locally compact groups. In particular, Theorem 1 can be deduced from the following structural theorem of Gleason and Yamabe:

Theorem 2 (Gleason-Yamabe theorem) Let {G} be a locally compact group, and let {U} be an open neighbourhood of the identity in {G}. Then there exists an open subgroup {G'} of {G}, and a compact subgroup {N} of {G'} contained in {U}, such that {G'/N} is isomorphic to a Lie group.

The deduction of Theorem 1 from Theorem 2 proceeds using the Brouwer invariance of domain theorem and is discussed in this previous post. In this post, I would like to discuss the proof of Theorem 2. We can split this proof into three parts, by introducing two additional concepts. The first is the property of having no small subgroups:

Definition 3 (NSS) A topological group {G} is said to have no small subgroups, or is NSS for short, if there is an open neighbourhood {U} of the identity in {G} that contains no subgroups of {G} other than the trivial subgroup {\{ \hbox{id}\}}.

An equivalent definition of an NSS group is one which has an open neighbourhood {U} of the identity that every non-identity element {g \in G \backslash \{\hbox{id}\}} escapes in finite time, in the sense that {g^n \not \in U} for some positive integer {n}. It is easy to see that all Lie groups are NSS; we shall shortly see that the converse statement (in the locally compact case) is also true, though significantly harder to prove.

Another useful property is that of having what I will call a Gleason metric:

Definition 4 Let {G} be a topological group. A Gleason metric on {G} is a left-invariant metric {d: G \times G \rightarrow {\bf R}^+} which generates the topology on {G} and obeys the following properties for some constant {C>0}, writing {\|g\|} for {d(g,\hbox{id})}:

  • (Escape property) If {g \in G} and {n \geq 1} is such that {n \|g\| \leq \frac{1}{C}}, then {\|g^n\| \geq \frac{1}{C} n \|g\|}.
  • (Commutator estimate) If {g, h \in G} are such that {\|g\|, \|h\| \leq \frac{1}{C}}, then

    \displaystyle  \|[g,h]\| \leq C \|g\| \|h\|, \ \ \ \ \ (1)

    where {[g,h] := g^{-1}h^{-1}gh} is the commutator of {g} and {h}.

For instance, the unitary group {U(n)} with the operator norm metric {d(g,h) := \|g-h\|_{op}} can easily verified to be a Gleason metric, with the commutator estimate (1) coming from the inequality

\displaystyle  \| [g,h] - 1 \|_{op} = \| gh - hg \|_{op}

\displaystyle  = \| (g-1) (h-1) - (h-1) (g-1) \|_{op}

\displaystyle  \leq 2 \|g-1\|_{op} \|g-1\|_{op}.

Similarly, any left-invariant Riemannian metric on a (connected) Lie group can be verified to be a Gleason metric. From the escape property one easily sees that all groups with Gleason metrics are NSS; again, we shall see that there is a partial converse.

Remark 1 The escape and commutator properties are meant to capture “Euclidean-like” structure of the group. Other metrics, such as Carnot-Carathéodory metrics on Carnot Lie groups such as the Heisenberg group, usually fail one or both of these properties.

The proof of Theorem 2 can then be split into three subtheorems:

Theorem 5 (Reduction to the NSS case) Let {G} be a locally compact group, and let {U} be an open neighbourhood of the identity in {G}. Then there exists an open subgroup {G'} of {G}, and a compact subgroup {N} of {G'} contained in {U}, such that {G'/N} is NSS, locally compact, and metrisable.

Theorem 6 (Gleason’s lemma) Let {G} be a locally compact metrisable NSS group. Then {G} has a Gleason metric.

Theorem 7 (Building a Lie structure) Let {G} be a locally compact group with a Gleason metric. Then {G} is isomorphic to a Lie group.

Clearly, by combining Theorem 5, Theorem 6, and Theorem 7 one obtains Theorem 2 (and hence Theorem 1).

Theorem 5 and Theorem 6 proceed by some elementary combinatorial analysis, together with the use of Haar measure (to build convolutions, and thence to build “smooth” bump functions with which to create a metric, in a variant of the analysis used to prove the Birkhoff-Kakutani theorem); Theorem 5 also requires Peter-Weyl theorem (to dispose of certain compact subgroups that arise en route to the reduction to the NSS case), which was discussed previously on this blog.

In this post I would like to detail the final component to the proof of Theorem 2, namely Theorem 7. (I plan to discuss the other two steps, Theorem 5 and Theorem 6, in a separate post.) The strategy is similar to that used to prove von Neumann’s theorem, as discussed in this previous post (and von Neumann’s theorem is also used in the proof), but with the Gleason metric serving as a substitute for the faithful linear representation. Namely, one first gives the space {L(G)} of one-parameter subgroups of {G} enough of a structure that it can serve as a proxy for the “Lie algebra” of {G}; specifically, it needs to be a vector space, and the “exponential map” needs to cover an open neighbourhood of the identity. This is enough to set up an “adjoint” representation of {G}, whose image is a Lie group by von Neumann’s theorem; the kernel is essentially the centre of {G}, which is abelian and can also be shown to be a Lie group by a similar analysis. To finish the job one needs to use arguments of Kuranishi and of Gleason, as discussed in this previous post.

The arguments here can be phrased either in the standard analysis setting (using sequences, and passing to subsequences often) or in the nonstandard analysis setting (selecting an ultrafilter, and then working with infinitesimals). In my view, the two approaches have roughly the same level of complexity in this case, and I have elected for the standard analysis approach.

Remark 2 From Theorem 7 we see that a Gleason metric structure is a good enough substitute for smooth structure that it can actually be used to reconstruct the entire smooth structure; roughly speaking, the commutator estimate (1) allows for enough “Taylor expansion” of expressions such as {g^n h^n} that one can simulate the fundamentals of Lie theory (in particular, construction of the Lie algebra and the exponential map, and its basic properties. The advantage of working with a Gleason metric rather than a smoother structure, though, is that it is relatively undemanding with regards to regularity; in particular, the commutator estimate (1) is roughly comparable to the imposition {C^{1,1}} structure on the group {G}, as this is the minimal regularity to get the type of Taylor approximation (with quadratic errors) that would be needed to obtain a bound of the form (1). We will return to this point in a later post.

— 1. Proof of theorem —

We now prove Theorem 7. Henceforth, {G} is a locally compact group with a Gleason metric {d} (and an associated “norm” {\|g\| = d(g, \hbox{id})}). In particular, by the Heine-Borel theorem, {G} is complete with this metric.

We use the asymptotic notation {X \ll Y} in place of {X \leq CY} for some constant {C} that can vary from line to line (in particular, {C} need not be the constant appearing in the definition of a Gleason metric), and write {X \sim Y} for {X \ll Y \ll X}. We also let {\epsilon > 0} be a sufficiently small constant (depending only on the constant in the definition of a Gleason metric) to be chosen later.

Note that the left-invariant metric properties of {d} give the symmetry property

\displaystyle  \|g^{-1} \| = \|g\|

and the triangle inequality

\displaystyle  \|g_1 \ldots g_n \| \leq \sum_{i=1}^n \|g_i\|.

From the commutator estimate (1) and the triangle inequality we also obtain a conjugation estimate

\displaystyle  \| ghg^{-1} \| \sim \|h\|

whenever {\|g\|, \|h\| \leq \epsilon}. Since left-invariance gives

\displaystyle  d(g,h) = \| g^{-1} h \|

we then conclude an approximate right invariance

\displaystyle  d(gk,hk) \sim d(g,h)

whenever {\|g\|, \|h\|, \|k\| \leq \epsilon}. In a similar spirit, the commutator estimate (1) also gives

\displaystyle  d(gh,hg) \ll \|g\| \|h\| \ \ \ \ \ (2)

whenever {\|g\|, \|h\| \leq \epsilon}.

This has the following useful consequence, which asserts that the power maps {g \mapsto g^n} behave like dilations:

Lemma 8 If {n \geq 1} and {\|g\|, \|h\| \leq \epsilon/n}, then

\displaystyle  d(g^n h^n, (gh)^n) \lesssim n^2 \|g\| \|h\|


\displaystyle  d(g^n,h^n) \sim n d(g,h).

Proof: We begin with the first inequality. By the triangle inequality, it suffices to show that

\displaystyle  d( (gh)^i g^{n-i} h^{n-i}, (gh)^{i+1} g^{n-i-1} h^{n-i-1} ) \ll n \|g\| \|h\| \ \ \ \ \ (3)

uniformly for all {0 \leq i < n}. By left-invariance and approximate right-invariance, the left-hand side is comparable to

\displaystyle  d( g^{n-i-1} h, h g^{n-i-1} ),

which by (2) is bounded above by

\displaystyle  \ll \|g^{n-i-1}\| \|h\| \ll n \|g\| \|h\|

as required.

Now we prove the second estimate. Write {g = hk}, then {\|k \| = d(g,h) \leq 2\epsilon/n}. We have

\displaystyle  d(h^n k^n,h^n) = \|k^n\| \sim n \|k\|

thanks to the escape property (shrinking {\epsilon} if necessary). On the other hand, from the first inequality, we have

\displaystyle  d(g^n, h^n k^n) \ll n^2 \|h\| \|k\|.

If {\epsilon} is small enough, the claim now follows from the triangle inequality. \Box

Remark 3 Lemma 8 implies (by a standard covering argument) that the group {G} is locally of bounded doubling, though we will not use this fact here.

Now we introduce the space {L(G)} of one-parameter subgroups, i.e. continuous homomorphisms {\phi: {\bf R} \rightarrow G}. We give this space the compact-open topology, thus the topology is generated by balls of the form

\displaystyle  \{ \phi \in L(G): \sup_{t \in I} d(\phi(t),\phi_0(t)) < r \}

for {\phi_0 \in L(G)}, {r > 0}, and compact {I}. Actually, using the homomorphism property, one can use a single compact interval {I}, such as {[-1,1]}, to generate the topology if desired, thus making {L(G)} a metric space.

Given that {G} is eventually going to be shown to be a Lie group, {L(G)} must be isomorphic to a Euclidean space. We now move towards this goal by establishing various properties of {L(G)} that Euclidean spaces enjoy.

Lemma 9 {L(G)} is locally compact.

Proof: It is easy to see that {L(G)} is complete. Let {\phi_0 \in L(G)}. As {\phi_0} is continuous, we can find an interval {I = [-T,T]} small enough that {\| \phi_0(t) \| \leq \epsilon} for all {t \in [-T,T]}. By the Heine-Borel theorem, it will suffice to show that the set

\displaystyle  B := \{ \phi \in L(G): \sup_{t \in [-T,T]} d(\phi(t),\phi_0(t)) < \epsilon \}

is totally bounded. By the Arzelá-Ascoli theorem, it suffices to show that the family of functions in {B} is equicontinuous.

By construction, we have {\| \phi(t) \| \leq 2\epsilon} whenever {|t| \leq T}. By the escape property, this implies (for {\epsilon} small enough, of course) that {\| \phi(t/n) \| \ll \epsilon/n} for all {|t| \leq T} and {n \geq 1}, thus {\| \phi(t) \| \ll \epsilon |t| / T} whenever {|t| \leq T}. From the homomorphism property, we conclude that {d(\phi(t),\phi(t')) \ll \epsilon |t-t'| / T} whenever {|t|, |t'| \leq T}, which gives uniform Lipschitz control and hence equicontinuity as desired. \Box

We observe for future reference that the proof of the above lemma also shows that all one-parameter subgroups are locally Lipschitz.

Now we put a vector space structure on {L(G)}, which we define by analogy with the Lie group case, in which each tangent vector {X} generates a one-parameter subgroup {t \mapsto \exp(tX)}. From this analogy, the scalar multiplication operation has an obvious definition: if {\phi \in L(G)} and {c \in {\bf R}}, we define {c\phi \in L(G)} to be the one-parameter subgroup

\displaystyle  c \phi(t) := \phi(ct)

which is easily seen to actually be a one-parameter subgroup.

Now we turn to the addition operation. In the Lie group case, one can express the one-parameter subgroup {t \mapsto \exp(t(X+Y))} in terms of the one-parameter subgroups {t \mapsto \exp(tX)}, {t \mapsto \exp(tY)} by the limiting formula

\displaystyle  \exp(t(X+Y)) = \lim_{n \rightarrow \infty} (\exp(tX/n) \exp(tY/n))^n.

In view of this, we would like to define the sum {\phi+\psi} of two one-parameter subgroups {\phi, \psi \in L(G)} by the formula

\displaystyle  (\phi+\psi)(t) := \lim_{n \rightarrow \infty} (\phi(t/n) \psi(t/n))^n.

Lemma 10 If {\phi, \psi \in L(G)}, then {\phi+\psi} is well-defined and also lies in {L(G)}.

Proof: To show well-definedness, it suffices to show that for each {t}, the sequence {(\phi(t/n) \psi(t/n))^n} is a Cauchy sequence. It suffices to show that

\displaystyle  \sup_{m \geq 1} d( (\phi(t/n) \psi(t/n))^n, (\phi(t/nm) \psi(t/nm))^{nm}) \rightarrow 0

as {n \rightarrow \infty}. By the continuity of multiplication, it suffices to show that there is some {\delta > 0} such that

\displaystyle  \sup_{m \geq 1} \sup_{1 \leq n' \leq \delta n} d( (\phi(t/n) \psi(t/n))^{n'}, (\phi(t/nm) \psi(t/nm))^{n'm}) \rightarrow 0

as {n \rightarrow \infty}.

Since {\phi,\psi} are locally Lipschitz, we can find a quantity {A} (depending on {t, \phi, \psi}) such that

\displaystyle  \| \phi(t/n) \|, \| \psi(t/n) \| \ll A / n

for all {n}. From Lemma 8, we conclude that

\displaystyle  d( \phi(t/n) \psi(t/n), (\phi(t/nm) \psi(t/nm)^m) ) \ll m^2 (A/nm) (A/nm) = A^2 / n^2

if {m \geq 1} and {n} is sufficiently large. Another application of Lemma 8 then gives

\displaystyle  d( (\phi(t/n) \psi(t/n))^{n'}, (\phi(t/nm) \psi(t/nm)^{n'm}) ) \ll m^2 (A/nm) (A/nm) = A^2 / n

if {m \geq 1}, {n} is sufficiently large, {1 \leq n' \leq \delta n}, and {\delta} is sufficiently small depending on {A}. The claim follows.

The above argument in fact shows that {(\phi(t/n) \psi(t/n))^n} is uniformly Cauchy for {t} in a compact interval, and so the pointwise limit {\phi+\psi} is in fact a uniform limit of continuous functions and is thus continuous. To prove that {\phi+\psi} is a homomorphism, it suffices by density of the rationals to show that

\displaystyle  (\phi+\psi)( at ) (\phi+\psi)( bt ) = (\phi+\psi)( (a+b)t )


\displaystyle  (\phi+\psi)(-t) = (\phi+\psi)(t)^{-1}

for all {t \in {\bf R}} and all positive integers {a,b}. To prove the first claim, we observe that

\displaystyle  (\phi+\psi)(at) = \lim_{n \rightarrow \infty} (\phi(at/n) \psi(at/n))^n

\displaystyle  = \lim_{n \rightarrow \infty} (\phi(t/n) \psi(t/n))^{an}

and similarly for {(\phi+\psi)(bt)} and {(\phi+\psi)((a+b)t)}, whence the claim. To prove the second claim, we see that

\displaystyle  (\phi+\psi)(-t)^{-1} = \lim_{n \rightarrow \infty} (\phi(-t/n) \psi(-t/n))^{-n}

\displaystyle  = \lim_{n \rightarrow \infty} (\psi(t/n) \phi(t/n))^n,

but {(\psi(t/n) \phi(t/n))^n} is {(\phi(t/n) \psi(t/n))^n} conjugated by {\psi(t/n)}, which goes to the identity; and the claim follows. \Box

{L(G)} also has an obvious zero element, namely the trivial one-parameter subgroup {t \mapsto \hbox{id}}.

Lemma 11 {L(G)} is a topological vector space.

Proof: We first show that {L(G)} is a vector space. It is clear that the zero element {0} of {L(G)} is an additive and scalar multiplication identity, and that scalar multiplication is associative. To show that addition is commutative, we again use the observation that {(\psi(t/n) \phi(t/n))^n} is {(\phi(t/n) \psi(t/n))^n} conjugated by an element that goes to the identity. A similar argument shows that {(-\phi) + (-\psi) = -(\phi+\psi)}, and a change of variables argument shows that {(a\phi) + (a\psi) = a(\phi+\psi)} for all positive integers {a}, hence for all rational {a}, and hence by continuity for all real {a}. The only remaining thing to show is that addition is associative, thus if {\phi, \psi, \eta \in L(G)}, that {((\phi+\psi)+\eta)(t) = (\phi+(\psi+\eta))(t)} for all {t \in {\bf R}}. By the homomorphism property, it suffices to show this for all sufficiently small {t}.

An inspection of the argument used to establish (10) reveals that there is a constant {A} such that

\displaystyle  d( (\phi+\psi)(t), (\phi(t/n) \psi(t/n))^n ) \ll A^2 / n

for all small {t} and all large {n}, and hence also that

\displaystyle  d( (\phi+\psi)(t/n), \phi(t/n) \psi(t/n) ) \ll A^2 / n^2

(thanks to Lemma 8). Similarly we have (after adjusting {A} if necessary)

\displaystyle  d( ((\phi+\psi)+\eta)(t), ((\phi+\psi)(t/n) \eta(t/n))^n ) \ll A^2 / n.

From Lemma 8 we have

\displaystyle  d( ((\phi+\psi)(t/n) \eta(t/n))^n, (\phi(t/n) \psi(t/n)\eta(t/n))^n ) \ll A^2/n

and thus

\displaystyle  d( ((\phi+\psi)+\eta)(t), (\phi(t/n) \psi(t/n) \eta(t/n))^n ) \ll A^2 / n.

Similarly for {\phi+(\psi+\eta)}. By the triangle inequality we conclude that

\displaystyle  d( ((\phi+\psi)+\eta)(t), (\phi+(\psi+\eta))(t)) \ll A^2 / n;

sending {t} to zero, the claim follows.

Finally, we need to show that the vector space operations are continuous. It is easy to see that scalar multiplication is continuous, as are the translation operations; the only remaining thing to verify is that addition is continuous at the origin. Thus, for every {\epsilon > 0} we need to find a {\delta > 0} such that {\sup_{t \in [-1,1]} \| (\phi+\psi)(t) \| \leq \epsilon} whenever {\sup_{t \in [-1,1]} \| \phi(t) \| \leq \delta} and {\sup_{t \in [-1,1]} \| \psi(t) \| \leq \delta}. But if {\phi, \psi} are as above, then by the escape property (assuming {\delta} small enough) we conclude that {\| \phi(t)\|, \|\psi(t)\| \ll \delta |t|} for {t \in [-1,1]}, and then from the triangle inequality we conclude that {\| (\phi+\psi)(t) \| \ll \delta} for {t \in [-1,1]}, giving the claim. \Box

As {L(G)} is both locally compact, metrisable, and a topological vector space, it must be isomorphic to a finite-dimensional vector space {{\bf R}^n} with the usual topology (see this blog post for a proof).

In analogy with the Lie algebra setting, we define the exponential map {\exp: L(G) \rightarrow G} by setting {\exp(\phi) := \phi(1)}. Given the topology on {L(G)}, it is clear that this is a continuous map. Using Lemma 8 one can see that the exponential map is locally injective near the origin, although we will not actually need this fact.

We have proved a number of useful things about {L(G)}, but at present we have not established that {L(G)} is large in any substantial sense; indeed, at present, {L(G)} could be completely trivial even if {G} was large. In particular, the image of the exponential map {\exp} could conceivably be quite small. We now address this issue. As a warmup, we show that {L(G)} is at least non-trivial if {G} is non-trivial:

Proposition 12 Suppose that {G} is not a discrete group. Then {L(G)} is non-trivial.

Of course, the converse is obvious; discrete groups do not admit any non-trivial one-parameter subgroups.

Proof: As {G} is not discrete, there is a sequence {g_n} of non-identity elements of {G} such that {\|g_n\| \rightarrow 0} as {n \rightarrow \infty}. Writing {N_n} for the integer part of {\epsilon / \|g_n\|}, then {N_n \rightarrow \infty} as {n \rightarrow \infty}, and we conclude from the escape property that {\| g_n^{N_n} \| \sim \epsilon} for all {n}.

We define the approximate one-parameter subgroups {\phi_n: [-1,1] \rightarrow G} by setting

\displaystyle  \phi_n(t) := g_n^{\lfloor t N_n \rfloor}.

Then we have {\|\phi_n(t) \| \ll \epsilon |t| + \frac{\epsilon}{N_n}} for {|t| \leq 1}, and we have the approximate homomorphism property

\displaystyle  d( \phi_n(t+s), \phi_n(t) \phi_n(s) ) \rightarrow 0

uniformly whenever {|t|, |s|, |t+s| \leq 1}. As a consequence, {\phi_n} is asymptotically equicontinuous on {[-1,1]}, and so by (a slight generalisation of) the Arzéla-Ascoli theorem, we may pass to a subsequence in which {\phi_n} converges uniformly to a limit {\phi: [-1,1] \rightarrow G}, which is a genuine homomorphism that is genuinely continuous, and is thus can be extended to a one-parameter subgroup. Also, {\|\phi_n(1)\| = \|g_n^{N_n} \| \sim \epsilon} for all {n}, and thus {\|\phi(1)\| \sim \epsilon}; in particular, {\phi} is non-trivial, and the claim follows. \Box

We now generalise the above proposition to a more useful result.

Proposition 13 For any neighbourhood {K} of the origin in {L(G)}, {\exp(K)} is a neighbourhood of the identity in {G}.

Proof: We use an argument of Hirschfeld (communicated to me by van den Dries and Goldbring). By shrinking {K} if necessary, we may assume that {K} is a compact star-shaped neighbourhood, with {\exp(K)} contained in the ball of radius {\epsilon} around the origin. As {K} is compact, {\exp(K)} is compact also.

Suppose for contradiction that {\exp(K)} is not a neighbourhood of the identity, then there is a sequence {g_n} of elements of {G \backslash K} such that {\|g_n\| \rightarrow 0} as {n \rightarrow \infty}. By the compactness of {K}, we can find an element {h_n} of {K} that minimises the distance {d(g_n,h_n)}. If we then write {g_n = h_n k_n}, then

\displaystyle \|k_n\| = d(g_n,h_n) \leq d(g_n,\hbox{id}) = \|g_n\|

and hence {\|h_n\|, \|k_n\| \rightarrow 0} as {n \rightarrow \infty}.

Let {N_n} be the integer part of {\epsilon_n / \|k_n\|}, then {N_n \rightarrow \infty} as {n \rightarrow \infty}, and {\|k_n^{N_n} \| \sim \epsilon} for all {n}.

Let {\phi_n: [-1,1] \rightarrow G} be the approximate one-parameter subgroups defined as

\displaystyle  \phi_n(t) := k_n^{\lfloor t N_n \rfloor}.

As before, we may pass to a subsequence such that {\phi_n} converges uniformly to a limit {\phi: [-1,1] \rightarrow G}, which extends to a one-parameter subgroup {\phi \in L(G)}.

In a similar vein, since {h_n \in \exp(K)}, we can find {\psi_n \in K} such that {\psi_n(1) = h_n}, which by the escape property (and the smallness of {K} implies that {\| \psi_n(t) \| \ll t \| h_n\|} for {|t| \leq 1}. In particular, {\psi_n} goes to zero in {L(G)}.

We now claim that {\exp( \psi_n + \frac{1}{N_n} \phi )} is close to {g_n}. Indeed, from Lemma 8 we see that

\displaystyle  d( \exp( \psi_n + \frac{1}{N_n} \phi ), \exp( \psi_n ) \exp( \frac{1}{N_n} \phi ) ) \ll \frac{1}{N_n} \| h_n \|.

Since {\exp(\psi_n) = h_n}, we conclude from the triangle inequality and left-invariance that

\displaystyle  d( \exp( \psi_n + \frac{1}{N_n} \phi ), g_n) \ll \frac{1}{N_n} \| h_n \| + d( k_n, \exp( \frac{1}{N_n} \phi ) ).

But from Lemma 8 again, one has

\displaystyle  d( k_n, \exp( \frac{1}{N_n} \phi ) ) \ll \frac{1}{N_n} d( k_n^{N_n}, \exp( \phi ) ) = o( 1/N_n )

and thus

\displaystyle  d( \exp( \psi_n + \frac{1}{N_n} \phi ), g_n) = o(1/N_n).

But for {n} large enough, {\psi_n + \frac{1}{N_n} \phi} lies in {K}, and so the distance from {g_n} to {K} is {o(1/N_n) = o(d(g_n,h_n))}. But this contradicts the minimality of {h_n} for {n} large enough, and the claim follows. \Box

We have some easy corollaries of this result:

Corollary 14 {G} is locally connected. In particular, the connected component {G^\circ} of the identity is an open subgroup of {G}.

Corollary 15 (Abelian case) If {G} is abelian, then {G} is isomorphic to a Lie group. In particular, in the non-abelian setting, the centre {Z(G)} of {G} is a Lie group.

Proof: In the abelian case one easily sees that {\exp} is a homomorphism. Thus we see from Proposition 13 that {G} has locally the structure of a vector space, and the claim clearly follows in that case. \Box

We are now finally ready to prove Theorem 7. By Corollary 14 we may assume without loss of generality that {G} is connected. (Note that if a topological group {G} is locally connected, and the connected component of the identity {G^\circ} is a Lie group, then the entire group a Lie group, because all outer automorphisms of {G^\circ} are necessarily smooth, as discussed here.)

Now we consider the adjoint action of {G} on {L(G)}. If {g \in G} and {\phi \in L(G)}, we can define another one-parameter subgroup {\hbox{Ad}_g(\phi) \in L(G)} by setting

\displaystyle  \hbox{Ad}_g(\phi)(t) := g\phi(t) g^{-1}.

As conjugation by {g} is an automorphism, one easily verifies that {\hbox{Ad}_g: L(G) \rightarrow L(G)} is linear, thus {\hbox{Ad}} is a map from {G} to the finite-dimensional linear group {GL(L(G))}. One easily verifies that this map is continuous, and so {\hbox{Ad}} is a finite-dimensional linear representation of {G}. If {g} is in the kernel of this representation, then by construction, {g} centralises {\exp(L(G))}, and thus by Proposition 13, centralises an open neighbourhood of the identity in {G}. As we are assuming {G} to be connected, we conclude that {g} is central. Thus we see that the kernel of {\hbox{Ad}} is the center {Z(G)}, thus giving a short exact sequence

\displaystyle  0 \rightarrow Z(G) \rightarrow G \rightarrow G/Z(G) \rightarrow 0.

The adjoint representation {\hbox{Ad}} is a faithful finite-dimensional linear representation of {G/Z(G)}, and so {G/Z(G)} is a Lie group by a theorem of von Neumann (discussed here). By Corollary 15, {Z(G)} is a central Lie group. By a result of Kuranishi and Gleason (discussed here), this implies that {G} is itself a Lie group, as required.

Remark 4 An alternate approach to Theorem 7 would be to construct a Lie bracket on {L(G)}, and then show that the multiplication law on {G} is locally given by the Baker-Campbell-Hausdorff formula; we will discuss this approach in a sequel to this post.