We now set aside our discussion of the finite time extinction results for Ricci flow with surgery (Theorem 4 from Lecture 2), and turn instead to the main portion of Perelman’s argument, which is to establish the global existence result for Ricci flow with surgery (Theorem 2 from Lecture 2), as well as the discreteness of the surgery times (Theorem 3 from Lecture 2).

As mentioned in Lecture 1, local existence of the Ricci flow is a fairly standard application of nonlinear parabolic theory, once one uses de Turck’s trick to transform Ricci flow into an explicitly parabolic equation. The trouble is, of course, that Ricci flow can and does develop singularities (indeed, we have just spent several lectures showing that singularities must inevitably develop when certain topological hypotheses (e.g. simple connectedness) or geometric hypotheses (e.g. positive scalar curvature) occur). In principle, one can use surgery to remove the most singular parts of the manifold at every singularity time and then restart the Ricci flow, but in order to do this one needs some rather precise control on the geometry and topology of these singular regions. (In particular, there are some hypothetical bad singularity scenarios which cannot be easily removed by surgery, due to topological obstructions; a major difficulty in the Perelman program is to show that such scenarios in fact cannot occur in a Ricci flow.)

In order to analyse these singularities, Hamilton and then Perelman employed the standard nonlinear PDE technique of “blowing up” the singularity using the scaling symmetry, and then exploiting as much “compactness” as is available in order to extract an “asymptotic profile” of that singularity from a sequence of such blowups, which had better properties than the original Ricci flow. [The PDE notion of a blowing up a solution around a singularity, by the way, is vaguely analogous to the algebraic geometry notion of blowing up a variety around a singularity, though the two notions are certainly not identical.] A sufficiently good classification of all the possible asymptotic profiles will, in principle, lead to enough structural properties on general singularities to Ricci flow that one can see how to perform surgery in a manner which controls both the geometry and the topology.

However, in order to carry out this program it is necessary to obtain geometric control on the Ricci flow which does not deteriorate when one blows up the solution; in the jargon of nonlinear PDE, we need to obtain bounds on some quantity which is both coercive (it bounds the geometry) and either critical (it is essentially invariant under rescaling) or subcritical (it becomes more powerful when one blows up the solution) with respect to the scaling symmetry. The discovery of controlled quantities for Ricci flow which were simultaneously coercive and critical was Perelman’s first major breakthrough in the subject (previously known controlled quantities were either supercritical or only partially coercive); it made it possible, at least in principle, to analyse general singularities of Ricci flow and thus to begin the surgery program discussed above. (In contrast, the main reason why questions such as Navier-Stokes global regularity are so difficult is that no controlled quantity which is both coercive and critical or subcritical is known.) The mere existence of such a quantity does not by any means establish global existence of Ricci flow with surgery immediately, but it does give one a non-trivial starting point from which one can hope to make progress.

To be a more precise, recall from Lecture 1 that the Ricci flow equation \frac{d}{dt} g = -2 \hbox{Ric}, in any spatial dimension d, has two basic symmetries (besides the geometric symmetry of diffeomorphism invariance); it has the obvious time-translation symmetry g(t) \mapsto g(t-t_0) (keeping the manifold M fixed), but it also has the scaling symmetry

g(t) \mapsto \lambda^2 g( \frac{t}{\lambda^2} ) (1)

for any \lambda > 0 (again keeping M fixed as a topological manifold). When applied with \lambda < 1, this scaling shrinks all lengths on the manifold M by a factor \lambda (recall that the length |v|_g of a tangent vector v is given by the square root of g(v,v)), and also speeds up the flow of time by a factor 1/\lambda^2; conversely, when applied with \lambda > 0, the scaling expands all lengths by a factor \lambda, and slows down the flow of time by 1/\lambda^2.

Suppose now that one has a Ricci flow t \mapsto (M,g(t)) which becomes singular at some time T > 0. To analyse the behaviour of the flow as one approaches the singular time T, one picks a sequence of times t_n \to T^- approaching T from below, a sequence of marked points x_n \in M(t_n) = M on the manifold, and a sequence of length scales L_n > 0 which go to zero as n \to \infty. One then considers the blown up Ricci flows t \mapsto (M^{(n)}, g^{(n)}(t)), where M^{(n)} is equal to M as a topological manifold (with x_n as a marked point or “origin” O), and g^{(n)}(t) is the flow of metrics given by the formula

g^{(n)}(t) := \frac{1}{L_n^2} g( t_n + L_n^2 t ). (2)

Thus the flow t \mapsto (M^{(n)}, g^{(n)}(t)) represents a renormalised flow in which the time t_n has been redesignated as the temporal origin 0, the point x_n has been redesignated as the spatial origin O, and the length scale L_n has been redesignated as the unit length scale (and the time scale L_n^2
has been redesignated as the unit time scale). Thus the behaviour of the rescaled flow t \mapsto (M^{(n)}, g^{(n)}(t))at unit scales of space and time around the spacetime origin (thus t = O(1) and x \in B(O,O(1))) correspond to the behaviour of the original flow t \mapsto (M,g(T)) at spatial scale L_n and time scale L_n^2 around the spacetime point (t_n,x_n), thus t = t_n + O( L_n^2 ) and x \in B(x_n,O(L_n)).

Because the original Ricci flow existed on the time interval 0 \leq t < T, the rescaled Ricci flow will exist on the time interval -\frac{t_n}{L_n^2} \leq t < \frac{T-t_n}{L_n^2}. In particular, in the limit n \to \infty (leaving aside for the moment the question of what “limit” means precisely here), these Ricci flows become increasingly ancient, in that they will have existed on the entire past time interval -\infty < t \leq 0 in the limit.

The strategy is now to show that these renormalised Ricci flows t \mapsto (M^{(n)},g^{(n)}) (with the marked origin O) exhibit enough “compactness” that there exists a subsequence of such flows which converge to some asymptotic limiting profile t \mapsto (M^{(\infty)}, g^{(\infty)}) in some sense. (We will define the precise notion of convergence of such flows later, but pointed Gromov-Hausdorff convergence is a good first approximation of the convergence concept to keep in mind for now.) If the notion of convergence is strong enough, then we will be able to conclude that this limiting profile of Ricci flows is also a Ricci flow. (Actually, due to the parabolic smoothing effects of Ricci flow, we will be able to automatically upgrade weak notions of convergence to strong ones, and so this step is in fact rather easy.) This limiting Ricci flow has better properties than the renormalised flows; for instance, while the renormalised flows are almost ancient, the limiting flow actually is an ancient solution. Also, while the Hamilton-Ivey pinching phenomenon from Lecture 3 suggests that the renormalised flows have mostly non-negative curvature, the limiting flow will have everywhere non-negative curvature (provided that the points (t_n,x_n) and scales L_n are chosen properly; we will return to this “point-picking” issue later in this course).

If one was able to classify all possible asymptotic profiles to Ricci flow, this would yield quite a bit of information on singularities to such flows, by the standard and general nonlinear PDE method of compactness and contradiction. This method, roughly speaking, runs as follows. Suppose we want to claim that whenever one is sufficiently close to a singularity, some scale-invariant property P eventually occurs. (In our specific application, P is roughly speaking going to assert that the geometry and topology of high-curvature regions can be classified as belonging to one of a short list of possible “canonical neighbourhood” types, all of which turn out to be amenable to surgery.) To prove this, we argue by contradiction, assuming we can find a Ricci flow t \mapsto (M,g(t)) in which P fails on a sequence of points in spacetime that approach the singularity, and on some sequence of scales going to zero. We then rescale the flow to create a sequence of rescaled Ricci flows t \mapsto (M^{(n)},g^{(n)}(t)) as discussed above, each of which exhibits failure of P at unit scales near the origin (here we use the hypothesis that P is scale-invariant). Now, we use compactness to find a subsequence of flows converging to an asymptotic profile t \mapsto (M^{(\infty)}, g^{(\infty)}(t)). If the convergence is strong enough, the asymptotic profile will also exhibit failure of P. But now one simply goes through the list of all possible profiles in one’s classification and verifies that each of them obeys P; and one is done.

Unfortunately, just knowing that a Ricci flow is ancient and has everywhere non-negative curvature does not seem enough, by itself, to obtain a full classification of asymptotic profiles (though one can definitely say some non-trivial statements about ancient Ricci flows with non-negative curvature, most notably the Li-Yau-Hamilton inequality, which we will discuss later). To proceed further, one needs further control on asymptotic profiles t \mapsto (M^{(\infty)}, g^{(\infty)}(t)). The only reasonable way to obtain such control is to obtain control on the rescaled flows t \mapsto (M^{(n)}, g^{(n)}(t)) which is uniform in n. While some control of this sort can be established merely by choosing the points (t_n,x_n) and scales L_n in a clever manner, there is a limit as to what one can accomplish just by point-picking alone (especially if one is interested in establishing properties P that apply to quite general regions of spacetime and general scales, rather than specific, hand-picked regions and scales). To really get good control on the rescaled flows t \mapsto (M^{(n)}, g^{(n)}(t)), one needs to obtain control on the original flow t \mapsto (M,g(t)) which does not deteriorate when one passes from the original flow to the rescaled flow.

One can express what “does not deteriorate” means more precisely using the language of dimensional analysis, or more precisely using the concepts of subcriticality, criticality, and supercriticality from nonlinear PDE. Suppose we have some (non-negative) scalar quantity F( M, g(\cdot) ) that measures some aspect of a flow t \mapsto (M, g(t)). [Dimensional analysis becomes trickier when considering tensor-valued quantities, though in practice one can use the magnitude of such quantities as a scalar-valued proxy for these tensor-valued objects; see my paper on Perelman’s argument for some further discussion.] In many situations, this quantity has some specific dimension k, in the sense that one has a scaling relationship

F( M, \lambda^2 g( \frac{\cdot}{\lambda^2} ) ) = \lambda^k F( M, g(\cdot) ) (3)

that measures how that quantity changes under the rescaling (1). In dimensional analysis language, (3) asserts that F has the units \hbox{length}^k.

Assuming that F is also invariant under time translation (and under changes of spatial origin), (3) implies that

F( M^{(n)}, g^{(n)}(\cdot) ) = L_n^{-k} F( M, g(\cdot) ). (4)

Thus, if F is critical or dimensionless (which means that k=0) or subcritical (which means that k < 0), any upper bound on F for the original Ricci flow t \mapsto (M,g(t)) will imply uniform bounds on the rescaled flows t \mapsto (M^{(n)},g^{(n)}(t)), and thus (assuming the convergence is strong enough, and F has some good continuity properties) on the asymptotic profile t \mapsto (M^{(\infty)}, g^{(\infty)}(t)). In the subcritical case, F should in fact now vanish in the limit. On the other hand, if F is supercritical (which means that k > 0) then no information about the asymptotic profile t \mapsto (M^{(\infty)}, g^{(\infty)}(t)) is obtained.

In order for control of F(M^{(\infty)}, g^{(\infty)}(\cdot)) to be truly useful, we would like the quantity F to be coercive. This term is not precisely defined (though it is somewhat analogous to the notion of a proper map), but coercivity basically means that upper bounds on F(M, g(\cdot)) translate to some upper bounds on various norms or similar quantities measuring the “size” of (M, g(\cdot)), and (hopefully) to then obtain useful bounds on the topology and geometry of (M, g(\cdot)).

Let us give some examples of various such quantities F for Ricci flow. We begin with some supercritical quantities:

  1. Any length-type quantity, e.g. the diameter \hbox{diam}(M) of the manifold, or the injectivity radius, has dimension 1 and is thus supercritical.
  2. The various widths W_2(t), W_3(t), \tilde W_3(t) of 3-dimensional Ricci flows from the previous lectures, which were based on areas of minimal surfaces, have dimension 2 and are also supercritical. Thus the various bounds we have on these quantities from Lectures 4, 5, 6 do not directly tell us anything about asymptotic profiles.
  3. The volume \int_M\ d\mu of 3-manifolds has dimension 3 and is thus also supercritical. Thus upper bounds on volume, such as Corollary 2 from Lecture 3, do not directly tell us anything about asymptotic profiles (though they are useful for other tasks, most notably for ensuring that surgery times are discrete, see Theorem 3 from Lecture 2).

As for subcritical quantities, one notable one is the minimal scalar curvature R_{\hbox{min}}. One can check (cf. the dimensional analysis at the end of Lecture 0)
that scalar curvature has dimension -2 and is thus subcritical. The quantity F(M, g(\cdot)) := \sup_t \max( - R_{\hbox{min}}, 0 ), that measures the maximal amount of negative scalar curvature present in a Ricci flow, is then bounded (by the maximum principle, see Proposition 2 of Lecture 3), and so by the previous discussion will vanish for asymptotic profiles; in other words, asymptotic profiles always have non-negative scalar curvature. Unfortunately, this quantity is only partially coercive; it prevents scalar curvature from becoming arbitrarily large and negative, but does not prevent scalar curvature from becoming arbitrarily large and positive. (Also, it is possible for other curvatures, such as Ricci and Riemann curvatures, to be large even while the scalar curvature is small or even zero.) So this quantity does say something non-trivial about asymptotic profiles, but is insufficient by itself to fully control such profiles.

In the next lecture we shall see that the least eigenvalue \lambda_1( -4\Delta + R ) of the modified Laplace-Beltrami operator, which can be viewed as an analytic analogue of the geometric quantity R_{\min} related to Poincaré inequalities, also enjoys a monotonicity property (which is connected to a certain gradient flow interpretation of (modified) Ricci flow); like R_{\min}, the least eigenvalue has dimension -2 and is thus also subcritical, but again it is not fully coercive, as it only prevents scalar curvature from becoming too negative.

So far we have not discussed any critical quantities. (One can create some trivial examples of critical quantities, such as the dimension \hbox{dim}(M) or topological quantities such as \pi_1(M), but these are not obviously coercive (the topological coercivity of the latter quantity being, of course, precisely the Poincaré conjecture that we are trying to prove!).) One way to create critical quantities is to somehow combine subcritical and supercritical examples together. Here is one simple example, due to Hamilton:

Exercise 1. Show that the quantity \max( - R_{\min}(t) V(t)^{2/d}, 0 ) is critical (scale-invariant) and monotone non-increasing in time under d-dimensional Ricci flow, where V = \int_M\ d\mu(t) denotes the volume of (M, g(t)) at time t. (This quantity can be used, for instance, to show that Ricci flow admits no “breather” solutions, i.e. non-constant periodic solutions; see the discussion in Perelman’s paper. Unfortunately, as with previous examples, it is not fully coercive.) \diamond

In the next few lectures, we will see two more advanced versions of critical controlled quantities of an analytic nature, the Perelman entropy (a scale-invariant version of the minimal eigenvalue \lambda_1( -4\Delta + R ), which is to log-Sobolev inequalities as the latter quantity is to Poincaré inequalities) and the Perelman reduced volume (which measures how heat-type kernels on Ricci flows compare against heat kernels on Euclidean space). These quantities were both introduced in Perelman’s first paper. The key feature of these new critical quantities, which distinguishes them from previously known examples, is that they are now coercive: they provide a crucial scale-invariant geometric control on a flow t \mapsto (M, g(t)), which is now known as \kappa-noncollapsing. This control, which describes a relationship between the supercritical quantities of length and volume and the subcritical quantities of curvature, will be discussed next.

— Length, volume, curvature, and collapsing —

Let p be a point in a d-dimensional complete Riemannian manifold (M,g) (we make no assumptions on the dimension d here). We will establish here some basic results in comparison geometry, which seeks to understand the relationship between the Riemann curvature \hbox{Riem} of the manifold M, and various geometric quantities of M such as the volume of balls and the injectivity radius, especially when compared against model geometries such as the sphere and hyperbolic space. (This is only a brief introduction; see e.g. Chapters 6, 9, and 10 of Petersen’s book for a more detailed treatment.)

Of course, in the case of Euclidean space {\Bbb R}^d with the Euclidean metric, the Riemann curvature is identically zero, and the volume of B(p,r) is c_d r^d for some explicit constant c_d := \frac{\pi^{d/2}}{\Gamma(\frac{d}{2}+1)} > 0 depending only on dimension. For Riemannian manifolds, it is easy to see that the volume of B(p,r) is (1+o(1)) c_d r^d in the limit r \to 0; for more precise asymptotics, see Exercises 7 and 8 below.

One of the most effective tools to study these questions comes from normal coordinates, or more precisely from the exponential map \exp_p: T_p M \to M from the tangent space T_p M to M, defined by setting \exp_p(v) to be the value of \gamma(1), where \gamma:[0,1] \to M is the unique constant-speed geodesic with \gamma(0)=p and \gamma'(0)=v. By the Hopf-Rinow theorem, M is complete (in the metric sense) if and only if the exponential map is defined on all of T_p M. Henceforth we will always assume M to be complete. The ball B(p,r) of radius r > 0 in M centred at p is then the image under the exponential map of the ball B_{T_p M}(0,r) of the tangent space of the same radius (using the metric g(p), of course):

B(p,r) = \exp_p( B_{T_p M}(0,r) ). (5)

Thus we can study the balls centred at p by using the exponential map to pull back to the tangent space T_p M and analysing the geometry there. Two radii become relevant for this approach:

  1. The injectivity radius at p is the supremum of all radii r such that \exp_p is injective on B_{T_p M}(0,r).
  2. The conjugate radius at p is the supremum of all radii r such that \exp_p is an immersion on B_{T_p M}(0,r).

In many situations, these two radii are equal, but there are cases in which the injectivity radius is smaller. In fact the injectivity radius is always less than or equal to the conjugate radius; see Exercise 4 below.

Example 1. (Sphere) Let K > 0, and let M = \frac{1}{\sqrt{K}} S^d := \{ (x_1,\ldots,x_{d+1}) \in {\Bbb R}^{d+1}: x_1^2 + \ldots + x_{d+1}^2 = 1/K \} be the sphere of radius 1/\sqrt{K}, with the metric induced from the metric ds^2 = dx_1^2 + \ldots + dx_{d+1}^2 of Euclidean space {\Bbb R}^{d+1}. Then at every point p of M, the injectivity radius and conjugate radius are both equal to \pi/\sqrt{K}, which is also the diameter of the manifold. Note also that this manifold has constant sectional curvature K. \diamond

Example 2. (Hyperbolic space) Let K > 0, and let M = \frac{1}{\sqrt{K}} H^d := \{ (t,x_1,\ldots,x_d) \in {\Bbb R}^{1+d}: x_1^2 + \ldots x_d - t^2 = 1/K ; t > 0 \} \subset {\Bbb R}^{1+d} be hyperbolic space of hyperbolic radius 1/\sqrt{K}, with the metric induced from the metric ds^2 = dx_1^2 + \ldots + dx_d^2 - dt^2 of Minkowski space. Then at any point p in M, e.g. p = (1,0), the injectivity radius, conjugate radius, and diameter are infinite. This manifold has constant sectional curvature -K. \diamond

Example 3. (Torus) Let r > 0, and let M = ({\Bbb R}/r{\Bbb Z})^d be the d-torus which is the product of d circles of length r. Then for any point p in M, the injectivity radius is r/2 and the conjugate radius is infinite. Here the sectional curvature is of course 0 everywhere. \diamond

The metric g on M induces a pullback metric on T_p M, which by abuse of notation we shall also call g. This metric can degenerate once one passes the conjugate radius, but let us ignore this issue for the time being. On T_p M, we have the radial variable r (defined as the magnitude of a tangent vector with respect to g(p)), and the radial vector field \partial_r (defined as the dual vector field to r using polar coordinates), which is smooth away from the origin.

In Euclidean space, the vector field \partial_r is the gradient of r. Happily, the same fact is true for more general Riemannian manifolds:

Lemma 1. (Gauss lemma)

  1. Away from the origin, we have |\partial_r|_g = 1 and \nabla_{\partial_r} \partial_r = 0.
  2. Away from the origin, \partial_r is the gradient \hbox{grad} r of r with respect to the metric g, thus (\partial_r)^\alpha = \nabla^\alpha r.

Exercise 2. Prove Lemma 1. (Hint: part 1 follows from the geodesic flow equation \nabla_{\dot \gamma} \dot \gamma = 0. For part 2, one way to proceed is to establish the ODE

\nabla_{\partial_r} ( \partial_r - \hbox{grad} r )^\alpha = (\nabla^\alpha (\partial_r)_\beta) ( \partial_r - \hbox{grad} r )^\beta (6)

and then apply Gronwall’s inequality. \diamond

Lemma 1 gives some important relationships between the radial vector field \partial_r and the Hessian \hbox{Hess}(r)_{\alpha \beta} := \nabla_\alpha \nabla_\beta r = \nabla_\alpha (\partial_r)_\beta (which can be viewed as the second fundamental form of the spheres centred at p):

Exercise 3. Away from the origin, obtain the deformation formula

{\mathcal L}_{\partial_r} g = 2 \hbox{Hess}(r) (7)

and the Riccati-type equation

\nabla_{\partial_r} \hbox{Hess}_{\alpha \beta} +\hbox{Hess}_{\alpha \beta} \hbox{Hess}^\beta_\gamma = \hbox{Riem}_{\alpha\gamma\beta}^\delta (\partial_r)^\gamma (\partial_r)_\delta. \diamond (8)

Also, show that \hbox{Hess}_{\alpha \beta} has \partial_r as a null eigenvector. \diamond

Exercise 4. Show that the injectivity radius r_i of a point p cannot exceed the conjugacy radius r_c. (Hint: there are several ways to establish this. Here is one: suppose for contradiction that r_i > r_c, thus r_i > (1+\varepsilon) r_c for some small \varepsilon > 0. Let v \in T_p M be a vector of magnitude at most r_c. Observe that the function d(p, x) + d( \exp_p((1+\varepsilon) v), x) achieves a global minimum at \exp_p(v) whenever and so has non-negative Hessian. Use this to obtain a lower bound on \hbox{Hess}(r) on B(p,r_c), and combine this with Exercise 3 to show that the exponential map is in fact immersed on a neighbourhood of B(p,r_c), a contradiction. Another approach is based on Klingenberg’s inequality (see Lemma 2 below), while a third approach is based on the second variation formula for the energy of a geodesic.) \diamond

Let us now impose the bound that all sectional curvatures are bounded by some K > 0 on a ball B(p,r_0), thus

|g( \hbox{Riem}(X,Y) X, Y)| \leq K (9)

for all orthonormal tangent vectors X, Y at any point in B(p,r_0). From Example 1 we know that the exponential map can become singular past the radius \pi/\sqrt{K}, so let us also assume that

r_0 \leq \pi/\sqrt{K}. (10)

Note that the sectional curvature bound also implies a Ricci curvature bound |\hbox{Ric}(X,X)| \leq (d-1)K for all unit tangent vectors based in B(p,r_0).

From (9) and (10) we see that |\hbox{Riem}|_g = O_d( r_0^{-2} ) on the ball B(p,r_0). When this latter property occurs, let us informally say that M has bounded normalised curvature at scale r_0 at p. Our analysis here can thus be interpreted as a study of the volume of balls (and of related quantities, such as the injectivity radius) under assumptions of bounded normalised curvature.

Remark 1. If one wishes, one can rescale to normalise K (or r_0) to equal 1, although this does not significantly simplify the computations that follow below. \diamond

Using equation (8), one can obtain sharp upper and lower bounds for \hbox{Hess}(r):

Exercise 5. (Comparison estimates for \hbox{Hess}(r)) Assume that (9) and (10) hold. At any non-zero point in B_{T_p M}(0,r_0), let \lambda_{\min} \leq \lambda_{\max} be the least and greatest eigenvalues of \hbox{Hess}(r) on the orthogonal complement of \partial_r. Use (8) to establish the differential inequalities

\nabla_{\partial_r} \lambda_{\max} + \lambda_{\max}^2 \leq K (11)


\nabla_{\partial_r} \lambda_{\min} + \lambda_{\min}^2 \geq -K (12)

for 0 < r < r_0 and also establish the infinitesimal bound

\lambda_{\min}, \lambda_{\max} = \frac{1}{r} + O(r) (13)

for all sufficiently small positive r. From (11), (12), (13), conclude the bounds

\sqrt{K} \coth(\sqrt{K} r) \leq \lambda_{\min} \leq \lambda_{\max} \leq \sqrt{K} \cot(\sqrt{K} r) (14)

and in particular that

(d-1) \sqrt{K} \coth(\sqrt{K} r) \leq \Delta r \leq (d-1) \sqrt{K} \cot(\sqrt{K} r). (15)

Using (7) and (14), deduce the bound

dr^2 + \frac{\sin^2(\sqrt{K} r) }{K} d\theta^2 \leq dg^2 \leq dr^2 + \frac{\sinh^2(\sqrt{K} r) }{K} d\theta^2 (16)

where (r,\theta) are the usual Euclidean polar coordinates on T_p M, thus the Euclidean metric (induced by g(p)) is given by ds^2 = dr^2 + r^2 d\theta^2). \diamond

Remark 2. Each of the above bounds are attained by either the sphere of constant sectional curvature +K (Example 1) or the hyperbolic space of constant sectional curvature -K (Example 2). More generally, one should think of these two examples as the two extreme geometries obeying the assumption (9). In the limit K=0 one recovers the formulae for Euclidean space {\Bbb R}^d or for the torus (Example 3). \diamond

Exercise 6. (Bounded curvature implies lower bound on conjugacy radius) Using Exercise 3, show that if (9) and (10) hold, then the conjugacy radius of p is at least r_0. \diamond

Remark 3. A converse of sorts to Exercise 6 is provided by Myers’ theorem, which asserts that if \hbox{Ric} \geq (d-1) K, then the diameter of M is at most \pi/\sqrt{K}. Another result in a somewhat similar spirit is the 1/4-pinched sphere theorem. The Ricatti-type equations and inequalities developed above play a key role in the proof of such theorems. \diamond

Now we relate the Hessian of r to the volume metric d\mu and the Laplacian \Delta r:

Exercise 7. Away from the origin, obtain the deformation formula

{\mathcal L}_{\partial_r} d\mu = (\Delta r) d\mu (17)

and the Riccati-type inequality

\nabla_{\partial_r} \Delta r + \frac{1}{d-1} (\Delta r)^2 \leq \nabla_{\partial_r} \Delta r + |\hbox{Hess}(r)|^2 = - \hbox{Ric}(\partial_r, \partial_r). \diamond (18)

Exercise 8. (Absolute volume comparison) Assume (9) and (10). Using Exercises 5 and 7, show that the volume of B_{T_p M}(0,r_0) is maximised in the case of hyperbolic space (Example 2) and minimised in the case of the sphere (Example 1). In particular, if r_0 \leq O(\sqrt{K}), conclude that the volume of B_{T_p M}(0,r_0) is comparable to r_0^d, with the comparability constants depending only on d and on the implied constant in the O() notation. \diamond

Remark 4. Later on in this course we will need a relative variant of this comparison inequality, known as the Bishop-Gromov comparison inequality, which will assert that certain ratios between volumes of balls are monotone in the radius r_0. \diamond

Exercise 9. Show that the volume of B(p,r) is (c_d - \frac{R(p)}{6(d+2)} r^2 + O(r^4)) r^d for sufficiently small r, where R(p) is the scalar curvature at p. Thus we see that scalar curvature distorts the infinitesimal volume growth of balls. Develop a similar interpretation of the Ricci curvature \hbox{Ric}(p)(v,v) as the volume distortion of infinitesimal sectors with apex p and direction v. \diamond

If r_0 is less than the injectivity radius, we see from (5) that B(p,r_0) has the same volume as B_{T_p M}(0,r_0). From Exercise 8, we thus conclude that

\hbox{vol}(B(p,r_0)) \sim_{d} r_0^d (19)

whenever (9) holds, and r_0 is less than both O(1/\sqrt{K}) and the injectivity radius of p.

What happens if r_0 exceeds the injectivity radius? We still obtain the upper bound in (18), but can lose the lower bound, as can already be seen by considering the torus example (Example 3) with the injectivity radius r small. Thus we see that failure of injectivity can lead to collapse in the volume of balls.

A deep result of Cheeger shows that in fact injectivity failure always collapses the volume of balls (assuming bounded normalised curvature), or equivalently that non-collapsing of volume is equivalent to a lower bound on the injectivity radius:

Theorem 1. (Cheeger’s lemma) Suppose that |\hbox{Riem}|_g \leq C r_0^{-2} on B(p,r_0) and that \hbox{vol}(B(p,r_0)) \geq \delta r_0^d for some \delta > 0. Then the injectivity radius of p is at least c(C,\delta,d) r_0 for some c(C,\delta,d) > 0 depending only on C,\delta,d.

Remark 5. This lemma is closely related to the Cheeger finiteness theorem, which asserts that the number of possible topologies for the ball B(p,r_0) under the assumptions of Theorem 1 is finite, as well as Gromov’s compactness theorem, which essentially asserts that the metrics on these balls form a compact set in a certain topology. \diamond

I may discuss the proof of Cheeger’s lemma at a later point in this course, if time permits. Cheeger’s original proof relies on the following inequality which is also of interest:

Lemma 2. (Klingenberg’s inequality) Assume the conjugacy radius is at least r_0. Then exactly one of the following holds:

  1. The injectivity radius of p is at least r_0.
  2. There exists a non-trivial geodesic starting and ending at p of length less than 2r_0.

Proof. (Sketch) It is clear that 1. and 2. cannot both be true. Now suppose that the injectivity radius r is strictly less than r_0, then there exist two distinct geodesic rays \gamma_1, \gamma_2 from p to another point q, one of length r and the other of length at most r. On the other hand, by hypothesis the exponential map is an immersion on B(x,r_0). From the inverse function theorem (and Lemma 1) we can then perturb the rays \gamma_1, \gamma_2 from p to have lengths slightly less than r but still ending up at the same point (thus contradicting the definition of r), unless \gamma_1, \gamma_2 have length exactly r and have equal and opposite tangent vectors at q. But then we have formed a geodesic path from p to p of length 2r, and the claim follows. \Box

Exercise 10. Show that the injectivity radius of p is equal to the minimum of the conjugacy radius of p, and half the length of the shortest non-trivial geodesic path from p to itself (or +\infty if no such path exists). \diamond

Exercise 11. Let M be a compact manifold whose sectional curvatures are all bounded in magnitude by K. Show that if the injectivity radius r of M (defined as the infimum of the injectivity radii of every point p in M) is less than \pi/\sqrt{K}, then there exists a closed geodesic loop of length exactly 2r. \diamond

Let us informally say that a Riemannian manifold M is non-collapsed at scale r_0 at a point p if B(p,r_0) has volume \gtrsim_d r_0^d. The above discussion then says that, under the assumption of bounded normalised curvature at scale r_0 at p, that non-collapsing is equivalent to a lower bound of \gtrsim_d r_0 on the injectivity radius, which is in turn equivalent to a lower bound of \gtrsim_d r_0 on the length of any non-trivial geodesic paths from p to itself. Thus we see that the non-collapsing property is quite coercive; it implies some important control on the local geometry of the Riemannian manifold.

Example 4. The sphere (example 1) of dimension 2 and higher and hyperbolic space (example 2) are non-collapsed at every point and scale for which one has bounded normalised curvature. (For the sphere, volume collapses at scales bigger than the diameter of the sphere, but one no longer has bounded normalised curvature in this regime.) Similarly for Euclidean space, or for products of any of these three examples. On the other hand, the torus (example 2) (or the sphere of dimension 1) is collapsed at large scales even though one still retains normalised bounded curvature. Similarly for the cylinder S^1 \times {\Bbb R}. \diamond

Now we adapt this concept to Ricci flows. The following definition is fundamental to Perelman’s arguments:

Definition 1. (\kappa-collapsing) Let t \mapsto (M,g(t)) be a d-dimensional Ricci flow, and let \kappa > 0. We say that the Ricci flow is \kappa-collapsed at a point (t_0,x_0) in spacetime at scale r_0 if the following statements hold:

  1. (Bounded normalised curvature) We have |\hbox{Riem}(t,x)|_g \leq r_0^{-2} for all (t,x) the spacetime cylinder {}[t_0 - r_0^2, t_0] \times B_{g(t_0)}(x_0, r_0) (in particular, we assume that the lifespan of the Ricci flow includes the time interval {}[t_0 - r_0^2, t_0]);
  2. (Collapsed volume) At time t_0, the ball B_{g(t_0)}(x_0,r_0) has volume at most \kappa r_0^d.

Otherwise, we say that the Ricci flow is \kappa-noncollapsed at this point and scale.

(One should view \kappa here as a small dimensionless quantity, in order to make the notion of \kappa-noncollapsing scale-invariant.)

It turns out that Perelman’s critical quantities are controlled enough, and coercive enough, to establish \kappa-noncollapsing at non-zero times assuming some noncollapsing at time zero. There are many ways to formulate this important non-collapsing result; here is one typical phrasing.

Theorem 2. (Perelman’s non-collapsing theorem, first version) Let t \mapsto (M,g(t)) be a Ricci flow on compact 3-manifolds on a time interval {}[0,T_0] such that at time zero, we have the normalised non-collapsing hypotheses |\hbox{Riem}(p)|_g \leq 1 and \hbox{Vol}(B(p,1)) \geq \omega for all p \in M, where \omega > 0 is fixed. Then the Ricci flow is \kappa-noncollapsed for all (t_0,x_0) \in [0,T_0] \times M and all scales 0 < r_0 < \sqrt{t_0}, where \kappa > 0 depends only on \omega and T_0.

Note that the conclusion here is scale-invariant and will therefore persist to asymptotic profiles (M^{(\infty)}, g^{(\infty)}) as discussed in the beginning of this lecture.

Remark 6. Actually, to establish the global existence results for Ricci flow with surgery, we will need to extend Definition 1 and Theorem 2 to Ricci flows with surgery; we shall return to this point later in this course. \diamond

Remark 7. This non-collapsing theorem in fact holds in all dimensions, not just 3, but of course many other aspects of our analysis will only work in three dimensions. \diamond

The next few lectures will be devoted to the proof of Theorem 2, and then we will discuss how Theorem 2 can be used to analyse asymptotic profiles near a Ricci flow singularity.

[Update, Apr 21: Some corrections.]