Having completed a heuristic derivation of the monotonicity of Perelman reduced volume (Conjecture 1 from the previous lecture), we now turn to a rigorous proof. Whereas in the previous lecture we derived this monotonicity by converting a parabolic spacetime to a high-dimensional Riemannian manifold, and then formally applying tools such as the Bishop-Gromov inequality to that setting, our approach here shall take the opposite tack, finding parabolic analogues of the *proof* of the elliptic Bishop-Gromov inequality, in particular obtaining analogues of the classical first and second variation formulae for geodesics, in which the notion of length is replaced by the notion of -length introduced in the previous lecture.

The material here is primarily based on Perelman’s first paper and Müller’s book, but detailed treatments also appear in the paper of Ye, the notes of Kleiner-Lott, the book of Morgan-Tian, and the paper of Cao-Zhu.

– Reduction to a pointwise inequality –

Recall that the Bishop-Gromov inequality (Corollary 1 from the previous lecture) states (among other things) that if a d-dimensional complete Riemannian manifold (M,g) is Ricci-flat (or more generally, has non-negative Ricci curvature), and is any point in M, then the Bishop-Gromov reduced volume is a non-increasing function of r. In fact one can obtain the slightly sharper result that is a non-increasing function of r, where is the sphere of radius r centred at .

From the basic formula (equation (1) from the previous lecture) and the Gauss lemma, one readily obtains the identity

(1)

where is the area element. The monotonicity of then follows (formally, at least) from the pointwise inequality

(2)

which we will derive shortly (at least for the portion of the manifold inside the cut locus) as a consequence of the first and second variation formulae for geodesics. (In the previous lecture, the inequality (2) was derived from a transport inequality for , but we will take a slightly different tack here.) Observe that (2) is an equality when (M,g) is a Euclidean space .

It turns out that the monotonicity of Perelman reduced volume for Ricci flows can similarly be reduced to a pointwise inequality, in which the Laplacian is replaced by a heat operator, and the radial variable r is replaced by the Perelman reduced length. More precisely, given an ancient Ricci flow for , a time , and two points , recall that the reduced length is defined as

(3)

where the *-length* of a curve from to is defined as

, (4)

where we adopt the shorthand , and that Conjecture 1 from the previous lecture asserts that the Perelman reduced volume

(5)

is non-increasing in for Ricci flows. If we differentiate (5) in , using the variation formula , we easily verify that the monotonicity of (5) will follow (assuming is sufficiently smooth, and that either M is compact, or grows sufficiently quickly at infinity) from the pointwise inequality

(6)

which should be viewed as a parabolic analogue to (2).

**Exercise 1. **Verify that (6) is an equality in the case of the (trivial) Ricci flow on Euclidean space, using Example 1 from the previous lecture. (This is of course consistent with Example 2 from that lecture.)

**Exercise 2.** Show that (6) is equivalent to the assertion that the function is a subsolution of the adjoint heat equation, or more precisely that . Note that this fact implies the monotonicity of Perelman reduced volume (cf. Exercise 2 from Lecture 8). [It seems that the elliptic analogue of this fact is the assertion that the Newton-type potential is subharmonic away from the origin for Ricci flat manifolds of dimension three or larger , which is a claim which is easily seen to be equivalent to (2) thanks to the Gauss lemma.]

So to prove monotonicity of the Perelman reduced volume, the main task will be to establish the pointwise inequality (6). (There are some additional technical issues, mainly concerning the parabolic counterpart of the cut locus, which we will also have to address, but we will work formally for now, and deal with these analytical matters later.)

We will perform a minor simplification: by using the rescaling symmetry (and noting the unsurprising fact that (6) is dimensionally consistent) we can normalise .

– First and second variation formulae for -geodesics –

To establish (6), we of course need some *variation formulae* that compute the first and second derivatives of the reduced length function . To motivate these formulae, let us first recall the more classical variation formulae that give the first and second derivatives of the metric function on a Riemannian manifold (M,g), which in particular can be used to derive (2) when the Ricci curvature is non-negative.

We recall that the distance can be defined by the energy-minimisation formula

(7)

where ranges over all curves from to x, where the Dirichlet energy of the curve is given by the formula

(8)

where we write . It is known that this infimum is always attained by some geodesic ; we shall assume this implicitly in the computations which follow.

Now suppose that we deform such a curve with respect to a real parameter , thus is now a function on the two-dimensional parameter space . The first variation here can be computed as

(9)

where is the pullback of the Levi-Civita connection on M with respect to applied in the direction ; here we of course use that g is parallel with respect to this connection. The torsion-free nature of this connection gives us the identity

(10)

where is the infinitesimal variation, and is the pullback of the Levi-Civita connection applied in the direction (cf. Exercise 5 from Lecture 6). An integration by parts (again using the parallel nature of g) then gives the first variation formula

. (11)

If we fix the endpoints of to be and , then the first term on the right-hand side of (11) vanishes. If we consider arbitrary infinitesimal variations of with fixed endpoints, we thus conclude that in order to be a minimiser for (7), that must obey the *geodesic flow equation*

. (12)

One consequence of this is that the speed of such a minimiser must be constant, and from (7) we then conclude

. (13)

If we then vary a geodesic with the initial endpoint fixed at and the final endpoint variable, the variation formula (11) gives

(14)

which, if we insert this back into (7) and use (13), gives

(13)

which is a (one-sided) version of the Gauss lemma. If one is inside the cut locus, then the metric function is smooth, and one can then replace the inequality with an equality by considering variations both forwards and backwards in the s variable, recovering the full Gauss lemma. In particular, we conclude in this case that is a unit vector.

Now we consider the second variation of the energy, when is already a geodesic. For simplicity we assume that evolves geodesically in the s direction, thus

. (14)

[Actually, since is already a geodesic and thus is stationary with respect to perturbations that respect the endpoints, the values of away from endpoints - which represents a second-order perturbation respecting the endpoints - will have no ultimate effect on the second variation of . Nevertheless it is convenient to assume (14) to avoid a few routine additional calculations.]

Differentiating (9) once more we obtain

. (15)

Using (10), (14), and the definition of curvature, we have

(16)

and thus (by one further application of (10))

. (17)

Now let us fix the initial endpoint and let the other endpoint vary, thus equals 0 at time t=0 and equals at time t=1. From Cauchy-Schwarz we conclude

. (18)

Actually, we can attain equality here by choosing the vector field appropriately:

**Exercise 3.** If we set , where v is the parallel transport of x’(s) along X, or more precisely the vector field that solves the ODE

(19)

show that all the inequalities in (18) are obeyed with equality.

For such a vector field, we conclude that

. (20)

From this formula (and the first variation formula) we conclude that

. (21)

Now let vary over an orthonormal basis of the tangent space of x(0); by (19) we see that v determines an orthonormal frame for s=0 and . Summing (21) over this basis (and using the formula for the Laplacian in normal coordinates) we conclude that

. (22)

In particular, for manifolds of non-negative Ricci curvature we have

(23)

from which (2) easily follows from the Gauss lemma. (Observe that (23) is obeyed with equality in the Euclidean case.)

Now we develop analogous variational formulae for -length (and reduced length) on a Ricci flow. We shall work formally for now, assuming that all infima are actually attained and that all quantities are as smooth as necessary for the analysis that follows to work; we then discuss later how to justify all of these assumptions. As mentioned earlier, we normalise .

Let us take a path and vary it with respect to some additional parameter s as before. Differentiating (4), we obtain

(24)

where and . On the other hand, if we have a Ricci flow , we see that

; (25)

placing this into (24) and using the fundamental theorem of calculus, we can express the right-hand side of (24) as

(26)

where G(X) is the vector field

. (27)

Here is the vector field , or equivalently it is the vector field Z such that for all vector fields W.

Note that G does not depend on Y. From this we see that in order for to be a minimiser of with the endpoints fixed, we must have G(X)=0, which is the parabolic analogue of the geodesic flow equation (14).

**Example 1. **In the case of the trivial Euclidean flow, the minimal -path from to takes the form where , in which case . It is not hard to verify that G=0 in this case. .

Arguing as in the elliptic case, we conclude (assuming the existence of a unique minimiser, and the local smoothness of reduced length) the first variation formula

(28)

or equivalently

. (29)

**Example 2. **Continuing Example 1, note that and , which is of course consistent with (28).

Having computed the spatial derivative of the reduced length, we turn to the time derivative. The simplest way to compute this is to observe that any partial segment of an -minimising path must again be a -minimising path. From (4) and the fundamental theorem of calculus we have

(30)

where we vary in by truncation; by (3) and the above discussion we conclude

(31)

where varies along (in particular, ). Applying the product and chain rules, we can expand the left-hand side of (31) as

; (32)

using (29), we conclude that

. (33)

Now we turn to the second spatial variation of the reduced length. Let be a -minimiser, so that G=0. Differentiating (24) again, we obtain

. (34)

As in the elliptic case, it is convenient to assume that we have a geodesic variation (14). In that case, we again have (16), and we also have . Using (10), we thus express (34) as

. (35)

As before, we optimise this in Y. Because the metric g now changes in time by Ricci flow, one has to modify the prescription in Exercise 3 slightly. More precisely, we now set , where v solves the following variant of (19),

. (36)

The point of doing this is that the ODE is orthogonal; the length of v is preserved along X, as is the inner product between any two such v’s (cf. equation (15) from Lecture 3). A brief computation then shows that

(37)

and hence

. (38)

Putting all of this into (35), we now see that the second variation (34) is equal to

. (39)

We now let range over an orthonormal basis of , which leads to v being an orthonormal frame at every point (0,t). Summing over (39) and also using (3), we conclude that

. (40)

Now we simplify the right-hand side of (40). The second term is of course elementary:

(41)

and this is consistent with the Euclidean case (in which is exactly when , and all curvature terms vanish). To simplify the remaining terms, we recall the variation formula

(42)

for the scalar curvature (equation (31) of Lecture 1); by the chain rule, we thu have the total derivative formula

. (43)

Inserting (41), (43) into (40) and integrating by parts, we express the right-hand side of (40) as

. (44)

To simplify this further, recall that the quantity G defined in (27) vanishes. This (and the fact that g evolves by Ricci flow ) allows one to compute the variation of :

. (45)

Inserting this into (44) and integrating by parts, one can rewrite (44) as

(46)

and so by (3) we obtain the inequality

. (47)

Combining (29), (33), and (47) we obtain (6) as desired.

– Analytical issues –

We now discuss in broad terms the analytical issues that one must address in order to make the above arguments rigorous. We first review the classical elliptic theory (i.e. the theory of geodesics in a Riemannian manifold) before turning to Perelman’s parabolic theory of -geodesics in a flow of Riemannian metrics.

In a complete Riemannian manifold, a geodesic from a fixed point to some other point has a well-defined initial velocity vector , and conversely each initial velocity vector determines a unique geodesic with an endpoint , thus defining the exponential map based at . One can show (from standard ODE theory) that this exponential map is smooth (with the derivative of this map controlled by Jacobi fields). Also, if M is connected, then any two points can be joined by a geodesic, and the exponential map is onto. However, there can be vectors v for which this map degenerates (i.e. its derivative ceases to be invertible) – these correspond to the conjugate points of in M.

Define the *injectivity region* of to be the set of all for which there is a unique minimising geodesic from to , and that the exponential map is not degenerate along this geodesic (in particular, and are not conjugate points). An analysis of Jacobi fields reveals that the injectivity region is open, that the distance function is smooth in this region (except at the origin), and that all the computations given above for the distance function can be justified. So it remains to understand what happens on the complement of the injectivity region, known as the cut locus. Points on the cut locus are either conjugate points to , or are else places where minimising geodesics are not unique, which (by a variant of the Gauss lemma) forces the distance function to be non-differentiable at these points. The former type of points form a set of measure zero, thanks to Sard’s theorem, whereas the latter set of points also form a set of measure zero, thanks to Radamacher’s differentiation theorem and the Lipschitz nature of the distance function (i.e. the triangle inequality). Thus the injectivity region has full measure. While this does mean that pointwise inequalities such as (2) now hold almost everywhere, this is unfortunately not quite enough to ensure that (2) holds in the sense of distributions, which is what one really needs in order to fully justify results such as the Bishop-Gromov inequality. (Indeed, by considering simple examples such as the unit circle, we see that the distribution can in fact contain some negative singular measures, although one should note that this does not actually contradict (2) due to the favourable sign of these singular components.) Fortunately, one can address this technical issue by constructing barrier functions to the radius function r at every point , i.e. functions for each which upper bound r near (and match r exactly at , and which obeys the inequality (2) at up to a loss of . Such functions can be constructed at any , even those in the cut locus, by perturbing the origin by an epsilon, and then one can use these barrier functions to justify (2) in the sense of distributions. (I believe that these arguments to control the distance function outside of the injectivity region originate with a paper of Calabi.) From this one can rigorously justify the Bishop-Gromov inequality for all radii, even those exceeding the radius of injectivity.

Analogues of the above assertions hold for the monotonicity of Perelman reduced volume on flows on compact Ricci flows (and more generally for Ricci flows of complete manifolds of bounded curvature). For instance, one can show (using compactness arguments in various weighted Sobolev spaces) that, as long as the manifold M is connected, a minimiser to (3) always exists, and is attained by an -geodesic (defined as a curve for which the G quantity defined in (27) vanishes). [One can easily reduce to the connected case, since the reduced length is clearly infinite when and lie on distinct connected components.] Such geodesics turn out to have a well-defined “initial velocity” , as can be seen by working out the ODE for the quantity (it is also convenient to reparameterise in terms of the variable to remove any apparent singularity at ). This leads to an -exponential map for any fixed time , which is smooth. The derivative of this map is controlled by -Jacobi fields, which are close analogues of their elliptic counterparts, and which lead to the notion of a -conjugate point to at the fixed time . One can then define the injectivity domain and cut locus as before (again for a fixed time ), and show as before that the former region has full measure. This lets one rigorously derive (6) almost everywhere (especially after noting that any segment of a minimising -geodesic without conjugate points is again a minimising -geodesic without conjugate points, thus establishing that the injectivity region is in some sense “star-shaped”), but again one needs to justify (6) in the sense of distributions in order to derive the monotonicity of Perelman reduced volume. This can again be done by use of barrier functions, perturbing the base point both spatially and also backwards in time by an epsilon. The details of this become rather technical; see for instance the paper of Ye or the notes of Kleiner-Lott, for details.

Thus far we have only discussed how reduced length and reduced volume behave on smooth Ricci flows of compact manifolds. Of course, to fully establish the global existence of Ricci flow with surgery, one also needs to build an analogous theory for Ricci flows with surgery. Here there turns out to be significant new technical difficulties, basically because one has to restrict attention to paths which avoid all regions in which surgery is taking place. This creates some “holes” in the region of integration for the reduced volume, as in some cases the minimising path between two points in spacetime goes through a surgery region. Fortunately it turns out that (very roughly speaking) these holes only occur when the reduced length (or a somewhat technical modification thereof) is rather large, which means that the holes do not significantly impact lower bounds on this reduced volume, which is what is needed to establish -noncollapsing. If time permits, I will discuss this issue further in later lectures, once we have described surgery in more detail.

In order to control ancient -noncollapsing solutions, which are complete but not necessarily compact, one also needs to extend the above theory to complete non-compact manifolds. It turns out that this can be done as long as one has uniform bounds on curvature; a key task here is to establish that the reduced length behaves roughly like (which is basically what it is in the Euclidean case) as goes to infinity, which allows the integrand in the definition of reduced volume to have enough decay to justify all computations. The technical details here can be found in several places, including the paper of Ye, the notes of Kleiner-Lott, the book of Morgan-Tian, and the paper of Cao-Zhu.

**Remark 1.** A theory analogous to Perelman’s theory above was worked out earlier by Li and Yau, but with the Ricci flow replaced by a static manifold with a lower bound on Ricci curvature, and with a time-dependent potential attached to the Laplacian.

## 12 comments

Comments feed for this article

10 May, 2008 at 9:56 pm

Richard BorcherdsYou say the Bishop-Gromov reduced volume is a non-decreasing function of r. This should either be “non-increasing” or I’ve misunderstood your sign conventions.

11 May, 2008 at 6:53 am

Terence TaoOops! Thanks for the correction.

14 May, 2008 at 7:42 am

285G, Lecture 11: κ-noncollapsing via Perelman reduced volume « What’s new[...] volume, Ricci flow Having established the monotonicity of the Perelman reduced volume in the previous lecture (after first heuristically justifying this monotonicity in Lecture 9), we now show how this can be [...]

18 May, 2008 at 4:01 pm

This past two weeks… « It’s Equal, but It’s Different…[...] 285G, Lecture 10: Variation of L-geodesics, and monotonicity of Perelman reduced volume, 285G, Lecture 11: κ-noncollapsing via Perelman reduced volume, 285G, Lecture 12: High curvature regions of Ricci flow and κ-solutions [...]

19 May, 2008 at 10:40 am

285G, Lecture 13: Li-Yau-Hamilton Harnack inequalities and κ-solutions « What’s new[...] u by the epsilon-regularisation trick. (Observe the similarity here with the -geodesic theory from Lecture 10.) By choosing to be the constant-speed minimising geodesic from to , we thus conclude [...]

21 May, 2008 at 1:02 pm

285G, Lecture 14: Stationary points of Perelman entropy or reduced volume are gradient shrinking solitons « What’s new[...] gave a proof of (13) in Lecture 10 using the second variation [...]

27 May, 2008 at 6:07 pm

285G, Lecture 15: Geometric limits of Ricci flows, and asymptotic gradient shrinking solitons « What’s new[...] the other hand, from the proof of the monotonicity of reduced volume from Lecture 10 we have (formally, at [...]

30 May, 2008 at 10:16 pm

285G, Lecture 16: Classification of asymptotic gradient shrinking solitons « What’s new[...] the other hand, from the second variation formula for distance (or more precisely, equation (21) of Lecture 10) and the non-negative sectional curvature assumption we have . (Actually one has to justify this in [...]

17 July, 2008 at 12:46 pm

kyDear Professor Tao:

Some basic questions about the notions of geodesic and cut locus.

I’ve read that the notion of distance on a manifold is measured in the tangent plane. Hence, does d(x_0, x) measured via (7) imply that measured in the tangent plane?

Call a point x and its cut point Cut(x) on a manifold. Call exp^(-1)(x) a tangent point and exp^(-1)(Cut(x)) a tangent cut point. Is the distance from a point to a cut point same as the distance from the tangent point to the tangent cut point?

In addition to above, denote the midpoint between x and Cut(x) by y. Is the distance between x and y same as the distance between the tangent point and a tangent cut point divided by 2?

Is Exp map restricted to tangent cut points bijective??? It is clearly surjective but how can one determine if it is also injective? In other words, how would one know if four tangent cut points all go to four different cut points or possibly all to a same cut point??? This last one has been in particular mystery to me.

17 July, 2008 at 2:02 pm

kyDear Professor Tao:

More basic questions:

Take a sphere S^2, pick a North Pole and get its tangent plane. Again, I will call the North Pole in the tangent plane the tangent point. The cut locus in the manifold is the antipodal point (South Pole) and the set of midpoints is the equatorial sphere. Now where is the tangent cut point? It’s right “beneath” the tangent point, is it not???

Continuing the case of S^2 above, around the tangent point in the tangent plane, I see a circle of radius 1 which is a union of infinitely many endpoints of vectors emanating from the tangent point. Is this circle the tangent cut locus or the equatorial sphere when taken to the manifold through Exponential map???

Thank you.

17 July, 2008 at 9:10 pm

Terence TaoDear ky,

Given a fixed point on a manifold M, there are two metric structures in play: the metric structure on the tangent space given by the bilinear form (which, when viewed in an orthonormal frame, is isometric to Euclidean space); and the metric structure on the manifold itself, given by the metric d defined in (7). The exponential map preserves distances between pairs of points only along rays from the origin to the cut locus. Thus, when v, w lie along a ray from the origin (and lie inside the cut locus), but in other situations one does not expect this equality to hold (although comparison geometry can yield some weak substitutes for this equality).

In general, I do not think that one can determine when two points on the cut locus map to the same point under the exponential map without using some global information about the manifold; just knowing the metric inside the cut locus is not going to be enough (there may be multiple ways to glue the boundary of this locus to itself). On the other hand, the exponential map is always injective in the interior region bounded by the cut locus.

In the case of a unit sphere, the cut locus bounds a disk of radius in the tangent plane, and the exponential map maps the entire boundary of this disk (i.e. a circle of radius ) to the antipodal point on the sphere. So there is no unique “tangent cut point” in this case – the entire cut locus is being mapped to the antipodal point.

It may be best to work through a textbook in Riemannian geometry (especially one with exercises) in order to have a firm grasp of these concepts. (A good textbook will also have some useful diagrams which should be helpful for visualising what is going on; this is hard to convey in this blog medium.)

18 July, 2008 at 12:54 am

ky2168Dear Professor Tao:

I understood your explanations completely. Thank you so much for the detailed reply.