You are currently browsing Terence Tao's articles.

Having established the monotonicity of the Perelman reduced volume in the previous lecture (after first heuristically justifying this monotonicity in Lecture 9), we now show how this can be used to establish \kappa-noncollapsing of Ricci flows, thus giving a second proof of Theorem 2 from Lecture 7. Of course, we already proved (a stronger version) of this theorem already in Lecture 8, using the Perelman entropy, but this second proof is also important, because the reduced volume is a more localised quantity (due to the weight e^{-l_{(0,x_0)}} in its definition and so one can in fact establish local versions of the non-collapsing theorem which turn out to be important when we study ancient \kappa-noncollapsing solutions later in Perelman’s proof, because such solutions need not be compact and so cannot be controlled by global quantities (such as the Perelman entropy).

The route to \kappa-noncollapsing via reduced volume proceeds by the following scheme:

Non-collapsing at time t=0 (1)

\Downarrow

Large reduced volume at time t=0 (2)

\Downarrow

Large reduced volume at later times t (3)

\Downarrow

Non-collapsing at later times t (4)

The implication (2) \implies (3) is the monotonicity of Perelman reduced volume. In this lecture we discuss the other two implications (1) \implies (2), and (3) \implies (4)).

Our arguments here are based on Perelman’s first paper, Kleiner-Lott’s notes, and Morgan-Tian’s book, though the material in the Morgan-Tian book differs in some key respects from the other two texts. A closely related presentation of these topics also appears in the paper of Cao-Zhu.

Read the rest of this entry »

I’ve just uploaded to the arXiv my paper “A global compact attractor for high-dimensional defocusing non-linear Schrödinger equations with potential“, submitted to Dynamics of PDE. This paper continues some earlier work of myself in an attempt to understand the soliton resolution conjecture for various nonlinear dispersive equations, and in particular, nonlinear Schrödinger equations (NLS). This conjecture (which I also discussed in my third Simons lecture) asserts, roughly speaking, that any reasonable (e.g. bounded energy) solution to such equations eventually resolves into a superposition of a radiation component (which behaves like a solution to the linear Schrödinger equation) plus a finite number of “nonlinear bound states” or “solitons”. This conjecture is known in many perturbative cases (when the solution is close to a special solution, such as the vacuum state or a ground state) as well as in defocusing cases (in which no non-trivial bound states or solitons exist), but is still almost completely open in non-perturbative situations (in which the solution is large and not close to a special solution) which contain at least one bound state. In my earlier papers, I was able to show that for certain NLS models in sufficiently high dimension, one could at least say that such solutions resolved into a radiation term plus a finite number of “weakly bound” states whose evolution was essentially almost periodic (or almost periodic modulo translation symmetries). These bound states also enjoyed various additional decay and regularity properties. As a consequence of this, in five and higher dimensions (and for reasonable nonlinearities), and assuming spherical symmetry, I showed that there was a (local) compact attractor K_E for the flow: any solution with energy bounded by some given level E would eventually decouple into a radiation term, plus a state which converged to this compact attractor K_E. In that result, I did not rule out the possibility that this attractor depended on the energy E. Indeed, it is conceivable for many models that there exist nonlinear bound states of arbitrarily high energy, which would mean that K_E must increase in size as E increases to accommodate these states. (I discuss these results in a recent talk of mine.)

In my new paper, following a suggestion of Michael Weinstein, I consider the NLS equation

i u_t + \Delta u = |u|^{p-1} u + Vu

where u: {\Bbb R} \times {\Bbb R}^d \to {\Bbb C} is the solution, and V \in C^\infty_0({\Bbb R}^d) is a smooth compactly supported real potential. We make the standard assumption 1 + \frac{4}{d} < p < 1 + \frac{4}{d-2} (which is asserting that the nonlinearity is mass-supercritical and energy-subcritical). In the absence of this potential (i.e. when V=0), this is the defocusing nonlinear Schrödinger equation, which is known to have no bound states, and in fact it is known in this case that all finite energy solutions eventually scatter into a radiation state (which asymptotically resembles a solution to the linear Schrödinger equation). However, once one adds a potential (particularly one which is large and negative), both linear bound states (solutions to the linear eigenstate equation (-\Delta + V) Q = -E Q) and nonlinear bound states (solutions to the nonlinear eigenstate equation (-\Delta+V)Q = -EQ - |Q|^{p-1} Q) can appear. Thus in this case the soliton resolution conjecture predicts that solutions should resolve into a scattering state (that behaves as if the potential was not present), plus a finite number of (nonlinear) bound states. There is a fair amount of work towards this conjecture for this model in perturbative cases (when the energy is small), but the case of large energy solutions is still open.

In my new paper, I consider the large energy case, assuming spherical symmetry. For technical reasons, I also need to assume very high dimension d \geq 11. The main result is the existence of a global compact attractor K: every finite energy solution, no matter how large, eventually resolves into a scattering state and a state which converges to K. In particular, since K is bounded, all but a bounded amount of energy will be radiated off to infinity. Another corollary of this result is that the space of all nonlinear bound states for this model is compact. Intuitively, the point is that when the solution gets very large, the defocusing nonlinearity dominates any attractive aspects of the potential V, and so the solution will disperse in this case; thus one expects the only bound states to be bounded. The spherical symmetry assumption also restricts the bound states to lie near the origin, thus yielding the compactness. (It is also conceivable that the localised nature of V also restricts bound states to lie near the origin, even without the help of spherical symmetry, but I was not able to establish this rigorously.)

Read the rest of this entry »

Having completed a heuristic derivation of the monotonicity of Perelman reduced volume (Conjecture 1 from the previous lecture), we now turn to a rigorous proof. Whereas in the previous lecture we derived this monotonicity by converting a parabolic spacetime to a high-dimensional Riemannian manifold, and then formally applying tools such as the Bishop-Gromov inequality to that setting, our approach here shall take the opposite tack, finding parabolic analogues of the proof of the elliptic Bishop-Gromov inequality, in particular obtaining analogues of the classical first and second variation formulae for geodesics, in which the notion of length is replaced by the notion of {\mathcal L}-length introduced in the previous lecture.

The material here is primarily based on Perelman’s first paper and Müller’s book, but detailed treatments also appear in the paper of Ye, the notes of Kleiner-Lott, the book of Morgan-Tian, and the paper of Cao-Zhu.

Read the rest of this entry »

We now turn to Perelman’s second scale-invariant monotone quantity for Ricci flow, now known as the Perelman reduced volume. We saw in the previous lecture that the monotonicity for Perelman entropy was ultimately derived (after some twists and turns) from the monotonicity of a potential under gradient flow. In this lecture, we will show (at a heuristic level only) how the monotonicity of Perelman’s reduced volume can also be “derived”, in a formal sense, from another source of monotonicity, namely the relative Bishop-Gromov inequality in comparison geometry (which has already been alluded to in previous lectures). Interestingly, in order to obtain this connection, one must first reinterpret parabolic flows such as Ricci flow as the limit of a certain high-dimensional Riemannian manifold as the dimension becomes infinite; this is part of a more general philosophy that parabolic theory is in some sense an infinite-dimensional limit of elliptic theory. Our treatment here is a (liberally reinterpreted) version of Section 6 of Perelman’s paper.

In the next few lectures we shall give a rigorous proof of this monotonicity, without using the infinite-dimensional limit and instead using results related to the Li-Yau-Hamilton Harnack inequality. (There are several other approaches to understanding Perelman’s reduced volume, such as Lott’s formulation based on optimal transport, but we will restrict attention in this course to the methods that are in Perelman’s original paper.)

Read the rest of this entry »

Over a year ago, I had a brief post here pointing out Gene Weingarten’s article in the Washington Post entitled “Pearls before breakfast“, in which the Post asked the question of what would happen if a world-class musician (in this case, Joshua Bell), were to perform incognito and out of context, in a Washington subway during the morning rush hour? If you haven’t yet read the article describing the experiment and the outcome, I recommend it to you.

Anyway, a few weeks ago, this article was awarded the 2008 Pulitzer Prize for Feature Writing. Congratulations to Gene, Joshua, and the other Washington Post staff!

[Actually, this article was highly atypical for Gene; he usually sticks to writing a weekly low-brow humour column entitled "Below the Beltway". By a random coincidence, I, together with Curt McMullen, even have a very minor bit part in one of these columns (on page 2), thanks to a brief phone conversation we each had with Gene.]

It is well known that the heat equation

\dot f = \Delta f (1)

on a compact Riemannian manifold (M,g) (with metric g static, i.e. independent of time), where f: [0,T] \times M \to {\Bbb R} is a scalar field, can be interpreted as the gradient flow for the Dirichlet energy functional

\displaystyle E(f) := \frac{1}{2} \int_M |\nabla f|_g^2\ d\mu (2)

using the inner product \langle f_1, f_2 \rangle_\mu := \int_M f_1 f_2\ d\mu associated to the volume measure d\mu. Indeed, if we evolve f in time at some arbitrary rate \dot f, a simple application of integration by parts (equation (29) from Lecture 1) gives

\displaystyle \frac{d}{dt} E(f) = - \int_M (\Delta f) \dot f\ d\mu = \langle -\Delta f, \dot f \rangle_\mu (3)

from which we see that (1) is indeed the gradient flow for (3) with respect to the inner product. In particular, if f solves the heat equation (1), we see that the Dirichlet energy is decreasing in time:

\displaystyle \frac{d}{dt} E(f) = - \int_M |\Delta f|^2\ d\mu. (4)

Thus we see that by representing the PDE (1) as a gradient flow, we automatically gain a controlled quantity of the evolution, namely the energy functional that is generating the gradient flow. This representation also strongly suggests (though does not quite prove) that solutions of (1) should eventually converge to stationary points of the Dirichlet energy (2), which by (3) are just the harmonic functions (i.e. the functions f with \Delta f = 0).

As one very quick application of the gradient flow interpretation, we can assert that the only periodic (or “breather”) solutions to the heat equation (1) are the harmonic functions (which, in fact, must be constant if M is compact, thanks to the maximum principle). Indeed, if a solution f was periodic, then the monotone functional E must be constant, which by (4) implies that f is harmonic as claimed.

It would therefore be desirable to represent Ricci flow as a gradient flow also, in order to gain a new controlled quantity, and also to gain some hints as to what the asymptotic behaviour of Ricci flows should be. It turns out that one cannot quite do this directly (there is an obstruction caused by gradient steady solitons, of which we shall say more later); but Perelman nevertheless observed that one can interpret Ricci flow as gradient flow if one first quotients out the diffeomorphism invariance of the flow. In fact, there are infinitely many such gradient flow interpretations available. This fact already allows one to rule out “breather” solutions to Ricci flow, and also reveals some information about how Poincaré’s inequality deforms under this flow.

The energy functionals associated to the above interpretations are subcritical (in fact, they are much like R_{\min}) but they are not coercive; Poincaré’s inequality holds both in collapsed and non-collapsed geometries, and so these functionals are not excluding the former. However, Perelman discovered a perturbation of these functionals associated to a deeper inequality, the log-Sobolev inequality (first introduced by Gross in Euclidean space). This inequality is sensitive to volume collapsing at a given scale. Furthermore, by optimising over the scale parameter, the controlled quantity (now known as the Perelman entropy) becomes scale-invariant and prevents collapsing at any scale - precisely what is needed to carry out the first phase of the strategy outlined in the previous lecture to establish global existence of Ricci flow with surgery.

The material here is loosely based on Perelman’s paper, Kleiner-Lott’s notes, and Müller’s book.

Read the rest of this entry »

We now set aside our discussion of the finite time extinction results for Ricci flow with surgery (Theorem 4 from Lecture 2), and turn instead to the main portion of Perelman’s argument, which is to establish the global existence result for Ricci flow with surgery (Theorem 2 from Lecture 2), as well as the discreteness of the surgery times (Theorem 3 from Lecture 2).

As mentioned in Lecture 1, local existence of the Ricci flow is a fairly standard application of nonlinear parabolic theory, once one uses de Turck’s trick to transform Ricci flow into an explicitly parabolic equation. The trouble is, of course, that Ricci flow can and does develop singularities (indeed, we have just spent several lectures showing that singularities must inevitably develop when certain topological hypotheses (e.g. simple connectedness) or geometric hypotheses (e.g. positive scalar curvature) occur). In principle, one can use surgery to remove the most singular parts of the manifold at every singularity time and then restart the Ricci flow, but in order to do this one needs some rather precise control on the geometry and topology of these singular regions. (In particular, there are some hypothetical bad singularity scenarios which cannot be easily removed by surgery, due to topological obstructions; a major difficulty in the Perelman program is to show that such scenarios in fact cannot occur in a Ricci flow.)

In order to analyse these singularities, Hamilton and then Perelman employed the standard nonlinear PDE technique of “blowing up” the singularity using the scaling symmetry, and then exploiting as much “compactness” as is available in order to extract an “asymptotic profile” of that singularity from a sequence of such blowups, which had better properties than the original Ricci flow. [The PDE notion of a blowing up a solution around a singularity, by the way, is vaguely analogous to the algebraic geometry notion of blowing up a variety around a singularity, though the two notions are certainly not identical.] A sufficiently good classification of all the possible asymptotic profiles will, in principle, lead to enough structural properties on general singularities to Ricci flow that one can see how to perform surgery in a manner which controls both the geometry and the topology.

However, in order to carry out this program it is necessary to obtain geometric control on the Ricci flow which does not deteriorate when one blows up the solution; in the jargon of nonlinear PDE, we need to obtain bounds on some quantity which is both coercive (it bounds the geometry) and either critical (it is essentially invariant under rescaling) or subcritical (it becomes more powerful when one blows up the solution) with respect to the scaling symmetry. The discovery of controlled quantities for Ricci flow which were simultaneously coercive and critical was Perelman’s first major breakthrough in the subject (previously known controlled quantities were either supercritical or only partially coercive); it made it possible, at least in principle, to analyse general singularities of Ricci flow and thus to begin the surgery program discussed above. (In contrast, the main reason why questions such as Navier-Stokes global regularity are so difficult is that no controlled quantity which is both coercive and critical or subcritical is known.) The mere existence of such a quantity does not by any means establish global existence of Ricci flow with surgery immediately, but it does give one a non-trivial starting point from which one can hope to make progress.

Read the rest of this entry »

A few months ago, I announced that I was going to convert a significant fraction of my 2007 blog posts into a book format. For various reasons, this conversion took a little longer than I had anticipated, but I have finally completed a draft copy of this book, which I have uploaded here; note that this is a moderately large file (1.5MB 1.3MB 1.1MB), as the book is 374 pages 287 pages 270 pages long. There are still several formatting issues to resolve, but the content has all been converted.

It may be a while before I hear back from the editors at the American Mathematical Society as to the status of the book project, but in the meantime any comments on the book, ranging from typos to suggestions as to the format, are of course welcome.

[Update, April 21: New version uploaded, incorporating contributed corrections. The formatting has been changed for the internet version to significantly reduce the number of pages. As a consequence, note that the page numbering for the internet version of the book will differ substantially from that in the print version.]

[Update, April 21: As some readers may have noticed, I have placed paraphrased versions of some of the blog comments in the book, using the handles given in the blog comments to identify the authors. If any such commenters wish to change one's handle (e.g. to one's full name) or to otherwise modify or remove any comments I have placed in the book, you are welcome to contact me by email to do so.]

[Update, April 23: Another new version uploaded, incorporating contributed corrections and shrinking the page size a little further.]

[Update, May 8: A few additional corrections to the book.]

In this lecture we discuss Perelman’s original approach to finite time extinction of the third homotopy group (Theorem 1 from the previous lecture), which, as previously discussed, can be combined with the finite time extinction of the second homotopy group to imply finite time extinction of the entire Ricci flow with surgery for any compact simply connected Riemannian 3-manifold, i.e. Theorem 4 from Lecture 2.

Read the rest of this entry »

Van Vu and I have just uploaded to the arXiv our preprint “On the permanent of random Bernoulli matrices“, submitted to Adv. Math. This paper establishes analogues of some recent results on the determinant of random n \times n Bernoulli matrices (matrices in which all entries are either +1 or -1, with equal probability of each), in which the determinant is replaced by the permanent.

More precisely, let M be a random n \times n Bernoulli matrix, with n large. Since every row of this matrix has magnitude n^{1/2}, it is easy to see (by interpreting the determinant as the signed volume of a parallelopiped) that |\det(M)| is at most n^{n/2}, with equality being satisfied exactly when M is a Hadamard matrix. In fact, it is known that the determinant \det(M) has magnitude n^{(1/2 - o(1)) n} with probability 1-o(1); for a more precise result, see my earlier paper with Van. (There is in fact believed to be a central limit theorem for \log |\det(M)|; see this paper of Girko for details.) These results are based upon the elementary “base times height” formula for the volume of a parallelopiped; the main difficulty is to understand what the distance is from one row of M to a subspace spanned by several other rows of M.

The permanent \hbox{per}(M) looks formally very similar to the determinant, but does not have a geometric interpretation as a signed volume of a parallelopiped and so can only be analysed combinatorially; the main difficulty is to understand the cancellation that can arise from the various signs in the matrix. It can be somewhat larger than the determinant; for instance, the maximum value of \hbox{per}(M) for a Bernoulli matrix M is n! = n^{(1 - o(1)) n}, attaned when M consists entirely of +1’s. Nevertheless, it is not hard to see that \hbox{per}(M) has the same mean and standard deviation as \det(M), namely 0 and \sqrt{n!} respectively, which shows that |\hbox{per}(M)| is at most n^{(1/2-o(1))n} with probability 1-o(1). Our main result is to show that one also has that |\hbox{per}(M)| is at least n^{(1/2-o(1))n} with probability 1-o(1), thus obtaining the analogue of the previously mentioned result for the determinant (though our o(1) bounds are significantly weaker).

In particular, this shows that the probability that the permanent vanishes completely is o(1) (in fact, we get a bound of O(n^{-c}) for some absolute constant c > 0). This result appears to be new (although there is a cute observation of Alon (see e.g. this paper of Wanless for a proof) that if n=2^m-1 is one less than a power of 2, then every Bernoulli matrix has non-zero permanent). In contrast, the probability that the determinant vanishes completely is conjectured to equal (1/2 + o(1))^n (which is easily seen to be a lower bound), but the best known upper bound for this probability is (1/\sqrt{2 }+ o(1))^n, due to Bourgain, Vu, and Wood.

Read the rest of this entry »

In the previous lecture, we saw that Ricci flow with surgery ensures that the second homotopy group \pi_2(M) became extinct in finite time (assuming, as stated in the above erratum, that there is no embedded \Bbb{RP}^2 with trivial normal bundle). It turns out that the same assertion is true for the third homotopy group, at least in the simply connected case:

Theorem 1. (Finite time extinction of \pi_3(M)) Let t \mapsto (M(t),g(t)) be a Ricci flow with surgery on compact 3-manifolds with t \in [0,+\infty), with M(0) simply connected. Then for all sufficiently large t, \pi_3(M(t)) is trivial (or more precisely, every connected component of M(t) has trivial \pi_3).

[Aside: it seems to me that this theorem should also be true if one merely assumes that M(0) contains no embedded copy of \Bbb{RP}^2 with trivial bundle, as opposed to M(0) being simply connected, but I will be conservative and only state Theorem 1 with this stronger hypothesis, as this is all that is necessary for proving the Poincaré conjecture.]

Suppose we apply Ricci flow with surgery to a compact simply connected Riemannian 3-manifold (M,g) (which, by Lemma 1 from Lecture 2, has no embedded \Bbb {RP}^2 with trivial normal bundle). From the above theorem, as well as Theorem 1 from the previous lecture, we know that all components of M(t) eventually have trivial \pi_2 and \pi_3 for all sufficiently large t. Also, since M is initially simply connected, we see from Exercise 2 of Lecture 2, as well as Theorem 2.1 of Lecture 2, that all components of M(t) also have trivial \pi_1. The finite time extinction result (Theorem 4 from Lecture 2) then follows immediately from Theorem 1 and the following topological result, combined with the following topological observation:

Lemma 1. Let M be a compact non-empty connected 3-manifold. Then it is not possible for \pi_1(M), \pi_2(M), and \pi_3(M) to simultaneously be trivial.

This lemma follows immediately from the Hurewicz theorem, but for sake of self-containedness we give a proof of it here.

There are two known approaches to establishing Theorem 1; one due to Colding and Minicozzi, and one due to Perelman. The former is conceptually simpler, but requires a certain technical concentration-compactness type property for a min-max functional which has only been established recently. This approach will be the focus of this lecture, while the latter approach of Perelman, which has also been rigorously shown to imply finite time extinction, will be the focus of the next lecture.

Read the rest of this entry »

Returning (perhaps anticlimactically) to the subject of the Poincaré conjecture, recall from Lecture 2 that one of the key pillars of the proof of that conjecture is the finite time extinction result (see Theorem 4 from that lecture), which asserted that if a compact Riemannian 3-manifold (M,g) was initially simply connected, then after a finite amount of time evolving via Ricci flow with surgery, the manifold will be empty.

In this lecture and the next few, we will describe some of the key ideas used to prove this theorem. We will not be able to completely establish this theorem at present, because we do not have a full definition of “surgery”, but we will be able to establish some partial results, and indicate (in informal terms) how to cope with the additional technicalities caused by the surgery procedure. Hopefully, if time permits later in the class, once we have studied the surgery process, I will be able to revisit this material and flesh out these technicalities a bit more.

The proof of finite time extinction proceeds in several stages. The first stage, which was already accomplished in the previous lecture (in the absence of surgery, at least), is to establish lower bounds on the least scalar curvature R_{\min}. The next stage, which we discuss in this lecture, is to show that the second homotopy group \pi_2(M) of the manifold must become extinct in finite time, thus all immersed copies of the 2-sphere S^2 in M(t) for sufficiently large t must be contractible to a point. The third stage is to show that the third homotopy group \pi_3(M) also becomes extinct so that all immersed copies of the 3-sphere S^3 in M are similarly contractible. The final stage, which uses homology theory, is to show that a non-empty 3-manifold cannot have \pi_1(M), \pi_2(M), \pi_3(M) simultaneously trivial, thus yielding the desired claim (note that a simply connected manifold has trivial \pi_1(M) by definition; also, from Exercise 2 of Lecture 2 we see that all components of M remain simply connected even after surgery).

More precisely, in this lecture we will discuss (most of) the proof of

Theorem 1. (Finite time extinction of \pi_2(M)) Let t \mapsto (M(t),g(t)) be a Ricci flow with surgery on compact 3-manifolds with t \in [0,+\infty), with M(0) containing no embedded copy of \Bbb{RP}^2 with trivial normal bundle. Then for all sufficiently large t, \pi_2(M(t)) is trivial (or more precisely, every connected component of M(t) has trivial \pi_2).

The technical assumption about having no copy of \Bbb{RP}^2 with trivial normal bundle is needed solely in order to apply the known existence theory for Ricci flow with surgery (see Theorem 2 from Lecture 2).

The intuition for this result is as follows. From the Gauss-Bonnet theorem (and the fact that the Euler characteristic \chi(S^2)=V-E+F=2 of the sphere is positive), we know that 2-spheres tend to have positive (Gaussian) curvature on the average, which should make them shrink under Ricci flow. (Here I am conflating Gaussian curvature with Ricci curvature; however, by restricting to a special class of 2-spheres, namely minimal surfaces, one can connect the two notions of curvature to each other (and to scalar curvature) quite nicely.) On the other hand, the presence of negative scalar curvature can counteract this by expanding these spheres. But the lower bounds on scalar curvature tell us that the negativity of scalar curvature becomes weakened over time, and it turns out that the shrinkage caused by the Gauss-Bonnet theorem eventually dominates and sends the area of all minimal immersed 2-spheres into zero, at which point one can conclude the triviality of \pi_2(M) by the Sacks-Uhlenbeck theory of minimal 2-spheres.

The arguments here are drawn from the book of Morgan-Tian and from the paper of Colding-Minicozzi. The idea of using minimal surfaces to force disappearance of various topological structures under Ricci flow originates with Hamilton (who used 2-torii instead of 2-spheres, but the idea is broadly the same).

Read the rest of this entry »

I usually try to keep political issues out of this blog, and I certainly try to avoid asking friends and readers of this blog for favours, but there is an urgent situation developing in mathematics (and related disciplines) in my home country of Australia, and I need to ask all of you for assistance to prevent an impending disaster.

When I was an undergraduate at Flinders University in South Australia from 1989 to 1992, the level of mathematics education in Australia was comparable to that of world-class institutions overseas. Even in a small and little-known university such as Flinders, I received a first-rate honours undergraduate education in mathematics, computer science, and physics which I continue to use daily in my career. (Examples of topics I learned as an undergraduate include wavelets; information theory; Lie algebras; differential geometry; nonlinear PDE; quantum mechanics; statistical mechanics; and harmonic analysis. I rely on my knowledge of all of these topics today, for instance many of them are are helpful for me in teaching my current class on the Poincaré conjecture.) In addition, several of the faculty (including the chair and my undergraduate advisor, Garth Gaudry) had the time to spare an hour a week with me to discuss mathematics, as they were not overloaded with large teaching loads and other duties. I honestly think that I would not be where I am today without the high-quality undergraduate education that I received (in particular, I would definitely have floundered in graduate school at Princeton, if I were admitted at all).

The situation for mathematics education in Australia began however to deteriorate in later years, due to a combination of factors including government neglect (the federal government is the most significant source of funding for most universities in Australia) and the low priority of basic education in mathematics and sciences among university administrators. In particular, at Flinders University, the School of Mathematics suffered severe attrition due to lack of support and was eventually folded into the School of Informatics and Engineering. In fact the number of mathematicians on the faculty at Flinders has dwindled down to just three (in my day it was close to 20).

The decline of mathematics departments across the country, particularly in a time in which mathematics skills are desperately needed in the workforce, has been documented thoroughly in the 2006 national strategic review of the mathematical sciences. In response to that report, the federal government in 2007 announced an increase in the funding allocation to universities based on their student enrollments in key majors including mathematics and the sciences. The newly elected federal government is also likely to continue and extend this support in its upcoming budget in May of this year.

Unfortunately, it appears that at many universities, the additional funding was diverted away from the schools that it was intended to support, for the administrator’s own priorities. (See also the letter by the international authors of the above mentioned review, Jean-Pierre Bourguignon, Brenda Dietrich, and Iain Johnstone, condemning this diversion.) As a consequence, many mathematics departments are in fact in worse shape than before.

There is a particular crisis unfolding at the University of Southern Queensland. On March 17, the university announced a rationalisation and restructuring proposal that would cut the number of mathematics faculty from 14 to 6, eliminate the majors in mathematics, chemistry, physics, and statistics, and phase out all non-service courses (for instance, any of the types of courses I mentioned above at Flinders would be lost). Similar cuts were also proposed in statistics, computer science, and physics, although other schools retained their funding and some even obtained increases. This is despite the increases in funding from the federal government for mathematics and statistics students (enrollments in these areas at USQ has held steady so far, though of course with the proposed cuts this is unlikely to last). Already as a consequence of these proposals, initiatives of the department such as an education program for high school mathematics teachers have had to be scrapped. Somewhat ironically, the Dean of Sciences at USQ, Janet Verbyla, who has been heavily involved in proposing the cuts, had also presided over similar reductions in the school of mathematics at Flinders.

If the proposed cuts at USQ go ahead, it is likely that other small universities in Australia will be tempted to similarly ignore concerns about mathematics and science education and perform similar cuts, even while receiving government support for these disciplines. (The University of New England, which currently shares some statistics courses at USQ, would for instance be particularly vulnerable.) So the crisis here is not purely localised to USQ, but could be very damaging for mathematics and sciences in Australia as a whole.

The consultation period for these cuts ends very soon, on April 14, and the vice-chancellor of USQ, Bill Lovegrove, plans to announce the specific cuts on April 18 at an unspecified future date. While there has been some media attention in Australia given to this issue, it has not yet had much effect in reversing the decisions of these administrators. Because of this, I am reluctantly turning to my friends and readers of this blog to ask for your urgent assistance in saving the school of mathematics and computing at USQ. In collaboration with several good friends and colleagues in Australia, I have begun a web page on this blog,

http://terrytao.wordpress.com/support-usq-maths/

that is documenting the situation and outlining ways to help, including an online petition

http://terrytao.wordpress.com/about/petition-to-support-maths-statistics-and-computing-at-usq/

that you can sign to show support, and people to contact in the university administration and in the Australian government to express your concerns, or to express support for mathematics and its role in the sciences. Please also inform others, especially those in Australia and who may have influence in media, political, or administrative circles, of the current crisis. There is still time, especially in view of the expected increase in support for mathematics and sciences in the upcoming federal budget, to reverse the situation before the damage becomes permanent, and to show that the political support for mathematics education is not so negligible as to be easily ignored.

Thank you all in advance for any help you can give - and I promise that I will keep the remainder of my blog on topic and focus primarily on mathematics. :-)

[Update, April 9: See my editorial at the Funneled Web, "Mathematics in Today's world", for a more detailed discussion of the USQ crisis, and also the broader context of the importance of higher mathematics education, and the pivotal role universities have to play in providing it.]

[Update, April 12: The Toowoomba Chronicle has a two-page article by Merryl Miller focusing on the crisis, and in particular focusing on its impact on a 10-year old child prodigy, Adam Walsh, currently taking maths classes at USQ. (Reprinted with permission.)]

[Update, April 14: The petition has been formally sent to the USQ administration. Apparently, the previously planned announcement of the cuts on April
18 has been delayed to some unspecified later date, but no further details are currently available.]

[Update, April 17: In response to the concerns of constituents, Hon. Mike Horan MP, the state member for Toowoomba South, spoke in the Queensland parliament urging the University of Southern Queensland to reconsider its cutbacks to mathematics and statistics. (The full and official transcript of the day's session in Parliament can be found here; the speech above is on page 1198.)]

[Update, April 29: The USQ administration released a revised draft proposal on April 22, but the details are largely unchanged (e.g. 11 staff cuts to the department of mathematics and computing instead of 12, and a "review" of the maths major and its courses rather than automatic elimination). The revised plan has already attracted criticism from the National Tertiary Education Union, and we are continuing to organise further opposition to the proposals. (For instance, László Lovász, President of the International Mathematical Union, wrote a letter of support on April 25.]

[Update, May 1.  A second revised draft proposal has been released, which uses some new (but possibly non-permanent) sources of funding to add some specialised positions to partially offset the cuts (e.g. there will be 2-3 such positions in mathematics and statistics, although the 11 staff cuts are still in effect).  The USQ administration has apparently also agreed to recheck the student load and financial data that is being used to underlie these proposals, as there appears to be some irregularities with this data in previous rationales.]

We now begin the study of (smooth) solutions t \mapsto (M(t),g(t)) to the Ricci flow equation

\frac{d}{dt} g_{\alpha \beta} = - 2 \hbox{Ric}_{\alpha \beta}, (1)

particularly for compact manifolds in three dimensions. Our first basic tool will be the maximum principle for parabolic equations, which we will use to bound (sub-)solutions to nonlinear parabolic PDE by (super-)solutions, and vice versa. Because the various curvatures \hbox{Riem}_{\alpha \beta \gamma}^\delta, \hbox{Ric}_{\alpha \beta}, R of a manifold undergoing Ricci flow do indeed obey nonlinear parabolic PDE (see equations (31) from Lecture 1), we will be able to obtain some important lower bounds on curvature, and in particular establishes that the curvature is either bounded, or else that the positive components of the curvature dominate the negative components. This latter phenomenon, known as the Hamilton-Ivey pinching phenomenon, is particularly important when studying singularities of Ricci flow, as it means that the geometry of such singularities is almost completely dominated by regions of non-negative (and often quite high) curvature.

Read the rest of this entry »

In order to motivate the lengthy and detailed analysis of Ricci flow that will occupy the rest of this course, I will spend this lecture giving a high-level overview of Perelman’s Ricci flow-based proof of the Poincaré conjecture, and in particular how that conjecture is reduced to verifying a number of (highly non-trivial) facts about Ricci flow.

At the risk of belaboring the obvious, here is the statement of that conjecture:

Theorem 1. (Poincaré conjecture) Let M be a compact 3-manifold which is simply connected (i.e. it is connected, and every loop is contractible to a point). Then M is homeomorphic to a 3-sphere S^3.

[Unless otherwise stated, all manifolds are assumed to be without boundary.]

I will take it for granted that this result is of interest, but you can read the Notices article of Milnor, the Bulletin article of Morgan, or the Clay Mathematical Institute description of the problem (also by Milnor) for background and motivation for this conjecture. Perelman’s methods also extend to establish further generalisations of the Poincaré conjecture, most notably Thurston’s geometrisation conjecture, but I will focus this course just on the Poincaré conjecture. (On the other hand, the geometrisation conjecture will be rather visibly lurking beneath the surface in the discussion of this lecture.)

Read the rest of this entry »

In the first lecture, we introduce flows t \mapsto (M(t), g(t)) on Riemannian manifolds (M,g), which are recipes for describing smooth deformations of such manifolds over time, and derive the basic first variation formulae for how various structures on such manifolds (e.g. curvature, length, volume) change by such flows. (One can view these formulae as describing the relationship between two “infinitesimally close” Riemannian manifolds.) We then specialise to the case of Ricci flow (together with some close relatives of this flow, such as renormalised Ricci flow, or Ricci flow composed with a diffeomorphism flow). We also discuss the “de Turck trick” that modifies the Ricci flow into a nonlinear parabolic equation, for the purposes of establishing local existence and uniqueness of that flow.

Read the rest of this entry »

Next week (starting on Wednesday, to be more precise), I will begin my class on Perelman’s proof of the Poincaré conjecture. As I only have ten weeks in which to give this proof, I will have to move rapidly through some of the more basic aspects of Riemannian geometry which will be needed throughout the course. In particular, in this preliminary lecture, I will quickly review the basic notions of infinitesimal (or microlocal) Riemannian geometry, and in particular defining the Riemann, Ricci, and scalar curvatures of a Riemannian manifold. (The more “global” aspects of Riemannian geometry, for instance concerning the relationship between distance, curvature, injectivity radius, and volume, will be discussed later in this course.) This is a review only, in particular omitting any leisurely discussion of examples or motivation for Riemannian geometry; it is impossible to compress this subject into a single lecture, and I will have to refer you to a textbook on the subject for a more complete treatment (I myself am using the text “Riemannian geometry” by my colleague here at UCLA, Peter Petersen).

Read the rest of this entry »

One of my favourite unsolved problems in mathematics is the Kakeya conjecture in geometric measure theory. This conjecture is descended from the

Kakeya needle problem. (1917) What is the least area in the plane required to continuously rotate a needle of unit length and zero thickness around completely (i.e. by 360^\circ)?

For instance, one can rotate a unit needle inside a unit disk, which has area \pi/4. By using a deltoid one requires only \pi/8 area.

In 1928, Besicovitch showed that that in fact one could rotate a unit needle using an arbitrarily small amount of positive area. This unintuitive fact was a corollary of two observations. The first, which is easy, is that one can translate a needle using arbitrarily small area, by sliding the needle along the direction it points in for a long distance (which costs zero area), turning it slightly (costing a small amount of area), sliding back, and then undoing the turn. The second fact, which is less obvious, can be phrased as follows. Define a Kakeya set in {\Bbb R}^2 to be any set which contains a unit line segment in each direction. (See this Java applet of mine, or the wikipedia page, for some pictures of such sets.)

Theorem. (Besicovitch, 1919) There exists Kakeya sets {\Bbb R}^2 of arbitrarily small area (or more precisely, Lebesgue measure).

In fact, one can construct such sets with zero Lebesgue measure. On the other hand, it was shown by Davies that even though these sets had zero area, they were still necessarily two-dimensional (in the sense of either Hausdorff dimension or Minkowski dimension). This led to an analogous conjecture in higher dimensions:

Kakeya conjecture. A Besicovitch set in {\Bbb R}^n (i.e. a subset of {\Bbb R}^n that contains a unit line segment in every direction) has Minkowski and Hausdorff dimension equal to n.

This conjecture remains open in dimensions three and higher (and gets more difficult as the dimension increases), although many partial results are known. For instance, when n=3, it is known that Besicovitch sets have Hausdorff dimension at least 5/2 and (upper) Minkowski dimension at least 5/2 + 10^{-10}. See my Notices article for a general survey of this problem (and its connections with Fourier analysis, additive combinatorics, and PDE), my paper with Katz for a more technical survey, and Wolff’s survey for a systematic treatment of the field (up to about 1998 or so).

In 1999, Wolff proposed a simpler finite field analogue of the Kakeya conjecture as a model problem that avoided all the technical issues involving Minkowski and Hausdorff dimension. If F^n is a vector space over a finite field F, define a Kakeya set to be a subset of F^n which contains a line in every direction.

Finite field Kakeya conjecture. Let E \subset F^n be a Kakeya set. Then E has cardinality at least c_n |F|^n, where c_n > 0 depends only on n.

This conjecture has had a significant influence in the subject, in particular inspiring work on the sum-product phenomenon in finite fields, which has since proven to have many applications in number theory and computer science. Modulo minor technicalities, the progress on the finite field Kakeya conjecture was, until very recently, essentially the same as that of the original “Euclidean” Kakeya conjecture.

Last week, the finite field Kakeya conjecture was proven using a beautifully simple argument by Zeev Dvir, using the polynomial method in algebraic extremal combinatorics. The proof is so short that I can present it in full here.

Read the rest of this entry »

Over two years ago, Emmanuel Candés and I submitted the paper “The Dantzig selector: Statistical estimation when p is much
larger than n
” to the Annals of Statistics. This paper, which appeared last year, proposed a new type of selector (which we called the Dantzig selector, due to its reliance on the linear programming methods to which George Dantzig, who had died as we were finishing our paper, had contributed so much to) for statistical estimation, in the case when the number p of unknown parameters is much larger than the number n of observations. More precisely, we considered the problem of obtaining a reasonable estimate \beta^* for an unknown vector \beta \in {\Bbb R}^p of parameters given a vector y = X \beta + z \in {\Bbb R}^n of measurements, where X is a known n \times p predictor matrix and z is a (Gaussian) noise error with some variance \sigma^2. We assumed that the predictor matrix X obeyed the restricted isometry property (RIP, also known as UUP), which roughly speaking asserts that X\beta has norm comparable to \beta whenever the vector \beta is sparse. This RIP property is known to hold for various ensembles of random matrices of interest; see my earlier blog post on this topic.

Our selection algorithm, inspired by our previous work on compressed sensing, chooses the estimated parameters \beta^* to have minimal l^1 norm amongst all vectors which are consistent with the data in the sense that the residual vector r := y - X \beta^* obeys the condition

\| X^* r \|_\infty \leq \lambda, where \lambda := C \sqrt{\log p} \sigma (1)

(one can check that such a condition is obeyed with high probability in the case that \beta^* = \beta, thus the true vector of parameters is feasible for this selection algorithm). This selector is similar, though not identical, to the more well-studied lasso selector in the literature, which minimises the l^1 norm of \beta^* penalised by the l^2 norm of the residual.

A simple model case arises when n=p and X is the identity matrix, thus the observations are given by a simple additive noise model y_i = \beta_i + z_i. In this case, the Dantzig selector \beta^* is given by the hard soft thresholding formula

\beta^*_i = \max(|y_i| - \lambda, 0 )  \hbox{sgn}(y_i).

The mean square error {\Bbb E}( \| \beta - \beta^* \|^2 ) for this selector can be computed to be roughly

\lambda^2 + \sum_{i=1}^n  \min( |y_i|^2, \lambda^2) (2)

and one can show that this is basically best possible (except for constants and logarithmic factors) amongst all selectors in this model. More generally, the main result of our paper was that under the assumption that the predictor matrix obeys the RIP, the mean square error of the Dantzig selector is essentially equal to (2) and thus close to best possible.

After accepting our paper, the Annals of Statistics took the (somewhat uncommon) step of soliciting responses to the paper from various experts in the field, and then soliciting a rejoinder to these responses from Emmanuel and I. Recently, the Annals posted these responses and rejoinder on the arXiv:

Read the rest of this entry »

This week I am at Rutgers University, giving the Lewis Memorial Lectures for this year, which are also concurrently part of a workshop in random matrices. I gave four lectures, three of which were on random matrices, and one of which was on the Szemerédi regularity lemma.

The titles, abstracts, and slides of these talks are as follows.

  1. Szemerédi’s lemma revisited. In this general-audience talk, I discuss the Szemerédi regularity lemma (which, roughly speaking, shows that an arbitrary large dense graph can always be viewed as the disjoint union of a bounded number of pseudorandom components), and how it has recently been reinterpreted in a more analytical (and infinitary) language using the theory of graph limits or of exchangeable measures. I also discuss arithmetic analogues of this lemma, including one which (implicitly) underlies my result with Ben Green that the primes contain arbitrarily long arithmetic progressions.
  2. Singularity and determinant of random matrices. Here, I present recent progress in understanding the question of how likely a random matrix (e.g. one whose entries are all +1 or -1 with equal probability) is to be invertible, as well as the related question of how large the determinant should be. The case of continuous matrix ensembles (such as the Gaussian ensemble) is well understood, but the discrete case contains some combinatorial difficulties and took longer to understand properly. In particular I present the results of Kahn-Komlós-Szemerédi and later authors showing that discrete random matrices are invertible with exponentially high probability, and also give some results for the distribution of the determinant.
  3. The least singular value of random matrices. A more quantitative version of the question “when is a matrix invertible?” is “what is the least singular value of that matrix”? I present here the recent results of Litvak-Pajor-Rudelson-Tomczak-Jaegermann, Rudelson, myself and Vu, and Rudelson-Vershynin on addressing this question in the discrete case. A central role is played by the inverse Littlewood-Offord theorems of additive combinatorics, which give reasonably sharp necessary conditions for a discrete random walk to concentrate in a small ball.
  4. The circular law. One interesting application of the above theory is to extend the circular law for the spectrum of random matrices from the continuous case to the discrete case. Previous arguments of Girko and Bai for the continuous case can be transplanted to the discrete case, but the key new ingredient needed is a least singular value bound for shifted matrices M-\lambda I in order to avoid the spectrum being overwhelmed by pseudospectrum. It turns out that the results of the preceding lecture are almost precisely what are needed to accomplish this.

[Update, Mar 31: first lecture slides corrected.  Thanks to Yoshiyasu Ishigami for pointing out a slight inaccuracy in the text.]

In this final lecture, we establish a Ratner-type theorem for actions of the special linear group SL_2({\Bbb R}) on homogeneous spaces. More precisely, we show:

Theorem 1. Let G be a Lie group, let \Gamma < G be a discrete subgroup, and let H \leq G be a subgroup isomorphic to SL_2({\Bbb R}). Let \mu be an H-invariant probability measure on G/\Gamma which is ergodic with respect to H (i.e. all H-invariant sets either have full measure or zero measure). Then \mu is homogeneous in the sense that there exists a closed connected subgroup H \leq L \leq G and a closed orbit Lx \subset G/\Gamma such that \mu is L-invariant and supported on Lx.

This result is a special case of a more general theorem of Ratner, which addresses the case when H is generated by elements which act unipotently on the Lie algebra {\mathfrak g} by conjugation, and when G/\Gamma has finite volume. To prove this theorem we shall follow an argument of Einsiedler, which uses many of the same ingredients used in Ratner’s arguments but in a simplified setting (in particular, taking advantage of the fact that H is semisimple with no non-trivial compact factors). These arguments have since been extended and made quantitative by Einsiedler, Margulis, and Venkatesh.
Read the rest of this entry »

<