While working on my recent paper with Ben Green, I was introduced to the beautiful theorems of Marina Ratner on unipotent flows on homogeneous spaces, and their application to questions in number theory, such as the Oppenheim conjecture (first solved by Margulis, by establishing what can retrospectively be viewed as a special case of Ratner’s theorems). This is a subject that I am still only just beginning to learn, but hope to understand better in the future, especially given that quantitative analogues of Ratner’s theorems should exist, and should have even more applications to number theory (see for instance this recent paper of Einsiedler, Margulis, and Venkatesh). In this post, I will try to describe some of the background for this theorem and its connection with the Oppenheim conjecture; I will not discuss the proof at all, largely because I have not fully understood it myself yet. For a nice introduction to these issues, I recommend Dave Morris’ recent book on the subject (and this post here is drawn in large part from that book).

Ratner’s theorem takes place on a homogeneous space. Informally, a homogeneous space is a space X which looks “the same” when viewed from any point on that space. For instance, a sphere is a homogeneous space, but the surface of a cube is not (the cube looks different when viewed from a corner than from a point on an edge or on a face). More formally, a homogeneous space is a space X equipped with an action of a group G of symmetries which is transitive: given any two points x, y on the space, there is at least one symmetry g that moves x to y, thus y=gx. (For instance the cube has several symmetries, but not enough to be transitive; in contrast, the sphere has the transitive action of the special orthogonal group SO(3) as its symmetry group.) It is not hard to see that a homogeneous space X can always be identified (as a set with an action of G) with a quotient , where is a subgroup of G; indeed, one can take to be the stabiliser of an arbitrarily chosen point x in X, and then identify with . For instance, the sphere has an obvious action of the special orthogonal group SO(3), and the stabiliser of (say) the north pole can be identified with SO(2), so that the sphere can be identified with SO(3)/SO(2). More generally, any Riemannian manifold of constant curvature is a homogeneous space; for instance, an m-dimensional torus can be identified with , while a surface X of constant negative curvature can be identified with for some subgroup of (e.g. the hyperbolic plane is isomorphic to ). Furthermore, the *cosphere bundle* of X – the space of unit (co)tangent vectors on X – is also a homogeneous space with structure group . (For instance, the cosphere bundle of the hyperbolic plane is isomorphic to .)

For the purposes of Ratner’s theorem, we only consider homogeneous spaces X in which the symmetry group G is a connected finite-dimensional Lie group, and X is *finite volume* (or more precisely, it has a finite non-trivial G-invariant measure). Every compact homogeneous space is finite volume, but not conversely; for instance the modular curve is finite volume but not compact (it has a cusp). (The modular curve has two real dimensions, but just one complex dimension, hence the term “curve”; rather confusingly, it is also referred to as the “modular surface”. As for the term “modular”, observe that the moduli space of unimodular lattices in has an obvious action of , with the stabiliser of being , and so this moduli space can be identified with the modular curve.)

Let U be a subgroup of G. The group U then acts on X, creating an orbit inside X for every point x in X. Even though X “looks the same” from every point, the orbits of U need not all look alike, basically because we are not assuming U to be a normal subgroup (i.e. in general). For instance on the surface of the earth, which we model as a sphere , if we let be the group of rotations around the Earth’s axis, then the orbits Ux are nothing more than the circles of latitude, together with the north and south poles as singleton orbits.

In the above example, the orbits were closed subsets of the space X. But this is not always the case. Consider for instance the 2-torus , and let be a line . Then if the slope of this line is irrational, the orbit Ux of a point x in the torus will be a dense one-dimensional subset of that two-dimensional torus, and thus definitely not closed. More generally, when considering the orbit of a subspace on a torus , the orbit Ux of a point x will always be a dense subset of some subtorus of (this is essentially Kronecker’s theorem).

From these examples we see that even if an orbit Ux is not closed, its closure is fairly “nice” – indeed, in all of the above cases, the closure can be written as a closed orbit Hx of some other group intermediate between U and G.

Unfortunately, this nice state of affairs is not true for arbitrary flows on homogeneous spaces. A classic example is geodesic flow on surfaces M of constant negative curvature (such as the modular curve mentioned earlier). This flow can be viewed as an action of (representing time) on the cosphere bundle (which represents the state space of a particle on M moving at unit speed), which is a homogeneous space with symmetry group . In this example, the subgroup is given as

. (*)

For certain surfaces, this flow is quite chaotic, for instance Morse produced an example of a geodesic flow on a constant negative curvature surface whose closed orbit had cross-sections that were homeomorphic to a Cantor set. (For the modular curve, there is an old result of Artin that exhibits an orbit which is dense in the whole curve, but I don’t know if one can obtain Cantor-like behaviour in this curve. There also seems to be some connection between geodesic flow on this curve and continued fractions which I don’t really understand.)

The reason for the “badness” of the above examples stems from the exponential instabilities present in the action of U, which can already be suspected from the presence of the exponential in (*). (Exponential instability is not a sufficient condition for chaos, but is often a necessary one.) Ratner’s theorems assert, very roughly speaking, that if one eliminates all exponential behaviour from the group U, then the orbits Ux become nicely behaved again; they are either closed, or are dense in larger closed orbits Hx.

What does it mean to eliminate “all exponential behaviour”? Consider a one-dimensional matrix group

where A is a matrix with some designated logarithm , and . Generally, we expect the coefficients of to contain exponentials (as is the case in (*)), or sines and cosines (which are basically just a complex version of exponentials). However, if A is a unipotent matrix (the only eigenvalue is 1, or equivalently that for some nilpotent matrix N), then is a polynomial in t, rather than an exponential or sinusoidal function of t. More generally, we say that an element g of a Lie group G is unipotent if its adjoint action on the Lie algebra is unipotent. Thus for instance any element in the centraliser of G is unipotent, and every element of a nilpotent group is unipotent.

We can now state one of Ratner’s theorems.

Ratner’s orbit closure theorem. Let be a homogeneous space of finite volume with a connected finite-dimensional Lie group G as symmetry group, and let U be a connected subgroup of G generated by unipotent elements. Let Ux be an orbit in of U in X. Then the closure is itself a homogeneous space of finite volume; in particular, there exists a closed subgroup such that .

This theorem (first conjectured by Raghanuthan, I believe) asserts that the orbit of any unipotent flow is dense in some homogeneous space of finite volume. In the case of algebraic groups, it has a nice corollary: any unipotent orbit in an algebraic homogeneous space which is Zariski dense, is topologically dense as well.

In some applications, density is not enough; we also want *equidistribution*. Happily, we have this also:

Ratner’s equidistribution theorem. Let X, G, U, x, H be as in the orbit closure theorem. Assume also that U is a one-parameter group, thus for some homomorphism . Then is equidistributed in ; thus for any continuous function we have

where represents integration on the normalised Haar measure on Hx.

One can also formulate this theorem (first conjectured by Dani, I believe) for groups U that have more than one parameter, but it is a bit technical to do so and we shall omit it. My paper with Ben Green concerns a quantitative version of this theorem in the special case when X is a nilmanifold, and where the continuous orbit Ux is replaced by a discrete polynomial sequence. (There is an extensive literature on generalising Ratner’s theorems from continuous U to discrete U, which I will not discuss here.)

From the equidistribution theorem and a little bit of ergodic theory one has a measure-theoretic corollary, which describes ergodic measures of a group generated by unipotent elements:

Ratner’s measure classification theorem. Let X be a finite volume homogeneous space for a connected Lie group G, and let U be a connected subgroup of G generated by unipotent elements. Let be a probability measure on X which is ergodic under the action of U. Then is the Haar measure of some closed finite volume orbit Hx for some .

– The Oppenheim conjecture –

To illustrate the power of Ratner’s orbit closure theorem, we discuss the first major application of this theorem, namely to solve the Oppenheim conjecture. (Margulis’ solution of the Oppenheim conjecture predates Ratner’s papers by a year or two, but Margulis solved the conjecture by establishing a special case of the orbit closure theorem.) I will not discuss applications of the other two theorems of Ratner here.

The Oppenheim conjecture concerns the possible value of quadratic forms in more than one variable, when all the variables are restricted to be integer. For instance, the famous four squares theorem of Lagrange asserts that the set of possible values of the quadratic form

where range over the integers, are precisely the natural numbers . More generally, if Q is a positive definite quadratic form in m variables, possibly with irrational coefficients, then the set of possible values of Q can be easily seen to be a discrete subset of the positive real axis. I can’t resist mentioning here a beautiful theorem of Jon Hanke and my friend Manjul Bhargava: if a positive-definite quadratic form with integer coefficients represents all positive integers up to 290, then it in fact represents all positive integers. If the off-diagonal coefficients are even, one only needs to represent the integers up to 15; this was done by John Conway and my classmate from Princeton, Will Schneeberger.

What about if Q is indefinite? Then a number of things can happen. If Q has integer coefficients, then clearly must take integer values, and can take arbitrarily large positive or negative such values, but can have interesting gaps in the representation. For instance, the question of which integers are represented by for some integer d already involves a little bit of class field theory of , and was first worked out by Gauss.

Similar things can happen of course if Q has commensurate coefficients, i.e. Q has integer coefficients after dividing out by a constant. What if Q has incommensurate coefficients? In the two-variable case, we can still have some discreteness in the representation. For instance, if is the golden ratio, then the quadratic form

cannot get arbitrarily close to 0, basically because the golden ratio is very hard to approximate by a rational a/b (the best approximants being given, of course, by the Fibonacci numbers).

However, for indefinite quadratic forms Q of three or more variables with incommensurate coefficients, Oppenheim conjectured in 1929 that there was no discreteness whatsoever – the set was dense in . There was much partial progress on this problem in the case of many variables (in large part due to the power of the Hardy-Littlewood circle method in this setting), but the hardest case of just three variables was only solved by Margulis in 1989.

Nowadays we can obtain Margulis’ result rather easily from Ratner’s theorem as follows. It is straightforward to reduce to the most difficult case, namely when m=3. We need to show that the image of under the quadratic form is dense in . Now, every quadratic form comes with a special orthogonal group SO(Q), defined as the orientation-preserving linear transformations that preserve Q; for instance, the Euclidean form in has the rotation group SO(3), the Minkowski form has the Lorentz group SO(3,1), and so forth. The image of under Q is the same as that of the larger set . [We may as well make our domain as large as possible, as this can only make our job easier, in principle at least.] Since Q is indefinite, , and so it will suffice to show that is dense in . Actually, for minor technical reasons it is convenient to just work with the identity component of SO(Q) (which has two connected components).

[An analogy with the Euclidean case might be enlightening here. If one spins around the lattice by the Euclidean orthogonal group SO(Q)=SO(3), one traces out a union of spheres around the origin, where the radii of the spheres are precisely those numbers whose square can be expressed as the sum of three squares. In this case, is not dense, and this is reflected in the fact that not every number is the sum of three perfect squares. The Oppenheim conjecture asserts instead that if you spin a lattice by an irrational Lorentz group, one traces out a dense set.]

In order to apply Ratner’s theorem, we will view as an orbit Ux in a symmetric space . Clearly, U should be the group , but what to do about the set ? We have to turn it somehow into a point in a symmetric space. The obvious thing to do is to view as the zero coset (i.e. the origin) in the torus , but this doesn’t work, because does not act on this torus (it is not a subgroup of ). So we need to lift up to a larger symmetric space , with a symmetry group G which is large enough to accommodate .

The problem is that the torus is the moduli space for translations of the lattice , but is not a group of translations; it is instead a group of unimodular linear transformations, i.e. a subgroup of the special linear group . This group acts on lattices, and the stabiliser of is . Thus the right homogeneous space to use here is , which has a geometric interpretation as the moduli space of unimodular lattices in (i.e. a higher-dimensional version of the modular curve); X is not compact, but one can verify that X has finite volume, which is good enough for Ratner’s theorem to apply. Since the group contains , U acts on X. Let be the origin in X (under the moduli space interpretation, x is just the standard lattice ). If Ux is dense in X, this implies that the set of matrices is dense in ; applying this to, say, the unit vector (1,0,0), we conclude that is dense in as required. (These reductions are due to Raghunathan. Note that the new claim is actually a bit stronger than the original Oppenheim conjecture; not only are we asserting now that applied to the standard lattice sweeps out a dense subset of Euclidean space, we are saying the stronger statement that one can use to bring the standard lattice “arbitrarily close” to any given unimodular lattice one pleases, using the topology induced from .)

How do we show that Ux is dense in X? We use Ratner’s orbit closure theorem! This theorem tells us that if Ux is *not* dense in X, it must be much smaller – it must be contained in a closed finite volume orbit Hx for some proper closed connected subgroup H of which still contains . [To apply this theorem, we need to check that U is generated by unipotent elements, which can be done by hand; here is where we need to assume .] An inspection of the Lie algebras of and shows in fact that the only such candidate for H is itself (here is where we really use the hypothesis m=3!). Thus is closed and finite volume in X, which implies that is a lattice in . Some algebraic group theory (specifically, the Borel density theorem) then shows that lies in the Zariski closure of , and in particular is definable over . It is then not difficult to see that the only way this can happen is if Q has rational coefficients (up to scalar multiplication), and the Oppenheim conjecture follows.

[*Update*, September 29: Link to Dave Morris’ book repaired slightly.]

## 25 comments

Comments feed for this article

29 September, 2007 at 12:30 pm

Johan RichterThere is a slight problem with the link to Morris’ book. Arxiv will take you to the right place but not without complaining first.

And feel free to delete ths comment.

29 September, 2007 at 12:33 pm

Terence TaoThanks for the correction!

29 September, 2007 at 11:36 pm

Prashant VDear Terry,

Are there any analogues to Ratner’s Theorem that can be extended to non-homogeneous spaces (i.e. the surface of a cube?

29 September, 2007 at 11:42 pm

MatheusDear Terry,

concerning your comment “There also seems to be some connection between geodesic flow on this curve and continued fractions which I don’t really understand.”, it seems that the article

“Symbolic dynamics for the modular surface and beyond” by Svetlana Katok and Ilie Ugarcovici (published on Bulletin of AMS, 44, n.1, (2007) p. 87-132)

gives a good explanation of the connection cited in your comment. More precisely, Katok et al. survey shows how one can undertsand the dynamics of this geodesic flow from the construction of suitable Markov partitions provided by continued fractions algorithms.

1 October, 2007 at 10:37 am

Terence TaoDear Prashant: In some cases one can foliate a non-homogeneous space into homogeneous spaces, if all the orbits of the ambient group are closed, and then Ratner’s theorem can apply to each space separately. For instance, the surface of a cylinder is not a homogeneous space for the action of the rotation group SO(2) around its axis, because this action is not transitive; but one can foliate that surface into circles, each of which is a homogeneous space for SO(2). Of course, orbits are not always closed (this is why we need Ratner’s theorem in the first place!)… My impression is that once one works with more general group actions on spaces, orbits can in fact get quite chaotic and not pleasantly classifiable.

Dear Matheus: thank you for the reference! It occurred to me afterwards that geodesic flow on the modular curve is also connected to the value of quadratic forms in two variables, by the exact same reductions used in the discussion of the Oppenheim conjecture, and that problem is already known to be connected with continued fractions (cf. my discussion above on the golden ratio). Of course, in two dimensions, the orthogonal group is not generated by unipotents, which can be viewed as an explanation as to why the Oppenheim conjecture is restricted to three and higher dimensions.

5 October, 2007 at 7:24 pm

Unipotent elements of the Lorentz group, and conic sections « What’s new[…] rotations, polynomial growth, unipotent matrices In my discussion of the Oppenheim conjecture in my recent post on Ratner’s theorems, I mentioned in passing the simple but crucial fact that the (orthochronous) special orthogonal […]

9 January, 2008 at 10:14 am

254A, Lecture 1: Overview « What’s new[…] actions arising from a nilpotent Lie group with discrete stabiliser. One of the key results here is Ratner’s theorem, which describes the distribution of orbits in nilsystems, and also in a more general class of […]

13 January, 2008 at 4:22 pm

254A, Lecture 3: Minimal dynamical systems, recurrence, and the Stone-Cech compactification « What’s new[…] on the Lie algebra is unipotent rather than trivial, then Theorem 2 still holds; this follows from Ratner’s theorem, of which we will discuss much later in this course. But the claim is not true for all group […]

24 January, 2008 at 4:02 pm

254A, Lecture 6: Isometric systems and isometric extensions « What’s new[…] we characterise the distribution of polynomial sequences in torii (a baby case of a variant of Ratner’s theorem due to (Leon) Green, which we will cover later in this […]

9 March, 2008 at 8:33 pm

254A, Lecture 16: A Ratner-type theorem for nilmanifolds « What’s new[…] nilpotent groups, Ratner’s theorem, structure The last two lectures of this course will be on Ratner’s theorems on equidistribution of orbits on homogeneous spaces. Due to lack of time, I will not be able to […]

15 March, 2008 at 3:07 pm

254A, Lecture 17: A Ratner-type theorem for SL_2(R) orbits « What’s new[…] groups, Mautner phenomenon, Ratner’s theorem, unipotence In this final lecture, we establish a Ratner-type theorem for actions of the special linear group on homogeneous spaces. More precisely, we […]

14 June, 2008 at 1:20 pm

The van der Corput’s trick, and equidistribution on nilmanifolds « What’s new[…] character mentioned above; this is an extremely simple case of a much more general result known as Ratner’s theorem, which I will not talk further about […]

22 November, 2008 at 11:11 am

Pete L. ClarkDear Prof. Tao,

The 290 theorem was conjectured by John Conway around the same time as the 15 theorem. The proof of the 290 theorem is joint work of Manjul Bhargava and (my colleague) Jonathan Hanke.

22 November, 2008 at 1:36 pm

Terence TaoDear Pete: thanks for the correction! I think I had gotten confused because the Bhargava-Hanke paper does not yet seem to be published.

13 January, 2009 at 7:05 pm

Distinguished Lecture Series I: Gregory Margulis, “Homogeneous dynamics and number theory I.” « What’s new[…] in homogeneous spaces. One famous example is the Oppenheim conjecture (which I also blogged about here), first proven in full generality by Margulis. It concerns the possible values of a real […]

16 January, 2009 at 9:45 am

Distinguished Lecture Series III: Gregory Margulis, “Homogeneous dynamics and number theory III.” « What’s new[…] some of the ideas around the theory of unipotent flows on homogeneous spaces, culminating in the orbit closure, equidsitribution, and measure classification theorems of Ratner in the subject. Margulis also discussed the application to metric theory of Diophantine […]

18 February, 2009 at 5:57 pm

Furstenberg’s 2x, 3x (mod 1) problem « Disquisitiones Mathematicae[…] problems, one finds Margulis’ solution to Oppenheim conjecture (see e.g. this blog post of Terence Tao). Here, the basic idea is to convert the study of the values of indefinite quadratic […]

26 July, 2009 at 7:02 am

“The remarkable effectiveness of Ergodic Theory in Number Theory” « Disquisitiones Mathematicae[…] the several successful applications of Ergodic Theory in old problems of Number Theory (such as Margulis solution of Oppenheim conjecture, Elkies-McMullen theorem on the gap distribution of mod 1, Green-Tao theorem about long arithmetic […]

2 December, 2009 at 2:30 pm

An inverse theorem for the Gowers U^4 norm « What’s new[…] reason why this Lemma implies Claim 1 is because the joint orbit of , , can be controlled, using Ratner’ s theorem (or more precisely, a quantitative version of this theorem that Ben and I worked out a few years […]

19 August, 2010 at 1:56 pm

Lindenstrauss, Ngo, Smirnov, Villani « What’s new[…] theorems; this was discussed earlier on this blog at this writeup of a lecture of Margulis, or at this discussion of Ratner’s theorems. Indeed, even though the Oppenheim conjecture is a purely number-theoretic statement, the only […]

20 August, 2010 at 10:54 am

ICM2010 — Lindenstrauss laudatio « Gowers's Weblog[…] of a very general and important story. I am not competent to tell that story, but fortunately Terence Tao has blogged on it. (That is not the only post of Terry’s on that topic: another I would recommend is this one. […]

13 July, 2011 at 6:44 am

Conférence internationale Géométrie Ergodique (Orsay 2011) I « Disquisitiones Mathematicae[…] Closing the “historical introduction” part of the talk, Lindenstrauss remarked that Dani-Margulis theorem is a special case of Raghunathan’s conjecture which in particular classifies orbit closures in whenever is generated by unipotent one-parameter subgroups. Furthermore, Raghunathan’s conjecture was proved in full generality by Marina Ratner. Remark 4 Of course, this is a very long history and due to the usual space-time limitations, Lindenstrauss was obliged to stop here the introductory considerations. For the curious reader wishing to learn more about the topics mentioned above, I strongly recommend reading Terence Tao’s posts on this subject, specially these ones here. […]

16 August, 2011 at 9:34 pm

Ratner’s Theorems | YAMB[…] subgroups, i.e., the image of by a homomorphism. However, according to Terrence Tao’s blog post on this subject, it can be formulated for more general amenable groups . In the case when is a […]

18 November, 2011 at 2:38 am

Diffusion in Ehrenfest wind-tree model « Disquisitiones Mathematicae[…] a celebrated Ratner-like classification theorem of closures of -orbits of genus 2 translation surfaces by K. Calta and […]

12 August, 2014 at 8:05 pm

Avila, Bhargava, Hairer, Mirzakhani | What's new[…] of Ratner’s celebrated rigidity theorems for unipotently generated groups (discussed in this previous blog post). Ratner’s theorems are already notoriously difficult to prove, and rely very much on the […]