Distinguished Lecture Series II: Gregory Margulis, “Homogeneous dynamics and number theory II.”

14 January, 2009 in DLS, math.DS, math.NT | Tags: equidistribution, Grigory Margulis, lattice points, Oppenheim conjecture | by Terence Tao

Today, Prof. Margulis continued his lecture series, focusing on two specific examples of homogeneous dynamics applications to number theory, namely counting lattice points on algebraic varieties, and quantitative versions of the Oppenheim conjecture. (Due to lack of time, the third application mentioned in the previous lecture, namely metric theory of Diophantine approximation, was not covered.)

— Counting lattice points in algebraic varieties —

Let $V \subset {\Bbb R}^n$ be an algebraic variety defined over ${\Bbb Q}$ . In general, the question of counting the lattice points $V \cap {\Bbb Z}^n$ is pretty much intractible (even determining whether $V \cap {\Bbb Z}^n$ is non-empty is essentially Hilbert’s tenth problem, known to be undecidable by Matiyasevich’s theorem). However, the problem looks much more tractable if V is homogeneous, in the sense that there exists a reductive subgroup G of $GL_n({\Bbb R})$ , defined over ${\Bbb Q}$ , which preserves V and acts transitively on V (thus $V = Gv$ for some $v \in {\Bbb R}^n$ ). A general question here would be to determine the asymptotics of the quantity $N(T,V)$ as $T \to \infty$ , where $N(T,V) := |V \cap TB \cap {\Bbb Z}^n|$ is the number of lattice points in V in the ball TB of radius T.

Thanks to a classical theorem of Borel and Harish-Chandra, it is known in the above setting that the integer points $V({\Bbb Z})$ of V split as the finite union of orbits of the discrete group $\Gamma := \{ g \in G: g {\Bbb Z}^n = {\Bbb Z}^n \}$ . So, modulo the problem of effectively computing these orbts (which is admittedly a non-trivial task), the question boils down to obtaining asymptotics for $N( T, {\mathcal O}) := |{\mathcal O} \cap B_T|$ as $T \to \infty$ for some orbit ${\mathcal O} := \Gamma v_0 \subset V$ for some $v_0 \in V$ .

Naively, one expects a discrete count such as $|{\mathcal O} \cap B_T|$ to asymptotically resemble its continuous counterpart (much as, say, the number of lattice points in a ball of radius R is known by elementary volume packing arguments going back to Gauss to asymptotically be equivalent to the volume of that ball). In this setting, the intuition would be formalised as follows. We can express the homogeneous space V as $V = G/H$ , where H is the stabiliser of $v_0$ . Then we can pull back $B_T$ to $G/H$ to create the ball-like region $R_T := \{ gH \in G/H: g v_0 \in B_T \}$ . Also, making the mild assumption that the connected component $G^\circ, H^\circ$ of G, H (where connectedness is in the algebraic geometry sense) have no non-trivial characters over ${\Bbb Q}$ (this hypothesis is automatic when G is semisiple), it follows from the work of Borel and Harish-Chandra that the homogeneous space $G/\Gamma$ , and the subspace $H / (\Gamma \cap H)$ both support invariant probability measures $\mu_G, \mu_H$ , which in turn naturally define an invariant measure $\lambda_{G/H}$ on G/H. The natural “discrete count is asymptotically equivalent to continuous count” conjecture would then be the assertion that

$N(T,{\mathcal O}) \sim \lambda_{G/H}(R_T)$ (1)

where $f(T) \sim g(T)$ means that $f(T)/g(T) \to 1$ as $T \to \infty$ .

In principle, the computation of the continuous volume $\lambda_{G/H}(R_T)$ is “just” a computation of a several variable calculus integral, and so (1) provides an asymptotic for the growth of lattice points in the orbit ${\mathcal O}$ .

A significant result in this subject is the work of Eskin, Mozes, and Shah, who showed that the asymptotic (1) held under the assumption that $H^\circ$ is a maximal proper connected ${\Bbb Q}$ -subgroup of G. The key step in their argument is to show that for any sequence $g_i \to G$ going to infinity, that the translated measures $g_i \mu_H$ converge weakly to $\mu_G$ (i.e. become asymptotically equidistributed).

As a typical illustration of their results, consider the variety

$V_p := \{ A \in M_n({\Bbb Z}): \det( \lambda I - A ) = p \}$

of integer matrices with a fixed characteristic polynomial p, which should of course be monic of degree n and with integer coefficients. We will also take $n \geq 2$ and assume p irreducible. Then as a corollary of the general theorem of Eskin, Mozes, and Shah, $N(T,V_p)$ is asymptotically $c_p T^{n(n-1)/2}$ , where $c_p$ is explicitly computable in terms of various algebraic number theory data arising from p. For instance, if p splits over ${\Bbb R}$ and has a root $\alpha$ such that ${\Bbb Z}[\alpha]$ is the ring of integers in ${\Bbb Q}(\alpha)$ , then

$\displaystyle c_p = \frac{2^{n-1} h R \omega_{n(n-1)/2}}{\sqrt{D} \prod_{k=2}^n \Lambda(k/2)}$

where D is the discriminant of p, R is the regulator of $Q(\alpha)$ , $\omega_d$ is the volume of the d-dimensional unit ball, h is the class number of $Z[\alpha]$ , and $\Lambda(s) = \pi^{-s} \Gamma(s) \zeta(2s)$ is a variant of the Riemann Xi function.

— Quantitative Oppenheim conjecture —

For the setting of the quantitative Oppenheim conjecture, one considers an indefinite quadratic form $Q: {\Bbb R}^n \to {\Bbb R}$ with some signature $(p,q)$ for some p+q=n; we normalise $p \geq q$ , and also normalise Q to have discriminant 1. We also fix a star-shaped region $\Omega$ around the origin (one can just take $\Omega$ to be the unit ball for sake of concreteness) and consider for fixed $a < b$ , the discrete quantity

$N_{Q,\Omega}(a,b,T) = | \{ x \in T \Omega \cap {\Bbb Z}^n: Q(x) \in (a,b) \} |$

and the continuous quantity

$V_{Q,\Omega}(a,b,T) = \hbox{mes}( \{ x \in T \Omega: Q(x) \in (a,b) \} | )$ .

Again, $V_{Q,\Omega}(a,b,T)$ can be computed asymptotically, indeed it is not hard (basically just several variable calculus) to show that

$V_{Q,\Omega}(a,b,T) \sim \lambda_{Q,\Omega} (b-a) T^{n-2}$

where $\lambda_{Q,\Omega}$ is the explicit quantity

$\displaystyle \lambda_{Q,\Omega} = \int_{L \cap \Omega} \frac{dA}{|\nabla Q|}$

where L is the light cone of Q, and A is the area element.

In analogy with (1) and with the Gauss circle problem, we would expect

$N_{Q,\Omega}(a,b,T) \sim V_{Q,\Omega}(a,b,T)$ (2)

for each fixed Q, $\Omega$ , a, b. The usual volume packing argument does not work in the indefinite case because the set $\{ x: a < Q(x) < b \}$ is very “narrow”. Nevertheless, from the work of Dani and Margulis we have some results. Firstly, when Q is irrational and $p \geq 2, q \geq 1$ , we have the lower bound

$\liminf_{T \to \infty} N_{Q,\Omega}(a,b,T)/V_{Q,\Omega}(a,b,T) \geq 1$ , (3)

thus there are asymptotically at least as many lattice points as predicted by (2). This bound is uniform over any compact set of irrational forms Q. In the case $p \geq 0, q \geq 0, p+q=n \geq 5$ , we also have the bound

$N_{Q,\Omega}(-\varepsilon,\varepsilon,T) / V_{Q,\Omega}(-\varepsilon,\varepsilon,T) > c > 0$

uniformly in $\varepsilon$ and for Q in a compact set (and not necessarily irrational), where c depends only on this compact set and on $\Omega$ . (The condition $n \geq 5$ is necessary here, since otherwise the counterexamples to Meyer’s theorem in lower dimensions give examples when $N_{Q,\Omega}$ stays bounded while $V_{Q,\Omega}$ goes to infinity.)

A more recent result of Eskin, Margulis, and Mozes improves the lower bound (3) to the asymptotic (2) in the case when $p \geq 3, q \geq 1$ and Q is irrational. This leaves out the “exceptional” cases $(p,q) = (2,1), (2,2)$ . Perhaps surprisingly, the asymptotic (2) fails in such cases, in fact there are examples of forms Q in which $N_{Q,\Omega}/V_{Q,\Omega}$ grows close to logarithmically in T. (More precisely, given any function $f(T) = o( \log T)$ , there exists a form Q and a sequence of times $T_n \to \infty$ such that $N_{Q,\Omega}(T_n)/V_{Q,\Omega}(T_n) \geq f(T_n)$ .) The forms are actually quite explicit; in the $(p,q)=(2,1)$ case they are given by $Q(x_1,x_2,x_3) = x_1^2+x_2^2 - \alpha x_3^2$ where $\sqrt{\alpha}$ is very well approximated by rationals, and in the case $(p,q)=(2,2)$ the are of the form $Q(x_1,x_2,x_3) = x_1^2+x_2^2 - \alpha (x_3^2+x_4^2)$ for similar $\alpha$ . (These sorts of examples originate with an observation of Sarnak.) In the above paper it was shown that the (2,2) counterexamples given here are “essentially” the “only” counterexamples of this signature, although the precise formal statement of this type is technical. The situation for (2,1) signature remains somewhat unclear. On the other hand, it is not too difficult to show that for generic Q (in particular, for almost every Q), one recovers the asymptotic (2). And for every Q in a given compact set K, there is a universal upper bound $N_{Q,\Omega}(T) = O( T^{n-2} )$ in the non-exceptional cases $p \geq 3, q \geq 1$ , with a logarithmic correction $N_{Q,\Omega}(T) = O( T^{n-2} \log T )$ in the exceptional cases $(p,q) = (2,1), (2,2)$ .

One reason for the failure of the asymptotic in these exceptional cases can apparently be traced back to the refusal of a certain integral to decay in the limit $g \to \infty$ . Specifically, if one lets $S^1 := \{ (x,y,1): x^2+y^2=1\}$ be the unit circle in the standard light cone $\{ (x,y,z): x^2+y^2=z^2\}$ , then it is a pleasant geometric exercise to observe that the integral $\int_{S^1} \frac{dv}{|gv|}$ for $g \in SO(2,1)$ is in fact independent of g, and in particular does not go to zero as $g \to \infty$ , whereas the higher-dimensional analogues of this integral do decay.

One of the basic tools in proving these estimates is the Siegel transform, which maps absolutely integrable functions $f \in L^1({\Bbb R}^n)$ to absolutely integrable functions $\tilde f \in L^1({\Omega}_n)$ by the formula

$\displaystyle \tilde f(\Delta) := \sum_{v \in \Delta \backslash \{0\}} f(v)$ .

A classical observation of Siegel is that this transform is mass-preserving:

$\displaystyle \int_{{\Bbb R}^n} f = \int_{{\Omega}_n} \tilde f$ .

As a quick corollary, one recovers a classical theorem of Minkowski that any measurable subset A of ${\Bbb R}^n \backslash \{0\}$ of measure less than 1 is avoided by at least one unimodular lattice (just apply the above identity with $f := 1_A$ ). It turns out that one needs a variant of this statement, namely that the proportion of lattices which avoid A has measure $O( 1 / \hbox{mes}(A) )$ . For this one needs some second moment estimates on $\tilde f$ , which turn out to be classical (essentially going back to C. C. Rogers) for $n \geq 3$ ) but are quite delicate for n=2, requiring in particular some facts about Eisenstein series and which were first worked out by Athreya and Margulis.

Using Siegel’s identity, one can reduce matters to understanding how the transform $\tilde f$ of a function f equidistributes over a shifted orbit $u_t K \Delta$ in $\Omega_n = G/\Gamma$ as $t \to \infty$ , where $\{u_t\}$ is a one-parameter subgroup of $G = SL_n({\Bbb R})$ fixing Q, $\Gamma = SL_n({\Bbb Z})$ , $K$ is a compact subgroup of G, and the lattice $\Delta \in \Omega_n$ is fixed. (Concretely, one can take $Q(x_1,\ldots,x_n) = 2x_1 x_n + \sum_{i=2}^{p} x_i^2 - \sum_{i=p+1}^n x_i^2$ , $u_t$ to be the Lorentz boost that maps $e_1$ to $e^t e_1$ and $e_n$ to $e^{-t} e_n$ , but leaves the other basis vectors $e_2,\ldots,e_{n-1}$ unchanged, and $K = SO(Q) \cap SO(n)$ is the stabiliser of $\hbox{span}(e_1+e_n,e_2,\ldots,e_p)$ .

If $\tilde f$ was continuous and bounded, then the question is purely a dynamical one, involving how the orbit $u_t K g \Gamma$ equidistributes in $G/\Gamma$ . Unfortunately $\tilde f(\Delta)$ blows up when the lattice $\Delta$ approaches degeneracy, in the sense that there exists some intermediate dimension parallelopiped in the lattice of small measure. (Thus, for instance, one way one can degenerate is if one of the vectors of $\Delta$ approaches the origin.) This is formalised by a classical “Lipschitz principle”, bounding $\tilde f$ by $\alpha$ for some geometric function $\alpha$ on $\Omega_n$ (essentially the reciprocal of the least measure of an parallelopiped in the lattice). To deal with this blowup, one basically needs some moment estimates on $\alpha$ in the cusp of $\Omega_n$ . In the non-exceptional case $p \geq 3, q \geq 1$ , it turns out that one can get bounds on the moments $\sup_{t > 0} \int_K \alpha^s(u_t k \Delta)\ dk$ for any $0 < s < 2$ , and in particular for some exponent s larger than 1, and this is enough to ignore the effect of the cusp. But in the exceptional case $p=2$ one can only get moment estimates for $0 < s < 1$ , which is not sufficient; but one has a substitute bound on $\sup_{t > 0} \frac{1}{t} \int_K \alpha(u_t k \Delta)\ dk$ in this case which is enough to give control up to a $\log T$ factor.

5 comments

Comments feed for this article

13 July, 2011 at 6:44 am

Conférence internationale Géométrie Ergodique (Orsay 2011) I « Disquisitiones Mathematicae

[…] mentioned above, I strongly recommend reading Terence Tao’s posts on this subject, specially these ones […]

2 July, 2012 at 8:40 am

Rex

How do we prove that the Siegel transform is mass-preserving?

Also, do you know of any standard references about this operator?

2 July, 2012 at 9:17 am

Terence Tao

See K.L. Siegel, Lectures on the Geometry of Numbers, Springer-Verlag, New York, 1989. One can, if one wishes, interpret Siegel’s formula as a definition of Haar measure on the quotient space $\Omega_n = SL_n({\bf R})/SL_n({\bf Z})$ , at which point the only issue is to verify that the measure is invariant and has total mass 1.

3 July, 2012 at 6:52 am

Rex

Why does the sum over lattice points in the definition of $\widetilde{f}(\Lambda)$ exclude the origin?

Is this just some kind of convention, or is there an important reason why we cannot add $f(0)$ into the sum?

3 July, 2012 at 7:42 am

Terence Tao

Well, the mass formula would fail then, since $\{0\}$ has measure zero in ${\bf R}^n$ but would give a non-zero contribution on the $\Omega_n$ side.

Note that the action of $SL_n({\bf R})$ on ${\bf R}^n$ has two orbits. One is $\{0\}$ , and the other is ${\bf R}^n \backslash \{0\}$ . Generally speaking, formulae in dynamics are cleaner if restricted to one orbit (or one orbit closure, or an ergodic measure) rather than a combination of such orbits.

	Anonymous on On product representations of…
	Alex Gunning on A symmetric formulation of the…
	Terence Tao on On product representations of…
	domotorp on On product representations of…
	Terence Tao on 275A, Notes 3: The weak and st…
	Terence Tao on A symmetric formulation of the…
	Anonymous on On product representations of…
	Anonymous on 275A, Notes 3: The weak and st…
	Anonymous on 275A, Notes 3: The weak and st…
	Alex Gunning on A symmetric formulation of the…
	Anonymous on Erratum for “An inverse…
	Anonymous on Erratum for “An inverse…
	Anonymous on Erratum for “An inverse…
	Anonymous on 275A, Notes 3: The weak and st…
	Anonymous on It ought to be common knowledg…

Distinguished Lecture Series II: Gregory Margulis, “Homogeneous dynamics and number theory II.”

Recent Comments

Articles by others

Diversions

Mathematics

Selected articles

Software

The sciences

Top Posts

Archives

Categories

The Polymath Blog

5 comments

Leave a comment Cancel reply

For commenters

Distinguished Lecture Series II: Gregory Margulis, “Homogeneous dynamics and number theory II.”

Share this:

Recent Comments

Articles by others

Diversions

Mathematics

Selected articles

Software

The sciences

Top Posts

Archives

Categories

The Polymath Blog

5 comments

Leave a comment Cancel reply

For commenters