On Wednesday, Shou-wu Zhang continued his lecture series. Whereas the first lecture was a general overview of the rational points on curves problem, the second talk focused entirely on the genus 1 case – i.e. the problem of finding rational points on elliptic curves. This is already a very deep and important problem in number theory – for instance, this theory is decisive in Wiles’ proof of Fermat’s last theorem. It was also somewhat more technical than the previous talk, and I had more difficulty following all the details, but in any case here is my attempt to reconstruct the talk from my notes. Once again, the inevitable inaccuracies here are my fault and not Shou-wu’s, and corrections or comments are greatly appreciated.

NB: the talk here seems to be loosely based in part on Shou-wu’s “Current developments in Mathematics” article from 2001.

The problem of finding rational (or more generally, algebraic) points on a curve is clearly an algebraic problem, but a remarkable feature of the subject is that it can be profitable (in the right circumstances) to seek such points via using transcendental functions, which at first glance are much more closely related to analysis and geometry than to algebra. Shou-wu gave the analogy of the unit circle \{(x_1,x_2): x_1^2+x_2^2 - 1 \} as sort of a simplified toy model of an elliptic curve. One way to construct algebraic points on this circle is to use the transcendental functions \sin: {\Bbb R}/2\pi {\Bbb Z} \to {\Bbb R} and \sin' = \cos: {\Bbb R}/2\pi {\Bbb Z} \to {\Bbb R}; indeed, given any rational number a/q, the point (\sin 2\pi \frac{a}{q}, \sin' 2\pi \frac{a}{q}) gives an algebraic point on the unit circle. The sine function is not an algebraic function, but can be defined instead via an integral of an algebraic function, since (modulo some small lies) we have

x = \int_0^{\sin(x)} \frac{dt}{\sqrt{1-t^2}}.

The sine function also satisfies a differential equation, which almost defines this function uniquely:

\sin'(x)^2 = 1 - \sin(x)^2.

Finally, of course, we all know that the sine function has a very geometric interpretation.

Now for constructing rational points on the unit circle, such as (\frac{3}{5}, \frac{4}{5}), it is not so profitable to use these transcendental functions, and it is better to use the purely algebraic formula (\frac{m^2-n^2}{m^2+n^2}, \frac{2mn}{m^2+n^2}) to generate the points. It seems that when we move to elliptic curves, something similar happens: the analogue of the transcendental sine function is the Weierstrass elliptic function {\mathcal P}, while the analogue of the purely algebraic formula is more subtle (and becomes transcendental also); once again, the latter is more useful than the former for constructing rational points (as opposed to just algebraic points).

In the previous lecture, Shou-wu defined an elliptic curve E as a genus 1 curve with a marked point P (which will eventually become the group identity element). It is traditional to move this point P to the (vertical) point at infinity 1/y = x/y = 0 and view the curve affinely, in which case the curve can be placed via an affine transformation in the normal form

E = \{ (x,y): y^2 = x^3 + ax + b \}

(ignoring some minor difficulties arising from characteristic 2 or 3, and assuming that the discriminant \Delta := -16(4a^3+27b^2) is non-zero). As before, we can view this curve over any field k, for instance giving a Riemann surface E({\Bbb C}), the rational points E({\Bbb Q}), the finite field points E({\Bbb F}_q), etc. (There are some minor issues regarding whether one should view E affinely or projectively here; I don’t fully understand this issue and will therefore ignore it.)

The Riemann surface E({\Bbb C}) was described completely by Weierstrass. Indeed, for any complex constants g_2, g_3 one can define the Weierstrass elliptic function {\mathcal P}(u) by the contour integration formula

u = \int_{{\mathcal P}(u)}^\infty \frac{dt}{\sqrt{4t^3 - g_2 t - g_3}};

note that the RHS is only defined up to a lattice \Lambda \subset {\Bbb C} arising from the periods of the multi-valued function \frac{1}{\sqrt{4t^3 - g_2 t - g_3}}, and so the Weierstrass function should really be defined on the torus {\Bbb C}/\Lambda rather than the complex plane {\Bbb C}. As with the sine function, it obeys a differential equation

{\mathcal P}'(u) = 4 {\mathcal P}(u)^3 - g_2 {\mathcal P}(u) - g_3

and so the map u \mapsto ({\mathcal P}(u), {\mathcal P}'(u)) sends the torus {\Bbb C}/\Lambda holomorphically to the Riemann surface of an elliptic curve, and in fact every elliptic curve can be described in this manner. Thus, from the complex (and hence topological) perspective, an elliptic curve is nothing more than a torus of one complex dimension (or two real dimensions).

A complex torus {\Bbb C}/\Lambda is an abelian Lie group, and hence there should also be an abelian Lie group structure on the original elliptic curve E. This group structure is in fact very easy to describe geometrically in the original affine curve picture: three points P,Q,R on the curve will sum to zero if and only if they are collinear (or more precisely, they are the intersection of the projective curve of E with a line, counting multiplicity, and letting the point at infinity play the role of the origin). This group structure can be described algebraically, and so makes sense over any field, not just the complex numbers.

One consequence of Weierstrass’ description is that elliptic curves (over the complex numbers) up to isomorphism can be parameterised by the moduli space of all lattices, which is given by X_0(1) = SL_2({\Bbb Z})\backslash {\Bbb H}; in particular this moduli space has one complex dimension and is thus just a modular curve. This is the first hint that modular functions and modular forms are likely to play a key role in the subject.

For some special lattices \Lambda, the complex torus {\Bbb C}/\Lambda also enjoys some non-trivial complex endomorphisms; for instance if \Lambda is the Gaussian integers {\Bbb Z}[\sqrt{-1}] then we have a multiplication by i, or more generally by any Gaussian integer. This gives rise to the theory of complex multiplication of elliptic curves; it turns out that for many number fields k (and an associated order \theta_k, which should be thought of as just the ring of integers in k) one can isolate the special elliptic curves E which enjoy a complex multiplication structure of \theta_k, or more precisely their endomorphism ring End(E) is isomorphic to \theta_k. This should be thought of as a zero-dimensional subset (or better yet, as a divisor) of the one-dimensional space of all elliptic curves. These divisors turn out to ultimately be very useful for constructing rational points on elliptic curves.

It turns out that in order to do all this, one needs to transform the original elliptic curve E into another object, a modular curve X_0(N), the points of which can themselves be thought of, more or less, as elliptic curves (this appearance of elliptic curves on two different “levels” seems rather confusing to me!). There is in fact now known to be a correspondence in both directions between elliptic curves and modular curves, but Shou-wu mostly focused on the conversion of elliptic curves to modular curves (the converse direction was worked out by Eichler and Shimura).

Firstly, there are a finite number of primes p where the reduction of E to p is bad for one reason or another; we multiply all these primes p together to form the conductor N, which then gets carried along throughout the rest of the construction. (Incidentally, the abc conjecture is equivalent to the assertion that N \ll_\epsilon |\Delta|^{6+\epsilon} for all \epsilon > 0.) Next, we form the Hasse-Weil L-function L(s,E), defined as the product over good primes p of the local L-functions (1 - a_p p^{-s} + p^{1-2s})^{-1}, where a_p = p+1 - \# E( {\Bbb F}_p ) was as in the previous lecture, times some other (not terribly important) factors coming from bad primes. It is a result of Faltings that the L-function basically captures the elliptic curve E up to isogeny, so in principle nothing significant is lost by working just with the L-function.

This L-function can then be expanded as a Dirichlet series \sum_n \frac{a_n}{n^s}; there are some important issues regarding convergence, analytic continuation, etc. which will be glossed over.
Next, we apply the Mellin transform, which in this context basically converts the Dirichlet series \sum_n \frac{a_n}{n^s} to the Fourier series f_E(z) := \sum_n a_n e^{2\pi i n z}. This function f_E(z) is defined on the upper half-plane {\Bbb H} and is obviously invariant under shifts z \mapsto z+1. The amazing and nontrivial fact, conjectured by Shimura-Taniyama, proved in large part by Wiles and Taylor (en route to proving Fermat’s last Theorem) and then completed by Breuil, Conrad, Diamond and Taylor, gives an additional invariance, namely that f_E is a modular form of weight 2 with respect to the principal congruence subgroup

\Gamma_0(N) := \{ \left( \begin{array}{cc} a & b \\ c & d \end{array}\right) \in SL_2({\Bbb Z}): c = 0 \hbox{ mod } N \}

 

 

Equivalently, f_E(z)\ dz is a complex 1-form on the modular curve X_0(N) := \Gamma_0(N) \backslash {\Bbb H}. (This modularity, incidentally, is equivalent after inverting the Mellin transform to a functional equation for the L-function, and can also be used to show that the L-function is entire, which is necessary to even just state the BSD conjecture properly.) One can therefore integrate this 1-form to create a complex map w \mapsto \int_w^\infty f_E(z)\ dz which maps X_0(N) to the torus {\Bbb C}/\Lambda, where \Lambda is the lattice of periods arising from integrating f_E(z) dz on closed loops in the modular curve. From Weierstrass, we know that the torus is equivalent to an elliptic curve E_f; from the work of Faltings, this new curve is isogenous to the original curve E.

To summarise, and ignoring some technical details (which I frankly didn’t fully understand), the deep fact that elliptic curves are modular gives us a new object, the modular curve X_0(N), which has an explicit (but somewhat transcendental) map back into the elliptic curve E (or something very much like E). This gives us a strategy to find rational (or algebraic) points on E by first finding special points (or things like points, e.g. divisors) on X_0(N), and mapping them into E.

[Actually, one does not need the full result of Wiles et al. for many applications; in particular, the elliptic curves with complex multiplication were known to be modular by significantly simpler arguments.]

There are some advantages in working in the modular curve rather than the elliptic curve, one of which is that the points on the modular curve themselves have interesting
structure. We already noted that when N=1, that the modular curve X_0(1) parameterises all elliptic curves. For larger N, there is a slight modification; it turns out that X_0(N) parameterises pairs (A,C) where A is an elliptic curve and C is a finite subgroup of A isomorphic to {\Bbb Z}/N{\Bbb Z}. Because both of these objects can be defined over specific fields (such as {\Bbb C} or {\Bbb Q}), this gives us a way to define X_0(N) concretely over such fields, for instance we can view it as an algebraic curve in {\Bbb C}^2 which is also definable over {\Bbb Q}. Actually, the complex description of X_0(N) can be made extremely concrete by means of the j-invariant, which is a modular function on X_0(1) which has an expansion on the upper half-plane as

j(\tau) = e^{-2\pi i \tau} + 744 + 19688 e^{2\pi i \tau} + \ldots

and the map \tau \mapsto (j(\tau), j(N\tau)) then identifies X_0(N) with an algebraic curve in {\Bbb C}^2. (This is apparently quite remarkable; most moduli spaces do not have concrete representations as algebraic varieties. I gather from Shou-wu that this phenomenon is ultimately due to the fact that elliptic curves are an abelian variety.) Now it becomes meaningful to talk about rational or other algebraic points in X_0(N).

In particular, given a number field k (e.g. a quadratic field k = {\Bbb Q}(\sqrt{-d})) with the associated ring of integers (or order) \theta_k we can define a sub-moduli space of X_0(N) by restricting to those pairs (A,C) for which A has a complex multiplication associated to \theta_k, thus End(A) \equiv \theta_k. After orienting this set appropriately (this is a little tricky, and may require all bad primes to split in k) we create a divisor in the modular curve X_0(N), which one can think of as a signed combination of points; the theory of complex multiplication can be used to show that the degree of this divisor is equal to the class number of the ring of integers. Since each point in the modular curve generates a point in the elliptic curve E, which is a group, we see that this divisor also generates a point in E, which is called a Heegner point; some Galois theory then shows that this is in fact a point over k, basically because everything in sight is \hbox{Gal}(\overline{k}/k) invariant. [As a side note, Heegner, who was a high-school maths teacher, managed to create this technique without any of the modern machinery of elliptic curves, while working on the very classical problem of determining which integers are congruent numbers; this problem turns out to be equivalent to determining which members of a family of elliptic curves admit rational points.]

The theory is particularly simple when the field k = {\Bbb Q}(\sqrt{-d}) has unique factorisation, so that the class number is one and the divisor is just a single point. For instance, when d=163 (this is the largest discriminant for which a imaginary quadratic field has unique factorisation), the divisor consists of a single point at \frac{1 + \sqrt{-163}}{2} (viewed as an element of the half-space, and thence as an element of X_0(1)); this leads to an integer point, and in particular j( \frac{1 + \sqrt{-163}}{2}) is an integer. Comparing this with the asymptotic expansion of j, we thus conclude the famous fact that e^{\pi \sqrt{163}} is extremely close to an integer. [I believe there is a common philosophy in this area, articulated for instance by Ellenberg, that asserts that behind every numerical “miracle” there is almost always some interesting arithmetic geometry object that can “explain” the miracle.]

The method of Heegner points is a powerful way to construct rational or other algebraic points on an elliptic curve, but there is of course the danger that the point constructed is trivial. However, the remarkable Gross-Zagier theorem describes when this occurs. It is formalised as follows: if k is an imaginary quadratic field with discriminant d, and L(E/K,s) is defined as the product L(E,s) L(E_D,s), where E_D is the curve E twisted by D (so that y^2 = x^3 + a x+ b becomes Dy^2 = x^3 + ax + b), then one has a functional equation L(E/K,s) = - L(E/K,2-s). The crucial point here is the minus sign; it implies in particular that L(E/K,s) vanishes at s=1, and with a bit more work one can show that the derivative at s=1 is equal to an absolute constant times the square of the height of the Heegner point. As a consequence, the Heegner point is non-trivial precisely when the order of vanishing of the L-function is exactly 1, which is consistent with the BSD conjecture. This is remarkable because it connects an analytic fact (a non-vanishing of a derivative of an L-function) with an algebraic fact (the triviality of the Heegner point).

In fact, Kolyvagin used ideas similar to this to show that the (weak) BSD conjecture is true for rank 1 elliptic curves. There is in fact a conjecture of Goldfeld that asserts that if one takes a fixed elliptic curve and twists it by various discriminants, then the rank of the resulting twisted curve should be 0 with density 50%, 1 with density 50%, and 2 with density 0% – thus “most” twists should have rank 1, and so the BSD conjecture has already been “mostly” proved. (This is a bit misleading though; if instead one orders elliptic curves more naively, e.g. via the discriminant, then it is suspected now that a positive proportion of curves have rank greater than 2.) In terms of the classic problem of congruent numbers, there is a closely related conjecture that asserts that a square free number will always be a congruent number if it is equal to 5,6,7 mod 8, but almost never be congruent if equal to 1,2,3 mod 8; this conjecture is still unsolved, though there are several partial results.

[Update, May 5: Brian Conrad pointed out to me that it does not actually make sense to talk about points in a complex torus over a field k, as this concept cannot be defined intrinsically within the complex structure, so I edited the text appropriately.]