You are currently browsing the tag archive for the ‘elliptic curves’ tag.

Previous set of notes: Notes 2. Next set of notes: Notes 4.

On the real line, the quintessential examples of a periodic function are the (normalised) sine and cosine functions {\sin(2\pi x)}, {\cos(2\pi x)}, which are {1}-periodic in the sense that

\displaystyle  \sin(2\pi(x+1)) = \sin(2\pi x); \quad \cos(2\pi (x+1)) = \cos(2\pi x).

By taking various polynomial combinations of {\sin(2\pi x)} and {\cos(2\pi x)} we obtain more general trigonometric polynomials that are {1}-periodic; and the theory of Fourier series tells us that all other {1}-periodic functions (with reasonable integrability conditions) can be approximated in various senses by such polynomial combinations. Using Euler’s identity, one can use {e^{2\pi ix}} and {e^{-2\pi ix}} in place of {\sin(2\pi x)} and {\cos(2\pi x)} as the basic generating functions here, provided of course one is willing to use complex coefficients instead of real ones. Of course, by rescaling one can also make similar statements for other periods than {1}. {1}-periodic functions {f: {\bf R} \rightarrow {\bf C}} can also be identified (by abuse of notation) with functions {f: {\bf R}/{\bf Z} \rightarrow {\bf C}} on the quotient space {{\bf R}/{\bf Z}} (known as the additive {1}-torus or additive unit circle), or with functions {f: [0,1] \rightarrow {\bf C}} on the fundamental domain (up to boundary) {[0,1]} of that quotient space with the periodic boundary condition {f(0)=f(1)}. The map {x \mapsto (\cos(2\pi x), \sin(2\pi x))} also identifies the additive unit circle {{\bf R}/{\bf Z}} with the geometric unit circle {S^1 = \{ (x,y) \in {\bf R}^2: x^2+y^2=1\} \subset {\bf R}^2}, thanks in large part to the fundamental trigonometric identity {\cos^2 x + \sin^2 x = 1}; this can also be identified with the multiplicative unit circle {S^1 = \{ z \in {\bf C}: |z|=1 \}}. (Usually by abuse of notation we refer to all of these three sets simultaneously as the “unit circle”.) Trigonometric polynomials on the additive unit circle then correspond to ordinary polynomials of the real coefficients {x,y} of the geometric unit circle, or Laurent polynomials of the complex variable {z}.

What about periodic functions on the complex plane? We can start with singly periodic functions {f: {\bf C} \rightarrow {\bf C}} which obey a periodicity relationship {f(z+\omega)=f(z)} for all {z} in the domain and some period {\omega \in {\bf C} \backslash \{0\}}; such functions can also be viewed as functions on the “additive cylinder” {\omega {\bf Z} \backslash {\bf C}} (or equivalently {{\bf C} / \omega {\bf Z}}). We can rescale {\omega=1} as before. For holomorphic functions, we have the following characterisations:

Proposition 1 (Description of singly periodic holomorphic functions)
  • (i) Every {1}-periodic entire function {f: {\bf C} \rightarrow {\bf C}} has an absolutely convergent expansion

    \displaystyle  f(z) = \sum_{n=-\infty}^\infty a_n e^{2\pi i nz} = \sum_{n=-\infty}^\infty a_n q^n \ \ \ \ \ (1)

    where {q} is the nome {q := e^{2\pi i z}}, and the {a_n} are complex coefficients such that

    \displaystyle  \limsup_{n \rightarrow +\infty} |a_n|^{1/n} = \limsup_{n \rightarrow +\infty} |a_{-n}|^{1/n} = 0. \ \ \ \ \ (2)

    Conversely, every doubly infinite sequence {(a_n)_{n \in {\bf Z}}} of coefficients obeying (2) gives rise to a {1}-periodic entire function {f: {\bf C} \rightarrow {\bf C}} via the formula (1).
  • (ii) Every bounded {1}-periodic holomorphic function {f: {\bf H} \rightarrow {\bf C}} on the upper half-plane {\{ z: \mathrm{Im}(z) > 0\}} has an expansion

    \displaystyle  f(z) = \sum_{n=0}^\infty a_n e^{2\pi i nz} = \sum_{n=0}^\infty a_n q^n \ \ \ \ \ (3)

    where the {a_n} are complex coefficients such that

    \displaystyle  \limsup_{n \rightarrow +\infty} |a_n|^{1/n} \leq 1. \ \ \ \ \ (4)

    Conversely, every infinite sequence {(a_n)_{n \in {\bf Z}}} obeying (4) gives rise to a {1}-periodic holomorphic function {f: {\bf H} \rightarrow {\bf C}} which is bounded away from the real axis (i.e., bounded on {\{ z: \mathrm{Im}(z) \geq \varepsilon\}} for every {\varepsilon > 0}).
In both cases, the coefficients {a_n} can be recovered from {f} by the Fourier inversion formula

\displaystyle  a_n = \int_{\gamma_{z_0 \rightarrow z_0+1}} f(z) e^{-2\pi i nz}\ dz \ \ \ \ \ (5)

for any {z_0} in {{\bf C}} (in case (i)) or {{\bf H}} (in case (ii)).

Proof: If {f: {\bf C} \rightarrow {\bf C}} is {1}-periodic, then it can be expressed as {f(z) = F(q) = F(e^{2\pi i z})} for some function {F: {\bf C} \backslash \{0\} \rightarrow {\bf C}} on the “multiplicative cylinder” {{\bf C} \backslash \{0\}}, since the fibres of the map {z \mapsto e^{2\pi i z}} are cosets of the integers {{\bf Z}}, on which {f} is constant by hypothesis. As the map {z \mapsto e^{2\pi i z}} is a covering map from {{\bf C}} to {{\bf C} \backslash \{0\}}, we see that {F} will be holomorphic if and only if {f} is. Thus {F} must have a Laurent series expansion {F(q) = \sum_{n=-\infty}^\infty a_n q^n} with coefficients {a_n} obeying (2), which gives (1), and the inversion formula (5) follows from the usual contour integration formula for Laurent series coefficients. The converse direction to (i) also follows by reversing the above arguments.

For part (ii), we observe that the map {z \mapsto e^{2\pi i z}} is also a covering map from {{\bf H}} to the punctured disk {D(0,1) \backslash \{0\}}, so we can argue as before except that now {F} is a bounded holomorphic function on the punctured disk. By the Riemann singularity removal theorem (Exercise 35 of 246A Notes 3) {F} extends to be holomorphic on all of {D(0,1)}, and thus has a Taylor expansion {F(q) = \sum_{n=0}^\infty a_n q^n} for some coefficients {a_n} obeying (4). The argument now proceeds as with part (i). \Box

The additive cylinder {{\bf Z} \backslash {\bf C}} and the multiplicative cylinder {{\bf C} \backslash \{0\}} can both be identified (on the level of smooth manifolds, at least) with the geometric cylinder {\{ (x,y,z) \in {\bf R}^3: x^2+y^2=1\}}, but we will not use this identification here.

Now let us turn attention to doubly periodic functions of a complex variable {z}, that is to say functions {f} that obey two periodicity relations

\displaystyle  f(z+\omega_1) = f(z); \quad f(z+\omega_2) = f(z)

for all {z \in {\bf C}} and some periods {\omega_1,\omega_2 \in {\bf C}}, which to avoid degeneracies we will assume to be linearly independent over the reals (thus {\omega_1,\omega_2} are non-zero and the ratio {\omega_2/\omega_1} is not real). One can rescale {\omega_1,\omega_2} by a common scaling factor {\lambda \in {\bf C} \backslash \{0\}} to normalise either {\omega_1=1} or {\omega_2=1}, but one of course cannot simultaneously normalise both parameters in this fashion. As in the singly periodic case, such functions can also be identified with functions on the additive {2}-torus {\Lambda \backslash {\bf C}}, where {\Lambda} is the lattice {\Lambda := \omega_1 {\bf Z} + \omega_2 {\bf Z}}, or with functions {f} on the solid parallelogram bounded by the contour {\gamma_{0 \rightarrow \omega_1 \rightarrow \omega_1+\omega_2 \rightarrow \omega_2 \rightarrow 0}} (a fundamental domain up to boundary for that torus), obeying the boundary periodicity conditions

\displaystyle  f(z+\omega_1) = f(z)

for {z} in the edge {\gamma_{\omega_2 \rightarrow 0}}, and

\displaystyle  f(z+\omega_2) = f(z)

for {z} in the edge {\gamma_{\omega_0 \rightarrow 1}}.

Within the world of holomorphic functions, the collection of doubly periodic functions is boring:

Proposition 2 Let {f: {\bf C} \rightarrow {\bf C}} be an entire doubly periodic function (with periods {\omega_1,\omega_2} linearly independent over {{\bf R}}). Then {f} is constant.

In the language of Riemann surfaces, this proposition asserts that the torus {\Lambda \backslash {\bf C}} is a non-hyperbolic Riemann surface; it cannot be holomorphically mapped non-trivially into a bounded subset of the complex plane.

Proof: The fundamental domain (up to boundary) enclosed by {\gamma_{0 \rightarrow \omega_1 \rightarrow \omega_1+\omega_2 \rightarrow \omega_2 \rightarrow 0}} is compact, hence {f} is bounded on this domain, hence bounded on all of {{\bf C}} by double periodicity. The claim now follows from Liouville’s theorem. (One could alternatively have argued here using the compactness of the torus {(\omega_1 {\bf Z} + \omega_2 {\bf Z}) \backslash {\bf C}}. \Box

To obtain more interesting examples of doubly periodic functions, one must therefore turn to the world of meromorphic functions – or equivalently, holomorphic functions into the Riemann sphere {{\bf C} \cup \{\infty\}}. As it turns out, a particularly fundamental example of such a function is the Weierstrass elliptic function

\displaystyle  \wp(z) := \frac{1}{z^2} + \sum_{z_0 \in \Lambda \backslash 0} \frac{1}{(z-z_0)^2} - \frac{1}{z_0^2} \ \ \ \ \ (6)

which plays a role in doubly periodic functions analogous to the role of {x \mapsto \cos(2\pi x)} for {1}-periodic real functions. This function will have a double pole at the origin {0}, and more generally at all other points on the lattice {\Lambda}, but no other poles. The derivative

\displaystyle  \wp'(z) = -2 \sum_{z_0 \in \Lambda} \frac{1}{(z-z_0)^3} \ \ \ \ \ (7)

of the Weierstrass function is another doubly periodic meromorphic function, now with a triple pole at every point of {\Lambda}, and plays a role analogous to {x \mapsto \sin(2\pi x)}. Remarkably, all the other doubly periodic meromorphic functions with these periods will turn out to be rational combinations of {\wp} and {\wp'}; furthermore, in analogy with the identity {\cos^2 x+ \sin^2 x = 1}, one has an identity of the form

\displaystyle  \wp'(z)^2 = 4 \wp(z)^3 - g_2 \wp(z) - g_3 \ \ \ \ \ (8)

for all {z \in {\bf C}} (avoiding poles) and some complex numbers {g_2,g_3} that depend on the lattice {\Lambda}. Indeed, much as the map {x \mapsto (\cos 2\pi x, \sin 2\pi x)} creates a diffeomorphism between the additive unit circle {{\bf R}/{\bf Z}} to the geometric unit circle {\{ (x,y) \in{\bf R}^2: x^2+y^2=1\}}, the map {z \mapsto (\wp(z), \wp'(z))} turns out to be a complex diffeomorphism between the torus {(\omega_1 {\bf Z} + \omega_2 {\bf Z}) \backslash {\bf C}} and the elliptic curve

\displaystyle  \{ (z, w) \in {\bf C}^2: z^2 = 4w^3 - g_2 w - g_3 \} \cup \{\infty\}

with the convention that {(\wp,\wp')} maps the origin {\omega_1 {\bf Z} + \omega_2 {\bf Z}} of the torus to the point {\infty} at infinity. (Indeed, one can view elliptic curves as “multiplicative tori”, and both the additive and multiplicative tori can be identified as smooth manifolds with the more familiar geometric torus, but we will not use such an identification here.) This fundamental identification with elliptic curves and tori motivates many of the further remarkable properties of elliptic curves; for instance, the fact that tori are obviously an abelian group gives rise to an abelian group law on elliptic curves (and this law can be interpreted as an analogue of the trigonometric sum identities for {\wp, \wp'}). The description of the various meromorphic functions on the torus also helps motivate the more general Riemann-Roch theorem that is a fundamental law governing meromorphic functions on other compact Riemann surfaces (and is discussed further in these 246C notes). So far we have focused on studying a single torus {\Lambda \backslash {\bf C}}. However, another important mathematical object of study is the space of all such tori, modulo isomorphism; this is a basic example of a moduli space, known as the (classical, level one) modular curve {X_0(1)}. This curve can be described in a number of ways. On the one hand, it can be viewed as the upper half-plane {{\bf H} = \{ z: \mathrm{Im}(z) > 0 \}} quotiented out by the discrete group {SL_2({\bf Z})}; on the other hand, by using the {j}-invariant, it can be identified with the complex plane {{\bf C}}; alternatively, one can compactify the modular curve and identify this compactification with the Riemann sphere {{\bf C} \cup \{\infty\}}. (This identification, by the way, produces a very short proof of the little and great Picard theorems, which we proved in 246A Notes 4.) Functions on the modular curve (such as the {j}-invariant) can be viewed as {SL_2({\bf Z})}-invariant functions on {{\bf H}}, and include the important class of modular functions; they naturally generalise to the larger class of (weakly) modular forms, which are functions on {{\bf H}} which transform in a very specific way under {SL_2({\bf Z})}-action, and which are ubiquitous throughout mathematics, and particularly in number theory. Basic examples of modular forms include the Eisenstein series, which are also the Laurent coefficients of the Weierstrass elliptic functions {\wp}. More number theoretic examples of modular forms include (suitable powers of) theta functions {\theta}, and the modular discriminant {\Delta}. Modular forms are {1}-periodic functions on the half-plane, and hence by Proposition 1 come with Fourier coefficients {a_n}; these coefficients often turn out to encode a surprising amount of number-theoretic information; a dramatic example of this is the famous modularity theorem, (a special case of which was) used amongst other things to establish Fermat’s last theorem. Modular forms can be generalised to other discrete groups than {SL_2({\bf Z})} (such as congruence groups) and to other domains than the half-plane {{\bf H}}, leading to the important larger class of automorphic forms, which are of major importance in number theory and representation theory, but which are well outside the scope of this course to discuss.

Read the rest of this entry »

An algebraic (affine) plane curve of degree {d} over some field {k} is a curve {\gamma} of the form

\displaystyle  \gamma = \{ (x,y) \in k^2: P(x,y) = 0 \}

where {P} is some non-constant polynomial of degree {d}. Examples of low-degree plane curves include

  • Degree {1} (linear) curves {\{ (x,y) \in k^2: ax+by=c\}}, which are simply the lines;
  • Degree {2} (quadric) curves {\{ (x,y) \in k^2: ax^2+bxy+cy^2+dx+ey+f=0\}}, which (when {k={\bf R}}) include the classical conic sections (i.e. ellipses, hyperbolae, and parabolae), but also include the reducible example of the union of two lines; and
  • Degree {3} (cubic) curves {\{ (x,y) \in k^2: ax^3+bx^2y+cxy^2+dy^3+ex^2+fxy+gy^2+hx+iy+j=0\}}, which include the elliptic curves {\{ (x,y) \in k^2: y^2=x^3+ax+b\}} (with non-zero discriminant {\Delta := -16(4a^3+27b^2)}, so that the curve is smooth) as examples (ignoring some technicalities when {k} has characteristic two or three), but also include the reducible examples of the union of a line and a conic section, or the union of three lines.
  • etc.

Algebraic affine plane curves can also be extended to the projective plane {{\Bbb P} k^2 = \{ [x,y,z]: (x,y,z) \in k^3 \backslash 0 \}} by homogenising the polynomial. For instance, the affine quadric curve {\{ (x,y) \in k^2: ax^2+bxy+cy^2+dx+ey+f=0\}} would become {\{ [x,y,z] \in {\Bbb P} k^2: ax^2+bxy+cy^2+dxz+eyz+fz^2=0\}}.

One of the fundamental theorems about algebraic plane curves is Bézout’s theorem, which asserts that if a degree {d} curve {\gamma} and a degree {d'} curve {\gamma'} have no common component, then they intersect in at most {dd'} points (and if the underlying field {k} is algebraically closed, one works projectively, and one counts intersections with multiplicity, they intersect in exactly {dd'} points). Thus, for instance, two distinct lines intersect in at most one point; a line and a conic section intersect in at most two points; two distinct conic sections intersect in at most four points; a line and an elliptic curve intersect in at most three points; two distinct elliptic curves intersect in at most nine points; and so forth. Bézout’s theorem is discussed in this previous post.

From linear algebra we also have the fundamental fact that one can build algebraic curves through various specified points. For instance, for any two points {A_1,A_2} one can find a line {\{ (x,y): ax+by=c\}} passing through the points {A_1,A_2}, because this imposes two linear constraints on three unknowns {a,b,c} and is thus guaranteed to have at least one solution. Similarly, given any five points {A_1,\ldots,A_5}, one can find a quadric curve passing through these five points (though note that if three of these points are collinear, then this curve cannot be a conic thanks to Bézout’s theorem, and is thus necessarily reducible to the union of two lines); given any nine points {A_1,\ldots,A_9}, one can find a cubic curve going through these nine points; and so forth. This simple observation is one of the foundational building blocks of the polynomial method in combinatorial incidence geometry, discussed in these blog posts.

In the degree {1} case, it is always true that two distinct points {A, B} determine exactly one line {\overleftrightarrow{AB}}. In higher degree, the situation is a bit more complicated. For instance, five collinear points determine more than one quadric curve, as one can simply take the union of the line containing those five points, together with an arbitrary additional line. Similarly, eight points on a conic section plus one additional point determine more than one cubic curve, as one can take that conic section plus an arbitrary line going through the additional point. However, if one places some “general position” hypotheses on these points, then one can recover uniqueness. For instance, given five points, no three of which are collinear, there can be at most one quadric curve that passes through these points (because these five points cannot lie on the union of two lines, and by Bézout’s theorem they cannot simultaneously lie on two distinct conic sections).

For cubic curves, the situation is more complicated still. Consider for instance two distinct cubic curves {\gamma_0 = \{ P_0(x,y)=0\}} and {\gamma_\infty = \{P_\infty(x,y)=0\}} that intersect in precisely nine points {A_1,\ldots,A_9} (note from Bézout’s theorem that this is an entirely typical situation). Then there is in fact an entire one-parameter family of cubic curves that pass through these points, namely the curves {\gamma_t = \{ P_0(x,y) + t P_\infty(x,y) = 0\}} for any {t \in k \cup \{\infty\}} (with the convention that the constraint {P_0+tP_\infty=0} is interpreted as {P_\infty=0} when {t=\infty}).

In fact, these are the only cubics that pass through these nine points, or even through eight of the nine points. More precisely, we have the following useful fact, known as the Cayley-Bacharach theorem:

Proposition 1 (Cayley-Bacharach theorem) Let {\gamma_0 = \{ P_0(x,y)=0\}} and {\gamma_\infty = \{P_\infty(x,y)=0\}} be two cubic curves that intersect (over some algebraically closed field {k}) in precisely nine distinct points {A_1,\ldots,A_9 \in k^2}. Let {P} be a cubic polynomial that vanishes on eight of these points (say {A_1,\ldots,A_8}). Then {P} is a linear combination of {P_0,P_\infty}, and in particular vanishes on the ninth point {A_9}.

Proof: (This proof is based off of a text of Husemöller.) We assume for contradiction that there is a cubic polynomial {P} that vanishes on {A_1,\ldots,A_8}, but is not a linear combination of {P_0} and {P_\infty}.

We first make some observations on the points {A_1,\ldots,A_9}. No four of these points can be collinear, because then by Bézout’s theorem, {P_0} and {P_\infty} would both have to vanish on this line, contradicting the fact that {\gamma_0, \gamma_\infty} meet in at most nine points. For similar reasons, no seven of these points can lie on a quadric curve.

One consequence of this is that any five of the {A_1,\ldots,A_9} determine a unique quadric curve {\sigma}. The existence of the curve follows from linear algebra as discussed previously. If five of the points lie on two different quadric curves {\sigma,\sigma'}, then by Bezout’s theorem, they must share a common line; but this line can contain at most three of the five points, and the other two points determine uniquely the other line that is the component of both {\sigma} and {\sigma'}, and the claim follows.

Now suppose that three of the first eight points, say {A_1,A_2,A_3}, are collinear, lying on a line {\ell}. The remaining five points {A_4,\ldots,A_8} do not lie on {\ell}, and determine a unique quadric curve {\sigma} by the previous discussion. Let {B} be another point on {\ell}, and let {C} be a point that does not lie on either {\ell} or {\sigma}. By linear algebra, one can find a non-trivial linear combination {Q = aP + bP_0 + cP_\infty} of {P,P_0,P_\infty} that vanishes at both {B} and {C}. Then {Q} is a cubic polynomial that vanishes on the four collinear points {A_1,A_2,A_3,B} and thus vanishes on {\ell}, thus the cubic curve defined by {Q} consists of {\ell} and a quadric curve. This curve passes through {A_4,\ldots,A_8} and thus equals {\sigma}. But then {C} does not lie on either {\ell} or {\sigma} despite being a vanishing point of {Q}, a contradiction. Thus, no three points from {A_1,\ldots,A_8} are collinear.

In a similar vein, suppose next that six of the first eight points, say {A_1,\ldots,A_6}, lie on a quadric curve {\sigma}; as no three points are collinear, this quadric curve cannot be the union of two lines, and is thus a conic section. The remaining two points {A_7, A_8} determine a unique line {\ell = \overleftrightarrow{A_7A_8}}. Let {B} be another point on {\sigma}, and let {C} be another point that does not lie on either {\ell} and {\sigma}. As before, we can find a non-trivial cubic {Q = aP + bP_0+cP_\infty} that vanishes at both {B, C}. As {Q} vanishes at seven points of a conic section {\sigma}, it must vanish on all of {\sigma}, and so the cubic curve defined by {Q} is the union of {\sigma} and a line that passes through {A_7} and {A_8}, which must necessarily be {\ell}. But then this curve does not pass through {C}, a contradiction. Thus no six points in {A_1,\ldots,A_8} lie on a quadric curve.

Finally, let {\ell} be the line through the two points {A_1,A_2}, and {\sigma} the quadric curve through the five points {A_3,\ldots,A_7}; as before, {\sigma} must be a conic section, and by the preceding paragraphs we see that {A_8} does not lie on either {\sigma} or {\ell}. We pick two more points {B, C} lying on {\ell} but not on {\sigma}. As before, we can find a non-trivial cubic {Q = aP + bP_0+cP_\infty} that vanishes on {B, C}; it vanishes on four points on {\ell} and thus {Q} defines a cubic curve that consists of {\ell} and a quadric curve. The quadric curve passes through {A_3,\ldots,A_7} and is thus {\sigma}; but then the curve does not pass through {A_8}, a contradiction. This contradiction finishes the proof of the proposition. \Box

I recently learned of this proposition and its role in unifying many incidence geometry facts concerning lines, quadric curves, and cubic curves. For instance, we can recover the proof of the classical theorem of Pappus:

Theorem 2 (Pappus’ theorem) Let {\ell, \ell'} be two distinct lines, let {A_1,A_2,A_3} be distinct points on {\ell} that do not lie on {\ell'}, and let {B_1,B_2,B_3} be distinct points on {\ell'} that do not lie on {\ell}. Suppose that for {ij=12,23,31}, the lines {\overleftrightarrow{A_i B_j}} and {\overleftrightarrow{A_j B_i}} meet at a point {C_{ij}}. Then the points {C_{12}, C_{23}, C_{31}} are collinear.

Proof: We may assume that {C_{12}, C_{23}} are distinct, since the claim is trivial otherwise.

Let {\gamma_0} be the union of the three lines {\overleftrightarrow{A_1 B_2}}, {\overleftrightarrow{A_2 B_3}}, and {\overleftrightarrow{A_3 B_1}} (the purple lines in the first figure), let {\gamma_1} be the union of the three lines {\overleftrightarrow{A_2 B_1}}, {\overleftrightarrow{A_3 B_2}}, and {\overleftrightarrow{A_1 B_3}} (the dark blue lines), and let {\gamma} be the union of the three lines {\ell}, {\ell'}, and {\overleftrightarrow{C_{12} C_{23}}} (the other three lines). By construction, {\gamma_0} and {\gamma_1} are cubic curves with no common component that meet at the nine points {A_1,A_2,A_3,B_1,B_2,B_3,C_{12},C_{23},C_{31}}. Also, {\gamma} is a cubic curve that passes through the first eight of these points, and thus also passes through the ninth point {C_{31}}, by the Cayley-Bacharach theorem. The claim follows (note that {C_{31}} cannot lie on {\ell} or {\ell'}). \Box

The same argument gives the closely related theorem of Pascal:

Theorem 3 (Pascal’s theorem) Let {A_1,A_2,A_3,B_1,B_2,B_3} be distinct points on a conic section {\sigma}. Suppose that for {ij=12,23,31}, the lines {\overleftrightarrow{A_i B_j}} and {\overleftrightarrow{A_j B_i}} meet at a point {C_{ij}}. Then the points {C_{12}, C_{23}, C_{31}} are collinear.

Proof: Repeat the proof of Pappus’ theorem, with {\sigma} taking the place of {\ell \cup \ell'}. (Note that as any line meets {\sigma} in at most two points, the {C_{ij}} cannot lie on {\sigma}.) \Box

One can view Pappus’s theorem as the degenerate case of Pascal’s theorem, when the conic section degenerates to the union of two lines.

Finally, Proposition 1 gives the associativity of the elliptic curve group law:

Theorem 4 (Associativity of the elliptic curve law) Let {\gamma := \{ (x,y) \in k^2: y^2 = x^3+ax+b \} \cup \{O\}} be a (projective) elliptic curve, where {O := [0,1,0]} is the point at infinity on the {y}-axis, and the discriminant {\Delta := -16(4a^3+27b^2)} is non-zero. Define an addition law {+} on {\gamma} by defining {A+B} to equal {-C}, where {C} is the unique point on {\gamma} collinear with {A} and {B} (if {A,B} are disjoint) or tangent to {A} (if {A=B}), and {-C} is the reflection of {C} through the {x}-axis (thus {C, -C, O} are collinear), with the convention {-O=O}. Then {+} gives {\gamma} the structure of an abelian group with identity {O} and inverse {-}.

Proof: It is clear that {O} is the identity for {+}, {-} is an inverse, and {+} is abelian. The only non-trivial assertion is associativity: {(A+B)+C = A+(B+C)}. By a perturbation (or Zariski closure) argument, we may assume that we are in the generic case when {O,A,B,C,A+B,B+C,-(A+B), -(B+C)} are all distinct from each other and from {-((A+B)+C), -(A+(B+C))}. (Here we are implicitly using the smoothness of the elliptic curve, which is guaranteed by the hypothesis that the discriminant is non-zero.)

Let {\gamma'} be the union of the three lines {\overleftrightarrow{AB}}, {\overleftrightarrow{C(A+B)}}, and {\overleftarrow{O(B+C)}} (the purple lines), and let {\gamma''} be the union of the three lines {\overleftrightarrow{O(A+B)}}, {\overleftrightarrow{BC}}, and {\overleftrightarrow{A(B+C)}} (the green lines). Observe that {\gamma'} and {\gamma} are cubic curves with no common component that meet at the nine distinct points {O, A, B, C, A+B, -(A+B), B+C, -(B+C), -((A+B)+C)}. The cubic curve {\gamma''} goes through the first eight of these points, and thus (by Proposition 1) also goes through the ninth point {-((A+B)+C)}. This implies that the line through {A} and {B+C} meets {\gamma} in both {-(A+(B+C))} and {-((A+B)+C)}, and so these two points must be equal, and so {(A+B)+C=A+(B+C)} as required. \Box

One can view Pappus’s theorem and Pascal’s theorem as a degeneration of the associativity of the elliptic curve law, when the elliptic curve degenerates to three lines (in the case of Pappus) or the union of one line and one conic section (in the case of Pascal’s theorem).

[This post is authored by Emmanuel Kowalski.]

This post may be seen as complementary to the post “The parity problem in sieve theory“. In addition to a survey of another important sieve technique, it might be interesting as a discussion of some of the foundational issues which were discussed in the comments to that post.

Many readers will certainly have heard already of one form or another of the “large sieve inequality”. The name itself is misleading however, and what is meant by this may be something having very little, if anything, to do with sieves. What I will discuss are genuine sieve situations.

The framework I will describe is explained in the preprint arXiv:math.NT/0610021, and in a forthcoming Cambridge Tract. I started looking at this first to have a common setting for the usual large sieve and a “sieve for Frobenius” I had devised earlier to study some arithmetic properties of families of zeta functions over finite fields. Another version of such a sieve was described by Zywina (“The large sieve and Galois representations”, preprint), and his approach was quite helpful in suggesting more general settings than I had considered at first. The latest generalizations more or less took life naturally when looking at new applications, such as discrete groups.

Unfortunately (maybe), there will be quite a bit of notation involved; hopefully, the illustrations related to the classical case of sieving integers to obtain the primes (or other subsets of integers with special multiplicative features) will clarify the general case, and the “new” examples will motivate readers to find yet more interesting applications of sieves.

Read the rest of this entry »

On Wednesday, Shou-wu Zhang continued his lecture series. Whereas the first lecture was a general overview of the rational points on curves problem, the second talk focused entirely on the genus 1 case – i.e. the problem of finding rational points on elliptic curves. This is already a very deep and important problem in number theory – for instance, this theory is decisive in Wiles’ proof of Fermat’s last theorem. It was also somewhat more technical than the previous talk, and I had more difficulty following all the details, but in any case here is my attempt to reconstruct the talk from my notes. Once again, the inevitable inaccuracies here are my fault and not Shou-wu’s, and corrections or comments are greatly appreciated.

NB: the talk here seems to be loosely based in part on Shou-wu’s “Current developments in Mathematics” article from 2001.

Read the rest of this entry »