You are currently browsing the tag archive for the ‘monodromy’ tag.
One of the most well known problems from ancient Greek mathematics was that of trisecting an angle by straightedge and compass, which was eventually proven impossible in 1837 by Pierre Wantzel, using methods from Galois theory.
Formally, one can set up the problem as follows. Define a configuration to be a finite collection of points, lines, and circles in the Euclidean plane. Define a construction step to be one of the following operations to enlarge the collection
:
- (Straightedge) Given two distinct points
in
, form the line
that connects
and
, and add it to
.
- (Compass) Given two distinct points
in
, and given a third point
in
(which may or may not equal
or
), form the circle with centre
and radius equal to the length
of the line segment joining
and
, and add it to
.
- (Intersection) Given two distinct curves
in
(thus
is either a line or a circle in
, and similarly for
), select a point
that is common to both
and
(there are at most two such points), and add it to
.
We say that a point, line, or circle is constructible by straightedge and compass from a configuration if it can be obtained from
after applying a finite number of construction steps.
Problem 1 (Angle trisection) Let
be distinct points in the plane. Is it always possible to construct by straightedge and compass from
a line
through
that trisects the angle
, in the sense that the angle between
and
is one third of the angle of
?
Thanks to Wantzel’s result, the answer to this problem is known to be “no” in general; a generic angle cannot be trisected by straightedge and compass. (On the other hand, some special angles can certainly be trisected by straightedge and compass, such as a right angle. Also, one can certainly trisect generic angles using other methods than straightedge and compass; see the Wikipedia page on angle trisection for some examples of this.)
The impossibility of angle trisection stands in sharp contrast to the easy construction of angle bisection via straightedge and compass, which we briefly review as follows:
- Start with three points
.
- Form the circle
with centre
and radius
, and intersect it with the line
. Let
be the point in this intersection that lies on the same side of
as
. (
may well be equal to
).
- Form the circle
with centre
and radius
, and the circle
with centre
and radius
. Let
be the point of intersection of
and
that is not
.
- The line
will then bisect the angle
.
The key difference between angle trisection and angle bisection ultimately boils down to the following trivial number-theoretic fact:
Proof: Obvious by modular arithmetic, by induction, or by the fundamental theorem of arithmetic.
In contrast, there are of course plenty of powers of that are evenly divisible by
, and this is ultimately why angle bisection is easy while angle trisection is hard.
The standard way in which Lemma 2 is used to demonstrate the impossibility of angle trisection is via Galois theory. The implication is quite short if one knows this theory, but quite opaque otherwise. We briefly sketch the proof of this implication here, though we will not need it in the rest of the discussion. Firstly, Lemma 2 implies the following fact about field extensions.
Corollary 3 Let
be a field, and let
be an extension of
that can be constructed out of
by a finite sequence of quadratic extensions. Then
does not contain any cubic extensions
of
.
Proof: If contained a cubic extension
of
, then the dimension of
over
would be a multiple of three. On the other hand, if
is obtained from
by a tower of quadratic extensions, then the dimension of
over
is a power of two. The claim then follows from Lemma 2.
To conclude the proof, one then notes that any point, line, or circle that can be constructed from a configuration is definable in a field obtained from the coefficients of all the objects in
after taking a finite number of quadratic extensions, whereas a trisection of an angle
will generically only be definable in a cubic extension of the field generated by the coordinates of
.
The Galois theory method also allows one to obtain many other impossibility results of this type, most famously the Abel-Ruffini theorem on the insolvability of the quintic equation by radicals. For this reason (and also because of the many applications of Galois theory to number theory and other branches of mathematics), the Galois theory argument is the “right” way to prove the impossibility of angle trisection within the broader framework of modern mathematics. However, this argument has the drawback that it requires one to first understand Galois theory (or at least field theory), which is usually not presented until an advanced undergraduate algebra or number theory course, whilst the angle trisection problem requires only high-school level mathematics to formulate. Even if one is allowed to “cheat” and sweep several technicalities under the rug, one still needs to possess a fair amount of solid intuition about advanced algebra in order to appreciate the proof. (This was undoubtedly be one reason why, even after Wantzel’s impossibility result was published, a large amount of effort was still expended by amateur mathematicians to try to trisect a general angle.)
In this post I would therefore like to present a different proof (or perhaps more accurately, a disguised version of the standard proof) of the impossibility of angle trisection by straightedge and compass, that avoids explicit mention of Galois theory (though it is never far beneath the surface). With “cheats”, the proof is actually quite simple and geometric (except for Lemma 2, which is still used at a crucial juncture), based on the basic geometric concept of monodromy; unfortunately, some technical work is needed however to remove these cheats.
To describe the intuitive idea of the proof, let us return to the angle bisection construction, that takes a triple of points as input and returns a bisecting line
as output. We iterate the construction to create a quadrisecting line
, via the following sequence of steps that extend the original bisection construction:
- Start with three points
.
- Form the circle
with centre
and radius
, and intersect it with the line
. Let
be the point in this intersection that lies on the same side of
as
. (
may well be equal to
).
- Form the circle
with centre
and radius
, and the circle
with centre
and radius
. Let
be the point of intersection of
and
that is not
.
- Let
be the point on the line
which lies on
, and is on the same side of
as
.
- Form the circle
with centre
and radius
. Let
be the point of intersection of
and
that is not
.
- The line
will then quadrisect the angle
.
Let us fix the points and
, but not
, and view
(as well as intermediate objects such as
,
,
,
,
,
,
) as a function of
.
Let us now do the following: we begin rotating counterclockwise around
, which drags around the other objects
,
,
,
,
,
,
that were constructed by
accordingly. For instance, here is an early stage of this rotation process, when the angle
has become obtuse:
Now for the slightly tricky bit. We are going to keep rotating beyond a half-rotation of
, so that
now becomes a reflex angle. At this point, a singularity occurs; the point
collides into
, and so there is an instant in which the line
is not well-defined. However, this turns out to be a removable singularity (and the easiest way to demonstrate this will be to tap the power of complex analysis, as complex numbers can easily route around such a singularity), and we can blast through it to the other side, giving a picture like this:
Note that we have now deviated from the original construction in that and
are no longer on the same side of
; we are thus now working in a continuation of that construction rather than with the construction itself. Nevertheless, we can still work with this continuation (much as, say, one works with analytic continuations of infinite series such as
beyond their original domain of definition).
We now keep rotating around
. Here,
is approaching a full rotation of
:
When reaches a full rotation, a different singularity occurs:
and
coincide. Nevertheless, this is also a removable singularity, and we blast through to beyond a full rotation:
And now is back where it started, as are
,
,
, and
… but the point
has moved, from one intersection point of
to the other. As a consequence,
,
, and
have also changed, with
being at right angles to where it was before. (In the jargon of modern mathematics, the quadrisection construction has a non-trivial monodromy.)
But nothing stops us from rotating some more. If we continue this procedure, we see that after two full rotations of
around
, all points, lines, and circles constructed from
have returned to their original positions. Because of this, we shall say that the quadrisection construction described above is periodic with period
.
Similarly, if one performs an octisection of the angle by bisecting the quadrisection, one can verify that this octisection is periodic with period
; it takes four full rotations of
around
before the configuration returns to where it started. More generally, one can show
Proposition 4 Any construction of straightedge and compass from the points
is periodic with period equal to a power of
.
The reason for this, ultimately, is because any two circles or lines will intersect each other in at most two points, and so at each step of a straightedge-and-compass construction there is an ambiguity of at most . Each rotation of
around
can potentially flip one of these points to the other, but then if one rotates again, the point returns to its original position, and then one can analyse the next point in the construction in the same fashion until one obtains the proposition.
But now consider a putative trisection operation, that starts with an arbitrary angle and somehow uses some sequence of straightedge and compass constructions to end up with a trisecting line
:
What is the period of this construction? If we continuously rotate around
, we observe that a full rotations of
only causes the trisecting line
to rotate by a third of a full rotation (i.e. by
):
Because of this, we see that the period of any construction that contains must be a multiple of
. But this contradicts Proposition 4 and Lemma 2.
Below the fold, I will make the above proof rigorous. Unfortunately, in doing so, I had to again leave the world of high-school mathematics, as one needs a little bit of algebraic geometry and complex analysis to resolve the issues with singularities that we saw in the above sketch. Still, I feel that at an intuitive level at least, this argument is more geometric and accessible than the Galois-theoretic argument (though anyone familiar with Galois theory will note that there is really not that much difference between the proofs, ultimately, as one has simply replaced the Galois group with a closely related monodromy group instead).











Recent Comments