You are currently browsing the category archive for the ‘question’ category.
I’ve just opened the research thread for the mini-polymath4 project over at the polymath blog to collaboratively solve one of the six questions from this year’s IMO. This year I have selected Q3, which is a somewhat intricate game-theoretic question. (The full list of questions this year may be found here.)
This post will serve as the discussion thread of the project, intended to focus all the non-research aspects of the project such as organisational matters or commentary on the progress of the project. The third component of the project is the wiki page, which is intended to summarise the progress made so far on the problem.
As with the previous mini-polymath projects, I myself will be serving primarily as a moderator, and hope other participants will take the lead in the research and in keeping the wiki up-to-date.
Just a reminder that the mini-polymath4 project will begin in three hours at Thu July 12 2012 UTC 22:00.
Two quick updates with regards to polymath projects. Firstly, given the poll on starting the mini-polymath4 project, I will start the project at Thu July 12 2012 UTC 22:00. As usual, the main research thread on this project will be held at the polymath blog, with the discussion thread hosted separately on this blog.
Second, the Polymath7 project, which seeks to establish the “hot spots conjecture” for acute-angled triangles, has made a fair amount of progress so far; for instance, the first part of the conjecture (asserting that the second Neumann eigenfunction of an acute non-equilateral triangle is simple) is now solved, and the second part (asserting that the “hot spots” (i.e. extrema) of that second eigenfunction lie on the boundary of the triangle) has been solved in a number of special cases (such as the isosceles case). It’s been quite an active discussion in the last week or so, with almost 200 comments across two threads (and a third thread freshly opened up just now). While the problem is still not completely solved, I feel optimistic that it should fall within the next few weeks (if nothing else, it seems that the problem is now at least amenable to a brute force numerical attack, though personally I would prefer to see a more conceptual solution).
Two polymath related items for this post. Firstly, there is a new polymath proposal over at the polymath blog, proposing to attack the “hot spots conjecture” (concerning a maximum principle for a heat equation) in the case when the domain is an acute-angled triangle (the case of the right and obtuse-angled triangles already being solved). Please feel free to comment on the proposal blog post if you are interested in participating.
Secondly, it is once again time to set up the annual “mini-polymath” project to collaboratively solve one of this year’s International Mathematical Olympiad problems. This year, the Olympiad is being held in Argentina, with the problems given out on July 10-11. As usual, there will be a wiki page, discussion thread, and research thread for the project. As in previous years, the first thing to resolve is the starting date and time, so I am setting up a poll here to fix a time (and also to get a preliminary indication of interest in the project). (I am using 24-hour Coordinated Universal Time (UTC) for these times. Here is a link that converts the first time given in the poll (Thu Jul 12 2012 UTC 6:00) into other time zones.) Given that the last three mini-polymaths were reasonably successful, I am not planning any changes to the format, but of course if there are any suggestions for changes, I’d be happy to hear them in the comments.
High school algebra marks a key transition point in one’s early mathematical education, and is a common point at which students feel that mathematics becomes really difficult. One of the reasons for this is that the problem solving process for a high school algebra question is significantly more free-form than the mechanical algorithms one is taught for elementary arithmetic, and a certain amount of planning and strategy now comes into play. For instance, if one wants to, say, write as a mixed fraction, there is a clear (albeit lengthy) algorithm to do this: one simply sets up the long division problem, extracts the quotient and remainder, and organises these numbers into the desired mixed fraction. After a suitable amount of drill, this is a task that can be accomplished by a large fraction of students at the middle school level. But if, for instance, one has to solve a system of equations such as
for , there is no similarly mechanical procedure that can be easily taught to a high school student in order to solve such a problem “mindlessly”. (I doubt, for instance, that any attempt to teach Buchberger’s algorithm to such students will be all that successful.) Instead, one is taught the basic “moves” (e.g. multiplying both sides of an equation by some factor, subtracting one equation from another, substituting an expression for a variable into another equation, and so forth), and some basic principles (e.g. simplifying an expression whenever possible, for instance by gathering terms, or solving for one variable in terms of others in order to eliminate it from the system). It is then up to the student to find a suitable combination of moves that isolates each of the variables in turn, to reveal their identities.
Once one is sufficiently expert in algebraic manipulation, this is not all that difficult to do, but when one is just starting to learn algebra, this type of problem can be quite daunting, in part because of an “embarrasment of riches”; there are several possible “moves” one could try to apply to the equations given, and to the novice it is not always clear in advance which moves will simplify the problem and which ones will make it more complicated. Often, such a student may simply try moves at random, which can lead to a dishearteningly large amount of effort expended without getting any closer to a solution. What is worse, each move introduces the possibility of an arithmetic error (such as a sign error), the effect of which is usually not apparent until the student finally arrives at his or her solution and either checks it against the original equation, or submits the answer to be graded. (My own son can get quite frustrated after performing a lengthy series of computations to solve an algebra problem, only to be told that the answer was wrong due to an arithmetic error; I am sure this experience is common to many other schoolchildren.)
It occurred to me recently, though, that the set of problem-solving skills needed to solve algebra problems (and, to some extent, calculus problems also) is somewhat similar to the set of skills needed to solve puzzle-type computer games, in which a certain limited set of moves must be applied in a certain order to achieve a desired result. (There are countless games of this general type; a typical example is “Factory balls“.) Given that the computer game format is already quite familiar to many schoolchildren, one could then try to teach the strategy component of algebraic problem-solving via such a game, which could automate mechanical tasks such as gathering terms and performing arithmetic in order to reduce some of the more frustrating aspects of algebra. (Note that this is distinct from the type of maths games one often sees on educational web sites, which are usually based on asking the player to correctly answer some maths problem in order to advance within the game, making the game essentially a disguised version of a maths quiz. Here, the focus is not so much on being able to supply the correct answer, but on being able to select an effective problem-solving strategy.)
It is difficult to explain in words exactly what type of game I have in mind, so I decided to create a quick mockup of what a sample “level” would look like here (note: requires Java). I didn’t want to spend too much time to make this mockup, so I wrote it in Scratch, which is a somewhat limited programming language intended for use by children, but which has the benefit of being able to churn out simple but functional apps very quickly (the mockup took less than half an hour to write). (I would certainly not attempt to write a full game in this language, though.) In this mockup level, the objective is to solve a single linear equation in one variable, such as , with only two “moves” available: the ability to subtract
from both sides of the equation, and the ability to divide both sides of the equation by
, which one performs by clicking on an appropriate icon.
It seems to me that one could actually teach a fair amount of algebra through a game such as this, with a progressively difficult sequence of levels that gradually introduce more and more types of “moves” that can handle increasingly difficult problems (e.g. simultaneous linear equations in several unknowns, quadratic equations in one or more variables, inequalities, etc.). Even within a single class of problem, such as solving linear equations, one could create additional challenge by placing some restriction on the available moves, for instance by limiting the number of available moves (as was done in the mockup), or requiring that each move cost some amount of game currency (which might possibly be waived if one is willing to perform the move “by hand”, i.e. by entering the transformed equation manually). And of course one could also make the graphics, sound, and gameplay fancier (e.g. by allowing for various competitive gameplay modes). One could also imagine some other types of high-school and lower-division undergraduate mathematics being amenable to this sort of gamification (calculus and ODE comes to mind, and maybe propositional logic), though I doubt that one could use it effectively for advanced undergraduate or graduate topics. (Though I have sometimes wished for an “integrate by parts” or “use Sobolev embedding” button when trying to control solutions to a PDE…)
This would however be a fair amount of work to actually implement, and is not something I could do by myself with the time I have available these days. But perhaps it may be possible to develop such a game (or platform for such a game) collaboratively, somewhat in the spirit of the polymath projects? I have almost no experience in modern software development (other than a summer programming job I had as a teenager, which hardly counts), so I would be curious to know how projects such as this actually get started in practice.
Let be an element of the unit circle, let
, and let
. We define the (rank one) Bohr set
to be the set
where is the distance to the origin in the unit circle (or equivalently, the distance to the nearest integer, after lifting up to
). These sets play an important role in additive combinatorics and in additive number theory. For instance, they arise naturally when applying the circle method, because Bohr sets describe the oscillation of exponential phases such as
.
Observe that Bohr sets enjoy the doubling property
thus doubling the Bohr set doubles both the length parameter and the radius parameter
. As such, these Bohr sets resemble two-dimensional balls (or boxes). Indeed, one can view
as the preimage of the two-dimensional box
under the homomorphism
.
Another class of finite set with two-dimensional behaviour is the class of (rank two) generalised arithmetic progressions
with and
Indeed, we have
and so we see, as with the Bohr set, that doubling the generalised arithmetic progressions doubles the two defining parameters of that progression.
More generally, there is an analogy between rank Bohr sets
and the rank generalised arithmetic progressions
One of the aims of additive combinatorics is to formalise analogies such as the one given above. By using some arguments from the geometry of numbers, for instance, one can show that for any rank Bohr set
, there is a rank
generalised arithmetic progression
for which one has the containments
for some explicit depending only on
(in fact one can take
); this is (a slight modification of) Lemma 4.22 of my book with Van Vu.
In the special case when , one can make a significantly more detailed description of the link between rank one Bohr sets and rank two generalised arithmetic progressions, by using the classical theory of continued fractions, which among other things gives a fairly precise formula for the generators
and lengths
of the generalised arithmetic progression associated to a rank one Bohr set
. While this connection is already implicit in the continued fraction literature (for instance, in the classic text of Hardy and Wright), I thought it would be a good exercise to work it out explicitly and write it up, which I will do below the fold.
It is unfortunate that the theory of continued fractions is restricted to the rank one setting (it relies very heavily on the total ordering of one-dimensional sets such as or
). A higher rank version of the theory could potentially help with questions such as the Littlewood conjecture, which remains open despite a substantial amount of effort and partial progress on the problem. At the end of this post I discuss how one can use the rank one theory to rephrase the Littlewood conjecture as a conjecture about a doubly indexed family of rank four progressions, which can be used to heuristically justify why this conjecture should be true, but does not otherwise seem to shed much light on the problem.
Let be a finite additive group. A tiling pair is a pair of non-empty subsets
such that every element of
can be written in exactly one way as a sum of an element
of
and an element
of
, in which case we write
. The sets
are then called tiles, with
being a complementary tile to
and vice versa. For instance, every subgroup
of
is a tile, as one can pick one representative from each coset of
to form the complementary tile. Conversely, any set formed by taking one representative from each coset of
is also a tile.
Tiles can be quite complicated, particularly when the group is “high-dimensional”. We will therefore restrict to the simple case of a cyclic group
, and restrict even further to the special case when the modulus
is square-free. Here, the situation should be much simpler. In particular, we have the following conjecture of Coven and Meyerowitz, which asserts that the previous construction of a tile is, in fact, the only such construction:
Conjecture 1 (Coven-Meyerowitz conjecture, square-free case) Let
be square-free, and let
be a tile of
. Then there exists a subgroup
of
such that
consists of a single representative from each coset of
.
Note that in the square-free case, every subgroup of
has a complementary subgroup
(thus
). In particular,
consists of a single representative from each coset of
, and so the examples of subgroups of
are covered by the above conjecture in the square-free case.
In the non-square free case, the above assertion is not true; for instance, if is a prime, then the multiples of
in
are a tile, but cannot be formed from taking a single representative from all the cosets of a given subgroup. There is a more general conjecture of Coven and Meyerowitz to handle this more general case, although it is more difficult to state:
Conjecture 2 (Coven-Meyerowitz conjecture, general case) Let
be a natural number, and let
be a tile of
. Then there exists a set
of prime powers with
such that the Fourier transform
vanishes whenever
is a non-zero element of
whose order is the product of elements of
that are powers of distinct primes. Equivalently, the generating polynomial
is divisible by the cyclotomic polynomials
whenever
is the product of elements of
that are powers of distinct primes.
It can be shown (with a modest amount of effort) that Conjecture 2 implies Conjecture 1, but we will not do so here, focusing instead exclusively on the square-free case for simplicity.
It was observed by Laba that Conjecture 2 is connected to the following well-known conjecture of Fuglede:
Conjecture 3 (One-dimensional Fuglede conjecture, tiling to spectral direction) Let
be a compact subset of
of positive measure which is a tile (thus
for some set
). Then
(with Lebesgue measure) has a spectrum, that is to say an orthogonal set of plane waves
.
Indeed, it was shown by Laba that Conjecture 2 implies Conjecture 3 in the case when is the finite union of unit intervals. Actually, thanks to the more recent work of Farkas, Matolcsi, and Mora we know that Conjecture 2 in fact implies the universal spectrum conjecture of Lagarias and Wang, which in turn was known to imply Conjecture 3 in full generality. (On the other hand, the conjecture fails in four and higher dimensions; see the papers of Kolountzakis-Matolcsi and of Farkas-Revesz.)
Given the simple statement of Conjecture 1, it is perhaps somewhat surprising that it remains open, even in simple cases such as when is the product of just four primes. One reason for this is that some plausible strengthenings of this conjecture (such as the Tijdeman-Sands conjecture) are known to be false, as we will review below. On the other hand, as we shall see, tiling sets have a lot of combinatorial structure, and in principle one should be able to work out a lot of special cases of the conjecture. Given the combinatorial nature of this problem, it may well be quite suitable for a polymath project in fact, if there is sufficient interest.
One of the most notorious problems in elementary mathematics that remains unsolved is the Collatz conjecture, concerning the function defined by setting
when
is odd, and
when
is even. (Here,
is understood to be the positive natural numbers
.)
Conjecture 1 (Collatz conjecture) For any given natural number
, the orbit
passes through
(i.e.
for some
).
Open questions with this level of notoriety can lead to what Richard Lipton calls “mathematical diseases” (and what I termed an unhealthy amount of obsession on a single famous problem). (See also this xkcd comic regarding the Collatz conjecture.) As such, most practicing mathematicians tend to spend the majority of their time on more productive research areas that are only just beyond the range of current techniques. Nevertheless, it can still be diverting to spend a day or two each year on these sorts of questions, before returning to other matters; so I recently had a go at the problem. Needless to say, I didn’t solve the problem, but I have a better appreciation of why the conjecture is (a) plausible, and (b) unlikely be proven by current technology, and I thought I would share what I had found out here on this blog.
Let me begin with some very well known facts. If is odd, then
is even, and so
. Because of this, one could replace
by the function
, defined by
when
is odd, and
when
is even, and obtain an equivalent conjecture. Now we see that if one chooses
“at random”, in the sense that it is odd with probability
and even with probability
, then
increases
by a factor of roughly
half the time, and decreases it by a factor of
half the time. Furthermore, if
is uniformly distributed modulo
, one easily verifies that
is uniformly distributed modulo
, and so
should be roughly
times as large as
half the time, and roughly
times as large as
the other half of the time. Continuing this at a heuristic level, we expect generically that
half the time, and
the other half of the time. The logarithm
of this orbit can then be modeled heuristically by a random walk with steps
and
occuring with equal probability. The expectation
is negative, and so (by the classic gambler’s ruin) we expect the orbit to decrease over the long term. This can be viewed as heuristic justification of the Collatz conjecture, at least in the “average case” scenario in which is chosen uniform at random (e.g. in some large interval
). (It also suggests that if one modifies the problem, e.g. by replacing
to
, then one can obtain orbits that tend to increase over time, and indeed numerically for this variant one sees orbits that appear to escape to infinity.) Unfortunately, one can only rigorously keep the orbit uniformly distributed modulo
for time about
or so; after that, the system is too complicated for naive methods to control at anything other than a heuristic level.
Remark 1 One can obtain a rigorous analogue of the above arguments by extending
from the integers
to the
-adics
. This compact abelian group comes with a Haar probability measure, and one can verify that this measure is invariant with respect to
; with a bit more effort one can verify that it is ergodic. This suggests the introduction of ergodic theory methods. For instance, using the pointwise ergodic theorem, we see that if
is a random
-adic integer, then almost surely the orbit
will be even half the time and odd half the time asymptotically, thus supporting the above heuristics. Unfortunately, this does not directly tell us much about the dynamics on
, as this is a measure zero subset of
. More generally, unless a dynamical system is somehow “polynomial”, “nilpotent”, or “unipotent” in nature, the current state of ergodic theory is usually only able to say something meaningful about generic orbits, but not about all orbits. For instance, the very simple system
on the unit circle
is well understood from ergodic theory (in particular, almost all orbits will be uniformly distributed), but the orbit of a specific point, e.g.
, is still nearly impossible to understand (this particular problem being equivalent to the notorious unsolved question of whether the digits of
are uniformly distributed).
The above heuristic argument only suggests decreasing orbits for almost all (though even this remains unproven, the state of the art is that the number of
in
that eventually go to
is
, a result of Krasikov and Lagarias). It leaves open the possibility of some very rare exceptional
for which the orbit goes to infinity, or gets trapped in a periodic loop. Since the only loop that
lies in is
(for
) or
(for
), we thus may isolate a weaker consequence of the Collatz conjecture:
Conjecture 2 (Weak Collatz conjecture) Suppose that
is a natural number such that
for some
. Then
is equal to
,
, or
.
Of course, we may replace with
(and delete “
“) and obtain an equivalent conjecture.
This weaker version of the Collatz conjecture is also unproven. However, it was observed by Bohm and Sontacchi that this weak conjecture is equivalent to a divisibility problem involving powers of and
:
Conjecture 3 (Reformulated weak Collatz conjecture) There does not exist
and integers
such that
is a positive integer that is a proper divisor of
Proof: To see this, it is convenient to reformulate Conjecture 2 slightly. Define an equivalence relation on
by declaring
if
for some integer
, thus giving rise to the quotient space
of equivalence classes
(which can be placed, if one wishes, in one-to-one correspondence with the odd natural numbers). We can then define a function
by declaring
, where
is the largest power of
that divides
. It is easy to see that
is well-defined (it is essentially the Syracuse function, after identifying
with the odd natural numbers), and that periodic orbits of
correspond to periodic orbits of
or
. Thus, Conjecture 2 is equivalent to the conjecture that
is the only periodic orbit of
.
Now suppose that Conjecture 2 failed, thus there exists such that
for some
. Without loss of generality we may take
to be odd, then
. It is easy to see that
is the only fixed point of
, and so
. An easy induction using (2) shows that
where, for each ,
is the largest power of
that divides
is odd,
. Using the recursion
divides
, and thus
:
Since , we have
for some integer . Since
is divisible by
, and
is odd, we conclude
; if we rearrange the above equation as (1), then we obtain a counterexample to Conjecture 3.
Conversely, suppose that Conjecture 3 failed. Then we have , integers
and a natural number such that (1) holds. As
, we see that the right-hand side of (1) is odd, so
is odd also. If we then introduce the natural numbers
by the formula (3), then an easy induction using (4) shows that
for
. As the
are increasing in
(even for
), we see that
is the largest power of
that divides the right-hand side of (5); as
is odd, we conclude that
is also the largest power of
that divides
. We conclude that
and thus is a periodic orbit of
. Since
is an odd number larger than
, this contradicts Conjecture 3.
Call a counterexample a tuple that contradicts Conjecture 3, i.e. an integer
and an increasing set of integers
such that (1) holds for some . We record a simple bound on such counterexamples, due to Terras and to Garner :
Lemma 5 (Exponent bounds) Let
, and suppose that the Collatz conjecture is true for all
. Let
be a counterexample. Then
Proof: The first bound is immediate from the positivity of . To prove the second bound, observe from the proof of Proposition 4 that the counterexample
will generate a counterexample to Conjecture 2, i.e. a non-trivial periodic orbit
. As the conjecture is true for all
, all terms in this orbit must be at least
. An inspection of the proof of Proposition 4 reveals that this orbit consists of
steps of the form
, and
steps of the form
. As all terms are at least
, the former steps can increase magnitude by a multiplicative factor of at most
. As the orbit returns to where it started, we conclude that
whence the claim.
The Collatz conjecture has already been verified for many values of (up to at least
, according to this web site). Inserting this into the above lemma, one can get lower bounds on
. For instance, by methods such as this, it is known that any non-trivial periodic orbit has length at least
, as shown in Garner’s paper (and this bound, which uses the much smaller value
that was available in 1981, can surely be improved using the most recent computational bounds).
Now we can perform a heuristic count on the number of counterexamples. If we fix and
, then
, and from basic combinatorics we see that there are
different ways to choose the remaining integers
to form a potential counterexample . As a crude heuristic, one expects that for a “random” such choice of integers, the expression (1) has a probability
of holding for some integer
. (Note that
is not divisible by
or
, and so one does not expect the special structure of the right-hand side of (1) with respect to those moduli to be relevant. There will be some choices of
where the right-hand side in (1) is too small to be divisible by
, but using the estimates in Lemma 5, one expects this to occur very infrequently.) Thus, the total expected number of solutions for this choice of
is
The heuristic number of solutions overall is then expected to be
, with the approximation here accurate to many decimal places.
We need a lower bound on . Here, we will use Baker’s theorem (as discussed in this previous post), which among other things gives the lower bound
. Meanwhile, Stirling’s formula (as discussed in this previous post) combined with the approximation
gives
where is the entropy function
A brief computation shows that
and so (ignoring all subexponential terms)
which makes the series (6) convergent. (Actually, one does not need the full strength of Lemma 5 here; anything that kept well away from
would suffice. In particular, one does not need an enormous value of
; even
(say) would be more than sufficient to obtain the heuristic that there are finitely many counterexamples.) Heuristically applying the Borel-Cantelli lemma, we thus expect that there are only a finite number of counterexamples to the weak Collatz conjecture (and inserting a bound such as
, one in fact expects it to be extremely likely that there are no counterexamples at all).
This, of course, is far short of any rigorous proof of Conjecture 2. In order to make rigorous progress on this conjecture, it seems that one would need to somehow exploit the structural properties of numbers of the form
with at most one exception (this is essentially what is called a
-cycle by Steiner). Then (8) simplifies via the geometric series formula to a combination of just a bounded number of powers of
and
, rather than an unbounded number. In that case, one can start using tools from transcendence theory such as Baker’s theorem to obtain good results; for instance, in the above-referenced paper of Steiner, it was shown that
-cycles cannot actually occur, and similar methods have been used to show that
-cycles (in which there are at most
exceptions to
) do not occur for any
, as was shown by Simons and de Weger. However, for general increasing tuples of integers
, there is no such representation by bounded numbers of powers, and it does not seem that methods from transcendence theory will be sufficient to control the expressions (8) to the extent that one can understand their divisibility properties by quantities such as
.
Amusingly, there is a slight connection to Littlewood-Offord theory in additive combinatorics – the study of the random sums
generated by some elements of an additive group
, or equivalently, the vertices of an
-dimensional parallelepiped inside
. Here, the relevant group is
. The point is that if one fixes
and
(and hence
), and lets
vary inside the simplex
then the set of all sums of the form (8) (viewed as an element of
) contains many large parallelepipeds. (Note, incidentally, that once one fixes
, all the sums of the form (8) are distinct; because given (8) and
, one can read off
as the largest power of
that divides (8), and then subtracting off
one can then read off
, and so forth.) This is because the simplex
contains many large cubes. Indeed, if one picks a typical element
of
, then one expects (thanks to Lemma 5) that there there will be
indices
such that
for
, which allows one to adjust each of the
independently by
if desired and still remain inside
. This gives a cube in
of dimension
, which then induces a parallelepiped of the same dimension in
. A short computation shows that the generators of this parallelepiped consist of products of a power of
and a power of
, and in particular will be coprime to
.
If the weak Collatz conjecture is true, then the set must avoid the residue class
in
. Let us suppose temporarily that we did not know about Baker’s theorem (and the associated bound (7)), so that
could potentially be quite small. Then we would have a large parallelepiped inside a small cyclic group
that did not cover all of
, which would not be possible for
small enough. Indeed, an easy induction shows that a
-dimensional parallelepiped in
, with all generators coprime to
, has cardinality at least
. This argument already shows the lower bound
. In other words, we have
Proposition 6 Suppose the weak Collatz conjecture is true. Then for any natural numbers
with
, one has
.
This bound is very weak when compared against the unconditional bound (7). However, I know of no way to get a nontrivial separation property between powers of and powers of
other than via transcendence theory methods. Thus, this result strongly suggests that any proof of the Collatz conjecture must either use existing results in transcendence theory, or else must contribute a new method to give non-trivial results in transcendence theory. (This already rules out a lot of possible approaches to solve the Collatz conjecture.)
By using more sophisticated tools in additive combinatorics, one can improve the above proposition (though it is still well short of the transcendence theory bound (7)):
Proposition 7 Suppose the weak Collatz conjecture is true. Then for any natural numbers
with
, one has
for some absolute constant
.
Proof: (Informal sketch only) Suppose not, then we can find with
of size
. We form the set
as before, which contains parallelepipeds in
of large dimension
that avoid
. We can count the number of times
occurs in one of these parallelepipeds by a standard Fourier-analytic computation involving Riesz products (see Chapter 7 of my book with Van Vu, or this recent preprint of Maples). Using this Fourier representation, the fact that this parallelepiped avoids
(and the fact that
) forces the generators
to be concentrated in a Bohr set, in that one can find a non-zero frequency
such that
of the
generators lie in the set
. However, one can choose the generators to essentially have the structure of a (generalised) geometric progression (up to scaling, it resembles something like
for
ranging over a generalised arithmetic progression, and
a fixed irrational), and one can show that such progressions cannot be concentrated in Bohr sets (this is similar in spirit to the exponential sum estimates of Bourgain on approximate multiplicative subgroups of
, though one can use more elementary methods here due to the very strong nature of the Bohr set concentration (being of the “
concentration” variety rather than the “
concentration”).). This furnishes the required contradiction.
Thus we see that any proposed proof of the Collatz conjecture must either use transcendence theory, or introduce new techniques that are powerful enough to create exponential separation between powers of and powers of
.
Unfortunately, once one uses the transcendence theory bound (7), the size of the cyclic group
becomes larger than the volume of any cube in
, and Littlewood-Offord techniques are no longer of much use (they can be used to show that
is highly equidistributed in
, but this does not directly give any way to prevent
from containing
).
One possible toy model problem for the (weak) Collatz conjecture is a conjecture of Erdos asserting that for , the base
representation of
contains at least one
. (See this paper of Lagarias for some work on this conjecture and on related problems.) To put it another way, the conjecture asserts that there are no integer solutions to
with and
. (When
, of course, one has
.) In this form we see a resemblance to Conjecture 3, but it looks like a simpler problem to attack (though one which is still a fair distance beyond what one can do with current technology). Note that one has a similar heuristic support for this conjecture as one does for Proposition 3; a number of magnitude
has about
base
digits, so the heuristic probability that none of these digits are equal to
is
, which is absolutely summable.
The classical formulation of Hilbert’s fifth problem asks whether topological groups that have the topological structure of a manifold, are necessarily Lie groups. This is indeed, the case, thanks to following theorem of Gleason and Montgomery-Zippin:
Theorem 1 (Hilbert’s fifth problem) Let
be a topological group which is locally Euclidean. Then
is isomorphic to a Lie group.
We have discussed the proof of this result, and of related results, in previous posts. There is however a generalisation of Hilbert’s fifth problem which remains open, namely the Hilbert-Smith conjecture, in which it is a space acted on by the group which has the manifold structure, rather than the group itself:
Conjecture 2 (Hilbert-Smith conjecture) Let
be a locally compact topological group which acts continuously and faithfully (or effectively) on a connected finite-dimensional manifold
. Then
is isomorphic to a Lie group.
Note that Conjecture 2 easily implies Theorem 1 as one can pass to the connected component of a locally Euclidean group (which is clearly locally compact), and then look at the action of
on itself by left-multiplication.
The hypothesis that the action is faithful (i.e. each non-identity group element acts non-trivially on
) cannot be completely eliminated, as any group
will have a trivial action on any space
. The requirement that
be locally compact is similarly necessary: consider for instance the diffeomorphism group
of, say, the unit circle
, which acts on
but is infinite dimensional and is not locally compact (with, say, the uniform topology). Finally, the connectedness of
is also important: the infinite torus
(with the product topology) acts faithfully on the disconnected manifold
by the action
The conjecture in full generality remains open. However, there are a number of partial results. For instance, it was observed by Montgomery and Zippin that the conjecture is true for transitive actions, by a modification of the argument used to establish Theorem 1. This special case of the Hilbert-Smith conjecture (or more precisely, a generalisation thereof in which “finite-dimensional manifold” was replaced by “locally connected locally compact finite-dimensional”) was used in Gromov’s proof of his famous theorem on groups of polynomial growth. I record the argument of Montgomery and Zippin below the fold.
Another partial result is the reduction of the Hilbert-Smith conjecture to the -adic case. Indeed, it is known that Conjecture 2 is equivalent to
Conjecture 3 (Hilbert-Smith conjecture for
-adic actions) It is not possible for a
-adic group
to act continuously and effectively on a connected finite-dimensional manifold
.
The reduction to the -adic case follows from the structural theory of locally compact groups (specifically, the Gleason-Yamabe theorem discussed in previous posts) and some results of Newman that sharply restrict the ability of periodic actions on a manifold
to be close to the identity. I record this argument (which appears for instance in this paper of Lee) below the fold also.
I’ve just opened the research thread for the mini-polymath3 project over at the polymath blog. I decided to use Q2 of this year’s IMO, in part to see how the polymath format copes with a geometric problem. (The full list of questions for this year is available here.)
This post will serve as the discussion thread of the project, intended to focus all the non-research aspects of the project such as organisational matters or commentary on the progress of the project. The third component of the project is the wiki page, which is intended to summarise the progress made so far on the problem.
As with mini-polymath1 and mini-polymath2, I myself will be serving primarily as a moderator, and hope other participants will take the lead in the research and in keeping the wiki up-to-date.

Recent Comments