I have just uploaded to the arXiv my paper “On the number of solutions to {\frac{4}{p}=\frac{1}{n_1}+\frac{1}{n_2}+\frac{1}{n_3}}“, submitted to the Journal of the Australian Mathematical Society.

For any positive integer {n}, let {f(n)} denote the number of solutions to the Diophantine equation

\displaystyle  \frac{4}{n} = \frac{1}{n_1} + \frac{1}{n_2} + \frac{1}{n_3}

where {n_1,n_2,n_3} are positive integers (we allow repetitions, and do not require the {n_i} to be increasing). The Erdös-Straus conjecture asserts that {f(n)>0} for all {n \geq 2}. By dividing through by any positive integer {m} we see that {f(nm) \geq f(n)}, so it suffices to verify this conjecture for primes {n=p}, i.e. to solve the Diophantine equation

\displaystyle  \frac{4}{p} = \frac{1}{n_1} + \frac{1}{n_2} + \frac{1}{n_3} \ \ \ \ \ (1)

for each prime {p}. As the case {p=2} is easily solved, we may of course restrict attention to odd primes.

This conjecture remains open, although there is a reasonable amount of evidence towards its truth. For instance, it was shown by Vaughan that for any large {x}, the number of exceptions {f(n)=0} to the Erdös-Straus conjecture with {n<x} is at most {x \exp( - c \log^{2/3} x)} for some absolute constant {c>0}. The Erdös-Straus conjecture is also verified in several congruence classes of primes; for instance, from the identity

\displaystyle  \frac{4}{p} = \frac{1}{\frac{p+1}{4}} + \frac{1}{\frac{p+1}{2}p} + \frac{1}{\frac{p+1}{2}p}

we see that the conjecture holds when {p = 3 \hbox{ mod } 4}. Further identities of this type can be used to resolve the conjecture unless {p} is a quadratic residue mod {840}, which leaves only six residue classes in that modulus to check (namely, {1, 121, 169, 289, 361}, and {529}). However, there is a significant obstruction to eliminating the quadratic residue classes, as we will discuss later.

By combining these reductions with extensive numerical calculations, the Erdös-Straus conjecture was verified for all {n \leq 10^{14}} by Swett.

One approach to solving Diophantine equations such as (1) is to use methods of analytic number theory, such as the circle method, to obtain asymptotics (or at least lower bounds) for the number {f(p)} of solutions (or some proxy for this number); if one obtains a lower bound which is nontrivial for every {p}, one has solved the problem. (One can alternatively view such methods as a variant of the probabilistic method; in this interpretation, one chooses the unknowns {n_1,n_2,n_3} according to some suitable probability distribution and then tries to show that the probability of solving the equation is positive.) Such techniques can be effective for instance for certain instances of the Waring problem. However, as a general rule, these methods only work when there are a lot of solutions, and specifically when the number {f(p)} of solutions grows at a polynomial rate with the parameter {p}.

The first main result of my paper is that the number of solutions {f(p)} is essentially logarithmic in nature, thus providing a serious obstruction to solution by analytic methods, as there is almost no margin of error available. More precisely, I show that for almost all primes {p} (i.e. in a subset of primes of relative density {1}), one has {f(p) \leq p^{o(1)}}, or more precisely that

\displaystyle  f(p) \leq \exp( O( \frac{\log p}{\log\log p} ) ).

Readers familiar with analytic number theory will recognise the right-hand side from the divisor bound, which indeed plays a role in the proof.

Actually, there are more precise estimates which suggest (though do not quite prove) that the mean value of {f(p)} is comparable to {\log^3 p}. To state these results properly, I need some more notation. It is not difficult to show that if {p} is an odd prime and {n_1,n_2,n_3} obey (1), then at least one, but not all, of the {n_1,n_2,n_3} must be a multiple of {p}. Thus we can split {f(p) = f_1(p) + f_2(p)}, where {f_i(p)} is the number of solutions to (1) where exactly {i} of the {n_1,n_2,n_3} are divisible by {p}. The sharpest estimates I can prove are then

Theorem 1 For all sufficiently large {x}, one has

\displaystyle  x \log^2 x \ll \sum_{p < x} f_1(p) \ll x \exp( O( \frac{\log x}{\log \log x} ) )

and

\displaystyle  x \log^2 x \ll \sum_{p < x} f_2(p) \ll x \log^2 x \log\log x.

Since the number of primes less than {x} is comparable to {x/\log x} by the prime number theorem, this shows that the mean value of {f_2(p)} in this range is between {\log^3 x} and {\log^3 \log\log x}, and the mean value of {f_1(p)} is between {\log^3 x} and {\exp( O( \frac{\log x}{\log \log x}) )}, which gives the previous result as a corollary (thanks to a Markov inequality argument). A naive Poisson process heuristic then suggests that each prime {p} has a “probability” {1 - O( \exp( - c \log^3 p ) )} of having a solution to (1), which by the Borel-Cantelli lemma heuristically suggests that there are only finitely many {p} for which (1) fails. Of course, this is far from a rigorous proof (though the result of Vaughan mentioned earlier, which is based on the large sieve, can be viewed as a partial formalisation of the argument).

The first step in obtaining these results is an elementary description of {f_1(p), f_2(p)} in terms of some divisibility constraints. More precisely, we have

Lemma 2 Let {p} be an odd prime. Then {f_1(p)} is equal to three times the number of triples {(a,b,c)} of positive integers, with {a, b} coprime, {c} dividing {a+b}, and {4ab} dividing {cp+1}. Similarly, {f_2(p)} is equal to three times the number of triples {(a,b,c)} of positive integers, with {a, b} coprime, {c} dividing {a+b}, and {4ab} dividing {p+c}.

One direction of this lemma is very easy: if {c} divides {a+b} and {4ab} divides {cp+1} then the identity

\displaystyle  \frac{4}{p} = \frac{1}{\frac{pc+1}{4}p} + \frac{1}{\frac{pc+1}{4a} \frac{a+b}{c}} + \frac{1}{\frac{pc+1}{4b} \frac{a+b}{c}}

and its cyclic permutations give three contributions to {f_1(p)}, and similarly if {4ab} divides {p+c} instead then the identity

\displaystyle  \frac{4}{p} = \frac{1}{\frac{p+c}{4}} + \frac{1}{\frac{p+c}{4a} \frac{a+b}{c} p} + \frac{1}{\frac{p+c}{4b} \frac{a+b}{c} p}

and its cyclic permutations give three contributions to {f_2(p)}. Conversely, some elementary number theory can be used to show that all solutions contributing to either {f_1(p)} or {f_2(p)} are of these forms. (Similar criteria have been used in prior literature, for instance in the previously mentioned paper of Vaughan.)

This lemma, incidentally, provides a way to quickly generate a large number of congruences of primes {p} for which the Erdös-Straus conjecture can be verified. Indeed, from the above lemma we see that if {a, b} are coprime and {c} is any odd factor of {a+b} (and hence coprime to {4ab}), then the Erdös-Straus conjecture is true whenever {p=-c \hbox{ mod } 4ab} or {p = -c^{-1} \hbox{ mod } 4ab}. For instance,

  • Taking {(a,b,c)=(1,1,1)}, we see that the conjecture holds whenever {p=3 \hbox{ mod } 4} (leaving only those primes {p=1 \hbox{ mod } 4});
  • Taking {(a,b,c)=(1,2,3)} we see that the conjecture holds whenever {p=5 \hbox{ mod } 8} (leaving only those primes {p=1 \hbox{ mod } 8});
  • Taking {(a,b,c)=(3,4,7)}, we see that the conjecture holds whenever {p=5 \hbox{ mod } 12} ((leaving only those primes {p=1 \hbox{ mod } 24});
  • Taking {(a,b,c)=(1,5,3)}, we see that the conjecture holds whenever {p=17, 13 \hbox{ mod } 20} (leaving only those primes {p=1,49 \hbox{ mod } 120});
  • Taking {(a,b,c)=(1,14,15)}, we see that the conjecture holds whenever {p= 41 \hbox{ mod } 56}; taking instead {(a,b,c)=(2,21,23)}, we see that the conjecture holds whenever {p = 73, 145 \hbox{ mod } 168}. (This leaves only the primes {p} that are equal to one of the six quadratic residues {1, 121, 169, 289, 361, 529 \hbox{ mod } 840}, an observation first made by Mordell.)
  • etc.

If we let {g_1(n)} (resp. {g_2(n)}) be the number of triples {(a,b,c)} with {a,b} coprime, {c} dividing {a+b}, and {n} congruent to {-c^{-1}} (resp. {-c}) mod {4ab}, the above lemma tells us that {f_i(p) = 3g_i(p)} for all odd primes {p}, so it suffices to show that {g_1(p), g_2(p)} are non-zero for all {p}. One might hope that there are enough congruence relations provided by the previous observation to obtain a covering system of congruences which would resolve the conjecture. Unfortunately, as was also observed by Mordell, an application of the quadratic reciprocity law shows that these congruence relations cannot eliminate quadratic residues, only quadratic non-residues. More precisely, one can show that if {a,b,c,p} are as above (restricting to the case p=1 \hbox{ mod } 8amp;fg000000, which contains all the unsolved cases of the conjecture) and {q} is the largest odd factor of {4ab}, then the Jacobi symbols {\left(\frac{-c}{q}\right)} and {\left(\frac{-c^{-1}}{q}\right)} are always equal to {-1}. One consequence of this is that {g_1(n)=g_2(n)=0} whenever {n} is an odd perfect square. This makes it quite hard to resolve the conjecture for odd primes {p} which resemble a perfect square in that they lie in quadratic residue classes to small moduli (and, from Dirichlet’s theorem on primes in arithmetic progressions, one can find many primes of this type). The same argument also shows that for an odd prime {p}, there are no solutions to

\displaystyle  \frac{4}{p^2} = \frac{1}{n_1} + \frac{1}{n_2} + \frac{1}{n_3}

in which one or two of the {n_1,n_2,n_3} are divisible by {p^2}, with the other denominators being coprime to {p}. (Of course, one can still get representations of {\frac{4}{p^2}} by starting with a representation of {\frac{4}{p}} and dividing by {p}, but such representations are is not of the above form.) This shows that any attempt to establish the Erdös-Straus conjecture by manually constructing {n_1,n_2,n_3} as a function of {p} must involve a technique which breaks down if {p} is replaced by {p^2} (for instance, this rules out any approach based on using polynomial combinations of {p} and dividing into cases based on residue classes of {p} in small moduli). Part of the problem here is that we do not have good bounds that prevent a prime {p} from “spoofing” a perfect square to all small moduli (say, to all moduli less than a small power of {p}); this problem is closely related (via quadratic reciprocity) to Vinogradov’s conjecture on the least quadratic nonresidue, discussed in this previous blog post.

It remains to control the sums {\sum_{p<x} g_i(p)} for {i=1,2}. The lower bounds {\sum_{p<x} g_i(p) \gg x \log^2 x} are relatively routine to establish, arising from counting the contribution of those {a,b,c} that are somewhat small (e.g. less than {x^{0.2}}), and use only standard asymptotics of arithmetic functions and the Bombieri-Vinogradov inequality (to handle the restriction of the summation to primes). To obtain (nearly) matching upper bounds, one needs to prevent {a,b,c} from getting too large. In the case of {g_2(p)}, the fact that {c} divides {a+b} and {4ab} divides {c+p} soon leads one to the bounds {ab \ll x}, at which point one can count primes in residue classes modulo {4ab} with reasonable efficiency via the Brun-Titchmarsh inequality. This inequality unfortunately introduces a factor of {\frac{4ab}{\phi(4ab)}}, which we were only able to handle by bounding it crudely by {O(\log \log x)}, leading to the double logarithmic loss in the {f_2} sum.

For {g_1(p)}, these arguments do not give good bounds, because it is possible for {ab} to be much larger than {x}, and one can no longer easily count solutions to {p = -c^{-1} \hbox{ mod } 4ab}. However, an elementary change of variables resolves this problem. It turns out that if one lets {l,m} be the positive integers

\displaystyle  l := \frac{pc+1}{4ab}

\displaystyle  m := \frac{4la^2+1}{c}

then one can convert the triple {(a,b,c)} to a triple {(a,l,m)} obeying the constraints

\displaystyle  al \leq p

\displaystyle  m | 4la^2+1

\displaystyle  p = -m \hbox{ mod } 4al.

Conversely, every such triple {(a,l,m)} determines {(a,b,c)} by the formulae

\displaystyle  c = \frac{4la^2+1}{m}; b = c \frac{p+m}{4al} - a = \frac{4la^2 p + m + p}{4alm}. \ \ \ \ \ (2)

The point is that this transformation now places {p} in a residue class modulo {4al} rather than {4ab}, and with {al = O(x)}, this allows one to count the number of solutions reasonably efficiently. The main loss now comes from counting the number of divisors {m} of {4la^2+1}; in particular, one becomes interested in estimating sums such as

\displaystyle  \sum_{a \sim A} \sum_{l \sim L} d( 4la^2+1 )

where {A, L} are less than {x} and {d(n)} is the number of divisors of {n}. In principle, because {d(n)} behaves like {\log n} on the average (as can be seen from the Dirichlet hyperbola method), we expect this sum to be of order {AL \log x}, but I was unable to show this, instead using the far cruder divisor bound

\displaystyle  d(n) \ll \exp( O(\frac{\log n}{\log\log n}) )

to obtain an upper bound of {AL \exp( O(\frac{\log x}{\log\log x}) )}. Any improvement upon this bound would lead to a corresponding improvement in the upper bound for {\sum_{p<x} f_1(p)}. While improvement is possible in some ranges (particularly when {L} is large) using various bounds on Kloosterman-type exponential sums, I was not able to adequately control these sums when {L} was fairly small (e.g. polylogarithmic size in {x}), as one could no longer extract much of an averaging effect from the {l} summation in that case. Part of the difficulty is that in that case one must somehow exploit the fact that {4la^2+1} is irreducible as a polynomial in {a} for any fixed {l}, otherwise there will be too many divisors.

— 1. Solvability criteria —

In the rest of this post, we record some sufficient conditions for an odd prime {p} to have a solution to (1). From Lemma 2 we have already seen that we have a solution whenever we can find positive integers {(a,b,c)} with {a,b} coprime, {c|a+b}, and {4ab} dividing either {pc+1} or {c+p}.

The {(a,l,m)} representation discussed earlier gives a slightly different criterion. Indeed, we see that if {(a,l,m)} are positive integers such that {m|4la^2+1}, and {p = -m \hbox{ mod } 4al}, then the quantities {b,c} defined by (2) are positive integers which clearly obey {c|a+b}; furthermore, some algebra reveals that {4abl = pc+1} and so {4ab|pc+1}, and so by Lemma 2 we have a solution to (1); explicitly, we have

\displaystyle  \frac{4}{p} = \frac{1}{pabl} + \frac{1}{kbl} + \frac{1}{kal}

where {k := \frac{p+m}{4al}}. This lets us eliminate some further residue classes (restricting, as we may, to the case {p=1 \hbox{ mod } 24}):

  • Taking {(a,l,m)=(1,11,3)}, we can eliminate {p= 41 \hbox{ mod } 44} (or equivalently, {p = 8 \hbox{ mod } 11});
  • Taking {(a,l,m)=(1,11,15)}, we can eliminate {p= 29 \hbox{ mod } 44} (or equivalently, {p = 7 \hbox{ mod } 11});
  • Taking {(a,l,m)=(2,33,23)}, we can eliminate {p = 241 \hbox{ mod } 264} (or equivalently, {p = 10 \hbox{ mod } 11}).

Unfortunately this only eliminates three of the five quadratic non-residues mod {11}; I do not know how to eliminate the {2,6 \hbox{ mod } 11} classes.

Here is another criterion for eliminating large numbers of primes:

Lemma 3 Let {c,q} be coprime positive integers with {c} odd. Suppose {p} is an odd prime such that

\displaystyle  4q | p+c

and

\displaystyle  c | p + 4r

for some {r} dividing {q^2}. Then there is a positive integer solution to (1).

Proof: Write {r = a^2 t} where {t} is square-free, then {a^2 t | q^2} and hence {at | q}. In particular, {4at} divides {p+c}. As {c} is coprime to {q} and odd, it is also coprime to {4at}.

If we then set {b := \frac{p+c}{4at}}, then we have {4ab | p+c}; also, since

\displaystyle a+b = \frac{p+c+4a^2t}{4at} = \frac{p+4r+c}{4at}

and {c} divides {p+4r}, we see that {c} divides {a+b}, and so from Lemma 2 we have a solution to (1), more explicitly, one has

\displaystyle  \frac{4}{p} = \frac{1}{\frac{p+c}{4}} + \frac{1}{\frac{p+c}{4a} k p} + \frac{1}{atkp}

where {k} is the positive integer

\displaystyle  k := \frac{a+b}{c} = \frac{p+4r+c}{4atc}.

\Box

Here are some typical applications of this lemma:

Corollary 4 Let {p} be an odd prime. Then we have a positive integer solution to (1) whenever one of the following happens:

  1. {\frac{p+3}{4}} is divisible by a factor {q = 2 \hbox{ mod } 3}.
  2. {p+4} is divisible by a factor {c = 3 \hbox{ mod } 4}. (Equivalently, {p+4} is not the sum of two squares.)
  3. One of {p+4}, {p+8}, or {p+16} is divisible by a factor {c = 7 \hbox{ mod } 8}.
  4. One of {p+4}, {p+12}, or {p+36} is divisible by a factor {c = 11 \hbox{ mod } 12}.
  5. One of {p+4}, {p+8}, {p+12}, {p+16}, {p+24}, {p+36}, {p+48}, {p+72}, or {p+144} is divisible by a factor {c=23 \hbox{ mod } 24}.
  6. One of {p+4}, {p+20}, or {p+100} is divisible by a factor {c = -p \hbox{ mod } 20}.

Proof: By previous considerations we may assume {p=1 \hbox{ mod } 24}. The first claim arises from Lemma 3 with {c=3} and {r=q}. The second claim comes from taking {q=r=1}. The third comes from taking {q=2} and {r=1,2,4}, the fourth from {q=3} and {r=1,3,9}, and the fifth from {q=6} and {r=1,2,3,6,9,12,18,36}. The final claim comes from taking {q=5} and {r=1,5,25}. \Box

These criteria already recover all the congruence relation conditions in small moduli mentioned previously, and generate infinitely many more (though again, these criteria will consistently miss all perfect squares). For instance, we may eliminate the nine residue classes

\displaystyle  p = 7, 10, 11, 15, 17, 19, 20, 21, 22 \hbox{ mod } 23

by this corollary, which is most (though not all) of the {11} quadratic non-residue classes modulo {23} (omitting {5} and {14}), and similarly eliminate nine residue classes mod {47}, mod {71}, etc. (actually from {71} onwards we eliminate ten classes, due to the first part of the corollary).

Here is a variant of Lemma 3, adapted to the {f_1} solutions rather than the {f_2} solutions:

Lemma 5 Let {a,k,m} be positive integers with {m,k} coprime. Suppose {p} is an odd prime such that

\displaystyle  m | ap+k

and

\displaystyle  4ak | p + m.

Then there is a positive integer solution to (1).

Proof: Write {l := \frac{p+m}{4ak}}, then {l} is a positive integer and {p = -m \hbox{ mod } 4al}. Also, {m} divides both {ap+k} and {4akl-p = 4akm}, so on eliminating {k} we see that {m} divides {p (4la^2+1)}. If {m} was divisible by {p}, then as {m|ap+k}, {k} would also be divisible by {p}, contradicting coprimality; thus {m} divides {4la^2+1}, and the claim follows from the {(a,l,m)} criterion. More explicitly, one has

\displaystyle  \frac{4}{p} = \frac{1}{p \frac{ap+k}{m} \frac{p+m}{4k}} + \frac{1}{\frac{ap+k}{m} \frac{p+m}{4a}} + \frac{1}{\frac{p+m}{4}}.

\Box

Thus, for instance, restricting as before to the case {p=1 \hbox{ mod } 24}, we can obtain a solution whenever one of

\displaystyle  p+1, p+2, p+3, p+6, 2p+1, 2p+3, 3p+1, 3p+2, 6p+1

is divisible by some {m = 23 \hbox{ mod } 24}. Combining this with Corollary 4, we conclude that in order not to have a solution, we must have

\displaystyle  p \neq -1, -2, -3, -4, -6, -8, -12, -16, -24, -36, -48, -72, -144,

\displaystyle  -\frac{1}{2}, -\frac{3}{2}, -\frac{1}{3}, -\frac{2}{3}, -\frac{1}{6} \hbox{ mod } q

for any {q = 23 \hbox{ mod } 24}. This unfortunately does not eliminate any more residue classes modulo {23} than we already had (and, of course, quadratic residues remain untouched), but eventually eliminates {18} classes for each sufficiently large choice of the modulus {q = 23 \hbox{ mod } 24}.

Finally, we recall a criterion of Vaughan, which asserts that if there are positive integers {r,s,t} such that {4rst-1|rp+s}, then there is a solution to (1), namely

\displaystyle  \frac{4}{p} = \frac{1}{stq} + \frac{1}{prtq} + \frac{1}{prst}

where {q := \frac{rp+s}{4rst-1}}. In our notation, and assuming {s,t} are coprime, this corresponds to the {f_2} representation with

\displaystyle  (a,b,c) = ( s, q, 4stq-p )

(note that {4stq-p} is also equal to {\frac{s+q}{r}}), or in the notation of Lemma 3,

\displaystyle  (\tilde q,\tilde r,\tilde c) = (st, s^2 t, 4stq-p)

(where we place tildes on the quantities in Lemma 3 to reduce confusion). Thus we see that Vaughan’s criterion is essentially the {f_2} case of Lemma 2, and is basically equivalent to Lemma 3 also.

One can use these criteria to test the Erdos-Straus conjecture for all primes in a certain range, by using these sorts of tests to sieve out all but a small exceptional set of primes, which one can perhaps eliminate using a secondary, larger sieve based on the same criteria. This was done up to {10^{14}} by Swett in 1999, using Vaughan’s criterion, which only covers the {f_2} type solutions; it is likely that with modern computers and by using criteria for both the {f_1} and {f_2} solutions, one could expand this range significantly.