Large prime gaps and probabilistic models

26 August, 2019 in math.NT, math.PR, paper | Tags: Cramer's random model, Kevin Ford, prime gaps, William Banks | by Terence Tao

William Banks, Kevin Ford, and I have just uploaded to the arXiv our paper “Large prime gaps and probabilistic models“. In this paper we introduce a random model to help understand the connection between two well known conjectures regarding the primes ${{\mathcal P} := \{2,3,5,\dots\}}$ , the Cramér conjecture and the Hardy-Littlewood conjecture:

Conjecture 1 (Cramér conjecture) If ${x}$ is a large number, then the largest prime gap ${G_{\mathcal P}(x) := \sup_{p_n, p_{n+1} \leq x} p_{n+1}-p_n}$ in ${[1,x]}$ is of size ${\asymp \log^2 x}$ . (Granville refines this conjecture to ${\gtrsim \xi \log^2 x}$ , where ${\xi := 2e^{-\gamma} = 1.1229\dots}$ . Here we use the asymptotic notation ${X \gtrsim Y}$ for ${X \geq (1-o(1)) Y}$ , ${X \sim Y}$ for ${X \gtrsim Y \gtrsim X}$ , ${X \gg Y}$ for ${X \geq C^{-1} Y}$ , and ${X \asymp Y}$ for ${X \gg Y \gg X}$ .)

Conjecture 2 (Hardy-Littlewood conjecture) If ${\mathcal{H} := \{h_1,\dots,h_k\}}$ are fixed distinct integers, then the number of numbers ${n \in [1,x]}$ with ${n+h_1,\dots,n+h_k}$ all prime is ${({\mathfrak S}(\mathcal{H}) +o(1)) \int_2^x \frac{dt}{\log^k t}}$ as ${x \rightarrow \infty}$ , where the singular series ${{\mathfrak S}(\mathcal{H})}$ is defined by the formula

$\displaystyle {\mathfrak S}(\mathcal{H}) := \prod_p \left( 1 - \frac{|{\mathcal H} \hbox{ mod } p|}{p}\right) (1-\frac{1}{p})^{-k}.$

(One can view these conjectures as modern versions of two of the classical Landau problems, namely Legendre’s conjecture and the twin prime conjecture respectively.)

A well known connection between the Hardy-Littlewood conjecture and prime gaps was made by Gallagher. Among other things, Gallagher showed that if the Hardy-Littlewood conjecture was true, then the prime gaps ${p_{n+1}-p_n}$ with ${n \leq x}$ were asymptotically distributed according to an exponential distribution of mean ${\log x}$ , in the sense that

$\displaystyle | \{ n: p_n \leq x, p_{n+1}-p_n \geq \lambda \log x \}| = (e^{-\lambda}+o(1)) \frac{x}{\log x} \ \ \ \ \ (1)$

as ${x \rightarrow \infty}$ for any fixed ${\lambda \geq 0}$ . Roughly speaking, the way this is established is by using the Hardy-Littlewood conjecture to control the mean values of ${\binom{|{\mathcal P} \cap (p_n, p_n + \lambda \log x)|}{k}}$ for fixed ${k,\lambda}$ , where ${p_n}$ ranges over the primes in ${[1,x]}$ . The relevance of these quantities arises from the Bonferroni inequalities (or “Brun pure sieve“), which can be formulated as the assertion that

$\displaystyle 1_{N=0} \leq \sum_{k=0}^K (-1)^k \binom{N}{k}$

when ${K}$ is even and

$\displaystyle 1_{N=0} \geq \sum_{k=0}^K (-1)^k \binom{N}{k}$

when ${K}$ is odd, for any natural number ${N}$ ; setting ${N := |{\mathcal P} \cap (p_n, p_n + \lambda \log x)|}$ and taking means, one then gets upper and lower bounds for the probability that the interval ${(p_n, p_n + \lambda \log x)}$ is free of primes. The most difficult step is to control the mean values of the singular series ${{\mathfrak S}(\mathcal{H})}$ as ${{\mathcal H}}$ ranges over ${k}$ -tuples in a fixed interval such as ${[0, \lambda \log x]}$ .

Heuristically, if one extrapolates the asymptotic (1) to the regime ${\lambda \asymp \log x}$ , one is then led to Cramér’s conjecture, since the right-hand side of (1) falls below ${1}$ when ${\lambda}$ is significantly larger than ${\log x}$ . However, this is not a rigorous derivation of Cramér’s conjecture from the Hardy-Littlewood conjecture, since Gallagher’s computations only establish (1) for fixed choices of ${\lambda}$ , which is only enough to establish the far weaker bound ${G_{\mathcal P}(x) / \log x \rightarrow \infty}$ , which was already known (see this previous paper for a discussion of the best known unconditional lower bounds on ${G_{\mathcal P}(x)}$ ). An inspection of the argument shows that if one wished to extend (1) to parameter choices ${\lambda}$ that were allowed to grow with ${x}$ , then one would need as input a stronger version of the Hardy-Littlewood conjecture in which the length ${k}$ of the tuple ${{\mathcal H} = (h_1,\dots,h_k)}$ , as well as the magnitudes of the shifts ${h_1,\dots,h_k}$ , were also allowed to grow with ${x}$ . Our initial objective in this project was then to quantify exactly what strengthening of the Hardy-Littlewood conjecture would be needed to rigorously imply Cramer’s conjecture. The precise results are technical, but roughly we show results of the following form:

Theorem 3 (Large gaps from Hardy-Littlewood, rough statement)

If the Hardy-Littlewood conjecture is uniformly true for ${k}$ -tuples of length ${k \ll \frac{\log x}{\log\log x}}$ , and with shifts ${h_1,\dots,h_k}$ of size ${O( \log^2 x )}$ , with a power savings in the error term, then ${G_{\mathcal P}(x) \gg \frac{\log^2 x}{\log\log x}}$ .

If the Hardy-Littlewood conjecture is “true on average” for ${k}$ -tuples of length ${k \ll \frac{y}{\log x}}$ and shifts ${h_1,\dots,h_k}$ of size ${y}$ for all ${\log x \leq y \leq \log^2 x \log\log x}$ , with a power savings in the error term, then ${G_{\mathcal P}(x) \gg \log^2 x}$ .

In particular, we can recover Cramer’s conjecture given a sufficiently powerful version of the Hardy-Littlewood conjecture “on the average”.

Our proof of this theorem proceeds more or less along the same lines as Gallagher’s calculation, but now with ${k}$ allowed to grow slowly with ${x}$ . Again, the main difficulty is to accurately estimate average values of the singular series ${{\mathfrak S}({\mathfrak H})}$ . Here we found it useful to switch to a probabilistic interpretation of this series. For technical reasons it is convenient to work with a truncated, unnormalised version

$\displaystyle V_{\mathcal H}(z) := \prod_{p \leq z} \left( 1 - \frac{|{\mathcal H} \hbox{ mod } p|}{p} \right)$

of the singular series, for a suitable cutoff ${z}$ ; it turns out that when studying prime tuples of size ${t}$ , the most convenient cutoff ${z(t)}$ is the “Pólya magic cutoff“, defined as the largest prime for which

$\displaystyle \prod_{p \leq z(t)}(1-\frac{1}{p}) \geq \frac{1}{\log t} \ \ \ \ \ (2)$

(this is well defined for ${t \geq e^2}$ ); by Mertens’ theorem, we have ${z(t) \sim t^{1/e^\gamma}}$ . One can interpret ${V_{\mathcal Z}(z)}$ probabilistically as

$\displaystyle V_{\mathcal Z}(z) = \mathbf{P}( {\mathcal H} \subset \mathcal{S}_z )$

where ${\mathcal{S}_z \subset {\bf Z}}$ is the randomly sifted set of integers formed by removing one residue class ${a_p \hbox{ mod } p}$ uniformly at random for each prime ${p \leq z}$ . The Hardy-Littlewood conjecture can be viewed as an assertion that the primes ${{\mathcal P}}$ behave in some approximate statistical sense like the random sifted set ${\mathcal{S}_z}$ , and one can prove the above theorem by using the Bonferroni inequalities both for the primes ${{\mathcal P}}$ and for the random sifted set, and comparing the two (using an even ${K}$ for the sifted set and an odd ${K}$ for the primes in order to be able to combine the two together to get a useful bound).

The proof of Theorem 3 ended up not using any properties of the set of primes ${{\mathcal P}}$ other than that this set obeyed some form of the Hardy-Littlewood conjectures; the theorem remains true (with suitable notational changes) if this set were replaced by any other set. In order to convince ourselves that our theorem was not vacuous due to our version of the Hardy-Littlewood conjecture being too strong to be true, we then started exploring the question of coming up with random models of ${{\mathcal P}}$ which obeyed various versions of the Hardy-Littlewood and Cramér conjectures.

This line of inquiry was started by Cramér, who introduced what we now call the Cramér random model ${{\mathcal C}}$ of the primes, in which each natural number ${n \geq 3}$ is selected for membership in ${{\mathcal C}}$ with an independent probability of ${1/\log n}$ . This model matches the primes well in some respects; for instance, it almost surely obeys the “Riemann hypothesis”

$\displaystyle | {\mathcal C} \cap [1,x] | = \int_2^x \frac{dt}{\log t} + O( x^{1/2+o(1)})$

and Cramér also showed that the largest gap ${G_{\mathcal C}(x)}$ was almost surely ${\sim \log^2 x}$ . On the other hand, it does not obey the Hardy-Littlewood conjecture; more precisely, it obeys a simplified variant of that conjecture in which the singular series ${{\mathfrak S}({\mathcal H})}$ is absent.

Granville proposed a refinement ${{\mathcal G}}$ to Cramér’s random model ${{\mathcal C}}$ in which one first sieves out (in each dyadic interval ${[x,2x]}$ ) all residue classes ${0 \hbox{ mod } p}$ for ${p \leq A}$ for a certain threshold ${A = \log^{1-o(1)} x = o(\log x)}$ , and then places each surviving natural number ${n}$ in ${{\mathcal G}}$ with an independent probability ${\frac{1}{\log n} \prod_{p \leq A} (1-\frac{1}{p})^{-1}}$ . One can verify that this model obeys the Hardy-Littlewood conjectures, and Granville showed that the largest gap ${G_{\mathcal G}(x)}$ in this model was almost surely ${\gtrsim \xi \log^2 x}$ , leading to his conjecture that this bound also was true for the primes. (Interestingly, this conjecture is not yet borne out by numerics; calculations of prime gaps up to ${10^{18}}$ , for instance, have shown that ${G_{\mathcal P}(x)}$ never exceeds ${0.9206 \log^2 x}$ in this range. This is not necessarily a conflict, however; Granville’s analysis relies on inspecting gaps in an extremely sparse region of natural numbers that are more devoid of primes than average, and this region is not well explored by existing numerics. See this previous blog post for more discussion of Granville’s argument.)

However, Granville’s model does not produce a power savings in the error term of the Hardy-Littlewood conjectures, mostly due to the need to truncate the singular series at the logarithmic cutoff ${A}$ . After some experimentation, we were able to produce a tractable random model ${{\mathcal R}}$ for the primes which obeyed the Hardy-Littlewood conjectures with power savings, and which reproduced Granville’s gap prediction of ${\gtrsim \xi \log^2 x}$ (we also get an upper bound of ${\lesssim \xi \log^2 x \frac{\log\log x}{2 \log\log\log x}}$ for both models, though we expect the lower bound to be closer to the truth); to us, this strengthens the case for Granville’s version of Cramér’s conjecture. The model can be described as follows. We select one residue class ${a_p \hbox{ mod } p}$ uniformly at random for each prime ${p}$ , and as before we let ${S_z}$ be the sifted set of integers formed by deleting the residue classes ${a_p \hbox{ mod } p}$ with ${p \leq z}$ . We then set

$\displaystyle {\mathcal R} := \{ n \geq e^2: n \in S_{z(t)}\}$

with ${z(t)}$ Pólya’s magic cutoff (this is the cutoff that gives ${{\mathcal R}}$ a density consistent with the prime number theorem or the Riemann hypothesis). As stated above, we are able to show that almost surely one has

$\displaystyle \xi \log^2 x \lesssim {\mathcal G}_{\mathcal R}(x) \lesssim \xi \log^2 x \frac{\log\log x}{2 \log\log\log x} \ \ \ \ \ (3)$

and that the Hardy-Littlewood conjectures hold with power savings for ${k}$ up to ${\log^c x}$ for any fixed ${c < 1}$ and for shifts ${h_1,\dots,h_k}$ of size ${O(\log^c x)}$ . This is unfortunately a tiny bit weaker than what Theorem 3 requires (which more or less corresponds to the endpoint ${c=1}$ ), although there is a variant of Theorem 3 that can use this input to produce a lower bound on gaps in the model ${{\mathcal R}}$ (but it is weaker than the one in (3)). In fact we prove a more precise almost sure asymptotic formula for ${{\mathcal G}_{\mathcal R}(x) }$ that involves the optimal bounds for the linear sieve (or interval sieve), in which one deletes one residue class modulo ${p}$ from an interval ${[0,y]}$ for all primes ${p}$ up to a given threshold. The lower bound in (3) relates to the case of deleting the ${0 \hbox{ mod } p}$ residue classes from ${[0,y]}$ ; the upper bound comes from the delicate analysis of the linear sieve by Iwaniec. Improving on either of the two bounds looks to be quite a difficult problem.

The probabilistic analysis of ${{\mathcal R}}$ is somewhat more complicated than of ${{\mathcal C}}$ or ${{\mathcal G}}$ as there is now non-trivial coupling between the events ${n \in {\mathcal R}}$ as ${n}$ varies, although moment methods such as the second moment method are still viable and allow one to verify the Hardy-Littlewood conjectures by a lengthy but fairly straightforward calculation. To analyse large gaps, one has to understand the statistical behaviour of a random linear sieve in which one starts with an interval ${[0,y]}$ and randomly deletes a residue class ${a_p \hbox{ mod } p}$ for each prime ${p}$ up to a given threshold. For very small ${p}$ this is handled by the deterministic theory of the linear sieve as discussed above. For medium sized ${p}$ , it turns out that there is good concentration of measure thanks to tools such as Bennett’s inequality or Azuma’s inequality, as one can view the sieving process as a martingale or (approximately) as a sum of independent random variables. For larger primes ${p}$ , in which only a small number of survivors are expected to be sieved out by each residue class, a direct combinatorial calculation of all possible outcomes (involving the random graph that connects interval elements ${n \in [0,y]}$ to primes ${p}$ if ${n}$ falls in the random residue class ${a_p \hbox{ mod } p}$ ) turns out to give the best results.

37 comments

Comments feed for this article

26 August, 2019 at 11:17 am

thebirdreader

Why do you reveal the journal you submitted it to? More generally, what’s the policy around this?

26 August, 2019 at 11:33 am

domotorp

It’s nice of William Banks and Kevin Ford that they’ve helped you in uploading your own paper ;)

[Corrected, thanks – T.]

26 August, 2019 at 11:49 am

Anonymous

In the very first sentence, “my paper” should be “our paper”.

26 August, 2019 at 1:05 pm

Anonymous

Is it possible to improve the upper bound in (3) by optimizing “Polya magic cutoff” (perhaps by a slight modification of (2))?

26 August, 2019 at 2:12 pm

Terence Tao

Varying the cutoff $z$ significantly would vary the density of the set ${\mathcal R}$ , which would then also vary the lower and upper bounds we obtain for the largest gap (but the ratio between the upper and lower bounds would be essentially unchanged). The more precise version of (3) given in the paper involves the function $g(u)$ , defined as the largest $y$ for which it is possible to delete one residue class mod $p$ from $[0,y]$ for each prime $p \leq (y/\log y)^{1/2}$ and end up with at most $u/\log y$ survivors. We basically show that the largest gap ${\mathcal G}_{\mathcal R}(x)$ is almost surely asymptotic to $g(\xi \log^2 x)$ . On the other hand, the best known bounds on $g(u)$ are

$\displaystyle u \lesssim g(u) \lesssim \frac{u \log u}{4 \log\log u}$

which is where the bounds (3) are coming from. We believe the lower bound here is closer to the truth, but improving the upper bound requires improving Iwaniec’s bound on the linear sieve, which is likely to be a very difficult problem (relating to breaking the notorious “parity barrier”). In an appendix to our paper we elaborate on the link with the parity problem by recording the folklore observation that a sufficiently strong Siegel zero can lead to a significant improvement in the lower bound for $g(u)$ and hence for ${\mathcal G}_{\mathcal R}(x)$ . So showing that the lower bound is sharp is at least as hard as ruling out Siegel zeroes.

28 August, 2019 at 5:27 am

Anonymous

Is it necessary to use the same cutoff for both (upper and lower) bounds? Is it possible to optimize the cutoff for the lower bound and use another optimized cutoff for the upper bound?

26 August, 2019 at 3:03 pm

Two conjectures in one – The nth Root

[…] William Banks, Kevin Ford, and I have just uploaded to the arXiv my paper “Large prime gaps and probabilistic models“, submitted to Inventiones. In this paper we introduce a random model to help understand the connection between two well known conjectures regarding the primes {{mathcal P} := {2,3,5,dots}}, the Cramér conjecture and the Hardy-Littlewood conjecture: … (Terence Tao) […]

26 August, 2019 at 6:23 pm

Zachary Kyle Knutson

Research gate my sample 2 that was my idea…

27 August, 2019 at 6:19 am

Zachary Kyle Knutson

https://www.researchgate.net/publication/327477654_Sample_2_Update_J_Equation

27 August, 2019 at 6:28 am

Zachary Kyle Knutson

yall misunderstand that was to comment not lol to Mr.Terrance…

27 August, 2019 at 3:33 pm

Some Math News | Not Even Wrong

[…] too bad Pat didn’t live to see the latest from Terry Tao, who describes recent results which are related to old work of Gallagher’s by “Our […]

28 August, 2019 at 1:34 am

Vincent

Sorry can I ask a very basic question about the constant S(H) in the Hardy-Littlewood conjecture? I would have looked it up on Wikipedia but curiously the first HL-conjecture doesn’t have a page (while the probably-false second HL-conjecture does). As written here it seems to me that the constant equals 0 when all $h_i$ lie in the same congruence class mod $p$ for some $p$, e.g. when they are all even. That seems a bit strange in light of the twin prime conjecture. Conversely I would expect the constant to become 0 when for some $p$ we have$|H mod p| = p$, i.e. the $h_i$ together cover all congruence classes mod $p$, but in this case the contribution of the prime $p$ to the constant seems to be non-zero. Am I missing something obvious here? What is the rationale behind the product making up $S(H)$?

[Oops, there was a typo in the definition of the singular series in the blog post, which has now been fixed. -T]

28 August, 2019 at 8:26 pm

年轻的数学家

Dear Prof. Tao,

first of all, there seems to be a typo in the conclusion of thm. 1.5 (or do they also use the lower index for iteration?). Then, I wonder whether even stronger Hardy‒Littlewood-type estimates would imply even stronger bounds on $G_{\mathcal P}(x)$ . Also, what about upper bounds? Without having read the paper, the Bonferroni inequalities as stated above seem pretty symmetrical.

30 August, 2019 at 7:52 am

Terence Tao

The lower index is indeed used for iteration (see the top of page 2).

Unfortunately, it appears that Hardy-Littlewood type conjectures can only be used to impose lower bounds on gaps, not upper bounds. To show that there is a prime gap of size at least H, it suffices to show that the number of $n$ in a given range such that the tuple $\{n,n+1,\dots,n+H\}$ is free of primes is positive; and this can in principle be done by the lower bound Bonferroni inequality and Hardy-Littlewood. But to show that all prime gaps have size at most H, one has to show that the number of $n$ for which the tuple $\{n,n+1,\dots,n+H\}$ is zero. One could only achieve this from an upper bound Bonferroni inequality if the upper bound was absolutely tight (no error term allowed at all), which is basically impossible from analytic number theory techniques (for instance, the Hardy-Littlewood conjectures contain an error term and are thus not usable for such a strategy).

One can also see this using the “redacted primes” test. One can “plant” a large gap in the primes by deleting an interval $\{n,n+1,\dots,n+H\}$ from the set of primes. For $H$ much smaller than the range $x$ (e.g. $H = \log^{10} x$ ), such a deletion will hardly make an impact on any of the Hardy-Littlewood conjectures (it will easily get absorbed into the error term). Thus these conjectures cannot prevent the creation of an extremely large gap.

28 August, 2019 at 8:38 pm

年轻的数学家

I’d also be interested in estimates for $g$ that would prove stronger bounds on $G_{\mathcal P}(x)$ .

28 August, 2019 at 8:51 pm

年轻的数学家

I realise that according to the above, one would need to disprove the existence of Siegel zeroes. So I’m in fact asking for upper bounds in the event that both the Riemann hypothesis and strong Hardy‒Littlewood-type estimates are true.

30 August, 2019 at 5:54 am

Mefes

Good connection, but seems that landau’s problems are not provable though

30 August, 2019 at 2:03 pm

Anonymous

Dr.Tao, we can deduce the Hardy-Littlewood conjectures without using probabilistic models. You can see my article: https://lagrida.com/conjecture_k_uple.html
Where i prove $I_{\mathcal{H}_k}(x) = \mathcal{G}_k \, e^{-\gamma k} \, \dfrac{x}{\log(\log(x))^k} \, (1+o(1))$

With : $I_{\mathcal{H}_k}(x) = \#\{(b,b+h_1,\cdots,b+h_{k-1})\in \mathcal{B}_{q(x)}^k \, | \, b+h_{k-1} \leq x\}$ and $\mathcal{B}_q = \{b \in \mathbb{N}^{*} \, | \, \gcd(b, \displaystyle{\small \prod_{\substack{p \leq q \\ \text{p premier}}} {\normalsize p}})=1 \}$ and $q(x)$ is the largest prime verify $\displaystyle{\small \left(\prod_{\substack{p \leq q(x) \\ \text{p premier}}} {\normalsize p}\right)} \leq x$

30 August, 2019 at 2:06 pm

Anonymous

And f course $\displaystyle \mathcal{G}_k = \prod_{\text{p premier}}\frac{1-\frac{w(\mathcal{H}_k, p)}{p}}{(1-\frac1p)^{k}}$

With $w(\mathcal{H}_k, p)$ is the number of distinct residues $\pmod p$ in $\mathcal{H}_k$.

30 August, 2019 at 3:25 pm

Anonymous

Hmm. On the largest gap or space, G(X), between consecutive primes less than or equal to large real X, I believe G(x) ≤ (log X)^2 is true according to an analysis involving the Prime Number Theorem and the average spacing between consecutive primes less than or equal to large real X.

Relevant Reference Link:

“LARGE GAPS BETWEEN CONSECUTIVE PRIME NUMBERS”,

https://www.math10.com/forum/viewtopic.php?f=63&t=8263

David Cole.

30 August, 2019 at 3:33 pm

Anonymous

Of course, the calculations are estimates/approximations within the known bounds of error associated with PNT/Riemann Prime-Counting Function for all large X… Our model/proof appears to be sound. David Cole.

30 August, 2019 at 3:38 pm

Anonymous

Remark: Riemann Hypothesis is true! David Cole. :-)

31 August, 2019 at 1:36 pm

Anonymous

Now I wonder why I even bother to visit this website/webpage… I always receive no positive feedback or worse. I will stop visiting this site in the future and this is my last comment and visit too. However, I do appreciate the information I gain here in the past. Thank you and goodbye. David Cole.

30 August, 2019 at 6:06 pm

anonymous

A possibly naive question: do you know if the constant 3/2 is optimal your Theorem 1.4, for the random model?

Also, does this model make predictions for the number of primes in short intervals much larger than log(x), along the lines of this paper of Montgomery and Soundararajan https://arxiv.org/pdf/math/0409258.pdf (e.g. Corollary 1 there)? I guess Theorem 1.3 of your paper allows one to directly reproduce Montgomery and Soundararajan’s computation for intervals of length up to exp(log(x)^{1/2}/loglog(x)), but for this model is there a more direct route that doesn’t pass through the computation of that paper, and is it feasible to consider short intervals larger than this?

30 August, 2019 at 6:32 pm

Terence Tao

We didn’t seriously try to optimise the exponent in Theorem 1.4, which arises basically from the second moment method and the Borel-Cantelli lemma. One could perhaps do a bit better by exploiting higher moments, but they become more difficult to compute.

I was planning to ask a graduate student to look into how what this model predicts for primes in short intervals; as we remark in Section 2.5, one does have the analogue of the Maier fluctuations in this model, but we didn’t quantify the size of these fluctuations precisely.

2 September, 2019 at 7:07 am

Mark

If you assume a Heath-Brown-type “conjecture” that the Hardy-Littlewood asymptotic holds with O(1) error term, can you deduce anything more about gaps?

2 September, 2019 at 7:42 am

Mark

I just noticed that the Heath-Brown’s “conjecture” already precludes the qualitative Hardy-Littlewood conjecture, as well as the Granville-Cramer gap prediction, so I’m not sure if there is much sense to be made from my question.

8 September, 2019 at 10:58 pm

Buzzman

Let $d_{n}=p_{n+1}-p_n$ , then there exists $x_0 \in \mathbb{N}$ such that for $p_n > x_0$ one obtains $d_{n+1} \ll O(p_{n+1}/\log p_n)$ , with the implied constant depending on $x_0$ .

11 September, 2019 at 7:34 am

humble suggestion

I wonder if it might be worthwhile for Terry to hire an undergrad to periodically delete the nonsense in the comment sections. The posts about high profile results (and to a lesser extent, many other posts) tend to attract a lot of personal or crackpottish comments that are either off topic or nonsensical.

11 September, 2019 at 9:49 am

Kipperock

I think that this type of moderation would be welcome, both to the writer and the serious visitors.
This blog’s comment section is quite important, since, for example, in the comments about Prof. Tao’s books there are both errata and many super important explanations – I’m working through his book on measure theory rn and I just feel how invaluable a neat comment section is.
When reading other posts, expository or research – it would be very cool to see a neat comment section where serious discussions take place and are not spoiled by – sometimes dozens or more! – comments that are clearly.. not in the right neighborhood.
I guess I understand why the author refrains from flat out full time moderation – he’s got important research to do, teaching duties, two kids and a wife [this list is not written by order of importance :)))], and there probably are things I simply didn’t think of.
From my experience(see above, and take my word for it that it was me), it is also very, very tiring – since I would like to see certain comments completely gone, but others deserve at least an attempt at an answer.
I don’t even read the comments on other posts, in order to not get dragged down into the same mess of time and nerve consuming keyboard kung-fu..
Honestly I think this comment was just a very long way of saying “I agree”, since this comment section already had it’s way with me. I hope Prof. Tao would eventually accept your suggestion.

23 September, 2019 at 5:23 pm

Anonymous

Is it really a good idea to publish this where anyone can read it when it could help hackers break cryptography or evil governments conduct unjust wars. Even if new cryptographic software could be written people in third world countries could be greatly harmed and would not be able to update in time.

20 March, 2020 at 3:57 am

Zhang Tan

Proof of the Twin Prime Conjecture

Let $p_n$ be the nth prime number
$p_1=2,p_2=3,p_3=5$

Let $P_n$ be the first n prime numbers multiplied together
$P_1=p_1,P_2=p_1 \times p_2,P_3=p_1 \times p_2 \times p_3$

Arithmetic Progression

$\{mP_n+a\}$
where $a$ in $A_n$ where $a$ is relatively prime to $P_n$ and less than $P_n$ and $0 \leq m < p_{n+1}$

There always exist numbers $a_1$ and $a_2$ in $A_s$ succh that $a_1+2=a_2$ where $s \geq 3$

Base Case $11,13$ in $A_3$

Induction Case

Let $a_1$ and $a_2$ in $A_n$ such that $a_1+d=a_2$ will propagate at least $p_{n+1}-2$ pairs of numbers which differs by $d$ in $A_{n+1}$

There are a total of $p_{n+1}$ elements generated by arithmetic progression $\{mP_n+a_1\}$ and out of all of the generated elements there is unique element $m_1P_n+a_1$ divisible by $p_{n+1}$

There are a total of $p_{n+1}$ elements generated by arithmetic progression $\{mP_n+a_2\}$ and out of all of the generated elements there is unique element $m_2P_n+a_2$ divisible by $p_{n+1}$

When $m_1 \neq m_2$ there are $p_{n+1}-2$ pairs of numbers $(\{mP_n+a_1\},\{mP_n+a_2\})$ differs by $d, a_1+d=a_2$ in $A_{n+1}$

When $m_1 = m_2$ there are $p_{n+1}-1$ pairs of numbers $(\{mP_n+a_1\},\{mP_n+a_2\})$ differs by $d, a_1+d=a_2$ in $A_{n+1}$

Arithmetic Progression

$\{mP_n+a\}$ where $a$ in $A_n$ where $a$ is relatively prime to $P_n$ and less than $P_n$ and $0 \leq m < P_n$

If there exist an element in $\{mP_n+a_1\}$ divisible by $f$ than in $f$ consecutive elements $x \leq m < x+f$ generated by arithmetic progression $\{mP_n+a_1\}$ there exist unique element $m_1P_n+a_1$ divisble by $f$

Proof of twin prime conjecture by contradiction

For there to not exist two prime numbers which differs by $d, a_1+d=a_2$
There must exist a non-prime number for every value of $m, 0 \leq m <P_n$ in either $\{mP_n+a_1\}$ or $\{mP_n+a_2\}$

All non-prime numbers greater than 1 in $\{mP_n+a\}$ where $a$ in $A_n$ where $a$ in relatively prime to $P_n$ and less than $P_n$ and $0 \leq m < P_n$ must be divisible by an odd number $f$ where $3 \leq f \leq P_n-1$

Removing pairs of numbers from $(\{mP_n+a_1\},\{mP_n+a_2\})$ where either $\{mP_n+a_1\}$ or $\{mP_n+a_2\}$ divisible by $f=P_n-1-2o$ where $3 \leq f \leq P_n-1$

Consider $f=P_n-1$ consective elements $x \leq m < x+f$ generated by arithmetic progression $\{mP_n+a_1\}$ Assume there exist $m_1P_n+a_1$ divisible by $f$ it is unique in these consecutive elements.

Consider $f=P_n-1$ consective elements $x \leq m < x+f$ generated by arithmetic progression $\{mP_n+a_2\}$ Assume there exist $m_2P_n+a_2$ divisible by $f$ it is unique in these consecutive elements.

Assume $m_1 \neq m_2$

Assume the remaining $f-2$ pairs not divisible by $f$ are consective.

Taking the remaining consecutive $f-2$ pairs not divisible by $f$ remove pairs divisible by $f-2$

Consider $f=P_n-1-2$ consective elements $x \leq m < x+f$ generated by arithmetic progression $\{mP_n+a_1\}$ Assume there exist $m_1P_n+a_1$ divisible by $f$ it is unique in these consecutive elements.

Consider $f=P_n-1-2$ consective elements $x \leq m < x+f$ generated by arithmetic progression $\{mP_n+a_2\}$ Assume there exist $m_2P_n+a_2$ divisible by $f$ it is unique in these consecutive elements.

Assume $m_1 \neq m_2$

Assume the remaining $f-2$ pairs not divisible by $f$ are consective.

Continue repeating until with all smaller odd numbers $f=P_n-1-2o$ where $o=0,1,2,3,\ldots$ until $f=3$

There must exist a prime number in $\{m_1P_n+a_1\}$ and $\{m_2P_n+a_2\}$ where $m_1=m_2$ and $0 \leq m < P_n$

Therefore there are infinite number of prime numbers which differ by 2.

7 November, 2020 at 12:21 am

Thomas Pickett

My paper titled:

“Structure is found in Prime Numbers and Gaps are Unbounded as the Quantity of Twin and Cousin Primes Run Neck and Neck Through Infinity”

Addresses “Large Prime Gaps”.
It can be found at:

http://www.primealignmentmatrix.com

26 January, 2022 at 6:22 pm

Leigh

How does the GUE hypothesis ‘know’ that the prime counting function (or related counting functions) are piecewise constant? When you say “…we have the GUE hypothesis which appears to accurately model the zeta function, but does not capture such basic properties of the primes as the fact that the primes are all natural numbers,” it sounds like you are taking for granted that other distributions of the zeros will still lead to a sensible counting function. Or maybe I am reading too much into this statement.

27 January, 2022 at 8:27 am

Terence Tao

In fact none of the plausible models for the distribution for the zeta functions are able to detect the piecewise constancy of the prime counting function. This can be explained in terms of the uncertainty principle (see e.g., the discussion at the end of Section 2 of this blog post of mine): when trying to use the zeta function $\zeta(1/2+it)$ to understand sums like $\sum_{n \leq x} \Lambda(n)$ , the uncertainty principle tells us that

$\Delta t \times \Delta \log x \gg 1$

or by the chain rule

$\Delta t \times \Delta x \gg x$

To see the piecewise constancy of the prime counting function, one has to resolve the spatial uncertainty down to unit scales, so we need $\Delta x \ll 1$ , which by the uncertainty principle forces $\Delta t \gg x$ . That is to say, we need information on the zeroes on an interval of length $\gg x$ before we could even hope to detect this piecewise constancy. On the other hand, GUE and related models only cover intervals in $t$ -space of length $O(1)$ (or even $O(1/\log T)$ ) at best, and so have nowhere near the resolution to see these effects; as far as the GUE model (or any other local model for the zeroes) is concerned, the integers and primes may well be continuously distributed.

It would be a major breakthrough if there was some new way to exploit the discrete nature of the integers and primes that would be visible on the zeta function side, thus circumventing this uncertainty principle barrier. The most obvious instance of this is the functional equation, which ultimately derives from the Poisson summation formula applied to the integers which one can view as a discrete subgroup of the reals. Some of the more advanced bounds on exponential sums related to the zeta function also rely more heavily on the arithmetic structure of the integers, but again not at anywhere near the resolution needed to say much about zeroes on the critical line (though they can help for instance with zero free regions and some zero density estimates, as well as subconvexity bounds).

27 January, 2022 at 7:25 pm

Leigh

OK that makes sense, thanks very much. I posted the above question on Math Stack Exchange (https://math.stackexchange.com/questions/4321777/riemanns-explicit-prime-counting-formula-how-is-it-piecewise-constant) a while ago, before posting it here. Do you mind if I copy your answer over there, with proper attribution? Or would you prefer to do so yourself if you do?

[Sure, any comment here is public and can be posted with attribution elsewhere. -T]

14 August, 2023 at 8:29 pm

The convergence of an alternating series of Erdos, assuming the Hardy–Littlewood prime tuples conjecture | What's new

[…] this obstacle, we take advantage of the random sifted model of the primes that was introduced in a paper of Banks, Ford, and myself. To model the primes in an interval such as with drawn randomly from say , we remove one random […]

	Anonymous on Infinite partial sumsets in th…
	Anonymous on A Banach algebra proof of the…
	Anonymous on A Banach algebra proof of the…
	Aleksandar on 245C, Notes 4: Sobolev sp…
	Anonymous on Erratum for “An inverse…
	Anonymous on Erratum for “An inverse…
	Anonymous on Erratum for “An inverse…
	Anonymous on Erratum for “An inverse…
	Terence Tao on 245C, Notes 4: Sobolev sp…
	Terence Tao on 275A, Notes 3: The weak and st…
	Terence Tao on What is a gauge?
	Terence Tao on Erratum for “An inverse…
	Terence Tao on 275A, Notes 3: The weak and st…
	Terence Tao on An epsilon of room: pages from…
	Aleksandar on 245C, Notes 4: Sobolev sp…

Large prime gaps and probabilistic models

Recent Comments

Articles by others

Diversions

Mathematics

Selected articles

Software

The sciences

Top Posts

Archives

Categories

The Polymath Blog

37 comments

Leave a comment Cancel reply

For commenters

Large prime gaps and probabilistic models

Share this:

Recent Comments

Articles by others

Diversions

Mathematics

Selected articles

Software

The sciences

Top Posts

Archives

Categories

The Polymath Blog

37 comments

Leave a comment Cancel reply

For commenters