New bounds for Szemeredi’s theorem, Ia: Progressions of length 4 in finite field geometries revisited

Ben Green and I have just uploaded to the arXiv our paper “New bounds for Szemeredi’s theorem, Ia: Progressions of length 4 in finite field geometries revisited“, submitted to Proc. Lond. Math. Soc.. This is both an erratum to, and a replacement for, our previous paper “New bounds for Szemeredi’s theorem. I. Progressions of length 4 in finite field geometries“. The main objective in both papers is to bound the quantity ${r_4(F^n)}$ for a vector space ${F^n}$ over a finite field ${F}$ of characteristic greater than ${4}$ , where ${r_4(F^n)}$ is defined as the cardinality of the largest subset of ${F^n}$ that does not contain an arithmetic progression of length ${4}$ . In our earlier paper, we gave two arguments that bounded ${r_4(F^n)}$ in the regime when the field ${F}$ was fixed and ${n}$ was large. The first “cheap” argument gave the bound

$\displaystyle r_4(F^n) \ll |F|^n \exp( - c \sqrt{\log n} )$

and the more complicated “expensive” argument gave the improvement

$\displaystyle r_4(F^n) \ll |F|^n n^{-c} \ \ \ \ \ (1)$

for some constant ${c>0}$ depending only on ${F}$ .

Unfortunately, while the cheap argument is correct, we discovered a subtle but serious gap in our expensive argument in the original paper. Roughly speaking, the strategy in that argument is to employ the density increment method: one begins with a large subset ${A}$ of ${F^n}$ that has no arithmetic progressions of length ${4}$ , and seeks to locate a subspace on which ${A}$ has a significantly increased density. Then, by using a “Koopman-von Neumann theorem”, ultimately based on an iteration of the inverse ${U^3}$ theorem of Ben and myself (and also independently by Samorodnitsky), one approximates ${A}$ by a “quadratically structured” function ${f}$ , which is (locally) a combination of a bounded number of quadratic phase functions, which one can prepare to be in a certain “locally equidistributed” or “locally high rank” form. (It is this reduction to the high rank case that distinguishes the “expensive” argument from the “cheap” one.) Because ${A}$ has no progressions of length ${4}$ , the count of progressions of length ${4}$ weighted by ${f}$ will also be small; by combining this with the theory of equidistribution of quadratic phase functions, one can then conclude that there will be a subspace on which ${f}$ has increased density.

The error in the paper was to conclude from this that the original function ${1_A}$ also had increased density on the same subspace; it turns out that the manner in which ${f}$ approximates ${1_A}$ is not strong enough to deduce this latter conclusion from the former. (One can strengthen the nature of approximation until one restores such a conclusion, but only at the price of deteriorating the quantitative bounds on ${r_4(F^n)}$ one gets at the end of the day to be worse than the cheap argument.)

After trying unsuccessfully to repair this error, we eventually found an alternate argument, based on earlier papers of ourselves and of Bergelson-Host-Kra, that avoided the density increment method entirely and ended up giving a simpler proof of a stronger result than (1), and also gives the explicit value of ${c = 2^{-22}}$ for the exponent ${c}$ in (1). In fact, it gives the following stronger result:

Theorem 1 Let ${A}$ be a subset of ${F^n}$ of density at least ${\alpha}$ , and let ${\epsilon>0}$ . Then there is a subspace ${W}$ of ${F^n}$ of codimension ${O( \epsilon^{-2^{20}})}$ such that the number of (possibly degenerate) progressions ${a, a+r, a+2r, a+3r}$ in ${A \cap W}$ is at least ${(\alpha^4-\epsilon)|W|^2}$ .

The bound (1) is an easy consequence of this theorem after choosing ${\epsilon := \alpha^4/2}$ and removing the degenerate progressions from the conclusion of the theorem.

The main new idea is to work with a local Koopman-von Neumann theorem rather than a global one, trading a relatively weak global approximation to ${1_A}$ with a significantly stronger local approximation to ${1_A}$ on a subspace ${W}$ . This is somewhat analogous to how sometimes in graph theory it is more efficient (from the point of view of quantative estimates) to work with a local version of the Szemerédi regularity lemma which gives just a single regular pair of cells, rather than attempting to regularise almost all of the cells. This local approach is well adapted to the inverse ${U^3}$ theorem we use (which also has this local aspect), and also makes the reduction to the high rank case much cleaner. At the end of the day, one ends up with a fairly large subspace ${W}$ on which ${A}$ is quite dense (of density ${\alpha-O(\epsilon)}$ ) and which can be well approximated by a “pure quadratic” object, namely a function of a small number of quadratic phases obeying a high rank condition. One can then exploit a special positivity property of the count of length four progressions weighted by pure quadratic objects, essentially due to Bergelson-Host-Kra, which then gives the required lower bound.

7 comments

Comments feed for this article

9 May, 2012 at 7:22 am

magnus carlsen

Dear Terry,

you should add in your references that the paper “New bounds on cap sets” (reference number 1)
is published in the journal of the AMS

[Will do, thanks – T.]

11 May, 2012 at 9:21 am

Anonymous

You could help young mathematicians by posting the referee report for the original paper along with the story behind the discovery of the gap in the proof of the main result.

28 May, 2012 at 10:11 am

reader

Sanders’ paper [13] has been published: MR 2012f:11019. So has Tao & Ziegler [16]: doi:10.1007/s00026-011-0124-3.

[Thanks, this will be added to the next revision of the ms. -T.]

23 May, 2024 at 8:47 pm

Despite being almost trivial linear algebra, I’m struggling a bit to understand your proof of Lemma 4.7, and was hoping you could give a few words of clarification. In particular, should there be a hypothesis that of quadratic functions that define the quadratic factor in the hypothesis have sufficiently large rank? You run what you call the “rank reduction process” until the rank of various linear combinations of the quadratic factors exceed r + d. At each stage you reduce the number of quadratic functions. You conclude that this must stop before you run out of quadratic functions. But I don’t see why this ensures you find a subset with high rank. Couldn’t you just continue until you get down to one quadratic function which has rank less than r+d? More succinctly, I don’t see how the argument works if you start with a factor defined by a single quadratic function, and seek a local quadratic factor with a higher rank.

This isn’t particularly relevant to the question, but I’m also not sure why you call the process of locating a factor with higher rank the “rank reduction process”.

24 May, 2024 at 2:45 pm

Terence Tao

It is quite permissible to end up with 0 quadratic polynomials at the final stage of the algorithm, in which case it will be of rank at least $r$ for any $r$ .

I realize that when referring to this argument as a “rank reduction” argument we were using a different application of the concept of rank which in this context is quite confusing. In linear algebra, when one has a collection of vectors $v_1,\dots,v_n$ , the rank of the collection is sometimes defined to be the dimension of the linear space spanned by those vectors (or the rank of the matrix which contains the coordinates of these vectors as the rows). It is the span of the quadratic component of the factor that is being reduced by this algorithm; but of course we are using rank for another purpose in this argument. “quadratic complexity reduction process” may be a more appropriate term here.

24 May, 2024 at 4:47 pm

I’m surely missing something here. Say that phi is defined by a full rank 100 by 100 matrix. Now rank( lambda phi) = 100 for all lambda neq 0. If I understand the algorithm (which it seems I do not), then the next stage is to restrict to the kernel of phi, which is just the trivial zero dimensional space. In what sense is this, say, rank > 1000?

25 May, 2024 at 7:11 am

From Definition 4.4, A collection $\phi_1,\dots,\phi_d$ of quadratic forms will automatically have rank at least $r$ for any $r$ if one has $d=0$ ; this is an example of a vacuously true statement, since in this case there are no non-trivial linear combinations of the $\phi_1,\dots,\phi_d$ to be tested.

	Anonymous on The Poisson-Dirichlet process,…
	Anonymous on Two announcements: AI for Math…
	Anonymous on A problem involving power…
	Terence Tao on 275A, Notes 3: The weak and st…
	Terence Tao on Marton’s conjecture in a…
	Terence Tao on Analysis I
	Terence Tao on 275A, Notes 3: The weak and st…
	Terence Tao on On product representations of…
	Anonymous on On product representations of…
	Terence Tao on A problem involving power…
	Anonymous on 275A, Notes 3: The weak and st…
	Marcel Goh on Marton’s conjecture in a…
	Adam Fennell on Analysis I
	FunSearch: Making ne… on Open question: best bounds for…
	Mohammed Mannan on 254A, Notes 1: Lie groups, Lie…

New bounds for Szemeredi’s theorem, Ia: Progressions of length 4 in finite field geometries revisited

Recent Comments

Articles by others

Diversions

Mathematics

Selected articles

Software

The sciences

Top Posts

Archives

Categories

The Polymath Blog

7 comments

Leave a comment Cancel reply

For commenters

New bounds for Szemeredi’s theorem, Ia: Progressions of length 4 in finite field geometries revisited

Share this:

Recent Comments

Articles by others

Diversions

Mathematics

Selected articles

Software

The sciences

Top Posts

Archives

Categories

The Polymath Blog

7 comments

Leave a comment Cancel reply

For commenters