Additive limits

12 October, 2014 in expository, math.CO, math.GR, math.LO | Tags: additive combinatorics, locally compact abelian groups, Loeb measure, nonstandard analysis | by Terence Tao

In graph theory, the recently developed theory of graph limits has proven to be a useful tool for analysing large dense graphs, being a convenient reformulation of the Szemerédi regularity lemma. Roughly speaking, the theory asserts that given any sequence ${G_n = (V_n, E_n)}$ of finite graphs, one can extract a subsequence ${G_{n_j} = (V_{n_j}, E_{n_j})}$ which converges (in a specific sense) to a continuous object known as a “graphon” – a symmetric measurable function ${p\colon [0,1] \times [0,1] \rightarrow [0,1]}$ . What “converges” means in this context is that subgraph densities converge to the associated integrals of the graphon ${p}$ . For instance, the edge density

$\displaystyle \frac{1}{|V_{n_j}|^2} |E_{n_j}|$

converge to the integral

$\displaystyle \int_0^1 \int_0^1 p(x,y)\ dx dy,$

the triangle density

$\displaystyle \frac{1}{|V_{n_j}|^3} \lvert \{ (v_1,v_2,v_3) \in V_{n_j}^3: \{v_1,v_2\}, \{v_2,v_3\}, \{v_3,v_1\} \in E_{n_j} \} \rvert$

converges to the integral

$\displaystyle \int_0^1 \int_0^1 \int_0^1 p(x_1,x_2) p(x_2,x_3) p(x_3,x_1)\ dx_1 dx_2 dx_3,$

the four-cycle density

$\displaystyle \frac{1}{|V_{n_j}|^4} \lvert \{ (v_1,v_2,v_3,v_4) \in V_{n_j}^4: \{v_1,v_2\}, \{v_2,v_3\}, \{v_3,v_4\}, \{v_4,v_1\} \in E_{n_j} \} \rvert$

converges to the integral

$\displaystyle \int_0^1 \int_0^1 \int_0^1 \int_0^1 p(x_1,x_2) p(x_2,x_3) p(x_3,x_4) p(x_4,x_1)\ dx_1 dx_2 dx_3 dx_4,$

and so forth. One can use graph limits to prove many results in graph theory that were traditionally proven using the regularity lemma, such as the triangle removal lemma, and can also reduce many asymptotic graph theory problems to continuous problems involving multilinear integrals (although the latter problems are not necessarily easy to solve!). See this text of Lovasz for a detailed study of graph limits and their applications.

One can also express graph limits (and more generally hypergraph limits) in the language of nonstandard analysis (or of ultraproducts); see for instance this paper of Elek and Szegedy, Section 6 of this previous blog post, or this paper of Towsner. (In this post we assume some familiarity with nonstandard analysis, as reviewed for instance in the previous blog post.) Here, one starts as before with a sequence ${G_n = (V_n,E_n)}$ of finite graphs, and then takes an ultraproduct (with respect to some arbitrarily chosen non-principal ultrafilter ${\alpha \in\beta {\bf N} \backslash {\bf N}}$ ) to obtain a nonstandard graph ${G_\alpha = (V_\alpha,E_\alpha)}$ , where ${V_\alpha = \prod_{n\rightarrow \alpha} V_n}$ is the ultraproduct of the ${V_n}$ , and similarly for the ${E_\alpha}$ . The set ${E_\alpha}$ can then be viewed as a symmetric subset of ${V_\alpha \times V_\alpha}$ which is measurable with respect to the Loeb ${\sigma}$ -algebra ${{\mathcal L}_{V_\alpha \times V_\alpha}}$ of the product ${V_\alpha \times V_\alpha}$ (see this previous blog post for the construction of Loeb measure). A crucial point is that this ${\sigma}$ -algebra is larger than the product ${{\mathcal L}_{V_\alpha} \times {\mathcal L}_{V_\alpha}}$ of the Loeb ${\sigma}$ -algebra of the individual vertex set ${V_\alpha}$ . This leads to a decomposition

$\displaystyle 1_{E_\alpha} = p + e$

where the “graphon” ${p}$ is the orthogonal projection of ${1_{E_\alpha}}$ onto ${L^2( {\mathcal L}_{V_\alpha} \times {\mathcal L}_{V_\alpha} )}$ , and the “regular error” ${e}$ is orthogonal to all product sets ${A \times B}$ for ${A, B \in {\mathcal L}_{V_\alpha}}$ . The graphon ${p\colon V_\alpha \times V_\alpha \rightarrow [0,1]}$ then captures the statistics of the nonstandard graph ${G_\alpha}$ , in exact analogy with the more traditional graph limits: for instance, the edge density

$\displaystyle \hbox{st} \frac{1}{|V_\alpha|^2} |E_\alpha|$

(or equivalently, the limit of the ${\frac{1}{|V_n|^2} |E_n|}$ along the ultrafilter ${\alpha}$ ) is equal to the integral

$\displaystyle \int_{V_\alpha} \int_{V_\alpha} p(x,y)\ d\mu_{V_\alpha}(x) d\mu_{V_\alpha}(y)$

where ${d\mu_V}$ denotes Loeb measure on a nonstandard finite set ${V}$ ; the triangle density

$\displaystyle \hbox{st} \frac{1}{|V_\alpha|^3} \lvert \{ (v_1,v_2,v_3) \in V_\alpha^3: \{v_1,v_2\}, \{v_2,v_3\}, \{v_3,v_1\} \in E_\alpha \} \rvert$

(or equivalently, the limit along ${\alpha}$ of the triangle densities of ${E_n}$ ) is equal to the integral

$\displaystyle \int_{V_\alpha} \int_{V_\alpha} \int_{V_\alpha} p(x_1,x_2) p(x_2,x_3) p(x_3,x_1)\ d\mu_{V_\alpha}(x_1) d\mu_{V_\alpha}(x_2) d\mu_{V_\alpha}(x_3),$

and so forth. Note that with this construction, the graphon ${p}$ is living on the Cartesian square of an abstract probability space ${V_\alpha}$ , which is likely to be inseparable; but it is possible to cut down the Loeb ${\sigma}$ -algebra on ${V_\alpha}$ to minimal countable ${\sigma}$ -algebra for which ${p}$ remains measurable (up to null sets), and then one can identify ${V_\alpha}$ with ${[0,1]}$ , bringing this construction of a graphon in line with the traditional notion of a graphon. (See Remark 5 of this previous blog post for more discussion of this point.)

Additive combinatorics, which studies things like the additive structure of finite subsets ${A}$ of an abelian group ${G = (G,+)}$ , has many analogies and connections with asymptotic graph theory; in particular, there is the arithmetic regularity lemma of Green which is analogous to the graph regularity lemma of Szemerédi. (There is also a higher order arithmetic regularity lemma analogous to hypergraph regularity lemmas, but this is not the focus of the discussion here.) Given this, it is natural to suspect that there is a theory of “additive limits” for large additive sets of bounded doubling, analogous to the theory of graph limits for large dense graphs. The purpose of this post is to record a candidate for such an additive limit. This limit can be used as a substitute for the arithmetic regularity lemma in certain results in additive combinatorics, at least if one is willing to settle for qualitative results rather than quantitative ones; I give a few examples of this below the fold.

It seems that to allow for the most flexible and powerful manifestation of this theory, it is convenient to use the nonstandard formulation (among other things, it allows for full use of the transfer principle, whereas a more traditional limit formulation would only allow for a transfer of those quantities continuous with respect to the notion of convergence). Here, the analogue of a nonstandard graph is an ultra approximate group ${A_\alpha}$ in a nonstandard group ${G_\alpha = \prod_{n \rightarrow \alpha} G_n}$ , defined as the ultraproduct of finite ${K}$ -approximate groups ${A_n \subset G_n}$ for some standard ${K}$ . (A ${K}$ -approximate group ${A_n}$ is a symmetric set containing the origin such that ${A_n+A_n}$ can be covered by ${K}$ or fewer translates of ${A_n}$ .) We then let ${O(A_\alpha)}$ be the external subgroup of ${G_\alpha}$ generated by ${A_\alpha}$ ; equivalently, ${A_\alpha}$ is the union of ${A_\alpha^m}$ over all standard ${m}$ . This space has a Loeb measure ${\mu_{O(A_\alpha)}}$ , defined by setting

$\displaystyle \mu_{O(A_\alpha)}(E_\alpha) := \hbox{st} \frac{|E_\alpha|}{|A_\alpha|}$

whenever ${E_\alpha}$ is an internal subset of ${A_\alpha^m}$ for any standard ${m}$ , and extended to a countably additive measure; the arguments in Section 6 of this previous blog post can be easily modified to give a construction of this measure.

The Loeb measure ${\mu_{O(A_\alpha)}}$ is a translation invariant measure on ${O(A_{\alpha})}$ , normalised so that ${A_\alpha}$ has Loeb measure one. As such, one should think of ${O(A_\alpha)}$ as being analogous to a locally compact abelian group equipped with a Haar measure. It should be noted though that ${O(A_\alpha)}$ is not actually a locally compact group with Haar measure, for two reasons:

There is not an obvious topology on ${O(A_\alpha)}$ that makes it simultaneously locally compact, Hausdorff, and ${\sigma}$ -compact. (One can get one or two out of three without difficulty, though.)
The addition operation ${+\colon O(A_\alpha) \times O(A_\alpha) \rightarrow O(A_\alpha)}$ is not measurable from the product Loeb algebra ${{\mathcal L}_{O(A_\alpha)} \times {\mathcal L}_{O(A_\alpha)}}$ to ${{\mathcal L}_{O(\alpha)}}$ . Instead, it is measurable from the coarser Loeb algebra ${{\mathcal L}_{O(A_\alpha) \times O(A_\alpha)}}$ to ${{\mathcal L}_{O(\alpha)}}$ (compare with the analogous situation for nonstandard graphs).

Nevertheless, the analogy is a useful guide for the arguments that follow.

Let ${L(O(A_\alpha))}$ denote the space of bounded Loeb measurable functions ${f\colon O(A_\alpha) \rightarrow {\bf C}}$ (modulo almost everywhere equivalence) that are supported on ${A_\alpha^m}$ for some standard ${m}$ ; this is a complex algebra with respect to pointwise multiplication. There is also a convolution operation ${\star\colon L(O(A_\alpha)) \times L(O(A_\alpha)) \rightarrow L(O(A_\alpha))}$ , defined by setting

$\displaystyle \hbox{st} f \star \hbox{st} g(x) := \hbox{st} \frac{1}{|A_\alpha|} \sum_{y \in A_\alpha^m} f(y) g(x-y)$

whenever ${f\colon A_\alpha^m \rightarrow {}^* {\bf C}}$ , ${g\colon A_\alpha^l \rightarrow {}^* {\bf C}}$ are bounded nonstandard functions (extended by zero to all of ${O(A_\alpha)}$ ), and then extending to arbitrary elements of ${L(O(A_\alpha))}$ by density. Equivalently, ${f \star g}$ is the pushforward of the ${{\mathcal L}_{O(A_\alpha) \times O(A_\alpha)}}$ -measurable function ${(x,y) \mapsto f(x) g(y)}$ under the map ${(x,y) \mapsto x+y}$ .

The basic structural theorem is then as follows.

Theorem 1 (Kronecker factor) Let ${A_\alpha}$ be an ultra approximate group. Then there exists a (standard) locally compact abelian group ${G}$ of the form

$\displaystyle G = {\bf R}^d \times {\bf Z}^m \times T$

for some standard ${d,m}$ and some compact abelian group ${T}$ , equipped with a Haar measure ${\mu_G}$ and a measurable homomorphism ${\pi\colon O(A_\alpha) \rightarrow G}$ (using the Loeb ${\sigma}$ -algebra on ${O(A_\alpha)}$ and the Baire ${\sigma}$ -algebra on ${G}$ ), with the following properties:

(i) ${\pi}$ has dense image, and ${\mu_G}$ is the pushforward of Loeb measure ${\mu_{O(A_\alpha)}}$ by ${\pi}$ .

(ii) There exists sets ${\{0\} \subset U_0 \subset K_0 \subset G}$ with ${U_0}$ open and ${K_0}$ compact, such that
$\displaystyle \pi^{-1}(U_0) \subset 4A_\alpha \subset \pi^{-1}(K_0). \ \ \ \ \ (1)$

(iii) Whenever ${K \subset U \subset G}$ with ${K}$ compact and ${U}$ open, there exists a nonstandard finite set ${B}$ such that
$\displaystyle \pi^{-1}(K) \subset B \subset \pi^{-1}(U). \ \ \ \ \ (2)$

(iv) If ${f, g \in L}$ , then we have the convolution formula
$\displaystyle f \star g = \pi^*( (\pi_* f) \star (\pi_* g) ) \ \ \ \ \ (3)$

where ${\pi_* f,\pi_* g}$ are the pushforwards of ${f,g}$ to ${L^2(G, \mu_G)}$ , the convolution ${\star}$ on the right-hand side is convolution using ${\mu_G}$ , and ${\pi^*}$ is the pullback map from ${L^2(G,\mu_G)}$ to ${L^2(O(A_\alpha), \mu_{O(A_\alpha)})}$ . In particular, if ${\pi_* f = 0}$ , then ${f*g=0}$ for all ${g \in L}$ .

One can view the locally compact abelian group ${G}$ as a “model “or “Kronecker factor” for the ultra approximate group ${A_\alpha}$ (in close analogy with the Kronecker factor from ergodic theory). In the case that ${A_\alpha}$ is a genuine nonstandard finite group rather than an ultra approximate group, the non-compact components ${{\bf R}^d \times {\bf Z}^m}$ of the Kronecker group ${G}$ are trivial, and this theorem was implicitly established by Szegedy. The compact group ${T}$ is quite large, and in particular is likely to be inseparable; but as with the case of graphons, when one is only studying at most countably many functions ${f}$ , one can cut down the size of this group to be separable (or equivalently, second countable or metrisable) if desired, so one often works with a “reduced Kronecker factor” which is a quotient of the full Kronecker factor ${G}$ . Once one is in the separable case, the Baire sigma algebra is identical with the more familiar Borel sigma algebra.

Given any sequence of uniformly bounded functions ${f_n\colon A_n^m \rightarrow {\bf C}}$ for some fixed ${m}$ , we can view the function ${f \in L}$ defined by

$\displaystyle f := \pi_* \hbox{st} \lim_{n \rightarrow \alpha} f_n \ \ \ \ \ (4)$

as an “additive limit” of the ${f_n}$ , in much the same way that graphons ${p\colon V_\alpha \times V_\alpha \rightarrow [0,1]}$ are limits of the indicator functions ${1_{E_n}\colon V_n \times V_n \rightarrow \{0,1\}}$ . The additive limits capture some of the statistics of the ${f_n}$ , for instance the normalised means

$\displaystyle \frac{1}{|A_n|} \sum_{x \in A_n^m} f_n(x)$

converge (along the ultrafilter ${\alpha}$ ) to the mean

$\displaystyle \int_G f(x)\ d\mu_G(x),$

and for three sequences ${f_n,g_n,h_n\colon A_n^m \rightarrow {\bf C}}$ of functions, the normalised correlation

$\displaystyle \frac{1}{|A_n|^2} \sum_{x,y \in A_n^m} f_n(x) g_n(y) h_n(x+y)$

converges along ${\alpha}$ to the correlation

$\displaystyle \int_G \int_G f(x) g(y) h(x+y)\ d\mu_G(x) d\mu_G(y),$

the normalised ${U^2}$ Gowers norm

$\displaystyle ( \frac{1}{|A_n|^3} \sum_{x,y,z,w \in A_n^m: x+w=y+z} f_n(x) \overline{f_n(y)} \overline{f_n(z)} f_n(w))^{1/4}$

converges along ${\alpha}$ to the ${U^2}$ Gowers norm

$\displaystyle ( \int_{G \times G \times G} f(x) \overline{f(y)} \overline{f(z)} f_n(x+y-z)\ d\mu_G(x) d\mu_G(y) d\mu_G(z))^{1/4}$

and so forth. We caution however that some correlations that involve evaluating more than one function at the same point will not necessarily be preserved in the additive limit; for instance the normalised ${\ell^2}$ norm

$\displaystyle (\frac{1}{|A_n|} \sum_{x \in A_n^m} |f_n(x)|^2)^{1/2}$

does not necessarily converge to the ${L^2}$ norm

$\displaystyle (\int_G |f(x)|^2\ d\mu_G(x))^{1/2},$

but can converge instead to a larger quantity, due to the presence of the orthogonal projection ${\pi_*}$ in the definition (4) of ${f}$ .

An important special case of an additive limit occurs when the functions ${f_n\colon A_n^m \rightarrow {\bf C}}$ involved are indicator functions ${f_n = 1_{E_n}}$ of some subsets ${E_n}$ of ${A_n^m}$ . The additive limit ${f \in L}$ does not necessarily remain an indicator function, but instead takes values in ${[0,1]}$ (much as a graphon ${p}$ takes values in ${[0,1]}$ even though the original indicators ${1_{E_n}}$ take values in ${\{0,1\}}$ ). The convolution ${f \star f\colon G \rightarrow [0,1]}$ is then the ultralimit of the normalised convolutions ${\frac{1}{|A_n|} 1_{E_n} \star 1_{E_n}}$ ; in particular, the measure of the support of ${f \star f}$ provides a lower bound on the limiting normalised cardinality ${\frac{1}{|A_n|} |E_n + E_n|}$ of a sumset. In many situations this lower bound is an equality, but this is not necessarily the case, because the sumset ${2E_n = E_n + E_n}$ could contain a large number of elements which have very few ( ${o(|A_n|)}$ ) representations as the sum of two elements of ${E_n}$ , and in the limit these portions of the sumset fall outside of the support of ${f \star f}$ . (One can think of the support of ${f \star f}$ as describing the “essential” sumset of ${2E_n = E_n + E_n}$ , discarding those elements that have only very few representations.) Similarly for higher convolutions of ${f}$ . Thus one can use additive limits to partially control the growth ${k E_n}$ of iterated sumsets of subsets ${E_n}$ of approximate groups ${A_n}$ , in the regime where ${k}$ stays bounded and ${n}$ goes to infinity.

Theorem 1 can be proven by Fourier-analytic means (combined with Freiman’s theorem from additive combinatorics), and we will do so below the fold. For now, we give some illustrative examples of additive limits.

Example 2 (Bohr sets) We take ${A_n}$ to be the intervals ${A_n := \{ x \in {\bf Z}: |x| \leq N_n \}}$ , where ${N_n}$ is a sequence going to infinity; these are ${2}$ -approximate groups for all ${n}$ . Let ${\theta}$ be an irrational real number, let ${I}$ be an interval in ${{\bf R}/{\bf Z}}$ , and for each natural number ${n}$ let ${B_n}$ be the Bohr set

$\displaystyle B_n := \{ x \in A^{(n)}: \theta x \hbox{ mod } 1 \in I \}.$

In this case, the (reduced) Kronecker factor ${G}$ can be taken to be the infinite cylinder ${{\bf R} \times {\bf R}/{\bf Z}}$ with the usual Lebesgue measure ${\mu_G}$ . The additive limits of ${1_{A_n}}$ and ${1_{B_n}}$ end up being ${1_A}$ and ${1_B}$ , where ${A}$ is the finite cylinder

$\displaystyle A := \{ (x,t) \in {\bf R} \times {\bf R}/{\bf Z}: x \in [-1,1]\}$

and ${B}$ is the rectangle

$\displaystyle B := \{ (x,t) \in {\bf R} \times {\bf R}/{\bf Z}: x \in [-1,1]; t \in I \}.$

Geometrically, one should think of ${A_n}$ and ${B_n}$ as being wrapped around the cylinder ${{\bf R} \times {\bf R}/{\bf Z}}$ via the homomorphism ${x \mapsto (\frac{x}{N_n}, \theta x \hbox{ mod } 1)}$ , and then one sees that ${B_n}$ is converging in some normalised weak sense to ${B}$ , and similarly for ${A_n}$ and ${A}$ . In particular, the additive limit predicts the growth rate of the iterated sumsets ${kB_n}$ to be quadratic in ${k}$ until ${k|I|}$ becomes comparable to ${1}$ , at which point the growth transitions to linear growth, in the regime where ${k}$ is bounded and ${n}$ is large.

If ${\theta = \frac{p}{q}}$ were rational instead of irrational, then one would need to replace ${{\bf R}/{\bf Z}}$ by the finite subgroup ${\frac{1}{q}{\bf Z}/{\bf Z}}$ here.

Example 3 (Structured subsets of progressions) We take ${A_n}$ be the rank two progression

$\displaystyle A_n := \{ a + b N_n^2: a,b \in {\bf Z}; |a|, |b| \leq N_n \},$

where ${N_n}$ is a sequence going to infinity; these are ${4}$ -approximate groups for all ${n}$ . Let ${B_n}$ be the subset

$\displaystyle B_n := \{ a + b N_n^2: a,b \in {\bf Z}; |a|^2 + |b|^2 \leq N_n^2 \}.$

Then the (reduced) Kronecker factor can be taken to be ${G = {\bf R}^2}$ with Lebesgue measure ${\mu_G}$ , and the additive limits of the ${1_{A_n}}$ and ${1_{B_n}}$ are then ${1_A}$ and ${1_B}$ , where ${A}$ is the square

$\displaystyle A := \{ (a,b) \in {\bf R}^2: |a|, |b| \leq 1 \}$

and ${B}$ is the circle

$\displaystyle B := \{ (a,b) \in {\bf R}^2: a^2+b^2 \leq 1 \}.$

Geometrically, the picture is similar to the Bohr set one, except now one uses a Freiman homomorphism ${a + b N_n^2 \mapsto (\frac{a}{N_n}, \frac{b}{N_n})}$ for ${a,b = O( N_n )}$ to embed the original sets ${A_n, B_n}$ into the plane ${{\bf R}^2}$ . In particular, one now expects the growth rate of the iterated sumsets ${k A_n}$ and ${k B_n}$ to be quadratic in ${k}$ , in the regime where ${k}$ is bounded and ${n}$ is large.

Example 4 (Dissociated sets) Let ${d}$ be a fixed natural number, and take

$\displaystyle A_n = \{0, v_1,\dots,v_d,-v_1,\dots,-v_d \}$

where ${v_1,\dots,v_d}$ are randomly chosen elements of a large cyclic group ${{\bf Z}/p_n{\bf Z}}$ , where ${p_n}$ is a sequence of primes going to infinity. These are ${O(d)}$ -approximate groups. The (reduced) Kronecker factor ${G}$ can (almost surely) then be taken to be ${{\bf Z}^d}$ with counting measure, and the additive limit of ${1_{A_n}}$ is ${1_A}$ , where ${A = \{ 0, e_1,\dots,e_d,-e_1,\dots,-e_d\}}$ and ${e_1,\dots,e_d}$ is the standard basis of ${{\bf Z}^d}$ . In particular, the growth rates of ${k A_n}$ should grow approximately like ${k^d}$ for ${k}$ bounded and ${n}$ large.

Example 5 (Random subsets of groups) Let ${A_n = G_n}$ be a sequence of finite additive groups whose order is going to infinity. Let ${B_n}$ be a random subset of ${G_n}$ of some fixed density ${0 \leq \lambda \leq 1}$ . Then (almost surely) the Kronecker factor here can be reduced all the way to the trivial group ${\{0\}}$ , and the additive limit of the ${1_{B_n}}$ is the constant function ${\lambda}$ . The convolutions ${\frac{1}{|G_n|} 1_{B_n} * 1_{B_n}}$ then converge in the ultralimit (modulo almost everywhere equivalence) to the pullback of ${\lambda^2}$ ; this reflects the fact that ${(1-o(1))|G_n|}$ of the elements of ${G_n}$ can be represented as the sum of two elements of ${B_n}$ in ${(\lambda^2 + o(1)) |G_n|}$ ways. In particular, ${B_n+B_n}$ occupies a proportion ${1-o(1)}$ of ${G_n}$ .

Example 6 (Trigonometric series) Take ${A_n = G_n = {\bf Z}/p_n {\bf C}}$ for a sequence ${p_n}$ of primes going to infinity, and for each ${n}$ let ${\xi_{n,1},\xi_{n,2},\dots}$ be an infinite sequence of frequencies chosen uniformly and independently from ${{\bf Z}/p_n{\bf Z}}$ . Let ${f_n\colon {\bf Z}/p_n{\bf Z} \rightarrow {\bf C}}$ denote the random trigonometric series

$\displaystyle f_n(x) := \sum_{j=1}^\infty 2^{-j} e^{2\pi i \xi_{n,j} x / p_n }.$

Then (almost surely) we can take the reduced Kronecker factor ${G}$ to be the infinite torus ${({\bf R}/{\bf Z})^{\bf N}}$ (with the Haar probability measure ${\mu_G}$ ), and the additive limit of the ${f_n}$ then becomes the function ${f\colon ({\bf R}/{\bf Z})^{\bf N} \rightarrow {\bf R}}$ defined by the formula

$\displaystyle f( (x_j)_{j=1}^\infty ) := \sum_{j=1}^\infty e^{2\pi i x_j}.$

In fact, the pullback ${\pi^* f}$ is the ultralimit of the ${f_n}$ . As such, for any standard exponent ${1 \leq q < \infty}$ , the normalised ${l^q}$ norm

$\displaystyle (\frac{1}{p_n} \sum_{x \in {\bf Z}/p_n{\bf Z}} |f_n(x)|^q)^{1/q}$

can be seen to converge to the limit

$\displaystyle (\int_{({\bf R}/{\bf Z})^{\bf N}} |f(x)|^q\ d\mu_G(x))^{1/q}.$

The reader is invited to consider combinations of the above examples, e.g. random subsets of Bohr sets, to get a sense of the general case of Theorem 1.

It is likely that this theorem can be extended to the noncommutative setting, using the noncommutative Freiman theorem of Emmanuel Breuillard, Ben Green, and myself, but I have not attempted to do so here (see though this recent preprint of Anush Tserunyan for some related explorations); in a separate direction, there should be extensions that can control higher Gowers norms, in the spirit of the work of Szegedy.

Note: the arguments below will presume some familiarity with additive combinatorics and with nonstandard analysis, and will be a little sketchy in places.

— 1. Proof of theorem —

By Freiman’s theorem for arbitrary abelian groups (see this paper of Green and Ruzsa), we can find an ultra coset progression ${H+P}$ such that

$\displaystyle A_\alpha \subset H+P \subset A_\alpha^m$

for some standard ${m}$ ; we abbreviate the latter inclusion as ${H+P \subset A_\alpha^{O(1)}}$ . By an ultra coset progression, we mean the sumset of a nonstandard finite group ${H}$ and a nonstandard generalised arithmetic progression

$\displaystyle P = \{ l_1 a_1 + \dots + l_r a_r: |l_i| \leq L_i \hbox{ for } i=1,\dots,r \}$

with ${r}$ (known as the rank) standard, the generators ${a_1,\dots,a_r}$ in ${O(A_\alpha)}$ and the dimensions ${L_1,\dots,L_r}$ being nonstandard natural numbers. (To get the containment ${H+P \subset A_\alpha^{O(1)}}$ , one can first use the Bogulybov lemma to get a large ultra coset progression ${H+P'}$ inside ${A_\alpha^4}$ , so that ${A_\alpha}$ can be covered by ${O(1)}$ translates of ${H+P'}$ ; one can then add these translates to the generators of ${P'}$ to obtain an ultra coset progression ${H+P}$ with the required properties.

We call the ultra coset progression ${O(1)}$ -proper if the sums ${h + l_1 a_1 + \dots + l_r a_r}$ for ${h \in H}$ and ${l_i = O(L_i)}$ for ${i=1,\dots,r}$ are all distinct. If ${H+P}$ fails to be ${O(1)}$ -proper, then we can find a containment

$\displaystyle H+P \subset H'+P' \subset A_\alpha^{O(1)}$

where the coset progression ${H'+P'}$ has strictly smaller rank than ${H+P}$ ; see e.g. Lemma 5.1 of this paper of Van Vu and myself). Iterating this fact, we see that we may assume without loss of generality that ${H+P}$ is ${O(1)}$ -proper. In particular, the group ${O(A_\alpha)}$ can now be parameterised by the sums ${h + l_1 a_1 + \dots + l_r a_r}$ with ${l_i = O(L_i)}$ for ${i=1,\dots,r}$ , with each element of ${O(A_\alpha)}$ having exactly one representation of this form.

The dimensions ${L_1,\dots,L_r}$ are either bounded (and thus standard natural numbers) or unbounded. After permuting the generators if necessary, we may assume that ${L_1,\dots,L_d}$ are unbounded and ${L_{d+1},\dots,L_{d+m}}$ are bounded for some ${d,m}$ with ${d+m=r}$ . We then have an external surjective group homomorphism ${\pi_0\colon O(A_\alpha) \rightarrow {\bf R}^d \times {\bf Z}^m}$ defined by

$\displaystyle \pi_0( h + l_1 a_1 + \dots + l_r a_r ) := ( \hbox{st} \frac{l_1}{N_1}, \dots, \hbox{st} \frac{l_d}{N_d}, l_{d+1},\dots,l_{d+m} );$

this will end up being the non-compact portion of the projection map ${\pi\colon O(A_\alpha) \rightarrow G}$ that we will eventually construct. The image ${\pi_0(A_\alpha)}$ is precompact in ${{\bf R}^d \times {\bf Z}^m}$ (in fact it is compact, thanks to countable saturation).

Now we perform some Fourier analysis on ${O(A_\alpha)}$ (analogous to the usual theory of Fourier analysis on locally compact abelian groups). Define a frequency to be a measurable homomorphism ${\xi\colon x \mapsto \xi \cdot x}$ from ${O(A_\alpha)}$ to ${{\bf R}/{\bf Z}}$ , and let ${\widehat{O(A_\alpha)}}$ denote the space of such frequencies; this is an additive group, which should be thought of as a “Pontryagin dual” to ${O(A_\alpha)}$ (even though ${O(A_\alpha)}$ is not a locally compact group). Meanwhile, we have the (genuine) Pontryagin dual ${{\bf R}^d \times ({\bf R}/{\bf Z})^m}$ of ${{\bf R}^d \times {\bf Z}^m}$ , using the identification

$\displaystyle (\xi_1,\dots,\xi_d,\theta_1,\dots,\theta_m) \cdot (x_1,\dots,x_d,a_1,\dots,a_m) := (\xi_1 x_1 + \dots + \xi_d x_d)$

$\displaystyle \hbox{ mod } 1 + a_1\theta_1 + \dots + a_m \theta_m.$

The homomorphism ${\pi_0\colon O(A_\alpha) \rightarrow {\bf R}^d \times {\bf Z}^m}$ then induces a dual homomorphism ${\hat \pi_0\colon {\bf R}^d \times ({\bf R}/{\bf Z})^m \rightarrow \widehat{O(A_\alpha)}}$ , defined by the formula

$\displaystyle \hat \pi_0(\xi) \cdot x := \xi \cdot \pi_0(x)$

for all ${\xi \in {\bf R}^d \times ({\bf R}/{\bf Z})^m}$ and ${x \in O(A_\alpha)}$ . This homomorphism ${\hat \pi_0}$ is easily seen to be injective. If we let ${\Gamma_0}$ denote the cokernel of this map, then ${\Gamma_0}$ is an abelian group (which we will view as a discrete group) and we have the short exact sequence

$\displaystyle 0 \rightarrow {\bf R}^d \times ({\bf R}/{\bf Z})^m \rightarrow \widehat{O(A_\alpha)} \rightarrow \Gamma_0 \rightarrow 0.$

Observe that ${{\bf R}^d \times ({\bf R}/{\bf Z})^m}$ is a divisible group. From this and a Zorn’s lemma argument we can split this short exact sequence, lifting ${\Gamma_0}$ up to a subgroup ${\Gamma}$ of ${\widehat{O(A_\alpha)}}$ , so that the latter group can be viewed as the direct sum of ${\Gamma}$ and ${{\bf R}^d \times ({\bf R}/{\bf Z})^m}$ .

Let ${T}$ be the Pontryagin dual of ${\Gamma}$ , that is to say the space of all homomorphisms ${t\colon \xi \mapsto \xi \cdot t}$ from ${\Gamma}$ to ${{\bf R}/{\bf Z}}$ (with no measurability or continuity hypotheses imposed). This is a compact abelian group (it is a closed subset of ${({\bf R}/{\bf Z})^\Gamma}$ , which is compact by Tychonoff’s theorem). Set ${G := {\bf R}^d \times {\bf Z}^m \times T}$ . We have a homomorphism ${\pi\colon O( A_\alpha ) \rightarrow G}$ , defined by

$\displaystyle \pi( x ) := ( \pi_0(x), (\xi \mapsto \xi \cdot x) ).$

We claim that ${\pi}$ has dense image. Since ${\pi_0}$ is surjective, it suffices to show that the map from ${x}$ to ${(\xi \mapsto \xi \cdot x)}$ has dense image from ${o(A_\alpha)}$ to ${T}$ , where

$\displaystyle o(A_\alpha) = \{ h + l_1 a_1 + \dots + l_r a_r: h \in H, l_i \in o(L_i) \hbox{ for } i=1,\dots, d\}$

is the kernel of ${\pi_0}$ . The closure of the image of ${o(A_\alpha)}$ is a compact subgroup of ${T}$ , so this map did not have dense image, there would exist a non-trivial ${\xi}$ in the Pontryagin dual ${\hat T = \Gamma}$ of ${T}$ which annihilates all of ${o(A_\alpha)}$ . The map ${x \mapsto \xi \cdot x}$ then factors through ${{\bf R}^d \times {\bf Z}^m}$ and thus can be identified with an element of ${{\bf R}^d \times ({\bf R}/{\bf Z})^m}$ ; but ${{\bf R}^d \times ({\bf R}/{\bf Z})^m}$ and ${\Gamma}$ only intersect at ${0}$ , a contradiction. Thus ${\pi}$ has dense image.

It is a routine matter to verify that ${\pi}$ is measurable (noting that the Baire algebra is generated by the product of measurable sets in ${{\bf R}^d \times ({\bf R}/{\bf Z})^m}$ and cylinder sets in ${T}$ ), that ${\pi(A_\alpha)}$ is precompact, and that the inverse image of any compact set is contained in ${A_\alpha^m}$ for some standard ${m}$ . From this and the Riesz representation theorem, we can define a Haar measure ${\mu_G}$ on ${G}$ by defining

$\displaystyle \int_G f(x)\ d\mu_G(x) := \int_{O(A_\alpha)} f(\pi(x))\ d\mu_{O(A_\alpha)}$

for all continuous, compactly supported functions ${f\colon G \rightarrow {\bf C}}$ ; the translation invariance of this measure follows from the surjectivity of ${\pi}$ . From Urysohn’s lemma and the inner and outer regularity of Haar measures, one can then show that ${\mu_G}$ is the pushforward of Loeb measure ${\mu_{O(A_\alpha)}}$ under ${\pi_*}$ .

Now we show the convolution property (3). First suppose that ${\pi_* f = 0}$ , which in particular implies that

$\displaystyle \int_{O(A_\alpha)} f(x) e^{-2\pi i \xi \cdot x}\ d\mu_{O(A_\alpha)}(x) = 0$

for all ${\xi \in \widehat{O(A_\alpha)}}$ , since the function ${x \mapsto e^{-2\pi i \xi \cdot x}}$ factors through ${G}$ . By the Loeb measure construction, we can write ${f}$ as the limit (in ${L^1}$ ) of functions ${\hbox{st} f_j}$ , where ${f_j\colon A_\alpha^m \rightarrow {}^* {\bf C}}$ are uniformly bounded nonstandard functions and ${m}$ is some standard natural number. Then we have

$\displaystyle \lim_{j \rightarrow \infty} \sup_{\xi \in \widehat{O(\alpha)}} |\int_{O(A_\alpha)} \hbox{st} f_j(x) e^{-2\pi i \xi \cdot x}\ d\mu_{O(A_\alpha)}(x)| = 0,$

which in particular implies that

$\displaystyle \lim_{j \rightarrow \infty} \hbox{st} \sup_{\xi} \frac{1}{|A_\alpha|} |\sum_{x \in A_\alpha^m} f_j(x) e^{-2\pi i \xi \cdot x}| = 0$

where ${\xi}$ ranges over all nonstandard maps ${\xi\colon A_\alpha^m \rightarrow {}^* {\bf R}/{\bf Z}}$ of the form

$\displaystyle \xi( h + l_1 a_1 + \dots + l_r a_r ) := \xi_0(h) + l_1 \theta_1 + \dots + l_r \theta_r$

for some ${\theta_1,\dots,\theta_r \in {}^* {\bf R}/{\bf Z}}$ and nonstandard homomorphism ${\xi_0\colon H \rightarrow {}^* {\bf R}/{\bf Z}}$ . From (nonstandard) Fourier analysis, we conclude that

$\displaystyle \lim_{j \rightarrow \infty} \hbox{st} \frac{1}{|A_\alpha|^3} \sum_{x \in A_\alpha^{m+m'}} |\sum_{y \in A_\alpha^m} f_j(y) g(x-y)|^2 = 0$

for any bounded nonstandard function ${g\colon A_\alpha^{m'} \rightarrow {}^* {\bf C}}$ , or equivalently that

$\displaystyle \lim_{j \rightarrow \infty} \int_{O(A_\alpha)} |\hbox{st} f_j \star \hbox{st} g(x)|^2\ d\mu_{O(A_\alpha)}(x) = 0$

and thus on taking limits we see that ${f \star \hbox{st} g = 0}$ , and on taking further limits we see that ${f*g=0}$ for any ${g \in L}$ , as required. This proves (3) when ${\pi_* f=0}$ ; similarly when ${\pi_* g = 0}$ . To finish off the general case of (3), it suffices to show that

$\displaystyle \pi^* f \star \pi^* g = \pi^* (f \star g)$

for bounded measurable ${f,g\colon G \rightarrow {\bf C}}$ . By Fourier decomposition, we may assume that ${f}$ takes the form

$\displaystyle f(x,t) = f_0(x) e^{2\pi i \xi \cdot t}$

for some ${\xi \in \Gamma = \hat T}$ and some continuous compactly supported ${f_0\colon {\bf R}^d \times {\bf Z}^m \rightarrow {\bf C}}$ , and similarly

$\displaystyle g(x,t) = g_0(x) e^{2\pi i \eta \cdot t}$

for some ${\eta \in \Gamma}$ and continuous compactly supported ${g_0\colon {\bf R}^d \times {\bf Z}^m\rightarrow {\bf C}}$ .

If ${\xi \neq \eta}$ , then ${\xi \cdot h \neq \eta \cdot h}$ for some ${h \in T}$ , and one can use this to show that ${f \star g}$ and ${\pi^* f \star \pi^* g}$ both vanish. Thus we may assume that ${\xi = \eta}$ ; using modulation symmetry we may then assume that ${\xi=\eta=0}$ . It thus suffices to show that

$\displaystyle \pi^*_0 f \star \pi^*_0 g = \pi^*_0 (f \star g).$

A direct calculation shows that the left and right hand sides agree up to constants; but both sides also have integral ${(\int_G f\ d\mu_G) (\int_G g\ d\mu_G)}$ when integrated against ${\mu_{O(A_\alpha)}}$ , so they must agree identically.

Now, we prove the inclusions (1). The outer inclusion ${4A_\alpha \subset \pi^{-1}( K_0 )}$ comes from the compactness (or precompactness) of ${\pi(A_\alpha)}$ . For the inner inclusion, we note from (3) and the positive measure and symmetry of ${A_\alpha}$ that ${1_{A_\alpha} * 1_{A_\alpha}}$ is the pullback ${\pi^*}$ of a continuous function on ${G}$ that is positive at the origin, and thus also bounded away from zero on a neighbourhood ${U}$ of the origin. This implies that the set ${2A_\alpha}$ has full measure in ${\pi^{-1}(U)}$ . We then let ${U_0}$ be a smaller symmetric neighbourhood of the origin such that ${U_0 + U_0 \subset U}$ . We then see that for any ${x \in \pi^{-1}(U_0)}$ , the sets ${2A_\alpha}$ and ${x - 2A_\alpha}$ both have full measure in ${\pi^{-1}(U_0)}$ , and hence ${x}$ lies in ${4A_\alpha}$ . This gives the inner inclusion ${\pi^{-1}(U_0) \subset 4A_\alpha}$ of (1).

Finally, we show the regularity claim (2). Given ${K \subset U \subset G}$ , we may apply Urysohn’s lemma to find non-negative bounded continuous functions ${f, g\colon G \rightarrow {\bf C}}$ such that ${f \ast g}$ is supported in ${U}$ and is at least ${1}$ on ${K}$ . Letting ${\tilde f, \tilde g \in L}$ be the pullbacks of ${f, g}$ by ${\pi}$ , we conclude using (3) that ${\tilde f \ast \tilde g}$ is at least ${1}$ on ${K}$ and vanishes outside of ${U}$ . Approximating ${\tilde f, \tilde g}$ in ${L^2}$ by bounded nonstandard functions ${F, G}$ supported in ${A_\alpha^{O(1)}}$ , we conclude that ${F \ast G}$ is at least ${2/3}$ (say) on ${K}$ and less than ${1/3}$ (say) outside of ${U}$ . If one then sets ${B}$ to be the non-standard set where ${F*G > 1/2}$ , we obtain the claim.

— 2. Sample applications of theorem —

In this section we illustrate how this theorem can be used to reprove some existing results in additive combinatorics, reducing them to various statements in continuous harmonic analysis. We begin with a qualitative version of a result of Croot and Sisask on almost periods, which reduces to the classical fact that the convolution of two square-integrable functions is continuous.

Proposition 7 (Croot-Sisask) Let ${A}$ be a ${K}$ -approximate group in an additive group ${G}$ , let ${B, C}$ be subsets of ${A}$ , and let ${0 < \varepsilon < 1}$ . Then there exists a subset ${S}$ of ${4A}$ with ${|S| \gg_{K,\varepsilon} |A|}$ such that

$\displaystyle \| 1_B \star 1_C - 1_{s+B} \star 1_C \|_{\ell^2} \leq \varepsilon |A|^{3/2}$

for all ${s \in S}$ (using the non-normalised convolution ${\star}$ ).

The Croot-Sisask argument in fact gives a quantitative lower bound of exponential type on ${S}$ , but such bounds are not available through the qualitative limiting arguments given here. The Croot-Sisask argument also works in non-commutative groups ${G}$ ; it is likely the arguments here would also extend to that setting once one developed a non-commutative version of Theorem 1, but we have not investigated that here.

Proof: By the usual transfer arguments, it suffices to show that when ${A}$ is an ultra approximate group, ${B,C}$ are non-standard subsets of ${A}$ , and ${0 < \varepsilon < 1}$ , there exists a nonstandard subset ${S}$ of ${4A}$ with ${|S| \gg |A|}$ such that

$\displaystyle \| 1_B \star 1_C - 1_{s+B} \star 1_C \|_{L^2(\mu_{O(A)})} \leq \varepsilon \ \ \ \ \ (5)$

for all ${s \in S}$ (using the normalised convolution ${\star}$ ). But by (3), ${1_B \star 1_C}$ is the pullback via ${\pi_*}$ of a continuous compactly supported function, so (5) holds for ${s}$ in ${\pi^{-1}(U)}$ for some neighbourhood ${U}$ of the identity, and thus by (2) it also holds for all ${s}$ in some nonstandard ${S \subset O(A)}$ of positive Loeb measure. The claim follows. $\Box$

Now we give a proof of Roth’s theorem (in the averaged form of Varnavides), at least for groups of odd order.

Proposition 8 (Roth’s theorem) Let ${\delta > 0}$ , let ${G}$ be a finite group of odd order, and let ${A \subset G}$ be such that ${|A| \geq \delta |G|}$ . Then there are ${\gg_\delta |G|^2}$ pairs ${(a,r) \in G^2}$ such that ${a,a+r,a+2r \in A}$ .

Proof: By the usual transfer arguments, it suffices to show that when ${G_\alpha}$ is a nonstandard finite group of odd order, and ${A}$ is a nonstandard subset of ${G_\alpha}$ with ${|A| \gg |G_\alpha|}$ , then there are ${\gg |G_\alpha|^2}$ pairs ${(a,r) \in G_\alpha^2}$ such that ${a,a+r,a+2r \in A}$ ; equivalently, we need to show that

$\displaystyle \langle 1_A * 1_A, 1_{2\cdot A} \rangle_{L^2} > 0 \ \ \ \ \ (6)$

where ${2 \cdot A := \{ 2a: a \in A \}}$ . As ${A}$ has positive measure, ${\pi_*(1_A)}$ is not identically zero. By a version of the Lebesgue differentiation theorem, we can then find a point ${x_0}$ in the Kronecker factor group ${G}$ such that ${\pi_*(1_A)}$ has positive density on every precompact neighbourhood of ${x_0}$ , and is bounded away from zero on a subset of a symmetric open precompact neighbourhood ${x_0+U}$ of ${x_0}$ of density greater than ${1/2}$ . From this and (3) we see that ${1_A * 1_A}$ is bounded away from zero on almost all of ${\pi^{-1}(2x_0 + V)}$ for some neighbourhood ${2x_0+V}$ of ${2x_0}$ . Also, as ${G_\alpha}$ has odd order, the map ${x \mapsto 2x}$ is a measure-preserving map on ${G_\alpha}$ , it must also be so on ${G}$ , and so we conclude that ${1_{2 \cdot A}}$ has positive measure in ${\pi^{-1}(2x_0+V)}$ , and (6) follows. $\Box$

Finally, we give a more advanced application of additive limits, namely reproving a lemma of Eberhard, Green, and Manners.

Proposition 9 For every ${\varepsilon>0}$ there is ${\delta>0}$ such that if ${A \subset \{1,\dots,N\}}$ is such that ${\lvert \{ 1_A \star 1_A \geq \delta N\} \rvert \leq 4|A| - \varepsilon N}$ , then there is an arithmetic progression ${P \subset \{1,\dots,N\}}$ such that ${|P| \gg_\varepsilon N}$ and ${|A \cap P| \geq (\frac{1}{2} + \frac{\varepsilon}{5})|P|}$ .

Proof: Here, we perform the transfer step more explicitly, as it is slightly trickier here. Suppose for contradiction that the claim failed, then there exists an ${\varepsilon>0}$ and a sequence ${A_n \subset \{1,\dots,N_n\}}$ with

$\displaystyle \lvert \{ 1_{A_n} \star 1_{A_n} \geq \frac{1}{n} N_n\} \rvert \leq 4|A_n| - \varepsilon N_n \ \ \ \ \ (7)$

but such that there is no arithmetic progression ${P \subset \{1,\dots,N_n\}}$ with ${|P| \geq N_n/n}$ such that ${|A_n \cap P| \geq (\frac{1}{2} - \frac{\varepsilon}{5}) |P|}$ . Note that ${N_n}$ must go to infinity (otherwise one could take ${P}$ to consist of a single element of ${A_n}$ , which must be non-empty from (7). Taking ultraproducts, we arrive at a nonstandard subset ${A}$ of ${\{1,\dots,N\}}$ for some unbounded natural number ${N}$ , such that

$\displaystyle \mu_{O(N)}(\{ 1_A \star 1_A > 0 \} ) \leq 4 \mu_{O(N)}(A) - \varepsilon \ \ \ \ \ (8)$

and such that there is no nonstandard arithmetic progression ${P \subset \{1,\dots,N\}}$ with ${\mu_{O(N)}(P) > 0}$ and ${\mu_{O(N)}( A \cap P ) \geq (\frac{1}{2} + 0.21 \varepsilon) \mu_{O(N)}(P)}$ (say). Here ${\mu_{O(N)}}$ is the Loeb measure associated with the approximate group ${\{-N/2,\dots,N/2\}}$ and the group ${O(N) := \{ n \in {}^* {\bf N}: n = O(N) \}}$ that it generates. By inspection of the proof of Theorem 1, the Kronecker factor of this group can be taken to be ${G = {\bf R} \times T}$ for some compact group ${T}$ , with projection ${\pi\colon O(N) \rightarrow {\bf R} \times T}$ given by ${\pi(n) := (\hbox{st} \frac{n}{N}, \pi_T(n))}$ for some measurable map ${\pi_T\colon O(N) \rightarrow T}$ , and Haar measure ${\mu_G}$ given by the product of Lebesgue measure on ${{\bf R}}$ and the Haar probability measure ${\mu_T}$ on ${T}$ . If we let ${f := \pi_* 1_A}$ , then ${f}$ is supported in ${[0,2] \times T}$ and takes values in ${[0,1]}$ , and from (3) and (8) we see that the set

$\displaystyle E := \{ (x,t) \in {\bf R} \times T: f \star f(x,t) > 0 \}$

is such that

$\displaystyle \mu_G(E) \leq 4 \int_G f\ d\mu_G - \varepsilon.$

We can view ${f}$ as a measurable function ${\tilde f\colon {\bf R} \rightarrow L^1(T)}$ , defined by ${\tilde f(x)(t) := f(x,t)}$ , and similarly the indicator function ${1_E}$ can be viewed as a measurable function ${\tilde 1_E\colon {\bf R} \rightarrow L^1(T)}$ defined by ${\tilde 1_E(x)(t) := 1_E(x,t)}$ . Being measurable, ${\tilde f}$ may be approximated in ${L^1}$ by piecewise constant functions. One can then adapt the proof of the Lebesgue differentiation theorem to show that almost all ${x \in {\bf R}}$ are a Lebesgue point of ${\tilde f}$ , in the sense that

$\displaystyle \lim_{r \rightarrow 0} \frac{1}{2r} \int_{x-r}^{x+r} \| \tilde f(y)-\tilde f(x) \|_{L^1(T)}\ dy = 0.$

Similarly, almost all ${x \in {\bf R}}$ is a Lebesgue point of ${\tilde 1_E}$ in the sense that

$\displaystyle \lim_{r \rightarrow 0} \frac{1}{2r} \int_{x-r}^{x+r} |\tilde 1_E(y)-\tilde 1_E(x)|\ dy = 0.$

From this, we see that for almost all ${x \in {\bf R}}$ , we have the inclusion

$\displaystyle \{ t \in T: (\tilde f(x) \star \tilde f(x))(t) > 0 \} \subset \{ t \in T: \tilde 1_E(2x)(t) > 0 \} \ \ \ \ \ (9)$

up to null sets in ${T}$ , where the convolution is now with respect to the Haar probability measure ${\mu_T}$ . On the other hand, from Fubini’s theorem we have

$\displaystyle \int_{\bf R} (\int_T \tilde f(x)\ d\mu_T)\ dx = \int_G f\ d\mu_G$

and

$\displaystyle \int_{\bf R} (\int_T \tilde 1_E(2x)\ d\mu_T)\ dx = \mu_G(E)/2 \leq 2 \int_G f\ d\mu_G - \varepsilon/2.$

Also ${\int_T \tilde f(x)\ d\mu_T}$ is supported on ${[0,2]}$ . Thus by the pigeonhole principle, we may find an ${x_* \in (0,2)}$ such that

$\displaystyle \int_T \tilde 1_E(2x_*)\ d\mu_T \leq 2 \int_T \tilde f(x_*)\ d\mu_T - \varepsilon/2$

and such that (9) holds up to null sets, and such that ${x_*}$ is a Lebesgue point for ${\tilde f}$ . If we fix this ${x_*}$ and now set ${F := \tilde f(x)}$ and ${B := \{ t \in T: F*F(t) > 0\}}$ , we thus have

$\displaystyle \mu_T( B ) \leq 2 \int_T F\ d\mu_T - \varepsilon/2. \ \ \ \ \ (10)$

At this point it is convenient to split the compact abelian group ${T}$ as

$\displaystyle 0 \rightarrow T_0 \rightarrow T \rightarrow T/T_0 \rightarrow 0$

where ${T_0}$ is the connected component of the identity, thus ${T/T_0}$ is a totally disconnected group. Let ${\tilde F}$ be the pushforward of ${F}$ to ${T/T_0}$ via the projection map ${\pi_0\colon T \rightarrow T/T_0}$ , thus ${\tilde F\colon T/T_0 \rightarrow [0,1]}$ is a measurable function of total integral ${\int_T F\ d\mu_T}$ . We claim that

$\displaystyle \|\tilde F \|_{L^\infty(T/T_0)} > \frac{1}{2} + 0.24 \varepsilon. \ \ \ \ \ (11)$

To see this, suppose for contradiction that

$\displaystyle \|\tilde F \|_{L^\infty(T/T_0)} \leq \frac{1}{2} + 0.24 \varepsilon.$

We may disintegrate

$\displaystyle F = \int_{T/T_0} \nu_y\ d\mu_{T/T_0}(y)$

where ${y \mapsto \nu_y}$ is a measurable map from ${T/T_0}$ to finite measures on ${T}$ , such that for almost every ${y}$ , ${\nu_y}$ is supported on ${\pi_0^{-1}(y)}$ and has total mass ${\tilde F(y)}$ . For almost every ${y_0}$ and ${y}$ , we then have that

$\displaystyle \nu_y * \nu_{y_0}$

is supported in ${B}$ . By Fubini’s theorem we have

$\displaystyle \int_{T/T_0} \mu_{\pi_0^{-1}(y)}( B )\ d\mu_{T/T_0}(y) = \mu_T(B)$

where ${\mu_{\pi_0^{-1}(y)}}$ is the Haar probability measure on ${\pi_0^{-1}(y)}$ . From (10), we conclude that for almost every ${y_0}$ , there is a positive measure set of ${y}$ such that

$\displaystyle \mu_{\pi_0^{-1}(y)}( B ) \leq 2 \tilde F( y - y_0 ) - \varepsilon/2 < 1$

for a positive measure set of ${y}$ (in particular ${\tilde F(y-y_0)>0}$ ), and that ${\nu_{y-y_0} * \nu_{y_0}}$ is supported in ${B}$ . On the other hand, ${\nu_{y-y_0}}$ and ${\nu_{y_0}}$ are supported on sets of measure at least ${\tilde F(y-y_0)}$ and ${F(y_0)}$ . Applying Kemperman’s theorem (see this previous post) , we conclude that

$\displaystyle \tilde F(y-y_0) + \tilde F(y_0) \leq 2 \tilde F( y - y_0 ) - \varepsilon/2$

for almost every ${y_0}$ with ${\tilde F(y_0)>0}$ , and for a positive measure set of ${y}$ . But this leads to a contradiction if we take ${\tilde F(y_0)}$ to be within ${\varepsilon/2}$ of the essential supremum of ${\tilde F}$ . This proves (11).

As ${T/T_0}$ is totally disconnected, we can express the origin as the intersection of open subgroups. From this, (11), and a Lebesgue differentiation argument, we may find a coset ${y_0+K_0}$ of an open subgroup ${K_0}$ of ${T/T_0}$ such that

$\displaystyle \int_{y_0+K_0} \tilde F\ d\mu_{T/T_0} > (\frac{1}{2} + 0.24 \varepsilon) \mu_{T/T_0}(y_0+K_0).$

Letting ${t_0+K}$ be a pullback of ${y_0+K_0}$ to ${T}$ , we thus have

$\displaystyle \int_{t_0+K} F\ d\mu_T > (\frac{1}{2} + 0.24 \varepsilon) \mu_{T}(t_0+K).$

Since ${x_*}$ is a Lebesgue point for ${\tilde f}$ , we may thus find a neighbourhood ${I}$ of ${x_*}$ in ${{\bf R}}$ such that

$\displaystyle \int_{I \times (t_0+K)} f\ d\mu_{{\bf R} \times T} > (\frac{1}{2} + 0.24 \varepsilon) \mu_{{\bf R} \times T}( I \times (t_0+K) )$

or equivalently

$\displaystyle \mu_{O(N)}( A \cap \pi^{-1}(I \times (t_0+K)) ) > (\frac{1}{2} + 0.24 \varepsilon) \mu_{O(N)}( \pi^{-1}(I \times (t_0+K)) ).$

To finish the proof of the claim, it then suffices to show that ${\pi^{-1}(I \times (t_0+K))}$ differs from a nonstandard arithmetic progression by a set of arbitrarily small Loeb measure.

Consider the quotient homomorphism ${\phi\colon O(N) \rightarrow T/K}$ formed by first using ${\pi}$ to project ${O(N)}$ to ${{\bf R} \times T}$ , then projecting to ${T}$ , then to ${T/K}$ . This is a Loeb measurable map, and thus the pointwise limit (up to null sets) by a nonstandard function. But observe that for ${x \in O(N)}$ , one has ${\phi(x) = a}$ if and only if ${\phi(x+y) = \phi(y) + a}$ for almost every ${y \in \{1,\dots,N\}}$ . In particuar, if ${\phi'}$ is a nonstandard function which is sufficiently close to ${\phi}$ , then ${\phi(x)=a}$ if and only if ${a}$ is the most common value of ${\phi'(x+y)-\phi'(y)}$ for ${y \in \{1,\dots,N\}}$ . Using this, one can find a representative of ${\phi}$ that is precisely a nonstandard function on ${\{-100N,\dots,100N\}}$ (say). Thus ${\phi}$ is now a nonstandard map from ${\{-100N,\dots,100N\}}$ to the standard finite group ${G/K}$ , and from construction one can check that ${\phi(x+y)=\phi(x)+\phi(y)}$ for all (and not merely almost all) ${x,y \in \{-10N, \dots,10N\}}$ . From this it is easy to see that ${\phi}$ is periodic with some bounded period, and that the level sets of ${\phi}$ are infinite nonstandard arithmetic progressions of bounded spacing. The claim then follows. $\Box$

15 comments

Comments feed for this article

12 October, 2014 at 4:00 pm

Anonymous

A few comments:

* “What “converges” means in this context means that” –> “What “converges” means in this context is that”
* Use “ $\lvert$ ” and “ $\rvert$ ”, reps., for absolut values instead of “ $|$ ” to get correct horixontal spacing
* Should there be four integrals instead of three in the sixth displayed expression?
* Use “ $\colon$ ” instead of “ $:$ ” to get correct horixontal spacing

[Corrected, thanks – T.]

12 October, 2014 at 4:02 pm

Anonymous

The last comment should be as follows:

* Use “ $\colon$ ” instead of “ $:$ ” for maps to get correct horizontal spacing

13 October, 2014 at 2:52 am

Mike

Click to access 0712.2749.pdf

13 October, 2014 at 12:28 pm

Francisco Javier Thayer

Consider a hyperfinite graph G for which some rescaling of the graph distance of G has the property that its infinitesimal collapse is locally compact. (infinitesimal collapse means the space obtained identifying elements infinitely close to each other — which yields always a complete metric space in which of course distances between points may be infinite). Such a rescaling may not exist, but it does for instance if the graph has polynomial growth for some rescaling. The infinitesimal collapse in this case is a length space and has a natural measure obtained by push-down of Loeb measure. Does this collapse have any relation to the graph-limit construction?

13 October, 2014 at 1:51 pm

Terence Tao

The two limits are analogous, but correspond to different scaling regimes. For the usual graph limits, one works with dense graphs, in which there are $n$ vertices for some large $n$ , and the number of edges is comparable to $n^2$ ; in particular, the typical graph distance between two vertices is of the order of $O(1)$ (in fact, unless the graph is pretty much k-partite for some bounded k, the graph distance between most pairs of vertices will be 1 or 2). For the asymptotic cone construction, one works with Cayley graphs in which the number of edges is instead comparable to $n$ , and the graph distance between two vertices would be polynomial in $n$ (if one is in the polynomial growth case). One then normalises the graph distance to be $O(1)$ and takes a limit.

For bounded degree graphs, there is another notion of graph limit, based on what happens with local neighbourhoods of randomly chosen vertices at graph distance O(1). For Cayley graphs of finitely generated groups, this comes down to taking pointwise limits of the set of relations between generators, for instance ${\bf Z}/N{\bf Z}$ with generators $\pm 1$ converges in this sense to ${\bf Z}$ with the generators $\pm 1$ . This would also correspond to the asymptotic cone construction if one did not rescale the metric beforehand.

14 October, 2014 at 3:02 am

Chris Eagle

“The set ${E_\alpha}$ can then be viewed as a symmetric subset of ${E_\alpha \times E_\alpha}$”: surely ${E_\alpha}$ is a subset of $V_\alpha \times V_\alpha}$?

[Corrected, thanks – T.]

20 October, 2014 at 6:45 am

Sean Eberhard

Dear Terry,

This is a nice way of looking at these things. Has such a soft proof of Roth’s theorem (reduction to Lebesgue differentiation theorem) appeared before? I’ve always wanted to see such a proof.

In the proof of prop 4, by restricting to the symmetric case $A+A$ you get away with just the trivial case of Brunn-Minkowski $\mu(A+A)\geq\mu(2\cdot A)= 4\mu(A)$ in $\mathbf{R}^2$ , but if you wanted a similar result for $A+B$ or even $A-A$ I guess you would actually need to say “Brunn-Minkowski” at some point. In general I tend to think of prop 4 as an incarnation of Brunn-Minkowski. (By the way, in the statement of prop 4 I think you want $+\epsilon/5$ , not $-\epsilon/5$ .)

25 October, 2014 at 6:47 am

Terence Tao

Thanks for the correction!

Regarding Roth’s theorem, certainly the ergodic theory approach to Roth’s theorem can be routed through the Lebesgue differentiation theorem after reducing to the Kronecker factor there, which is also a compact group, and computing the second Host-Kra measure mu^{[2]} on that group. And in the graph theory approach there is an analogous proof (I think in Lovasz’s book, though I don’t have it handy right now) of Roth’s theorem via Lebesgue differentiation on the unit interval [0,1].

27 October, 2014 at 7:39 am

Sean Eberhard

Thanks for the references.

By the way, is it really true in Theorem 1 that $\pi$ is always surjective? In other words, in the proof, is it really true that the image of $o(A_\alpha)$ is a compact subgroup of $T$ ? I do agree that the image of $\pi$ is always dense, and this seems to be enough for you (i.e., this still implies that the push-forward measure is translation-invariant). If $T$ happens to be metrizable then $\pi$ is surjective by countable saturation.

27 October, 2014 at 9:22 am

Terence Tao

Oh, thanks for pointing out that subtlety! I am so used to dealing with metrisable situations that I forgot that on non-metrisable spaces countable saturation is sometimes insufficient to guarantee closed images. But as you say, dense image is sufficient for the current application.

20 January, 2015 at 4:02 pm

Sean Eberhard

Correct me if I’m wrong but in fact I think $\pi$ is *injective*. If we consider just the ultrafinite genuine group case for the moment, then there are lots of measurable characters on the ultraproduct coming from the ultraproduct of the dual groups: enough to separate points, by finite Fourier analysis. This implies that $\pi$ is injective.

Somewhat related to this, I’m not sure I believe that $\pi$ is (Loeb, Borel)-measurable. Though indeed the inverse images of the cylinder sets are measurable, unfortunately these do not generate the Borel sigma-algebra. I don’t have an example to prove this point though, so maybe I’ve just overlooked something. However I do beleive that $\pi$ is (Loeb,Baire)-measurable, since by Stone-Weierstrass every continuous function on $T$ is the $L^1$ limit of a sequence of character sums.

20 January, 2015 at 4:34 pm

Terence Tao

Yes, you’re right; I had forgotten about the distinctions between Baire and Borel in the inseparable case; I’ve updated the post to correct this. Regarding injectivity, I think you are right for the “universal” group T constructed in the blog post, but in practice one usually quotients down to a more manageable T (in particular, to a separable T, in order to avoid all these technical issues coming from inseparability), at which point injectivity is lost (but surjectivity is gained!). (But conclusion (iv) of the theorem need not hold for all functions f,g after such a quotienting; one can only save this conclusion for a separable space of functions, though this is usually plenty for applications.)

20 October, 2014 at 2:07 pm

Balazs Szegedy

I think that the following paper on mine (more recent than the one that you cite) is more relevant to this post:

http://arxiv.org/abs/1203.2260

In this paper I give a generalisation of the graph limit setting to the additive setting. I discuss how to take limits of subsets (or more generally functions) in compact abelian groups. Quite surprisingly the limit objects are defined on nil manifolds that are non-commutative structures.

Another very recent paper (joint with Lovasz) is about the non commutative setting:

http://arxiv.org/abs/1406.4958

Balazs

[Corrected, thanks – T.]

22 September, 2018 at 2:36 am

Group limits – Random Permutations

[…] few months ago on his blog Terry Tao explained how one could, by analogy with the theory of graph limits, replace the explicit […]

19 January, 2020 at 1:50 pm

The Kronecker factor of an ultrafinite group – Random Permutations

[…] few months ago on his blog Terry Tao explained how one could, by analogy with the theory of graph limits, replace the explicit […]

	Anonymous on 275A, Notes 3: The weak and st…
	Anonymous on 275A, Notes 3: The weak and st…
	Alex Gunning on A symmetric formulation of the…
	Anonymous on Erratum for “An inverse…
	Anonymous on Erratum for “An inverse…
	Anonymous on Erratum for “An inverse…
	Anonymous on 275A, Notes 3: The weak and st…
	Anonymous on It ought to be common knowledg…
	Ring Theory Intervie… on Reading seminar: “Stable…
	Anonymous on Work hard
	Anonymous on Infinite partial sumsets in th…
	Anonymous on Erratum for “An inverse…
	Anonymous on Erratum for “An inverse…
	Anonymous on Infinite partial sumsets in th…
	Anonymous on Infinite partial sumsets in th…

Additive limits

Recent Comments

Articles by others

Diversions

Mathematics

Selected articles

Software

The sciences

Top Posts

Archives

Categories

The Polymath Blog

15 comments

Leave a comment Cancel reply

For commenters

Additive limits

Share this:

Recent Comments

Articles by others

Diversions

Mathematics

Selected articles

Software

The sciences

Top Posts

Archives

Categories

The Polymath Blog

15 comments

Leave a comment Cancel reply

For commenters