In a previous post, we discussed the Szemerédi regularity lemma, and how a given graph could be regularised by partitioning the vertex set into random neighbourhoods. More precisely, we gave a proof of
Lemma 1 (Regularity lemma via random neighbourhoods) Let
. Then there exists integers
with the following property: whenever
be a graph on finitely many vertices, if one selects one of the integers
at random from
, then selects
vertices
uniformly from
at random, then the
vertex cells
(some of which can be empty) generated by the vertex neighbourhoods
for
, will obey the regularity property
with probability at least
, where the sum is over all pairs
for which
is not
-regular between
and
. [Recall that a pair
is
-regular for
if one has
for any
and
with
, where
is the density of edges between
and
.]
The proof was a combinatorial one, based on the standard energy increment argument.
In this post I would like to discuss an alternate approach to the regularity lemma, which is an infinitary approach passing through a graph-theoretic version of the Furstenberg correspondence principle (mentioned briefly in this earlier post of mine). While this approach superficially looks quite different from the combinatorial approach, it in fact uses many of the same ingredients, most notably a reliance on random neighbourhoods to regularise the graph. This approach was introduced by myself back in 2006, and used by Austin and by Austin and myself to establish some property testing results for hypergraphs; more recently, a closely related infinitary hypergraph removal lemma developed in the 2006 paper was also used by Austin to give new proofs of the multidimensional Szemeredi theorem and of the density Hales-Jewett theorem (the latter being a spinoff of the polymath1 project).
For various technical reasons we will not be able to use the correspondence principle to recover Lemma 1 in its full strength; instead, we will establish the following slightly weaker variant.
Lemma 2 (Regularity lemma via random neighbourhoods, weak version) Let
. Then there exist an integer
with the following property: whenever
be a graph on finitely many vertices, there exists
such that if one selects
vertices
uniformly from
at random, then the
vertex cells
generated by the vertex neighbourhoods
for
, will obey the regularity property (1) with probability at least
.
Roughly speaking, Lemma 1 asserts that one can regularise a large graph with high probability by using
random neighbourhoods, where
is chosen at random from one of a number of choices
; in contrast, the weaker Lemma 2 asserts that one can regularise a large graph
with high probability by using some integer
from
, but the exact choice of
depends on
, and it is not guaranteed that a randomly chosen
will be likely to work. While Lemma 2 is strictly weaker than Lemma 1, it still implies the (weighted) Szemerédi regularity lemma (Lemma 2 from the previous post).
— 1. The graph correspondence principle —
The first key tool in this argument is the graph correspondence principle, which takes a sequence of (increasingly large) graphs and uses random sampling to extract an infinitary limit object, which will turn out to be an infinite but random (and, crucially, exchangeable) graph. This concept of a graph limit is related to (though slightly different from) the “graphons” used as graph limits in the work of Lovasz and Szegedy, or the ultraproducts used in the work of Elek and Szegedy. It also seems to be related to the concept of an elementary limit that I discussed in this post, though this connection is still rather tentative.
The correspondence works as follows. We start with a finite, deterministic graph . We can then form an infinite, random graph
from this graph by the following recipe:
- The vertex set of
will be the integers
.
- For every integer
, we randomly select a vertex
in
, uniformly and independently at random. (Note that there will be many collisions, i.e. integers
for which
, but these collisions will become asymptotically negligible in the limit
.)
- We then define the edge set
of
by declaring
to be an edge on
if and only if
is an edge in
(which in particular requires
).
More succinctly, is the pullback of
under a random map from
to
.
The random graph captures all the “local” information of
, while obscuring all the “global” information. For instance, the edge density of
is essentially just the probability that a given edge, say
, lies in
. (There is a small error term due to the presence of collisions, but this goes away in the limit
.) Similarly, the triangle density of
is essentially the probability that a given triangle, say
, lies in
. On the other hand, it is difficult to read off global properties of
, such as being connected or
-colourable, just from
.
At first glance, it may seem a poor bargain to trade in a finite deterministic graph for an infinite random graph
, which is a more complicated and less elementary object. However, there are three major advantages of working with
rather than
:
- Exchangeability. The probability distribution of
has a powerful symmetry or exchangeability property: if one takes the random graph
and interchanges any two vertices in
, e.g.
and
, one obtains a new graph which is not equal to
, but nevertheless has the same probability distribution as
, basically because the
were selected in an iid manner. More generally, given any permutation
, the pullback
of
by
has the same probability distribution as
; thus we have a measure-preserving action of the symmetric group
, which places us in the general framework of ergodic theory.
- Limits. The space of probability measures on the space
of infinite graphs is sequentially compact; given any sequence
of infinite random graphs, one can find a subsequence
which converges in the vague topology to another infinite random graph. What this means is that given any event
on infinite graphs that involve only finitely many of the edges, the probability that
obeys
converges to the probability that
obeys
. (Thus, for instance, the probability that
contains the triangle
will converge to the probability that
contains the same triangle.) Note that properties that involve infinitely many edges (e.g. connectedness) need not be preserved under vague limits.
- Factors. The underlying probability space for the random variable
is the space
of infinite graphs, and it is natural to give this space the Borel
-algebra
, which is the
-algebra generated by the cylinder events “
” for
. But this
-algebra also has a number of useful sub-
-algebras or factors, representing various partial information on the graph
. In particular, given any subset
of
, one can create the factor
, defined as the
-algebra generated by the events “
” for
. Thus for instance, the event that
contains the triangle is measurable in
, but not in
. One can also look at compound factors such as
, the factor generated by the union of
and
. For instance, the event that
contains the edges
is measurable in
, but the event that
contains the triangle
is not.
The connection between the infinite random graph and partitioning through random neighbourhoods comes when contemplating the relative difference between a factor such as
and
(say). The latter factor is generated by the former factor, together with the events “
” for
. But observe if
is generated from a finite deterministic graph
, then
lies in
if and only if
lies in the vertex neighbourhood of
. Thus, if one uses the vertex neighbourhoods of
to subdivide the original vertex set
into
cells of varying sizes, the factor
is generated from
, together with the random variable that computes which of these
cells the random vertex
falls into. We will see this connection in more detail later in this post, when we use the correspondence principle to prove Lemma 2.
Combining the exchangeability and limit properties (and noting that the vague limit of exchangeable random graphs is still exchangeable), we obtain
Lemma 3 (Graph correspondence principle) Let
be a sequence of finite deterministic graphs, and let
be their infinite random counterparts. Then there exists a subsequence
such that
converges in the vague topology to an exchangeable infinite random graph
.
We can illustrate this principle with three main examples, two from opposing extremes of the “dichotomy between structure and randomness”, and one intermediate one.
Example 1 (Random example) Let
be a sequence of
-regular graphs of edge density
, where
,
, and
as
. Then any graph limit
of this sequence will be an Erdös-Rényi graph
, where each edge
lies in
with an independent probability of
.
Example 2 (Structured example) Let
be a sequence of complete bipartite graphs, where the two cells of the bipartite graph have vertex density
and
respectively, with
and
. Then any graph limit
of this sequence will be a random complete bipartite graph, constructed as follows: first, randomly colour each vertex
of
red with probability
and blue with probability
, independently for each vertex. Then define
to be the complete bipartite graph between the red vertices and the blue vertices.
Example 3 (Random+structured example) Let
be a sequence of incomplete bipartite graphs, where the two cells of the bipartite graph have vertex density
and
respectively, and the graph
is
-regular between these two cells with edge density
, with
,
,
, and
. Then any graph limit
of this sequence will be a random bipartite graph, constructed as follows: first, randomly colour each vertex
of
red with probability
and blue with probability
, independently for each vertex. Then define
to be the bipartite graph between the red vertices and the blue vertices, with each edge between red and blue having an independent probability of
of lying in
.
One can use the graph correspondence principle to prove statements about finite deterministic graphs, by the usual compactness and contradiction approach: argue by contradiction, create a sequence of finite deterministic graph counterexamples, use the correspondence principle to pass to an infinite random exchangeable limit, and obtain the desired contradiction in the infinitary setting. (See this earlier blog post of mine for further discussion.) This will be how we shall approach the proof of Lemma 2.
— 2. The infinitary regularity lemma —
To prove the finitary regularity lemma via the correspondence principle, one must first develop an infinitary counterpart. We will present this infinitary regularity lemma (first introduced in this paper) shortly, but let us motivate it by a discussion based on the three model examples of infinite exchangeable graphs from the previous section.
First, consider the “random” graph from Example 1. Here, we observe that the events “
” are jointly independent of each other, thus for instance
More generally, we see that the factors for all distinct
are independent, which means that
whenever and the
are distinct.
Next, we consider the “structured” graph from Example 2, where we take
to avoid degeneracies. In contrast to the preceding example, the events “
” are now highly dependent; for instance, if
and
, then this forces
to lie outside of
, despite the fact that the events “
” each occur with a non-zero probability of
. In particular, the factors
are not jointly independent.
However, one can recover a conditional independence by introducing some new factors. Specifically, let be the factor generated by the event that the vertex
is coloured red. Then we see that the factors
now become conditionally jointly independent, relative to the base factor
, which means that we have conditional independence identities such as
Indeed, once one fixes (conditions) the information in (i.e. once one knows what colour the vertices
are), the events “
” for
either occur with probability
(if
have distinct colours) or probability
(if
have the same colour), and so the conditional independence is trivially true.
A similar phenomenon holds for the “random+structured” graph from Example 3, with
. Again, the factors
are not jointly independent in an absolute sense, but once one introduces the factors
based on the colour of the vertex
, we see once again that the
become conditionally jointly independent relative to the
.
These examples suggest, more generally, that we should be able to regularise the graph (or more precisely, the system of edge factors
) by introducing some single-vertex factors
, with respect to which the edge factors become conditionally independent; this is the infinitary analogue of a finite graph becoming
-regular relative to a suitably chosen partition of the vertex set into cells.
Now, in Examples 2, 3 we were able to obtain this regularisation because the vertices of the graph were conveniently coloured for us (red or blue). But for general infinite exchangeable graphs , such a vertex colouring is not provided to us, so how is one to generate the vertex factors
?
The key trick – which is the infinitary analogue of using random neighbourhoods to regularise a finitary graph – is to sequester half of the infinite vertices in – e.g. the negative vertices
– away as “reference” or “training” vertices, and then and colorise the remaining vertices
of the graph based on how that vertex interacts with the reference vertices. More formally, we define
for
by the formula
We then have
Lemma 4 (Infinitary regularity lemma) Let
be a infinite exchangeable random graph. Then the
for natural numbers
are conditinally jointly independent relative to the
. More precisely, if
is a set of natural numbers,
is a subset of
, and
is a
-measurable event for all
, then
Proof: By induction on , it suffices to show that for any
, the event
and the event
are independent relative to
.
By relabeling we may take and
for some
. We use the exchangeability of
(and Hilbert’s hotel) to observe that the random variables
and
have the same distribution; in particular, they have the same norm. By Pythagoras’ theorem, they must therefore be equal almost surely; furthermore, for any intermediate
-algebra
between
and
,
is also equal almost surely to the above two expressions. (The astute reader will observe that we have just run the “energy increment argument”; in the infinitary world, it is somewhat slicker than in the finitary world, due to the convenience of the Hilbert’s hotel trick, and the fact that the existence of orthogonal projections (and in particular, conditional expectation) is itself encoding an energy increment argument.)
As a special case of the above observation, we see that
In particular, this implies that is conditionally independent of every event measurable in
, relative to
, and the claim follows.
We remark that the same argument also allows one to easily regularise infinite exchangeable hypergraphs; see my paper for details. In fact one can go further and obtain a structural theorem for these hypergraphs generalising de Finetti’s theorem, and also closely related to the graphons of Lovasz and Szegedy; see this paper of Austin for details.
— 3. Proof of finitary regularity lemma —
Having proven the infinitary regularity lemma, we now use the correspondence principle and the compactness and contradiction argument to recover the finitary regularity lemma, Lemma 2.
Suppose this lemma failed. Carefully negating all the quantifiers, this means that there exists , a sequence
going to infinity, and a sequence of finite deterministic graphs
such that for every
, if one selects vertices
uniformly from
, then the
vertex cells
generated by the vertex neighbourhoods
for
, will obey the regularity property (1) with probability less than
.
We convert each of the finite deterministic graphs to an infinite random exchangeable graph
; invoking the ocrrespondence principle and passing to a subsequence if necessary, we can assume that this graph converges in the vague topology to an exchangeable limit
. Applying the infinitary regularity lemma to this graph, we see that the edge factors
for natural numbers
are conditionally jointly independent relative to the vertex factors
.
Now for any distinct natural numbers , let
be the indicator of the event “
lies in
“, thus
when
lies in
and
otherwise. Clearly
is
-measurable. We can write
where
and
The exchangeability of ensures that
are exchangeable with respect to permutations of the natural numbers, in particular
and
.
By the infinitary regularity lemma, the are jointly independent relative to the
, and also have mean zero relative to these factors, so in particular they are infinitely pseudorandom in the sense that
Meanwhile, the random variable is measurable with respect to the factor
, which is the limit of the factors
as
increases. Thus, given any
(to be chosen later), one can find an approximation
to
, bounded between
and
, which is
-measurable for some
, and such that
We can also impose the symmetry condition . Now let
be an extremely small number (depending on
) to be chosen later. Then one can find an approximation
to
, bounded between
and
, which is
-measurable for some
, and such that
Again we can impose the symmetry condition . We can then extend
by exchangeability, so that
for all distinct natural numbers . By the triangle inequality we then have
The bounds (2), (3) apply to the limiting infinite random graph . On the other hand, all the random variables appearing in (2), (3) involve at most finitely many of the edges of the graph. Thus, by vague convergence, the bounds (2), (3) also apply to the graph
for sufficiently large
.
Now we unwind the definitions to move back to the finite graphs . Observe that, when applied to the graph
, one has
where is a symmetric function which is constant on the pairs of cells
generated the vertex neighbourhoods of
. Similarly,
for some symmetric function . The estimate (2) can then be converted to a uniformity estimate on
while the estimate (3) can be similarly converted to
If one then repeats the arguments in the preceding blog post, we conclude (if is sufficiently small depending on
, and
is sufficiently small depending on
,
,
) that for
of the choices for
, the partition
induced by the corresponding vertex neighbourhoods will obey (1). But this contradicts the construction of the
, and the claim follows.

1 comment
Comments feed for this article
3 December, 2012 at 5:35 pm
The spectral proof of the Szemeredi regularity lemma « What’s new
[...] are many proofs of this lemma, which is actually not that difficult to establish; see for instance these previous blog posts for some examples. In this post I would like to record one further proof, based [...]