254A, Lecture 15: The Furstenberg-Zimmer structure theorem and the Furstenberg recurrence theorem

5 March, 2008 in 254A - ergodic theory, math.DS | Tags: Furstenberg recurrence theorem, randomness, structure, Zorn's lemma | by Terence Tao

In this lecture – the final one on general measure-preserving dynamics – we put together the results from the past few lectures to establish the Furstenberg-Zimmer structure theorem for measure-preserving systems, and then use this to finish the proof of the Furstenberg recurrence theorem.

— The Furstenberg-Zimmer structure theorem —

Let $X = (X,{\mathcal X},\mu,T)$ be a measure-preserving system, and let $Y = (Y, {\mathcal Y},\nu,S)$ be a factor. In Theorem 2 of the previous lecture, we showed that if X was not a weakly mixing extension of Y, then we could find a non-trivial compact extension Z of Y (thus $L^2(Z)$ is a non-trivial superspace of $L^2(Y)$ ). Combining this with Zorn’s lemma (and starting with the trivial factor Y = pt), one obtains

Theorem 1. (Furstenberg-Zimmer structure theorem) Let $(X,{\mathcal X},\mu,T)$ be a measure-preserving system. Then there exists an ordinal $\alpha$ and a factor $Y_\beta = (Y_\beta, {\mathcal Y}_\beta, \nu_\beta, S_\beta)$ for every $\beta \leq \alpha$ with the following properties:

$Y_\emptyset$ is a point.

For every successor ordinal $\beta+1 \leq \alpha$ , $Y_{\beta+1}$ is a compact extension of $Y_\beta$ .

For every limit ordinal $\beta \leq \alpha$ , $Y_\beta$ is the inverse limit of the $Y_\gamma$ for the $\gamma < \beta$ , in the sense that $L^2(Y_\beta)$ is the closure of $\bigcup_{\gamma < \beta} L^2(Y_\gamma)$ .

X is a weakly mixing extension of $Y_\alpha$ .

This theorem should be compared with Furstenberg’s structure theorem for distal systems in topological dynamics (Theorem 2 from Lecture 7). Indeed, in analogy to that theorem, the factors $Y_\beta$ are known as distal measure-preserving systems. The result was proven independently by Furstenberg and by Zimmer.

Exercise 1. Deduce Theorem 1 from Theorem 2 of the previous lecture. $\diamond$

Remark 1. Since the Hilbert spaces $L^2(Y_\beta)$ are increasing inside the separable Hilbert space $L^2(X)$ , it is not hard to see that the ordinal $\alpha$ must be at most countable. Conversely, a result of Beleznay and Foreman shows that every countable ordinal can appear as the minimal length of a Furstenberg tower of a given system. Thus, in some sense, the complexity of a system can be as great as any countable ordinal. This is because the structure theorem roots out every last trace of structure from the system, so much so that every remaining function orthogonal to the final factor $L^2(Y_\alpha)$ is weakly mixing. But in many applications one does not need so much weak mixing; for instance to establish k-fold recurrence for a function f, it would be enough to obtain weak mixing control on just a few combinations of f (such as $T^h f \overline{f}$ ), as we already saw in the proof of Roth’s theorem in Lecture 12. In fact, it is not hard to show that to prove Furstenberg’s recurrence theorem for a fixed k, one only needs to analyse the first k-2 steps of the Furstenberg tower. As one consequence of this, it is possible to avoid the use of Zorn’s lemma (and the axiom of choice) in the proof of the recurrence theorem. $\diamond$

Remark 2. Analogues of the structure theorem exist for other actions, such as the action of ${\Bbb Z}^d$ on a measure space (which can equivalently be viewed as the action of d commuting shifts $T_1,\ldots,T_d: X \to X$ ). There is a new feature in this case, though: instead of having a tower of purely compact extensions, followed by one weakly mixing extension at the end, one instead has a tower of hybrid extensions (known as primitive extensions), each one of which is compact along one subgroup of ${\Bbb Z}^d$ and weakly mixing along a complementary subgroup. See for instance Furstenberg’s book for details. $\diamond$

— The Furstenberg recurrence theorem —

The Furstenberg recurrence theorem asserts that every measure-preserving system $(X,{\mathcal X},\mu,T)$ has the uniform multiple recurrence (UMR) property, thus

$\liminf_{N \to \infty} \frac{1}{N} \sum_{n=0}^{N-1} \int_X f T^n f \ldots T^{(k-1)n} f\ d\mu > 0$ (1)

whenever $k \geq 1$ and $f \in L^\infty(X)$ is non-negative with positive mean. The UMR property is trivially true for a point, and we have already shown that UMR is preserved by compact extensions (Theorem 1 of Lecture 13) and by weakly mixing extensions (Theorem 1 of Lecture 14). The former result lets us climb the successor ordinal steps of the tower in Theorem 1, while the latter lets us jump from the final distal system $Y_\alpha$ to X. But to clinch the proof of the recurrence theorem, we also need to deal with the limit ordinals. More precisely, we need to prove

Theorem 2. (Limits of chains) Let $(Y_\beta)_{\beta \in B}$ be a totally ordered family of factors of a measure-preserving system X (thus $L^2(Y_\beta)$ is increasing with $\beta$ , and let Y be the inverse limit of the $Y_\beta$ . If each of the $Y_\beta$ obeys the UMR, then Y does also.

With this theorem, the Furstenberg recurrence theorem (Theorem 1 from Lecture 11) follows from the previous theorems and transfinite induction.

The main difficulty in establishing Theorem 2 is that while each $Y_\beta$ obeys the UMR separately, we do not know that this property holds uniformly in $\beta$ . The main new observation needed to establish the theorem is that there is another way to leverage the UMR from a factor to an extension… if the support of the function f is sufficiently “dense”. We motivate this by first considering the unconditional case.

Proposition 1. (UMR for densely supported functions) Let $(X,{\mathcal X},\mu,T)$ be a measure-preserving system, let $k \geq 1$ be an integer, and let $f \in L^\infty(X)$ be a non-negative function whose support $\{ x: f(x) > 0\}$ has measure greater than $1-1/k$ . Then (1) holds.

Proof. By monotone convergence, we can find $\varepsilon > 0$ such that $f(x) > \varepsilon$ for all x outside of a set E of measure at most $1/k - \varepsilon$ . For any n, this implies that $f(x) T^n f(x) \ldots T^{(k-1) n} f(x) > \varepsilon^k$ for all x outside of the set $E \cup T^n E \cup \ldots \cup T^{(k-1)n} E$ , which has measure at most $1-k\varepsilon$ . In particular we see that

$\int_X f T^n f \ldots T^{(k-1) n} f\ d\mu > k \varepsilon^{k+1}$ (2)

for all n, and the claim follows. $\Box$

As with the other components of the proof of the recurrence theorem, we will need to upgrade the above proposition to a “relative” version:

Proposition 2. (UMR for relatively densely supported functions) Let $(X,{\mathcal X},\mu,T)$ be an extension of a factor $(Y,{\mathcal Y},\nu, S)$ with the UMR, let $k \geq 1$ be an integer, and let $f \in L^\infty(X)$ be a non-negative function whose support $\Omega := \{ x: f(x) > 0\}$ is such that the set $\{ y \in Y: {\Bbb E}(1_\Omega|Y) > 1-1/k \}$ has positive measure in Y. Then (1) holds.

Proof. By monotone convergence again, we can find $\varepsilon > 0$ such that the set $E := \{ x: f(x) > \varepsilon \}$ is such that the set $F := \{ y \in Y: {\Bbb E}(1_E|Y) > 1-1/k+\varepsilon \}$ has positive measure. Since Y has the UMR, this implies that (1) holds for $1_F$ . In other words, there exists $c > 0$ such that

$\nu( F \cap T^n F \cap \ldots \cap T^{(k-1)n} F ) > c$ (3)

for all n in a set of positive lower density.

Now we turn to f. We have the pointwise lower bound $f(x) \geq \varepsilon 1_E(x)$ , and so

$f T^n f \ldots T^{(k-1)n} f(x) \geq \varepsilon^k 1_{E \cap T^n E \cap \ldots \cap T^{(k-1)n} E}(x)$ . (4)

We have the crude lower bound

$1_{E \cap T^n E \cap \ldots \cap T^{(k-1)n} E}(x) \geq 1 - \sum_{j=0}^{k-1} 1_{T^{jn} E^c}(x)$ ; (5)

inserting this into (4) and taking conditional expectations, we conclude

${\Bbb E}( f T^n f \ldots T^{(k-1)n} f | Y) (y) \geq \varepsilon^k ( 1 - \sum_{j=0}^{k-1} {\Bbb E}( 1_{T^{jn} E^c} | Y)(y))$ (6)

a.e. On the other hand, we have

${\Bbb E}( 1_{T^{jn} E^c} | Y) = 1 - {\Bbb E}( 1_{T^{jn} E} | Y) = 1 - T^{jn} {\Bbb E}(1_E|Y)$ . (7)

By definition of F, we thus see that if y lies in $F \cap T^n F \cap \ldots \cap T^{(k-1)n} F$ , then

${\Bbb E}( f T^n f \ldots T^{(k-1)n} f | Y) (y) \geq \varepsilon^k \times k \varepsilon$ . (8)

Integrating this and using (3), we obtain

$\int_X f T^n f \ldots T^{(k-1)n} f\ d\mu \geq c \varepsilon^k \times k \varepsilon$ (9)

for all n in a set of positive lower density, and (1) follows. $\Box$

Proof of Theorem 2. Let $f \in L^\infty(Y)$ be non-negative with positive mean $\int_X f\ d\mu = c > 0$ ; we may normalise f to be bounded by 1. Since Y is the inverse limit of the $Y_\beta$ , we see that the orthogonal projections ${\Bbb E}(f|Y_\beta)$ converge in $L^2(X)$ norm to ${\Bbb E}(f|Y) = f$ . Thus, for any $\varepsilon$ , we can find $\beta$ such that

$\| f - {\Bbb E}(f|Y_\beta) \|_{L^2(X)} \leq \varepsilon$ . (10)

Now ${\Bbb E}(f|Y_\beta)$ has the same mean c as f, and is also bounded by 1. Thus the set $E := \{ y: {\Bbb E}(f|Y_\beta)(y) \geq c/2 \}$ must have measure at least c/2 in $Y_\beta$ . Now if $\Omega := \{ x: f(x) > 0 \}$ , then we have the pointwise bound

$|f - {\Bbb E}(f|Y_\beta)| \geq \frac{c}{2} 1_{\Omega^c} 1_E$ ; (11)

squaring this and taking conditional expectations we obtain

${\Bbb E}(|f - {\Bbb E}(f|Y_\beta)|^2|Y_\beta)(y) \geq \frac{c^2}{4} (1 - {\Bbb E}(1_{\Omega}|Y_\beta)(y)) 1_E(y)$ , (12)

and so by (10) and Markov’s inequality we see that $1 - {\Bbb E}(1_{\Omega}|Y_\beta)(y) 1_E(y) 1-1/k$ on a set of positive measure. The claim now follows from Proposition 2. $\Box$

The proof of the Furstenberg recurrence theorem (and thus Szemerédi’s theorem) is finally complete.

Remark 3. The same type of argument yields many further recurrence theorems, and thus (by the correspondence principle) many combinatorial results also. For instance, in the original paper of Furstenberg it was noted that the above arguments allow one to strengthen (1) to

$\liminf_{N \to \infty} \inf_M \frac{1}{N} \sum_{n=M}^{M+N-1} \int_X f T^n f \ldots T^{(k-1)n} f\ d\mu > 0$ , (13)

which allows one to conclude that in a set A of positive upper density, the set of n for which $A \cap (A+n) \cap \ldots \cap (A+(k-1)n)$ has positive upper density is syndetic for every k. One can also extend the argument to higher dimensions, and to polynomial recurrence without too many changes in the structure of the proof. But some more serious modifications to the argument are needed for other recurrence results involving IP systems or Hales-Jewett type results; see Lecture 10 for more discussion. $\diamond$

[Update, Mar 6: Statement of Proposition 1 corrected.]

17 comments

Comments feed for this article

6 March, 2008 at 7:14 am

Lior

In the statement of Prop 1, $1/k$ should be $1-1/k$ .

6 March, 2008 at 8:51 am

Terence Tao

Dear Lior: thanks for the correction!

7 January, 2009 at 10:10 pm

陶哲轩遍历论：第十五讲 « Liu Xiaochuan’s Weblog

[…] (注：遍历论为陶哲轩教授于今年年初的一门课程，我尝试将所有习题做出来，这是第十五讲的唯一一个习题，就是Zorn引理的使用。这里是本讲的链接。) […]

7 January, 2009 at 10:51 pm

Nate Chandler

Just a small correction: In the second paragraph, you wrote “In Theorem 2 of the previous lecture, we showed that X was not a weakly mixing extension of Y, then we could find a non-trivial compact extension Z of Y”. I think you meant “In Theorem 2 of the previous lecture, we showed that if X was not a weakly mixing extension of Y, then we could find a non-trivial compact extension Z of Y”.

8 January, 2009 at 5:51 am

liuxiaochuan

Dear Professor:

In the left hand side of (12), if the expectation is taken with respect to $Y_\beta$ , then ${\Bbb E}(|f - {\Bbb E}(f|Y_\beta)|^2|Y_{\beta})(y)$

8 January, 2009 at 9:15 am

Terence Tao

Thanks for the corrections!

27 February, 2010 at 5:34 am

ERT8: Weak Mixing Systems « Disquisitiones Mathematicae

[…] denote this by and is called a factor of . Theorem 1 (Furstenberg structural theorem) Given a mps , there exists an ordinal and a family of factors of , for every , such […]

1 May, 2011 at 3:34 pm

ERT16: Compact extensions « Disquisitiones Mathematicae

[…] when looked through the spectral lenses. It is a matter of fact that this dichotomy turns Furstenberg-Zimmer structural theorem direct, as we’ll prove in […]

3 June, 2011 at 7:55 pm

The Furstenberg multiple recurrence theorem and finite extensions « What’s new

[…] as a consequence of the Furstenberg-Zimmer structure theorem for measure-preserving systems. See these lecture notes for further […]

20 June, 2014 at 9:11 pm

An abstract ergodic theorem, and the Mackey-Zimmer theorem | What's new

[…] measure-preserving systems (as well as subsequent refinements of this theory by Host and Kra); see this previous blog post for some relevant discussion. One can obtain similar descriptions of non-ergodic extensions via the […]

9 April, 2017 at 8:00 pm

yaoxiao

Dear professor Tao,
by (10) and markov inequality, $1-E(1_{\Omega}|Y)(y)1_{E}(y)<\frac{1}{k}$ on a set of $O_{c}(\epsilon)$ , did you here mean that out a set of … Thanks!

[Corrected, thanks – T.]

14 March, 2019 at 12:36 pm

Maths student

Dear Prof. Tao,

I’d really appreciate an answer to this question, and I hope that the question is sufficiently non-stupid. I’m not yet done with the proof of the Furstenberg recurrence theorem, but I noticed that in some way it was a bit similar to the proof of von Neumann’s theorem, that is to say, the first proof that was given in your book. Namely, we decompose a space into two types of elements which behave contrary to each other.

Hence, I have been asking myself whether it might be possible to generalise the Furstenberg recurrence to a Hilbert space setting, where the Integral might be replaced by some sort of scalar products against Dirac measures. (A more plausible, yet less far-reaching generalisation might be given by replacing the shift map of a dynamical system by an operator on a suitable $L^2$ space, perhaps a space given by Bochner integrable functions.)

18 March, 2019 at 9:04 am

Terence Tao

The Furstenberg recurrence theorem requires the ability to take higher order correlations $\int_X f_1 \dots f_k\ d\mu$ of $k$ different functions with $k$ potentially larger than 2; a Hilbert space structure only gives $k=2$ correlations and so it does not seem possible to even phrase the Furstenberg recurrence theorem in purely Hilbert space-theoretic terms, let alone prove it. (On the other hand, higher order Hilbert spaces can be used to at least describe the closely related concept of a Gowers-Host-Kra seminorm; see https://terrytao.wordpress.com/2010/05/19/higher-order-hilbert-spaces/ ).

One natural abstract framework to describe these recurrence theorems is that of commutative tracial von Neumann algebras. One can then try to extend these theorems to the noncommutative case, but unfortunately the theorems largely break down in that case: https://terrytao.wordpress.com/2009/12/29/nonconventional-ergodic-averages-and-multiple-recurrence-for-von-neumann-dynamical-systems/

18 March, 2019 at 11:14 am

Maths student

Thanks a lot for your reply. I had questioned myself about whether there is a space that admits “pair”ings of any order, and whether such a concept would (at least given a higher cardinality) be a generalisation of some additional spaces (which would have to be found). Perhaps an appropriate generalisation would require the given forms to satisfy some consistency conditions. These are just random ideas; when I have time, I’ll give it more thought.

18 March, 2019 at 11:45 am

Maths student

In particular, the negative examples Prof. Tao gave jointly with Austin and Eisner prove that one would have to choose one’s category in a way that excludes the non-commutative systems Prof. Tao referred to.

4 December, 2022 at 11:54 pm

Hi, thank you for this proof!
I have one question.
Can you elaborate on why the ordinal \alpha you mentioned in the structure theorem must be at most countable?

Thanks in advanced

5 December, 2022 at 1:58 pm

Terence Tao

If there is an increasing set $H_\beta, \beta < \alpha$ of strictly increasing Hilbert subspaces
of $H$ indexed by an uncountable ordinal $\alpha$ then the Hilbert space $H$ has an uncountable orthonormal basis (formed by choosing an orthonormal basis for each $H_\beta \ominus \bigcup_{\gamma < \beta} H_\gamma$ , which is non-trivial for successor ordinals $\beta$ ). So if $H$ is separable it can only admit a countable tower of such spaces.

inside a separable Hilbert space $H$ then

	Anonymous on Infinite partial sumsets in th…
	Anonymous on Infinite partial sumsets in th…
	Anonymous on Infinite partial sumsets in th…
	Anonymous on Infinite partial sumsets in th…
	Anonymous on Erratum for “An inverse…
	Anonymous on Infinite partial sumsets in th…
	Anonymous on Infinite partial sumsets in th…
	Anonymous on A Banach algebra proof of the…
	Anonymous on A Banach algebra proof of the…
	Aleksandar on 245C, Notes 4: Sobolev sp…
	Anonymous on Erratum for “An inverse…
	Anonymous on Erratum for “An inverse…
	Anonymous on Erratum for “An inverse…
	Anonymous on Erratum for “An inverse…
	Terence Tao on 245C, Notes 4: Sobolev sp…

254A, Lecture 15: The Furstenberg-Zimmer structure theorem and the Furstenberg recurrence theorem

Recent Comments

Articles by others

Diversions

Mathematics

Selected articles

Software

The sciences

Top Posts

Archives

Categories

The Polymath Blog

17 comments

Leave a comment Cancel reply

For commenters

254A, Lecture 15: The Furstenberg-Zimmer structure theorem and the Furstenberg recurrence theorem

Share this:

Recent Comments

Articles by others

Diversions

Mathematics

Selected articles

Software

The sciences

Top Posts

Archives

Categories

The Polymath Blog

17 comments

Leave a comment Cancel reply

For commenters