245B, Notes 9: The Baire category theorem and its Banach space consequences

1 February, 2009 in 245B - Real analysis, math.FA, math.GN, math.MG | Tags: Baire category theorem, closed graph theorem, non-complemented subspace, open mapping theorem, uniform boundedness principle | by Terence Tao

The notion of what it means for a subset E of a space X to be “small” varies from context to context. For instance, in measure theory, when $X = (X, {\mathcal X}, \mu)$ is a measure space, one useful notion of a “small” set is that of a null set: a set E of measure zero (or at least contained in a set of measure zero). By countable additivity, countable unions of null sets are null. Taking contrapositives, we obtain

Lemma 1. (Pigeonhole principle for measure spaces) Let $E_1, E_2, \ldots$ be an at most countable sequence of measurable subsets of a measure space X. If $\bigcup_n E_n$ has positive measure, then at least one of the $E_n$ has positive measure.

Now suppose that X was a Euclidean space ${\Bbb R}^d$ with Lebesgue measure m. The Lebesgue differentiation theorem easily implies that having positive measure is equivalent to being “dense” in certain balls:

Proposition 1. Let $E$ be a measurable subset of ${\Bbb R}^d$ . Then the following are equivalent:

E has positive measure.

For any $\varepsilon > 0$ , there exists a ball B such that $m( E \cap B ) \geq (1-\varepsilon) m(B)$ .

Thus one can think of a null set as a set which is “nowhere dense” in some measure-theoretic sense.

It turns out that there are analogues of these results when the measure space $X = (X, {\mathcal X}, \mu)$ is replaced instead by a complete metric space $X = (X,d)$ . Here, the appropriate notion of a “small” set is not a null set, but rather that of a nowhere dense set: a set E which is not dense in any ball, or equivalently a set whose closure has empty interior. (A good example of a nowhere dense set would be a proper subspace, or smooth submanifold, of ${\Bbb R}^d$ , or a Cantor set; on the other hand, the rationals are a dense subset of ${\Bbb R}$ and thus clearly not nowhere dense.) We then have the following important result:

Theorem 1. (Baire category theorem). Let $E_1, E_2, \ldots$ be an at most countable sequence of subsets of a complete metric space X. If $\bigcup_n E_n$ contains a ball B, then at least one of the $E_n$ is dense in a sub-ball B’ of B (and in particular is not nowhere dense). To put it in the contrapositive: the countable union of nowhere dense sets cannot contain a ball.

Exercise 1. Show that the Baire category theorem is equivalent to the claim that in a complete metric space, the countable intersection of open dense sets remain dense. $\diamond$

Exercise 2. Using the Baire category theorem, show that any non-empty complete metric space without isolated points is uncountable. (In particular, this shows that Baire category theorem can fail for incomplete metric spaces such as the rationals ${\Bbb Q}$ .) $\diamond$

To quickly illustrate an application of the Baire category theorem, observe that it implies that one cannot cover a finite-dimensional real or complex vector space ${\Bbb R}^n, {\Bbb C}^n$ by a countable number of proper subspaces. One can of course also establish this fact by using Lebesgue measure on this space. However, the advantage of the Baire category approach is that it also works well in infinite dimensional complete normed vector spaces, i.e. Banach spaces, whereas the measure-theoretic approach runs into significant difficulties in infinite dimensions. This leads to three fundamental equivalences between the qualitative theory of continuous linear operators on Banach spaces (e.g. finiteness, surjectivity, etc.) to the quantitative theory (i.e. estimates):

The uniform boundedness principle, that equates the qualitative boundedness (or convergence) of a family of continuous operators with their quantitative boundedness.
The open mapping theorem, that equates the qualitative solvability of a linear problem Lu = f with the quantitative solvability.
The closed graph theorem, that equates the qualitative regularity of a (weakly continuous) operator T with the quantitative regularity of that operator.

Strictly speaking, these theorems are not used much directly in practice, because one usually works in the reverse direction (i.e. first proving quantitative bounds, and then deriving qualitative corollaries); but the above three theorems help explain why we usually approach qualitative problems in functional analysis via their quantitative counterparts.

— Proof of Baire category theorem —

Assume that the Baire category theorem failed; then it would be possible to cover a ball $B(x_0,r_0)$ in a complete metric space by a countable family $E_1, E_2, E_3, \ldots$ of nowhere dense sets.

We now invoke the following easy observation: if E is nowhere dense, then every ball B contains a subball B’ which is disjoint from E. Indeed, this follows immediately from the definition of a nowhere dense set.

Invoking this observation, we can find a ball $B(x_1,r_1)$ in $B(x_0,r_0/10)$ (say) which is disjoint from $E_1$ ; we may also assume that $r_1 \leq r_0/10$ by shrinking $r_1$ as necessary. Then, inside $B(x_1,r_1/10)$ , we can find a ball $B(x_2,r_2)$ which is also disjoint from $E_2$ , with $r_2 \leq r_1/10$ . Continuing this process, we end up with a nested sequence of balls $B(x_n,r_n)$ , each of which are disjoint from $E_1,\ldots,E_n$ , and such that $B(x_n,r_n) \subset B(x_{n-1},r_{n-1}/10)$ and $r_n \leq r_{n-1}/10$ for all $n=1,2,\ldots$ .

From the triangle inequality we have $d(x_n,x_{n-1}) \leq 2 r_{n-1} / 10 \leq 2 \times 10^{-n} r_0$ , and so the sequence $x_n$ is a Cauchy sequence. As X is complete, $x_n$ converges to a limit x. Summing the geometric series, one verifies that $x \in B(x_{n-1},r_{n-1})$ for all $n=1,2,\ldots$ , and in particular $x$ is an element of $B(x_0,r_0)$ which avoids all of $E_1, E_2, E_3, \ldots$ , a contradiction. $\Box$

We can illustrate the analogy between the Baire category theorem and the measure-theoretic analogs by introducing some further definitions. Call a set E meager or of the first category if it can be expressed (or covered) by a countable union of nowhere dense sets, and of the second category if it is not meager. Thus, the Baire category theorem shows that any subset of a complete metric space with non-empty interior is of the second category, which may help explain the name for the property. Call a set co-meager or residual if its complement is meager, and call a set Baire or almost open if it differs from an open set by a meager set (note that a Baire set is unrelated to the Baire $\sigma$ -algebra). Then we have the following analogy between complete metric space topology, and measure theory:

Complete non-empty metric space X	Measure space X of positive measure
first category (meager)	zero measure (null )
second category	positive measure
residual (co-meager)	full measure (co-null)
Baire	measurable

Nowhere dense sets are meager, and meager sets have empty interior. Contrapositively, sets with dense interior
are residual, and residual sets are somewhere dense. Taking complements instead of contrapositives, we see that open dense sets are co-meager,and co-meager sets are dense.

While there are certainly many analogies between meager sets and null sets (for instance, both classes are closed under countable unions, or under intersections with arbitrary sets), the two concepts can differ in practice. For instance, in the real line ${\Bbb R}$ with the standard metric and measure space structures, the set

$\bigcup_{n=1}^\infty (q_n - 2^{-n}, q_n + 2^{-n}),$ (1)

where $q_1, q_2, \ldots$ is an enumeration of the rationals, is open and dense, but has Lebesgue measure at most 2; thus its complement has infinite measure in ${\Bbb R}$ but is nowhere dense (hence meager). As a variant of this, the set

$\bigcap_{m=1}^\infty \bigcup_{n=1}^\infty (q_n - 2^{-n}/m, q_n + 2^{-n}/m),$ (2)

is a null set, but is the intersection of countably many open dense sets and is thus co-meager.

Exercise 3. A real number x is Diophantine if for every $\varepsilon > 0$ there exists $c_\varepsilon > 0$ such that $|x - \frac{a}{q}| \geq \frac{c_\varepsilon}{|q|^{2+\varepsilon}}$ for every rational number $\frac{a}{q}$ . Show that the set of Diophantine real numbers has full measure but is meager. $\diamond$

Remark 1. If one assumes some additional axioms of set theory (e.g. the continuum hypothesis), it is possible to show that the collection of meager subsets of ${\Bbb R}$ and the collection of null subsets of ${\Bbb R}$ (viewed as $\sigma$ -ideals of the collection of all subsets of ${\Bbb R}$ ) are isomorphic; this is the Sierpinski-Erdös theorem, which we will not prove here. Roughly speaking, this theorem tells us that any “effective” first-order statement which is true about meager sets will also be true about null sets, and conversely. $\diamond$

— The uniform boundedness principle —

As mentioned in the introduction, the Baire category theorem implies various equivalences between qualitative and quantitative properties of linear transformations between Banach spaces. (Lemma 1 of Notes 3 already gives a prototypical such equivalence between a qualitative property (continuity) and a quantitative one (boundedness).)

Theorem 2. (Uniform boundedness principle) Let X be a Banach space, let Y be a normed vector space, and let $(T_\alpha)_{\alpha \in A}$ be a family of continuous linear operators $T_\alpha: X \to Y$ . Then the following are equivalent:

(Pointwise boundedness) For every $x \in X$ , the set $\{ T_\alpha x: \alpha \in A \}$ is bounded.

(Uniform boundedness) The operator norms $\{ \|T_\alpha \|_{op}: \alpha \in A \}$ are bounded.

The uniform boundedness principle is also known as the Banach-Steinhaus theorem.

Proof. It is clear that 2. implies 1.; now assume 1 holds and let us obtain 2.

For each $n = 1, 2, \ldots$ , let $E_n$ be the set

$E_n := \{ x \in X: \| T_\alpha x \|_Y \leq n \hbox{ for all } \alpha \in A \}$ . (3)

The hypothesis 1 is nothing more than the assertion that the $E_n$ cover X, and thus by the Baire category theorem one of the $E_n$ must be dense in a ball. Since the $T_\alpha$ are continuous, the $E_n$ are closed, and so one of the $E_n$ contains a ball. Since $E_n - E_n \subset E_{2n}$ , we see that one of the $E_n$ contains a ball centred at the origin. Dilating n as necessary, we see that one of the $E_n$ contains the unit ball $B(0,1)$ . But then all the $\|T_\alpha\|_{op}$ are bounded by n, and the claim follows. $\Box$

Exercise 4. Give counterexamples to show that the uniform boundedness principle fails if one relaxes the assumptions in any of the following ways:

X is merely a normed vector space rather than a Banach space (i.e. completeness is dropped).
The $T_\alpha$ are not assumed to be continuous.
The $T_\alpha$ are allowed to be nonlinear rather than linear.

Thus completeness, continuity, and linearity are all essential for the uniform boundedness principle to apply. $\diamond$

Remark 2. It is instructive to establish the uniform boundedness principle more “constructively” without the Baire category theorem (though the proof of the Baire category theorem is still implicitly present), as follows. Suppose that 2 fails, then $\|T_\alpha\|_{op}$ is unbounded. We can then find a sequence $\alpha_n \in A$ such that $\| T_{\alpha_{n+1}} \|_{op} > 100^n \| T_{\alpha_n} \|_{op}$ (say) for all n. We can then find unit vectors $x_n$ such that $\| T_{\alpha_n} x_n \|_Y \geq \frac{1}{2} \| T_{\alpha_n} \|_{op}$ .

We can then form the absolutely convergent (and hence conditionally convergent, by completeness) sum $x = \sum_{n=1}^\infty \epsilon_n 10^{-n} x_n$ for some choice of signs $\epsilon_n = \pm 1$ recursively as follows: once $\epsilon_1,\ldots,\epsilon_{n-1}$ have been chosen, choose the sign $\epsilon_n$ so that

$\|\sum_{m=1}^n \epsilon_m 10^{-m} T_{\alpha_n} x_m \|_Y \geq \| 10^{-n} T_{\alpha_n} x_n \|_Y \geq \frac{1}{2} 10^{-n} \| T_{\alpha_n} \|_{op}$ . (4)

From the triangle inequality we soon conclude that

$\| T_{\alpha_n} x \|_Y \geq \frac{1}{4} 10^{-n} \| T_{\alpha_n} \|_{op}.$ (5)

But by hypothesis, the RHS is unbounded in n, contradicting 1. $\Box$

A common way to apply the uniform boundedness principle is via the following corollary:

Corollary 1. (Uniform boundedness principle for norm convergence) Let $X$ and $Y$ be Banach spaces, and let $(T_n)_{n=1}^\infty$ be a family of continuous linear operators $T_n: X \to Y$ . Then the following are equivalent:

(Pointwise convergence) For every $x \in X$ , $T_n x$ converges strongly in $Y$ as $n \to \infty$ .

(Pointwise convergence to a continuous limit) There exists a continuous linear $T: X \to Y$ such that for every $x \in X$ , $T_n x$ converges strongly in $Y$ to $Tx$ as $n \to \infty$ .

(Uniform boundedness + dense subclass convergence) The operator norms $\{ \|T_n\|: n = 1,2,\ldots \}$ are bounded, and for a dense set of $x$ in $X$ , $T_n x$ converges strongly in $Y$ as $n \to \infty$ .

Proof. Clearly 2. implies 1., and as convergent sequences are bounded, we see from Theorem 2 that 1. implies 3. The implication of 2 from 3 follows by a standard limiting argument and is left as an exercise. $\Box$

Remark 3. The same equivalences hold if one replaces the sequence $(T_n)_{n=1}^\infty$ by a net $(T_\alpha)_{\alpha \in A}$ . $\diamond$

Example 1 (Fourier inversion formula). For any $f \in L^2({\Bbb R})$ and N > 0, define the Dirichlet summation operator

$S_N f(x) := \int_{-N}^N \hat f(\xi) e^{2\pi i x \xi}\ d\xi$ (4)

where $\hat f$ is the Fourier transform of f, defined on smooth compactly supported functions $f \in C^\infty_0({\Bbb R})$ by the formula $\hat f(\xi) := \int_{-\infty}^\infty f(x) e^{-2\pi i x \xi}\ dx$ and then extended to $L^2$ by the Plancherel theorem. Using the Plancherel identity, we can verify that the operator norms $\|S_N\|_{op}$ are uniformly bounded (indeed, they are all 1); also, one can check that for $f \in C^\infty_0({\Bbb R})$ , that $S_N f$ converges in $L^2$ norm to f as $N \to \infty$ . As $C^\infty_0({\Bbb R})$ is known to be dense in $L^2({\Bbb R)}$ , this implies that $S_N f$ converges in $L^2$ norm to f for every $f \in L^2({\Bbb R})$ .

This argument only used the “easy” implication of Corollary 1, namely the deduction of 2. from 3. The “hard” implication using the Baire category theorem was not directly utilised. However, from a metamathematical standpoint, that implication is important because it tells us that the above strategy to prove convergence in norm of the Fourier inversion formula on $L^2$ – i.e. to obtain uniform operator norms on the partial sums, and to establish convergence on a dense subclass of “nice” functions – is in some sense the only strategy available to prove such a result. $\diamond$

Remark 4. There is a partial analogue of Corollary 1 for the question of pointwise almost everywhere convergence rather than norm convergence, known as Stein’s maximal principle (discussed for instance in this previous blog post of mine). For instance, it reduces Carleson’s theorem on the pointwise almost everywhere convergence of Fourier series to the boundedness of a certain maximal function (the Carleson maximal operator) related to Fourier summation, although the latter task is again quite non-trivial. (As in Example 1, the role of the maximal principle is meta-mathematical rather than direct.) $\diamond$

Of course, if we omit some of the hypotheses, it is no longer true that pointwise boundedness and uniform boundedness are the same. For instance, if we let $c_0({\Bbb N})$ be the space of complex sequences with only finitely many non-zero entries and with the uniform topology, and let $\lambda_n: c_0({\Bbb N}) \to {\Bbb C}$ be the map $(a_m)_{m=1}^\infty \to n a_n$ , then the $\lambda_n$ are pointwise bounded but not uniformly bounded; thus completeness of X is important. Also, even in the one-dimensional case $X=Y={\Bbb R}$ , the uniform boundedness principle can easily be seen to fail if the $T_\alpha$ are non-linear transformations rather than linear ones. $\diamond$

— The open mapping theorem —

A map $f: X \to Y$ between topological spaces X and Y is said to be open if it maps open sets to open sets. This is similar to, but slightly different, from the more familiar property of being continuous, which is equivalent to the inverse image of open sets being open. For instance, the map $f: {\Bbb R} \to {\Bbb R}$ defined by $f(x) := x^2$ is continuous but not open; conversely, the function $g: {\Bbb R}^2 \to {\Bbb R}$ defined by $g(x,y) := \hbox{sgn}(y)+x$ is discontinuous but open.

We have seen that it is quite possible for non-linear continuous maps to fail to be open. But for linear maps between Banach spaces, the situation is much better:

Theorem 3. (Open mapping theorem) Let $L: X \to Y$ be a continuous linear transformation between two Banach spaces X and Y. Then the following are equivalent:

L is surjective.

L is open.

(Qualitative solvability) For every $f \in Y$ there exists a solution $u \in X$ to the equation $Lu = f$ .

(Quantitative solvability) There exists a constant $C > 0$ such that for every $f \in Y$ there exists a solution $u \in X$ to the equation $Lu = f$ , which obeys the bound $\|u\|_X \leq C \|f\|_Y$ .

(Quantitative solvability for a dense subclass) There exists a constant $C > 0$ such that for a dense set of f in Y, there exists a solution $u \in X$ to the equation $Lu = f$ , which obeys the bound $\|u\|_X \leq C \|f\|_Y$ .

Proof. Clearly 4. implies 3., which is equivalent to 1., and it is easy to see from linearity that 2. and 4. are equivalent (cf. the proof of Lemma 1 from Notes 3). 4. trivially implies 5., while to obtain 4. from 5., observe that if E is any dense subset of the Banach space Y, then any f in Y can be expressed as an absolutely convergent series $f = \sum_n f_n$ of elements in E (since one can iteratively approximate the residual $f - \sum_{n=1}^{N-1} f_n$ to arbitrary accuracy by an element of E for $N=1,2,3,\ldots$ ), and the claim easily follows. So it suffices to show that 3. implies 4.

For each n, let $E_n \subset Y$ be the set of all $f \in Y$ for which there exists a solution to Lu=f with $\|u\|_X \leq n \|f\|_Y$ . From the hypothesis 3, we see that $\bigcup_n E_n = Y$ . Since Y is complete, the Baire category theorem implies that there is some $E_n$ which is dense in some ball $B(f_0,r)$ in Y. In other words, the problem Lu=f is approximately quantitatively solvable in the ball $B(f_0,r)$ in the sense that

For every $\varepsilon > 0$ and every $f \in B(f_0,r)$ , there exists an approximate solution u with $\| Lu - f \|_Y \leq \varepsilon$ and $\|u\|_X \leq n \|Lu \|_Y$ , and thus $\|u\|_X \leq n r + n \varepsilon$ .

By subtracting two such approximate solutions, we conclude that

For any $f \in B(0,2r)$ and any $\varepsilon > 0$ , there exists $u \in X$ with $\|Lu - f \|_Y \leq 2\varepsilon$ and $\|u\|_X \leq 2nr + 2 n \varepsilon$ .

Since L is homogeneous, we can rescale and conclude that

For any $f \in Y$ and any $\varepsilon > 0$ there exists $u \in X$ with $\|Lu - f \|_Y \leq 2 \varepsilon$ and $\|u\|_X \leq 2n \|f\|_Y + 2n \varepsilon$ .

In particular, setting $\varepsilon = \frac{1}{4} \|f\|_Y$ (treating the case f=0 separately), we conclude that

For any $f \in Y$ , we may write $f = Lu + f'$ , where $\| f'\|_Y \leq \frac{1}{2} \|f\|_Y$ and $\|u\|_X \leq \frac{5}{2} n \|f\|_Y$ .

We can iterate this procedure and then take limits (now using the completeness of X rather than Y) to obtain a solution to Lu=f for every $f \in Y$ with $\|u\|_X \leq 5 n \|f\|_Y$ , and the claim follows. $\Box$

Remark 5. The open mapping theorem provides metamathematical justification for the method of a priori estimates for solving linear equations such as $Lu = f$ for a given datum $f \in Y$ and for an unknown $u \in X$ , which is of course a familiar problem in linear PDE. The a priori method assumes that f is in some dense class of nice functions (e.g. smooth functions) in which solvability of Lu=f is presumably easy, and then proceeds to obtain the a priori estimate $\|u\|_X \leq C \|f\|_Y$ for some constant C. Theorem 3 then assures that Lu=f is solvable for all f in Y (with a similar bound). As before, this implication does not directly use the Baire category theorem, but that theorem helps explain why this method is “not wasteful”. $\diamond$

A pleasant corollary of the open mapping theorem is that, as with ordinary linear algebra or with arbitrary functions, invertibility is the same thing as bijectivity:

Corollary 2. Let $T: X \to Y$ be a continuous linear operator between two Banach spaces X, Y. Then the following are equivalent:

(Qualitative invertibility) T is bijective.

(Quantitative invertibility) T is bijective, and $T^{-1}: Y \to X$ is a continuous (hence bounded) linear transformation.

Remark 6. The claim fails without the completeness hypotheses on X and Y. For instance, consider the operator $T: c_c({\Bbb N}) \to c_c({\Bbb N})$ defined by $T (a_n)_{n=1}^\infty := (\frac{a_n}{n})_{n=1}^\infty$ , where we give $c_c({\Bbb N})$ the uniform norm. Then T is continuous and bijective, but $T^{-1}$ is unbounded. $\diamond$

Exercise 5. Show that Corollary 2 can still fail if we drop the completeness hypothesis on just X, or just Y. $\diamond$

Exercise 6. Suppose that $L: X \to Y$ is a surjective continuous linear transformation between Banach spaces. By using the open mapping theorem, show that the transpose map $L^*: Y^* \to X^*$ is bounded from below, i.e. there exists $c > 0$ such that $\| L^* \lambda \|_{X^*} \geq c \|\lambda \|_{Y^*}$ for all $\lambda \in Y^*$ . Conclude that $L^*$ is an isomorphism between $Y^*$ and $L^*(Y^*)$ . $\diamond$

Let L be as in Theorem 3, so that the problem Lu=f is both qualitatively and quantitatively solvable. A standard application of Zorn’s lemma (similar to that used to prove the Hahn-Banach theorem) shows that the problem Lu=f is also qualitatively linearly solvable, in the sense that there exists a linear transformation $S: Y \to X$ such that $LSf = f$ for all $f \in Y$ (i.e. S is a right-inverse of L). In view of the open mapping theorem, it is then tempting to conjecture that L must also be quantitatively linearly solvable, in the sense that there exists a continuous linear transformation $S: Y \to X$ such that $LSf = f$ for all $f \in Y$ . By Corollary 2, we see that this conjecture is true when the problem Lu=f is determined, i.e. there is exactly one solution u for each datum f. Unfortunately, the conjecture can fail when Lu=f is underdetermined (more than one solution u for each f); we discuss this in the appendix to these notes. On the other hand, the situation is much better for Hilbert spaces:

Exercise 7. Suppose that $L: H \to H'$ is a surjective continuous linear transformation between Hilbert spaces. Show that there exists a continuous linear transformation $S: H' \to H$ such that $LS = I$ . Furthermore, we can ensure that the range of S is orthogonal to the kernel of L, and that this condition determines S uniquely. $\diamond$

Remark 7. In fact, Hilbert spaces are essentially the only type of Banach space for which we have this nice property, due to the Lindenstrauss-Tzafriri solution of the complemented subspaces problem. $\diamond$

Exercise 8. Let M and N be closed subspaces of a Banach space X. Show that the following statements are equivalent:

(Qualitative complementation) Every x in X can be expressed in the form m+n for $m \in M, n \in N$ in exactly one way.
(Quantitative complementation) Every x in X can be expressed in the form m+n for $m \in M, n \in N$ in exactly one way. Furthermore there exists C > 0 such that $\|m\|_X, \|n\|_X \leq C \|x\|_X$ all x.

When either of these two properties hold, we say that M (or N) is a complemented subspace, and that N is a complement of M (or vice versa). $\diamond$

The property of being complemented is closely related to that of quantitative linear solvability:

Exercise 9. Let $L: X \to Y$ be a surjective bounded linear map between Banach spaces. Show that there exists a bounded linear map $S: Y \to X$ such that $LSf = f$ for all $f \in Y$ if and only if the kernel $\{ u \in X: Lu=0\}$ is a complemented subspace of X. $\diamond$

Exercise 10. Show that any finite-dimensional or closed finite co-dimensional subspace of a Banach space is complemented. $\diamond$

Remark 8. The problem of determining whether a given closed subspace of a Banach space is complemented or not is, in general, quite difficult. However, non-complemented subspaces do exist in abundance; some example are given in the apendix, and the Lindenstrauss-Tzafriri theorem referred to in in Remark 7 asserts that any Banach space not isomorphic to a Hilbert space contains at least one non-complemented subspace. There is also a remarkable construction of Gowers and Maurey of a Banach space such that every subspace, other than those ruled out by Exercise 10, are uncomplemented. $\diamond$

— The closed graph theorem —

Recall that a map $T: X \to Y$ between two metric spaces is continuous if and only if, whenever $x_n$ converges to x in X, $Tx_n$ converges to Tx in Y. We can also define the weaker property of being closed: an map $T: X \to Y$ is closed if and only if whenever $x_n$ converges to x in X, and $Tx_n$ converges to a limit y in Y, then y is equal to Tx; equivalently, T is closed if its graph $\{ (x,Tx): x \in X \}$ is a closed subset of $X \times Y$ . This is weaker than continuity because it has the additional requirement that the sequence $Tx_n$ is already convergent. (Despite the name, closed operators are not directly related to open operators.)

Example 2. Let $T: c_c({\Bbb N}) \to c_c({\Bbb N})$ be the transformation $T( a_m )_{m=1}^\infty := (ma_m)_{m=1}^\infty$ . This transformation is unbounded and hence discontinuous, but one easily verifies that it is closed. $\diamond$

As Example 2 shows, being closed is often a weaker property than being continuous. However, the remarkable closed graph theorem shows that as long as the domain and range of the operator are both Banach spaces, the two statements are equivalent:

Theorem 4. (Closed graph theorem) Let $T: X \to Y$ be a linear transformation between two Banach spaces. Then the following are equivalent:

T is continuous.

T is closed.

(Weak continuity) There exists some topology ${\mathcal F}$ on Y, weaker than the norm topology (i.e. containing fewer open sets) but still Hausdorff, for which $T: X \to (Y, {\mathcal F})$ is continuous.

Proof. It is clear that 1 implies 3 (just take ${\mathcal F}$ to equal the norm topology). To see why 3 implies 2, observe that if $x_n \to x$ in X and $Tx_n \to y$ in norm, then $Tx_n \to y$ in the weaker topology ${\mathcal F}$ as well; but by weak continuity $Tx_n \to Tx$ in ${\mathcal F}$ . Since Hausdorff topological spaces have unique limits, we have Tx=y and so T is closed.

Now we show that 2 implies 1. If T is closed, then the graph $\Gamma := \{ (x,Tx): x \in X \}$ is a closed linear subspace of the Banach space $X \times Y$ and is thus also a Banach space. On the other hand, the projection map $\pi: (x,Tx) \mapsto x$ from $\Gamma$ to X is clearly a continuous linear bijection. By Corollary 2, its inverse $x \mapsto (x,Tx)$ is also continuous, and so T is continuous as desired. $\Box$

We can reformulate the closed graph theorem in the following fashion:

Corollary 3. Let X, Y be Banach spaces, and suppose we have some continuous inclusion $Y \subset Z$ of Y into a Hausdorff topological vector space Z. Let $T: X \to Z$ be a continuous linear transformation. Then the following are equivalent.

(Qualitative regularity) For all $x \in X$ , $Tx \in Y$ .

(Quantitative regularity) For all $x \in X$ , $Tx \in Y$ , and furthermore $\|Tx\|_Y \leq C \|x\|_X$ for some $C > 0$ independent of x.

(Quantitative regularity on a dense subclass) For all x in a dense subset of X, $Tx \in Y$ , and furthermore $\|Tx\|_Y \leq C \|x\|_X$ for some $C > 0$ independent of x.

Proof. Clearly 2. implies 3. or 1. If we have 3., then T extends uniquely to a bounded linear map from X to Y, which must agree with the original continuous map from X to Z since limits in the Hausdorff space Z are unique, and so 3. implies 2. Finally, if 1. holds, then we can view T as a map from X to Y, which by Theorem 4 is continuous, and the claim now follows from Lemma 1 from Notes 3. $\Box$

In practice, one should think of Z as some sort of “low regularity” space with a weak topology, and Y as a “high regularity” subspace with a stronger topology. Corollary 3 motivates the method of a priori estimates to establish the Y-regularity of some linear transform Tx of an arbitrary element x in a Banach space X, by first establishing the a priori estimate $\|Tx\|_Y \leq C \|x\|_X$ for a dense subclass of “nice” elements of X, and then using the above corollary (and some weak continuity of T in a low regularity space) to conclude. The closed graph theorem provides the metamathematical explanation as to why this approach is at least as powerful as any other approach to proving regularity.

Example 3. Let $1 \leq p \leq 2$ , and let p’ be the dual exponent of p. To prove that the Fourier transform $\hat f$ of a function $f \in L^p({\Bbb R})$ necessarily lies in $L^{p'}({\Bbb R})$ , it suffices to prove the Hausdorff-Young inequality

$\| \hat f \|_{L^{p'}({\Bbb R})} \leq C_p \|f\|_{L^p({\Bbb R})}$ (5)

for some constant $C_p$ and all f in some suitable dense subclass of $L^p({\Bbb R})$ (e.g. the space $C^\infty_0({\Bbb R})$ of smooth functions of compact support), together with the “soft” observation that the Fourier transform is continuous from $L^p({\Bbb R})$ to the space of tempered distributions, which is a Hausdorff space into which $L^{p'}({\Bbb R})$ embeds continuously. One can replace the Hausdorff-Young inequality here by countless other estimates in harmonic analysis to obtain similar qualitative regularity conclusions. $\diamond$

— Appendix: Nonlinear solvability (optional) —

In this appendix we give an example of a linear equations Lu=f which can only be quantitatively solved in a nonlinear fashion. We will use a number of basic tools which we will only cover later in this course, and so this material is optional reading.

Let $X = \{0,1\}^{\Bbb N}$ be the infinite discrete cube with the product topology; by Tychonoff’s theorem, this is a compact Hausdorff space. The Borel $\sigma$ -algebra is generated by the cylinder sets

$E_n := \{ (x_m)_{m=1}^\infty \in \{0,1\}^{\Bbb N}: x_n = 1 \}$ . (6)

(From a probabilistic view point, one can think of X as the event space for flipping a countably infinite number of coins, and $E_n$ as the event that the $n^{th}$ coin lands as heads.)

Let $M(X)$ be the space of finite Borel measures on X; this can be verified to be a Banach space. There is a map $L: M(X) \to \ell^\infty({\Bbb N})$ defined by

$L( \mu ) := ( \mu(E_n) )_{n=1}^\infty$ . (7)

This is a continuous linear transformation. The equation $Lu=f$ is quantitatively solvable for every $f \in \ell^\infty({\Bbb N})$ . Indeed, if f is an indicator function $f = 1_A$ , then $f = L \delta_{x_A}$ , where $x_A \in \{0,1\}^{\Bbb N}$ is the sequence that equals 1 on A and 0 outside of A, and $\delta_{x_A}$ is the Dirac mass at A. The general case then follows by expressing a bounded sequence as an integral of indicator functions (e.g. if f takes values in [0,1], we can write $f = \int_0^1 1_{\{f > t\}}\ dt$ ). Note however that this is a nonlinear operation, since the indicator $1_{\{f>t\}}$ depends nonlinearly on f.

We now claim that the equation $Lu=f$ is not quantitatively linearly solvable, i.e. there is no bounded linear map $S: \ell^\infty({\Bbb N}) \to M(X)$ such that LSf = f for all $f \in \ell^\infty({\Bbb N})$ . This fact was first observed by Banach and Mazur; we shall give two proofs, one of a “soft analysis” flavour and one of a “hard analysis” flavour.

We begin with the “soft analysis” proof, starting with a measure-theoretic result which is of independent interest.

Theorem 5. (Nikodym convergence theorem) Let $(X, {\mathcal B})$ be a measurable space, and let $\sigma_n: {\mathcal B} \to {\Bbb R}$ be a sequence of signed finite measures which is weakly convergent in the sense that $\sigma_n(E)$ converges to some limit $\sigma(E)$ for each $E \in {\mathcal B}$ . Then:

The $\sigma_n$ are uniformly countably additive, which means that for any sequence $E_1, E_2, \ldots$ of disjoint measurable sets, the series $\sum_{m=1}^\infty |\sigma_n(E_m)|$ converges uniformly in n.

$\sigma$ is a signed finite measure.

Proof. It suffices to prove the first part, since this easily implies that $\sigma$ is also countably additive, and is thence a signed finite measure. Suppose for contradiction that the claim failed, then one could find disjoint $E_1, E_2, \ldots$ and $\varepsilon > 0$ such that one has $\limsup_{n \to \infty} \sum_{m=M}^\infty |\sigma_n(E_m)| > \varepsilon$ for all M. We now construct disjoint sets $A_1, A_2, \ldots$ , each consisting of the union of a finite collection of the $E_j$ , and an increasing sequence $n_1, n_2, \ldots$ of positive integers, by the following recursive procedure:

Initialise $k=0$ .
Suppose recursively that $n_1 < \ldots < n_{2k}$ and $A_1,\ldots,A_k$ has already been constructed for some $k \geq 0$ .
Choose $n_{2k+1} > n_{2k}$ so large that for all $n \geq n_{2k+1}$ , $\sigma_n(A_1 \cup \ldots \cup A_k)$ differs from $\sigma(A_1 \cup \ldots \cup A_k)$ by at most $\varepsilon/10$ .
Choose $M_k$ so large that $M_k$ is larger than j for any $E_j \subset A_1 \cup \ldots \cup A_k$ , and such that $\sum_{m=M_k}^\infty |\sigma_{n_j}(E_m)| \leq \varepsilon / 100^{k+1}$ for all $1 \leq j \leq 2k+1$ .
Choose $n_{2k+2} > n_{2k+1}$ so that $\sum_{m=M_k}^\infty |\sigma_{n_{2k+2}}(E_m)| > \varepsilon$ .
Pick $A_{k+1}$ to be a finite union of the $E_j$ with $j \geq M_k$ such that $|\sigma_{n_{2k+2}}(A_{k+1})| > \varepsilon/2$ .
Increment k to k+1 and then return to Step 2.

It is then a routine matter to show that if $A := \bigcup_{j=1}^\infty A_j$ , then $|\sigma_{n_{2k+2}}(A) - \sigma_{n_{2k+1}}(A)| \geq \varepsilon/10$ for all j, contradicting the hypothesis that $\sigma_j$ is weakly convergent to $\sigma$ . $\Box$

Exercise 11. (Schur’s property for $\ell^1$ ) Show that if a sequence in $\ell^1({\Bbb N})$ is convergent in the weak topology, then it is convergent in the strong topology. $\diamond$

We return now to the map $S: \ell^\infty({\Bbb N}) \to M(X)$ . Consider the sequence $a_n \in c_0({\Bbb N}) \subset \ell^\infty$ defined by $a_n := (1_{m \leq n})_{m=1}^\infty$ , i.e. each $a_n$ is the sequence consisting of n 1’s followed by an infinite number of 0’s. As the dual of $c_0({\Bbb N})$ is isomorphic to $\ell^1({\Bbb N})$ , we see from the dominated convergence theorem that $a_n$ is a weakly Cauchy sequence in $c_0({\Bbb N})$ , in the sense that $\lambda(a_n)$ is Cauchy for any $\lambda \in c_0({\Bbb N})^*$ . Applying S, we conclude that $S(a_n)$ is weakly Cauchy in $M(X)$ . In particular, using the bounded linear functionals $\mu \mapsto \mu(E)$ on M(X), we see that $S(a_n)(E)$ converges to some limit $\mu(E)$ for all measurable sets E. Applying the Nikodym convergence theorem we see that $\mu$ is also a signed finite measure. We then see that $S(a_n)$ converges in the weak topology to $\mu$ . (One way to see this is to define $\nu := \sum_{n=1}^\infty 2^{-n} |S(a_n)| + |\mu|$ , then $\nu$ is finite and $S(a_n), \mu$ are all absolutely continuous with respect to $\nu$ ; now use the Radon-Nikodym theorem (see Notes 1) and the fact that $L^1(\nu)^* \equiv L^\infty(\nu)$ .) On the other hand, as $LS=I$ and L and S are both bounded, S is a Banach space isomorphism between $c_0$ and $S(c_0)$ . Thus $S(c_0)$ is complete, hence closed, hence weakly closed (by Hahn-Banach), and so $\mu = S(a)$ for some $a \in c_0$ . By Hahn-Banach again, this implies that $a_n$ converges weakly to $a \in c_0$ . But this is easily seen to be impossible, since the constant sequence $(1)_{m=1}^\infty$ does not lie in $c_0$ , and the claim follows.

Now we give the “hard analysis” proof. Let $e_1, e_2, \ldots$ be the standard basis for $\ell^\infty({\Bbb N})$ , let N be a large number, and consider the random sums

$S( \varepsilon_1 e_1 + \ldots + \varepsilon_N e_N )$ (8)

where $\varepsilon_n \in \{-1,1\}$ are iid random signs. Since the $\ell^\infty$ norm of $\varepsilon_1 e_1 + \ldots + \varepsilon_N e_N$ is 1, we have

$\| S( \varepsilon_1 e_1 + \ldots + \varepsilon_N e_N ) \|_{M(X)} \leq C$ (9)

for some constant C independent of N. On the other hand, we can write $S(e_n) = f_n \nu$ for some finite measure $\nu$ and some $f_n \in L^1(\nu)$ using Radon-Nikodym as in the previous proof, and then

$\| \varepsilon_1 f_1 + \ldots + \varepsilon_N f_N \|_{L^1(\nu)} \leq C.$ (10)

Taking expectations and applying Khintchine’s inequality we conclude

$\| (\sum_{n=1}^N |f_n|^2)^{1/2} \|_{L^1(\nu)} \leq C'$ (11)

for some constant C’ independent of N. By Cauchy-Schwarz this implies that

$\| \sum_{n=1}^N |f_n| \|_{L^1(\nu)} \leq C' \sqrt{N}$ (12)

But as $\|f_n\|_{L^1(\nu)} = \|S(e_n)\|_{M(X)} \geq c$ for some constant c > 0 independent of N, we obtain a contradiction for N large enough, and the claim follows.

Remark 9. The phenomenon of nonlinear quantitative solvability actually comes up in many applications of interest. For instance, consider the Fefferman-Stein decomposition theorem, which asserts that any $f \in BMO({\Bbb R})$ of bounded mean oscillation can be decomposed as $f = g + Hh$ for some $g, h \in L^\infty({\Bbb R})$ , where H is the Hilbert transform. This theorem was first proven by using the duality of the Hardy space $H^1({\Bbb R})$ and BMO (and by using Exercise 13 from Notes 6), and by using the fact that a function f is in $H^1({\Bbb R})$ if and only if f and Hf both lie in $L^1({\Bbb R})$ . From the open mapping theorem we know that we can pick g, h so that the $L^\infty$ norms of g, h are bounded by a multiple of the BMO norm of f. But it turns out not to be possible to pick g and h in a bounded linear manner in terms of f, although this is a little tricky to prove. (Uchiyama famously gave an explicit construction of g, h in terms of f, but the construction was highly nonlinear; see my blog post on the topic.)

An example in a similar spirit was given more recently by Bourgain and Brezis, who considered the problem of solving the equation $\hbox{div} u = f$ on the d-dimensional torus ${\Bbb T}^d$ for some function $f: {\Bbb T}^d \to {\Bbb C}$ on the torus with mean zero, and with some unknown vector field $u: {\Bbb T}^d \to {\Bbb C}^d$ , where the derivatives are interpreted in the weak sense. They showed that if $d \geq 2$ and $f \in L^d({\Bbb T}^d)$ , then there existed a solution u to this problem with $u \in W^{1,d} \cap C^0$ , despite the failure of Sobolev embedding at this endpoint. Again, the open mapping theorem allows one to choose u with norm bounded by a multiple of the norm of f, but Bourgain and Brezis also show that one cannot select u in a bounded linear fashion depending on f. $\diamond$

Question. All of the above constructions of non-complemented closed subspaces, or of linear problems that can only be quantitatively solved nonlinearly, were quite involved. Is there a “soft” or “elementary” way to see that closed subspaces of Banach spaces exist which are not complemented, or (equivalently) that surjective continuous linear maps between Banach spaces do not always enjoy a continuous linear right-inverse? I do not have a good answer to this question. $\diamond$

[Update, Feb 4: definition of “residual” corrected.]

71 comments

Comments feed for this article

2 February, 2009 at 2:11 am

Anonymous

Dear Terence,
you write
“There is also a remarkable construction of Gowers and Maurey of a Banach space such that every subspace, other than those ruled out by Exercise 10, are complemented. ”
Shouldn’t the sentence read : “There is also a remarkable construction of Gowers and Maurey of a Banach space such that every subspace, other than those ruled out by Exercise 10, is UNcomplemented” ?

Achille Talon

[Oops – corrected, thanks!]

2 February, 2009 at 11:58 pm

Combinations and Permutations

I know this is a bit off subject but I am a graduate student at UNLV as well as a weekly math based podcast called Combinations and Permutations where we start with a mathematical topic and spin off onto as many tangents as we can. You can follow the previous link to the blog page of our podcast, search for us on iTunes, or take a trip over to our host site http://cppodcast.libsyn.com. Give us a try I do think that you will enjoy what you hear.

4 February, 2009 at 1:11 pm

Anonymous

Dr. Tao,

The term “residual” has been defined in two different ways here. Is this a mistake or an unfortunate terminology ambiguity?

[Oops – that was a mistake, now fixed, hopefully. -T.]

6 February, 2009 at 4:15 am

Ulrich

Dear Terence, in your post “Remark 3”, Baire Category you mentioned, that the theorem of Banach-Steinaus (Corollary 3) is also valid for a net. But this can’t be true in general: E.g. if one takes $A = \mathbb{Z}$ and set
$T_{n} = I$ for $n > 0$ and equal $nI$ otherwise where
$I$ is the identity operator on an infinite dimensional Banach space. Then the net converges but is not bounded (I hope I entered the LaTeX code correctly)

6 February, 2009 at 7:38 am

Terence Tao

Ah, right: convergent nets are not necessarily bounded, only bounded for sufficiently large indices. One might hope to salvage Banach-Steinhaus replacing bounded by “bounded for sufficiently large”, but the $E_n$ cease to be closed now, and my guess is that the statement fails, though I can’t immediately think of a counterexample. In any case, I’ll remove the comment.

9 February, 2009 at 9:54 am

245B, Notes 10: Compactness in topological spaces « What’s new

[…] & PDE conferences pageBooksOn writingDoes one have to be a genius to do maths?About245B, Notes 9: The Baire category theorem and its Banach space consequencesWhy global regularity for Navier-Stokes is hard […]

14 February, 2009 at 2:06 am

liuxiaochuan

Dear Professor Tao:

In the following paragraph of Exercise 11, why is $S(c_0)$ closed?

By the way, in Theorem 5, the measures $\sigma$ becomes $\mu$ in the proof.

Xiaochuan

14 February, 2009 at 8:11 am

Terence Tao

Dear Xiaochuan,

Thanks for the correction!

The identity LS=I (and the boundedness of both L and S) shows that S is a Banach space isomorphism between $c_0$ and $S(c_0)$ . Thus $S(c_0)$ is complete and therefore closed.

14 February, 2009 at 5:46 pm

liuxiaochuan

Professor, why is $c_0$ complete, since $a_n\to a$ and a doesnot lie in $c_0$ ?

14 February, 2009 at 6:04 pm

Terence Tao

Dear Liuxiaochuan, that is the contradiction that establishes that S does not exist. (The completeness of $c_0$ is nothing more than the assertion that the uniform limit of any sequence of sequences converging to zero, also converges to zero.)

16 February, 2009 at 7:28 am

实分析0-10 « Liu Xiaochuan’s Weblog

[…] 第九节探讨Baire纲定理以及由此引发的几个大定理：一致有界原理，开映射定理，闭图像定理。把这些神奇的定理复习了一遍，加深了理解。要说Banach的工作，最著名的就就是这么几个定理，当真是刺激的数学好长时间的发展。Banach空间上的几何是非常奇特的，应该有很多人在做专门的研究。Tim Gowers在20几岁的时候就提出过有趣的Banach空间做反例，本节有提到。最后一个部分有一个非线性的应用。非线性偏微分方程我上学期花过一个多月下功夫，可是依然半途而废，好不遗憾，我认真的把这部分看完了，希望今后能够捡起来。 […]

21 February, 2009 at 9:34 pm

245B, Notes 11: The strong and weak topologies « What’s new

[…] can rephrase the uniform boundedness principle for convergence (Corollary 1 from Notes 9) as follows: Proposition 7 (Uniform boundedness principle) Let be a sequence of bounded linear […]

10 March, 2009 at 12:28 am

Samir

Dear Professor Tao,

I’m not sure I see how, in Theorem 3 (the Open Mapping Theorem), that L is open implies quantitative solvability (i.e. 2 => 4). How does this follow from linearity? Any help would be greatly appreciated!

Samir

10 March, 2009 at 7:28 am

Terence Tao

Dear Samir,

If L is open, then the image of the unit ball is open, and thus contains some open ball centred at the origin of some radius r. This means that if f has norm less than r, then there exists u of norm less than 1 such that Lu=f. By linearity, this means that if f has any norm, then there exists a solution to Lu=f such that u has norm at most $\|f\|/r$ .

10 March, 2009 at 8:48 pm

Samir

Dear Professor Tao,

In hindsight, the equivalence is simple, as you’d said. I wish I’d seen it myself earlier. Thanks for your help, as always!

Samir

27 March, 2009 at 11:18 am

Anonymous

Another notion of residual sets still occurs in the paragraph following the table comparing complete metric spaces and measurable spaces. [Corrected, thanks – T]

31 August, 2009 at 10:11 pm

Phi. Isett

Dear Professor Tao,

Some tiny things: Remark 1.. “(viewed as … ” has no end parentheses.

Proof of theorem 3: “that if E is any sense” should be “dense”. I think you also meant “of elements in E” instead of “of elements in Y” during the same proof.

Should Y be a Banach space in corollary 3?

Is there a standard quantitative way by which to prove injectivity of a linear map (when an estimate of the form $\| x \| \leq C \| Tx \|$ is unavailable)? I know the Fourier transform does not quite work this way (but it seems like having an inversion formula available is a bit special). It might be slightly related to your question in a dual sense..

For example, if you take an inclusion map $i : X \to Y$ of a subspace, the transpose between the dual spaces is surjective, but this fact requires the Hahn Bahnach theorem. It somehow seems completely silly that $i^t$ would have a nice right-inverse. Maybe this helps?

Thank you for the very helpful entry!
~Phil

1 September, 2009 at 1:31 am

Terence Tao

Thanks for the corrections!

Injectivity does not seem to have any good quantitative counterpart, as far as I know; the bound $\|x\| \leq C \|Tx\|$ certainly implies injectivity but is substantially stronger than it.

22 December, 2009 at 1:19 am

The double Duhamel trick and the in/out decomposition « What’s new

[…] equivalent to the non-complemented nature of a certain subspace of a Banach space; see these lecture notes of mine and this old blog post for some discussion.) So one could imagine a sophisticated nonlinear […]

12 February, 2010 at 9:17 am

Groß oder klein – dies ist manchmal die Frage – « UGroh's Weblog

[…] Zusammenfassung, die man in diesem Zusammenhang einfach durcharbeiten muss findet sich auf dem Blog von Terence Tao. Dieser Blog enthält viele Beispiele und Anwendungen der verschiedenen Prinzipien, die wir […]

10 May, 2010 at 8:48 pm

Anonymous

This question might be silly or unanswerable but I will ask it anyway.

In the above you explain that in the linear case one can approach qualitative statements through quantitative means and in using this approach ones doesn’t lose anything. In the limited reading I have done it seems that in the non-linear case (e.g. non-linear PDE) people expend effort in showing that qualitative properties of solutions are in fact equivalent to some quantitative property.

Of course I can see the appeal of the quantitative approach becuase not only do you get the qualitative result you also obtain estimates. My questions are:

1. Why do people fall back on the quantitative approach? Is it ‘easier’ to prove the quantitative statement than the qualitative one (even though it seems it should be the other way around since you are asking for more i.e. estimates)?

2. If it is easier then why is it easier?

Thanks in advance for your time.

10 May, 2010 at 8:51 pm

Anonymous

I should clarify something in my above question.

When I say “Why do people fall back on the quantitative approach?” what I really mean to ask is why do you hardly see people tackling the qualitative formulation directly as opposed to first reformulating it in a quantitative fashion.

10 May, 2010 at 10:03 pm

Terence Tao

In my opinion, the reason that the quantitative framework is more powerful is that it is more expressive, and so one can take advantage of more precise tools. Suppose for instance that one has two sequences $x_n, y_n$ of real numbers, with $x_n \to 0$ and $y_n \to \infty$ as $n \to \infty$ . What happens to $x_n y_n$ ? With a qualitative approach, one does not know – it could go to zero, to infinity, to something else, or not converge at all. But if one proceeds quantitatively instead, by establishing upper and lower bounds on $x_n$ and $y_n$ , then one can often get good enough bounds on $x_n y_n$ to settle the question.

More generally, there are any number of deep estimates whose proof involves a large number of delicate quantitative steps that do not have an obvious qualitative analogue, for instance by carefully comparing different upper and lower bounds to show that various error terms are dominated by a main term. In some cases one can encapsulate these arguments in a clean qualitative form (e.g. by asserting that some operation is continuous or bounded) but often one is not so lucky, and so one has to get one’s hands dirty.

8 July, 2010 at 11:21 am

Kestutis Cesnavicius

Several typos:

1. In the second sentence after the table “non-empty interior” should be “dense interior”.
2. In Exercise 3 the first appearance of $c$ should be $c_{\epsilon}$ .
3. In the first sentence of Exercise 4 the word “if” is missing.
4. In display (4) a bunch of indices $n$ should be $m$ .
5. In Remark 5 “some constant Y” should be “some constant C”.
6. In Exercise 9 I guess $L$ should be bounded linear.
7. In Exercise 10 “finite co-dimensional” should be “closed finite co-dimensional”.
8. In the end of the opening paragraph of “The closed graph theorem” an opening parenthesis is missing.
9. Shortly after display (7) the set Z of integers in the exponent should be the set N of natural numbers.
10. In the last line of the proof of Theorem 5 I believe that the indices $2k + 2$ and $2k + 1$ were meant to be $n_{2k + 2}$ and $n_{2k + 1}$ .

Thank you for a great set of notes!

[Corrected, thanks – T.]

24 January, 2011 at 8:17 pm

yucao

Prof. Tao,

In the Baire Category Theorem, “at least one of the $E_n$ is dense in a sub-ball B’ of B “, do you mean $\overline{E_n} contains sub-ball B'$ or $\overline{E_n}=B'$ ? The term seems strange here. Is it a custom to use “dense” in this way? On the other hand, is the ball closed or open? Or it does not matter here?

24 January, 2011 at 9:46 pm

yucao

I think you have already explain this in Remark 5,

245A, Notes 1: Lebesgue measure

24 January, 2011 at 9:22 pm

yucao

In the proof of “uniform boundedness, “The hypothesis 1 is nothing more than the assertion that the $E_n$ cover X” should be “The hypothesis 1 is nothing more than the assertion that the $\cup_nE_n$ cover X,”?

24 January, 2011 at 9:44 pm

yucao

Sorry for the stupid question. “the $E_n$ cover X” means “ $U_n E_n$ covers X.

19 August, 2011 at 7:11 am

Jack

Prof. Tao,

For exercise 1, do we have to use the proposition that a set $E$ is nowhere dense if and only if $\overline{E}^c$ is open and dense? This is what I learned from your topology 121 notes(http://www.math.ucla.edu/~tao/resource/general/121.1.00s/compact.pdf).

It seems that De Morgan will not help anything here.

Besides, it’s a little surprising that the opposite of “nowhere dense” is “dense” but not “somewhere dense” according to your lecture notes.

19 August, 2011 at 8:07 am

Terence Tao

Yes, one can use that proposition to establish Exercise 1.

I think you may be conflating two different notions of “opposite”, namely “logical negation” and “set complement“. The negation of “E is nowhere dense” is “E is somewhere dense”. But the complement of a nowhere dense set is an everywhere dense set. This is similar to how the complement of a closed set is an open set, but how the negation of “E is closed” is _not_ “E is open” (since there exist sets that are neither open nor closed, and there also exist sets that are both open and closed).

20 August, 2011 at 8:30 am

Jack

Hmm. In your notes, you use that proposition to prove that a countable number of nowhere dense sets cannot cover a complete metric space. But in Exercise 1, one needs to show that “a countable number of nowhere dense sets cannot cover a ball”, which seems a stronger conclusion. I am wondering if one needs some special technique here.

20 August, 2011 at 9:00 am

Terence Tao

A closed ball in a complete metric space is still complete.

20 August, 2011 at 8:27 pm

Jack

Hmm, the complement of a nowhere dense set is an everywhere dense set. But the complement of everywhere dense set is not necessarily nowhere dense set, right? So the relationship is not the exact the same as that of closed and open sets.

8 September, 2011 at 2:10 pm

254A, Notes 2: Building Lie structure from representations and metrics « What’s new

[…] (Hint: mimic the proof of the open mapping theorem for Banach spaces, as discussed for instance in these notes. In particular, take advantage of the Baire category […]

6 October, 2011 at 11:41 pm

Dirk

In the proof of the Nikodym convergence theorem the notation changes from $\sigma_n$ to $\mu_n$ . Unfortunately this typo is also present in the book where I noticed it…

[Corrected, thanks – T.]

7 October, 2011 at 12:27 am

An elementry proof of Schur’s Theorem « regularize

[…] this alternative proof with Exercise 11 (right after Theorem 5 [Nikodym convergence theorem]) in this blog post by Terry where one shall take a similar path to proof Schur’s […]

25 February, 2012 at 3:22 pm

Rex

As a pedagogical question, how many in-class lectures does it usually require for you to cover the content of one of your blog posts such as this one? Surely it is not possible to pack such an immense amount of material into a single 90-minute lecture.

20 November, 2012 at 6:30 pm

The closed graph theorem in various categories « What’s new

[…] qualitative and quantitative notions of regularity preservation properties of an operator ; see this blog post for further […]

1 September, 2013 at 6:44 am

Sylvester Eriksson-Bique

I could be wrong, but Corollary 1 seems to be incorrect as stated. Particularly the implication 3-> 2. As stated in 3, the operators T_n:X->Y converge on a dense set, but don’t necessarily “patch up” in the limit to anything. That is, one should assume that T_n->T on a dense subset and that the norms are bounded. Then 2 follows from 3. OR, one can assume Y a Banach space, in which case uniform boundedness and dense convergence implies that there exists such an operator T. The problem is if Y is not complete.

Example: X=L^1 and \phi_n \to \delta_0 (as n\to \infty) is a compactly supported, positive, smooth approximation of unity (as in http://en.wikipedia.org/wiki/Mollifier). Furthermore let Y=C(R,L^1), the continuous real valued functions on \R with bounded L^1 norm. This space is normed by L^1, and as it’s completion is L^1 it is not Banach. Now consider the operators: T_n: f\to f * \phi_n. By Young’s inequality and since \int \phi_n = 1, \phi_n>=0, then ||T_n(f)||_{L^1}=||f*\phi_n||_L_1<= ||f||_{L_1}||\phi_n||_L_1=||f||_L_1. In particular ||T_n||2 would be the only part requiring this. Another fix: Adjust the assumptions and assume in addition to convergence that there is some a priori limiting operator. Say if we assume that T_n \to T on a dense set and $T: X\to Y$ is some bounded linear operator, then the problem also disappears. In the above example e.g. take T_n f = f*\phi_n – f*\phi_{n+1}, then $T_n(y)\to 0$ on a dense set and in fact T_n(x)\to 0 everywhere.

1 September, 2013 at 6:47 am

Sylvester Eriksson-Bique

Sorry, copied my comment incorrectly:

1 September, 2013 at 6:49 am

Sylvester Eriksson-Bique

Bloody HTML:

Example: X=L^1 and \phi_n \to \delta_0 (as n\to \infty) is a compactly supported, positive, smooth approximation of unity (as in http://en.wikipedia.org/wiki/Mollifier). Furthermore let Y=C(R,L^1), the continuous real valued functions on \R with bounded L^1 norm. This space is normed by L^1, and as it’s completion is L^1 it is not Banach. Now consider the operators: T_n: f\to f * \phi_n. By Young’s inequality and since \int \phi_n = 1, \phi_n>=0, then ||T_n(f)||_{L^1}=||f*\phi_n||_L_1 \leq ||f||_{L_1}||\phi_n||_L_1=||f||_L_1. In particular ||T_n|| \leq 1, and we have uniform boundedness. Moreover for any continuous function $f$ (which are dense in X), we have by standard results for mollifiers that T_n(f)=f*\phi_n \to f in L^1, and since the entire sequence is in C(R), then the convergence is strong in Y. Thus we have dense convergence. But now let $f=\xhi(0,1)$, the characteristic function of the unit interval. Clearly f \nin Y, but f \in X. Also, T_n(f)\to f in L^1, and hence the limiting function is “not in” Y, and there is no limit for the operators T_n:X\to Y.

Fixes: Obviously if you replace Y with a banach space L^1, then T_n\to Id (Identity). 3\to2 would be the only part requiring this. Another fix: Adjust the assumptions and assume in addition to convergence that there is some a priori limiting operator. Say if we assume that T_n \to T on a dense set and $T: X\to Y$ is some bounded linear operator, then the problem also disappears. In the above example e.g. take T_n f = f*\phi_n – f*\phi_{n+1}, then $T_n(y)\to 0$ on a dense set and in fact $T_n(x)\to 0$ everywhere.

1 September, 2013 at 2:14 pm

Terence Tao

Thanks, I have corrected the corollary accordingly (by requiring the range Y to be a Banach space).

27 December, 2014 at 7:22 pm

Anonymous

“We now invoke the following easy observation: if E is nowhere dense, then every ball B contains a subball B’ which is disjoint from E. Indeed, this follows immediately from the definition of a nowhere dense set.”

By “disjoint”, do you mean the intersection is empty or the distance is positive?

27 December, 2014 at 8:05 pm

Anonymous

I don’t see why “this follows immediately from the definition of a nowhere dense set”…

Suppose there is a ball $B$ such that every subball $B'$ of $B$ is such that $B'\cap E\not=\emptyset$ . How would this contradicts the definition of $E$ ?

Following the definition, one can see that $\overline{E}^c$ is a nonempty open set, and thus there exists a ball $B$ which is disjoint from $E$ . It seems that the quoted observation above is stronger.

27 December, 2014 at 8:50 pm

Terence Tao

If $B$ has the above property, then every element of $B$ is an adherent point of $E$ , and thus $\overline{E}$ contains the ball $B$ , contradicting the nowhere dense nature of $E$ .

16 October, 2015 at 12:47 pm

Anonymous

In the proof of Theorem 2:

The hypothesis 1 is nothing more than the assertion that the $E_n$ cover $X$ , and thus by the Baire category theorem must be dense in a ball.

Do you mean that one of the $E_n$ (instead “all of them”) must be dense in some ball? Or do you mean (trivially) that the union of them is dense in a ball?

[Corrected, thanks – T.]

17 October, 2015 at 12:02 am

Anonymous

In the statement of theorem 5, “then” is missing.

[Corrected, thanks – T.]

31 October, 2016 at 3:54 am

Sunting

dear prof tao. i don’t know how to use the triangle inequality to get (4) to get(5) in Remark 2. i think the “Tam” in the left side of (4) should be “Tan” instead(then it is possible to use the triangle inequality). but i am not sure.

[Corrected, thanks – T.]

1 November, 2016 at 4:40 am

Sunting

in example 2. c0(N) is interpreted as sequence (a_n) and a_n goes to 0 as n is going to infinity. so i don’t think T is well-defined. since T(an) may not be in c0(N)?! we pick (an)=(1,1/2,1/3,…), then T(an)=(1,1,…) which is not in c0(N)

[ $c_0$ here should have been $c_c$ – T.]

9 November, 2016 at 11:23 pm

Ian

Professor Tao:

What is the Hahn-Banach Theorem used for in Exercise 6?

[It is not used, and the reference to it has been removed – T.]

9 January, 2017 at 4:00 pm

Anonymous

If a measure space also has the metric space structure, are there any relations between null sets and nowhere dense sets?

9 January, 2017 at 4:37 pm

Anonymous

Oh, this is a stupid question. Examples have been given in the notes.

9 January, 2017 at 4:14 pm

Anonymous

In Theorem 1: Is “the countable union of nowhere dense sets cannot contain a ball” the same as “the countable union of nowhere dense sets is nowhere dense”? (Just like the countable union of null sets is null sets.)

9 January, 2017 at 4:47 pm

Anonymous

At the end of the proof of Baire category theorem, I think you mean “in particular $x$ is an element of $B(x_0,r_0)$ ” instead of B.

[Corrected, thanks – T.]

27 February, 2017 at 5:49 am

1) It seems to me that the remark 3 contains a mistake. Convergent nets need not be bounded in general. We only have the (very useful) 3) ->2)->1). The implication 1) -> 3) seems problematic.

2) In the proof of Corollary, it is “Theorem 2” instead of “Theorem 1”.

3) In Corollary 1, a “Tx” is not in mathematical mode.

[Corrected, thanks. I do not see a quick way to recover the implication of 1 from 3, so it is probably best to just delete the remark. -T.]

5 April, 2017 at 12:34 am

Jochen Wengenroth

There is another interesting aspect of the open mapping theorem: It is enough to require “quantitative approximate solutions”: There are $a \in [0,1)$ and $b\in[0,\infty)$ such that for each $y \in Y$ there is $x \in X$ with $\|y-L(x)\| \le a \|y\|$ and $\|x\| \le b \|y\|$ . Then you get quantitative solvability with $C=b/(1-a)$ .

For example, the Tietze-Urysohn theorem can be proved with this condition for $a=2/3$ and $b=1/3$ . There are also non-linear versions for complete metric spaces of this principle (Schauder lemma).

11 May, 2018 at 5:31 am

Anonymous

In Example 1, it is said that

“As $C^\infty_0({\Bbb R})$ is known to be dense in $L^2({\Bbb R)}$ “.

I can’t find this result in the previous notes. Would you elaborate?

(As I go through the 245ABC series of notes, I find that results used are not necessarily in a strictly logical order, which is emphasized strongly in your two volumes of Analysis books. Is it intentional?)

12 May, 2018 at 1:15 pm

Terence Tao

See Exercise 1 of https://terrytao.wordpress.com/2009/04/19/245c-notes-3-distributions/ (this should also help answer your other question). I do eventually plan to revise the book form of these notes, in which I will change the order in which certain topics are introduced. (One reason for the discrepancy is that I actually taught and wrote notes for 245BC before I taught 245A.)

10 August, 2018 at 8:36 am

Debajyoti Choudhuri

Dear Prof Tao,

I am not sure but I cannot see why there has to exist at least one $E_n$ such that it is dense in a subball $B’$.

Thank you
DJ

[Take contrapositives (or argue by contradiction) – T.]

10 August, 2018 at 11:56 am

Debajyoti Choudhuri

I am sorry for this naive question but then shouldn’t it be the other way, i.e. there exists a subball B’ which is sitting in some E_n as a dense subset?.

10 August, 2018 at 10:45 pm

Debajyoti Choudhuri

or should it be “Let E_1, E_2, \ldots be an at most countable sequence of nowhere dense subsets…” instead of “Let E_1, E_2, \ldots be an at most countable sequence of subsets…”?.

11 August, 2018 at 12:50 pm

Terence Tao

Depends on whether you are using the first form of the Baire Category theorem or the second (they are essentially contrapositives of each other). The proof given in the blog post is for the second form, but this quickly implies the first form also.

First form: Let $E_1,E_2,\dots$ be an at most countable sequence of subsets of a complete metric space $X$ . If $\bigcup_n E_n$ contains a ball $B$ , then one of the $E_n$ is dense in a sub-ball $B'$ of $B$ .

Second form: Let $E_1,E_2,\dots$ be an at most countable sequence of nowhere dense subsets of a complete metric space $X$ . Then $\bigcup_n E_n$ cannot contain a ball.

11 August, 2018 at 9:12 pm

Debajyoti Choudhuri

Thank you very much.:)

10 August, 2018 at 8:39 am

Debajyoti Choudhuri

PS: My question is for the Theorem 1 (Baire category theorem).

31 May, 2019 at 2:42 am

Anonymous

how do we prove that a metric space X is of 2nd Baire category. Like what is it that we are suppose to show

14 May, 2020 at 7:26 am

247B, Notes 4: almost everywhere convergence of Fourier series | What's new

[…] Calderón) is similar in spirit to the uniform boundedness principle (see e.g. Corollary 1 of this previous blog post). The restriction is needed for just one implication (from (iii) to (ii)) in the arguments below, […]

29 November, 2020 at 3:14 am

Anonymous

Sir, in the proof of open mapping theorem I am not getting the following point :-
By subtracting two such approximate solutions, we conclude that
For any f \in B(0,2r) and any \varepsilon > 0, there exists u \in X with \|Lu – f \|_Y \leq 2\varepsilon and \|u\|_X \leq 2nr + 2 n \varepsilon.
Can you help me littile bit to understand this conclusion.
Thank you.

30 November, 2020 at 11:29 am

Terence Tao

Any $f \in B(0,2r)$ can be written as the difference $f = f_1 - f_2$ of two elements $f_1,f_2$ of $B(f_0,r)$ . Apply the hypothesis to find approximate solutions $u_1, u_2$ to $Lu_1 \approx f_1, Lu_2 \approx f_2$ respectively and then subtract.

8 April, 2023 at 3:52 am

Mayoorathy Vishagan

Dear Prof. Tao,
Can you please explain a bit what you mean in “general” and in this case by the term “regularity”. Thanks.

10 April, 2023 at 4:39 am

In remark (2), I don’t see how you can choose the signs such that the inequality holds? Are we using some property of conditional sums?

14 April, 2023 at 10:27 am

Terence Tao

This is a vector-valued version of the pigeonhole principle, and can be proven by contradiction and the triangle inequality.

	Anonymous on Erratum for “An inverse…
	Anonymous on Infinite partial sumsets in th…
	Anonymous on Infinite partial sumsets in th…
	Anonymous on A Banach algebra proof of the…
	Anonymous on A Banach algebra proof of the…
	Aleksandar on 245C, Notes 4: Sobolev sp…
	Anonymous on Erratum for “An inverse…
	Anonymous on Erratum for “An inverse…
	Anonymous on Erratum for “An inverse…
	Anonymous on Erratum for “An inverse…
	Terence Tao on 245C, Notes 4: Sobolev sp…
	Terence Tao on 275A, Notes 3: The weak and st…
	Terence Tao on What is a gauge?
	Terence Tao on Erratum for “An inverse…
	Terence Tao on 275A, Notes 3: The weak and st…

245B, Notes 9: The Baire category theorem and its Banach space consequences

Recent Comments

Articles by others

Diversions

Mathematics

Selected articles

Software

The sciences

Top Posts

Archives

Categories

The Polymath Blog

71 comments

Leave a comment Cancel reply

For commenters

245B, Notes 9: The Baire category theorem and its Banach space consequences

Share this:

Recent Comments

Articles by others

Diversions

Mathematics

Selected articles

Software

The sciences

Top Posts

Archives

Categories

The Polymath Blog

71 comments

Leave a comment Cancel reply

For commenters