Algebraic probability spaces

28 June, 2014 in expository, math.CA, math.CT, math.DS, math.PR, math.RA | Tags: ergodic theory, Gelfand-Naimark-Segal construction, Loomis-Sikorski theorem, tracial algebra | by Terence Tao

As laid out in the foundational work of Kolmogorov, a classical probability space (or probability space for short) is a triplet ${(X, {\mathcal X}, \mu)}$ , where ${X}$ is a set, ${{\mathcal X}}$ is a ${\sigma}$ -algebra of subsets of ${X}$ , and ${\mu: {\mathcal X} \rightarrow [0,1]}$ is a countably additive probability measure on ${{\mathcal X}}$ . Given such a space, one can form a number of interesting function spaces, including

the (real) Hilbert space ${L^2(X, {\mathcal X}, \mu)}$ of square-integrable functions ${f: X \rightarrow {\bf R}}$ , modulo ${\mu}$ -almost everywhere equivalence, and with the positive definite inner product ${\langle f, g\rangle_{L^2(X, {\mathcal X}, \mu)} := \int_X f g\ d\mu}$ ; and
the unital commutative Banach algebra ${L^\infty(X, {\mathcal X}, \mu)}$ of essentially bounded functions ${f: X \rightarrow {\bf R}}$ , modulo ${\mu}$ -almost everywhere equivalence, with ${\|f\|_{L^\infty(X, {\mathcal X}, \mu)}}$ defined as the essential supremum of ${|f|}$ .

There is also a trace ${\tau = \tau_\mu: L^\infty(X, {\mathcal X}, \mu) \rightarrow {\bf C}}$ on ${L^\infty}$ defined by integration: ${\tau(f) := \int_X f\ d\mu}$ .

One can form the category ${\mathbf{Prb}}$ of classical probability spaces, by defining a morphism ${\phi: (X, {\mathcal X}, \mu) \rightarrow (Y, {\mathcal Y}, \nu)}$ between probability spaces to be a function ${\phi: X \rightarrow Y}$ which is measurable (thus ${\phi^{-1}(E) \in {\mathcal X}}$ for all ${E \in {\mathcal Y}}$ ) and measure-preserving (thus ${\mu(\phi^{-1}(E)) = \nu(E)}$ for all ${E \in {\mathcal Y}}$ ).

Let us now abstract the algebraic features of these spaces as follows; for want of a better name, I will refer to this abstraction as an algebraic probability space, and is very similar to the non-commutative probability spaces studied in this previous post, except that these spaces are now commutative (and real).

Definition 1 An algebraic probability space is a pair ${({\mathcal A}, \tau)}$ where

${{\mathcal A}}$ is a unital commutative real algebra;

${\tau: {\mathcal A} \rightarrow {\bf R}}$ is a homomorphism such that ${\tau(1)=1}$ and ${\tau( f^2 ) \geq 0}$ for all ${f \in {\mathcal A}}$ ;

Every element ${f}$ of ${{\mathcal A}}$ is bounded in the sense that ${\sup_{k \geq 1} \tau( f^{2k} )^{1/2k} < \infty}$ . (Technically, this isn’t an algebraic property, but I need it for technical reasons.)

A morphism ${\phi: ({\mathcal A}_1, \tau_1) \rightarrow ({\mathcal A}_2, \tau_2)}$ is a homomorphism ${\phi^*: {\mathcal A}_2 \rightarrow {\mathcal A}_1}$ which is trace-preserving, in the sense that ${\tau_1(\phi^*(f)) = \tau_2(f)}$ for all ${f \in {\mathcal A}_2}$ .

For want of a better name, I’ll denote the category of algebraic probability spaces as ${\mathbf{AlgPrb}}$ . One can view this category as the opposite category to that of (a subcategory of) the category of tracial commutative real algebras. One could emphasise this opposite nature by denoting the algebraic probability space as ${({\mathcal A}, \tau)^{op}}$ rather than ${({\mathcal A},\tau)}$ ; another suggestive (but slightly inaccurate) notation, inspired by the language of schemes, would be ${\hbox{Spec}({\mathcal A},\tau)}$ rather than ${({\mathcal A},\tau)}$ . However, we will not adopt these conventions here, and refer to algebraic probability spaces just by the pair ${({\mathcal A},\tau)}$ .

By the previous discussion, we have a covariant functor ${F: \textbf{Prb} \rightarrow \textbf{AlgPrb}}$ that takes a classical probability space ${(X, {\mathcal X}, \mu)}$ to its algebraic counterpart ${(L^\infty(X, {\mathcal X},\mu), \tau_\mu)}$ , with a morphism ${\phi: (X, {\mathcal X}, \mu) \rightarrow (Y, {\mathcal Y}, \nu)}$ of classical probability spaces mapping to a morphism ${F(\phi): (L^\infty(X, {\mathcal X},\mu), \tau_\mu) \rightarrow (L^\infty(Y, {\mathcal Y},\nu), \tau_\nu)}$ of the corresponding algebraic probability spaces by the formula

$\displaystyle F(\phi)^* f := f \circ \phi$

for ${f \in L^\infty(Y, {\mathcal Y}, \nu)}$ . One easily verifies that this is a functor.

In this post I would like to describe a functor ${G: \textbf{AlgPrb} \rightarrow \textbf{Prb}}$ which partially inverts ${F}$ (up to natural isomorphism), that is to say a recipe for starting with an algebraic probability space ${({\mathcal A}, \tau)}$ and producing a classical probability space ${(X, {\mathcal X}, \mu)}$ . This recipe is not new – it is basically the (commutative) Gelfand-Naimark-Segal construction (discussed in this previous post) combined with the Loomis-Sikorski theorem (discussed in this previous post). However, I wanted to put the construction in a single location for sake of reference. I also wanted to make the point that ${F}$ and ${G}$ are not complete inverses; there is a bit of information in the algebraic probability space (e.g. topological information) which is lost when passing back to the classical probability space. In some future posts, I would like to develop some ergodic theory using the algebraic foundations of probability theory rather than the classical foundations; this turns out to be convenient in the ergodic theory arising from nonstandard analysis (such as that described in this previous post), in which the groups involved are uncountable and the underlying spaces are not standard Borel spaces.

Let us describe how to construct the functor ${G}$ , with details postponed to below the fold.

Starting with an algebraic probability space ${({\mathcal A}, \tau)}$ , form an inner product on ${{\mathcal A}}$ by the formula ${\langle f, g \rangle := \tau(fg)}$ , and also form the spectral radius ${\rho(f) :=\lim_{k \rightarrow \infty} \tau(f^{2^k})^{1/2^k}}$ .
The inner product is clearly positive semi-definite. Quotienting out the null vectors and taking completions, we arrive at a real Hilbert space ${L^2 = L^2({\mathcal A},\tau)}$ , to which the trace ${\tau}$ may be extended.
Somewhat less obviously, the spectral radius is well-defined and gives a norm on ${{\mathcal A}}$ . Taking ${L^2}$ limits of sequences in ${{\mathcal A}}$ of bounded spectral radius gives us a subspace ${L^\infty = L^\infty({\mathcal A},\tau)}$ of ${L^2}$ that has the structure of a real commutative Banach algebra.
The idempotents ${1_E}$ of the Banach algebra ${L^\infty}$ may be indexed by elements ${E}$ of an abstract ${\sigma}$ -algebra ${{\mathcal B}}$ .
The Boolean algebra homomorphisms ${\delta_x: {\mathcal B} \rightarrow \{0,1\}}$ (or equivalently, the real algebra homomorphisms ${\iota_x: L^\infty \rightarrow {\bf R}}$ ) may be indexed by elements ${x}$ of a space ${X}$ .
Let ${{\mathcal X}}$ denote the ${\sigma}$ -algebra on ${X}$ generated by the basic sets ${\overline{E} := \{ x \in X: \delta_x(E) = 1 \}}$ for every ${E \in {\mathcal B}}$ .
Let ${{\mathcal N}}$ be the ${\sigma}$ -ideal of ${{\mathcal X}}$ generated by the sets ${\bigcap_n \overline{E_n}}$ , where ${E_n \in {\mathcal B}}$ is a sequence with ${\bigcap_n E_n = \emptyset}$ .
One verifies that ${{\mathcal B}}$ is isomorphic to ${{\mathcal X}/{\mathcal N}}$ . Using this isomorphism, the trace ${\tau}$ on ${L^\infty}$ can be used to construct a countably additive measure ${\mu}$ on ${{\mathcal X}}$ . The classical probability space ${(X, {\mathcal X}, \mu)}$ is then ${G( {\mathcal A}, \tau )}$ , and the abstract spaces ${L^2, L^\infty}$ may now be identified with their concrete counterparts ${L^2(X, {\mathcal X}, \mu)}$ , ${L^\infty(X, {\mathcal X}, \mu)}$ .
Every algebraic probability space morphism ${\phi: ({\mathcal A}_1,\tau_1) \rightarrow ({\mathcal A}_2,\tau_2)}$ generates a classical probability morphism ${G(\phi): (X_1, {\mathcal X}_1, \mu_1) \rightarrow (X_2, {\mathcal X}_2, \mu_2)}$ via the formula
$\displaystyle \delta_{G(\phi)(x_1)}( E_2 ) = \delta_{x_1}( \phi^*(E_2) )$

using a pullback operation ${\phi^*}$ on the abstract ${\sigma}$ -algebras ${{\mathcal B}_1, {\mathcal B}_2}$ that can be defined by density.

Remark 1 The classical probability space ${X}$ constructed by the functor ${G}$ has some additional structure; namely ${X}$ is a ${\sigma}$ -Stone space (a Stone space with the property that the closure of any countable union of clopen sets is clopen), ${{\mathcal X}}$ is the Baire ${\sigma}$ -algebra (generated by the clopen sets), and the null sets are the meager sets. However, we will not use this additional structure here.

The partial inversion relationship between the functors ${F: \textbf{Prb} \rightarrow \textbf{AlgPrb}}$ and ${G: \textbf{AlgPrb} \rightarrow \textbf{Prb}}$ is given by the following assertion:

There is a natural transformation from ${F \circ G: \textbf{AlgPrb} \rightarrow \textbf{AlgPrb}}$ to the identity functor ${I: \textbf{AlgPrb} \rightarrow \textbf{AlgPrb}}$ .

More informally: if one starts with an algebraic probability space ${({\mathcal A},\tau)}$ and converts it back into a classical probability space ${(X, {\mathcal X}, \mu)}$ , then there is a trace-preserving algebra homomorphism of ${{\mathcal A}}$ to ${L^\infty( X, {\mathcal X}, \mu )}$ , which respects morphisms of the algebraic probability space. While this relationship is far weaker than an equivalence of categories (which would require that ${F \circ G}$ and ${G \circ F}$ are both natural isomorphisms), it is still good enough to allow many ergodic theory problems formulated using classical probability spaces to be reformulated instead as an equivalent problem in algebraic probability spaces.

Remark 2 The opposite composition ${G \circ F: \textbf{Prb} \rightarrow \textbf{Prb}}$ is a little odd: it takes an arbitrary probability space ${(X, {\mathcal X}, \mu)}$ and returns a more complicated probability space ${(X', {\mathcal X}', \mu')}$ , with ${X'}$ being the space of homomorphisms ${\iota_x: L^\infty(X, {\mathcal X}, \mu) \rightarrow {\bf R}}$ . while there is “morally” an embedding of ${X}$ into ${X'}$ using the evaluation map, this map does not exist in general because points in ${X}$ may well have zero measure. However, if one takes a “pointless” approach and focuses just on the measure algebras ${({\mathcal X}, \mu)}$ , ${({\mathcal X}', \mu')}$ , then these algebras become naturally isomorphic after quotienting out by null sets.

Remark 3 An algebraic probability space captures a bit more structure than a classical probability space, because ${{\mathcal A}}$ may be identified with a proper subset of ${L^\infty}$ that describes the “regular” functions (or random variables) of the space. For instance, starting with the unit circle ${{\bf R}/{\bf Z}}$ (with the usual Haar measure and the usual trace ${\tau(f) = \int_{{\bf R}/{\bf Z}} f}$ ), any unital subalgebra ${{\mathcal A}}$ of ${L^\infty({\bf R}/{\bf Z})}$ that is dense in ${L^2({\bf R}/{\bf Z})}$ will generate the same classical probability space ${G( {\mathcal A}, \tau )}$ on applying the functor ${G}$ , namely one will get the space ${({\bf R}/{\bf Z})'}$ of homomorphisms from ${L^\infty({\bf R}/{\bf Z})}$ to ${{\bf R}}$ (with the measure induced from ${\tau}$ ). Thus for instance ${{\mathcal A}}$ could be the continuous functions ${C( {\bf R}/{\bf Z} )}$ , the Wiener algebra ${A({\bf R}/{\bf Z})}$ or the full space ${L^\infty({\bf R}/{\bf Z})}$ , but the classical space ${G( {\mathcal A}, \tau )}$ will be unable to distinguish these spaces from each other. In particular, the functor ${F \circ G}$ loses information (roughly speaking, this functor takes an algebraic probability space and completes it to a von Neumann algebra, but then forgets exactly what algebra was initially used to create this completion). In ergodic theory, this sort of “extra structure” is traditionally encoded in topological terms, by assuming that the underlying probability space ${X}$ has a nice topological structure (e.g. a standard Borel space); however, with the algebraic perspective one has the freedom to have non-topological notions of extra structure, by choosing ${{\mathcal A}}$ to be something other than an algebra ${C(X)}$ of continuous functions on a topological space. I hope to discuss one such example of extra structure (coming from the Gowers-Host-Kra theory of uniformity seminorms) in a later blog post (this generalises the example of the Wiener algebra given previously, which is encoding “Fourier structure”).

A small example of how one could use the functors ${F, G}$ is as follows. Suppose one has a classical probability space ${(X, {\mathcal X}, \mu)}$ with a measure-preserving action of an uncountable group ${\Gamma}$ , which is only defined (and an action) up to almost everywhere equivalence; thus for instance for any set ${E}$ and any ${g, h \in \Gamma}$ , ${T^{gh} E}$ and ${T^g T^h E}$ might not be exactly equal, but only equal up to a null set. For similar reasons, an element ${E}$ of the invariant factor ${{\mathcal X}^\Gamma}$ might not be exactly invariant with respect to ${\Gamma}$ , but instead one only has ${T^g E}$ and ${E}$ equal up to null sets for each ${g \in \Gamma}$ . One might like to “clean up” the action of ${\Gamma}$ to make it defined everywhere, and a genuine action everywhere, but this is not immediately achievable if ${\Gamma}$ is uncountable, since the union of all the null sets where something bad occurs may cease to be a null set. However, by applying the functor ${F}$ , each shift ${T^g: X \rightarrow X}$ defines a morphism ${T^g: L^\infty(X, {\mathcal X}, \mu) \rightarrow L^\infty(X, {\mathcal X}, \mu)}$ on the associated algebraic probability space (i.e. the Koopman operator), and then applying ${G}$ , we obtain a shift ${T^g: X' \rightarrow X'}$ on a new classical probability space ${(X', {\mathcal X}', \mu')}$ which now gives a genuine measure-preserving action of ${\Gamma}$ , and which is equivalent to the original action from a measure algebra standpoint. The invariant factor ${{\mathcal X}^\Gamma}$ now consists of those sets in ${{\mathcal X}'}$ which are genuinely ${\Gamma}$ -invariant, not just up to null sets. (Basically, the classical probability space ${(X', {\mathcal X}', \mu')}$ contains a Boolean algebra ${\overline{\mathcal B}}$ with the property that every measurable set ${A \in {\mathcal X}'}$ is equivalent up to null sets to precisely one set in ${\overline{\mathcal B}}$ , allowing for a canonical “retraction” onto ${\overline{\mathcal B}}$ that eliminates all null set issues.)

More indirectly, the functors ${F, G}$ suggest that one should be able to develop a “pointless” form of ergodic theory, in which the underlying probability spaces are given algebraically rather than classically. I hope to give some more specific examples of this in later posts.

— 1. Details —

We now flesh out the construction of ${G}$ that was sketched above. The arguments here are drawn from these two previous blog posts, with some minor simplifications coming from the commutativity of the algebraic probability space.

We begin with an algebraic probability space ${({\mathcal A}, \tau)}$ . As indicated, we then give ${{\mathcal A}}$ an inner product ${\langle, \rangle}$ on ${{\mathcal A}}$ by the formula

$\displaystyle \langle f, g \rangle := \tau( fg ).$

By construction we see that this is a positive semi-definite inner product. We let ${L^2 = L^2({\mathcal A},\tau)}$ be the associated completion of ${{\mathcal A}}$ after quotienting out by null vectors, thus ${L^2}$ is a real Hilbert space and we have an isometry ${\iota: {\mathcal A} \rightarrow L^2}$ with dense image. We use ${\| \|_{L^2}}$ to denote the norm on ${L^2}$ , thus

$\displaystyle \| \iota(f) \|_{L^2} = \tau(f^2)^{1/2}$

for ${f \in {\mathcal A}}$ .

From the Cauchy-Schwarz inequality, we see that

$\displaystyle |\tau(fg)| \leq \tau(f^2)^{1/2} \tau(g^2)^{1/2}$

for all ${f, g \in {\mathcal A}}$ . In particular, since ${\tau(1)=1}$ , we have

$\displaystyle |\tau(f)| \leq \tau(f^2)^{1/2}$

which implies that ${\tau(f^{2^k})^{1/2^k}}$ is a non-decreasing function of ${k=0,1,\dots}$ . As each ${f \in {\mathcal A}}$ is assumed to be bounded, we thus have a well-defined spectral radius

$\displaystyle \rho(f) := \lim_{k \rightarrow \infty} \tau(f^{2^k})^{1/2^k}.$

By monotonicity, we have

$\displaystyle |\tau(f)| \leq \| \iota(f)\|_{L^2} \leq \rho(f). \ \ \ \ \ (1)$

Also, from many applications of Cauchy-Schwarz we have

$\displaystyle |\tau(f^j g^k)| \leq \tau(f^{j+k})^{\frac{j}{j+k}} \tau(g^{j+k})^{\frac{k}{j+k}}$

for ${f,g \in {\mathcal A}}$ and ${j,k \geq 0}$ with ${j+k}$ summing to a power of two; in particular

$\displaystyle |\tau(f^j g^k)| \leq \rho(f)^j \rho(g)^k,$

which by the binomial theorem gives

$\displaystyle \rho(f+g) \leq \rho(f) + \rho(g)$

and also

$\displaystyle \rho(fg) \leq \rho(f) \rho(g),$

thus ${\rho}$ is an algebra norm on ${\iota({\mathcal A})}$ ; a similar argument also gives the inequality.

$\displaystyle \| \iota(fg) \|_{L^2} \leq \rho(f) \|\iota(g)\|_{L^2}.$

From this, we see that for each ${f \in {\mathcal A}}$ , the multiplication operator ${g \mapsto fg}$ on ${{\mathcal A}}$ induces a self-adjoint bounded linear operator on ${L^2}$ ; of operator norm at most ${\rho(f)}$ ; in fact, from the definition of ${\rho(f)}$ we see that this operator cannot have norm strictly less than ${\rho(f)}$ . Thus we have identified ${\iota({\mathcal A})}$ as a commutative normed algebra with a subalgebra of the space ${B(L^2)}$ of bounded linear operators on ${L^2}$ , with the operator norm.

We now define ${L^\infty = L^\infty({\mathcal A},\tau)}$ to be the space of functions ${f}$ in ${L^2}$ which are limits (in ${L^2}$ ) of sequences ${f_n}$ in ${\iota({\mathcal A})}$ of uniformly bounded spectral radius. The associated multiplication operators ${g \mapsto f_n g}$ are then uniformly bounded in operator norm, and converge in ${L^2}$ for fixed ${g \in \iota({\mathcal A})}$ ; thus ${f}$ defines a multiplication operator ${g \mapsto fg}$ that is a self-adjoint bounded linear operator on ${L^2}$ , which one can check to be independent of the choice of sequence. This identifies each element of ${L^\infty}$ with a self-adjoint element of ${B(H)}$ ; we then define the ${L^\infty}$ norm of an element ${f \in L^\infty}$ to be its operator norm in ${B(H)}$ , thus this extends the spectral radius ${\rho}$ on ${\iota({\mathcal A})}$ . Specialising ${g}$ to ${1}$ we see that this identification of ${L^\infty}$ with a subset of ${B(H)}$ is injective, and this gives ${L^\infty}$ the structure of a commutative Banach algebra, with ${L^2}$ being a module over ${L^\infty}$ . From construction, we also see that every closed ball in ${L^\infty}$ is also closed in ${L^2}$ .

Define an idempotent element of ${L^\infty}$ to be an element ${1_E}$ such that ${1_E^2 = 1_E}$ ; we let ${{\mathcal B}}$ denote an index set for the set of idempotents ${\{ 1_E: E \in {\mathcal B} \}}$ . We have the following basic density result, which ensures an ample supply of idempotents:

Lemma 2 The linear space spanned by the idempotents is dense in ${L^\infty}$ .

Proof: Let ${f \in L^\infty}$ ; our task is to approximate ${f}$ to arbitrary accuracy in ${L^\infty}$ norm by a finite linear combination of idempotents.

We view ${f}$ as a bounded self-adjoint linear operator on ${L^2}$ , which contains the unit vector ${1}$ . By the spectral theorem, we can find a Radon probability measure ${\mu_f}$ on ${[-\|f\|_{L^\infty}, \|f\|_{L^\infty}]}$ such that

$\displaystyle \langle f^k 1, 1 \rangle = \int x^k\ d\mu_f(x)$

for all ${k}$ , and in particular that

$\displaystyle \| P(f) \|_{L^2} = \int P(x)^2\ d\mu_f(x)$

for any polynomial ${P: {\bf R} \rightarrow {\bf R}}$ . Also we see that

$\displaystyle \|P(f)\|_{L^\infty} = \mu_f-\sup |P(x)|$

where ${\mu_f-\sup}$ denotes the ${\mu_f}$ -essential supremum. From this and a density argument, we have an ${L^\infty}$ functional calculus: given any bounded Borel function ${F: \hbox{supp}(\mu) \rightarrow {\bf R}}$ , we can find ${F(f) \in L^\infty}$ such that

$\displaystyle \| F(f) \|_{L^2} = \int F(x)^2\ d\mu_f(x)$

and

$\displaystyle \|F(f)\|_{L^\infty} = \mu_f-\sup |F(x)|,$

and such that the map ${F \mapsto F(f)}$ is a homomorphism. In particular, if ${F}$ is an indicator function, then ${F(f)}$ is an idempotent. Approximating the identity function in ${L^2(\mu_f)}$ by a finite combination of indicator functions, we obtain the claim. $\Box$

Now, we can give ${{\mathcal B}}$ the structure of an abstract Boolean algebra by defining intersection

$\displaystyle 1_{E \cap F} := 1_E 1_F$

and complement

$\displaystyle 1_{E^c} := 1 - 1_E$

and then defining union by de Morgan’s law ${E \cup F := (E^c \cap F^c)^c}$ . One can verify (somewhat tediously) that ${{\mathcal B}}$ obeys the axioms of an abstract Boolean algebra, and acquires an ordering ${\subseteq}$ in the usual manner, with minimal element ${1_\emptyset = 0}$ and maximal element ${1_X = 1}$ . If ${E \subset F}$ , then a short computation shows that

$\displaystyle \| 1_E - 1_F \|_{L^2}^2 = \tau(1_E) - \tau(1_F).$

In particular, if ${E_1 \supset E_2 \supset \dots}$ is a decreasing sequence in ${{\mathcal B}}$ , then the ${1_{E_n}}$ are Cauchy in ${L^2}$ , and thus converge to another idempotent ${1_E}$ ; we write ${E := \bigcap_{n=1}^\infty E_n}$ , and observe that this is the greatest lower bound of the ${E_n}$ . Similarly, any increasing sequence ${E_1 \subset E_2 \subset \dots}$ has a least upper bound ${\bigcup_{n=1}^\infty E_n}$ .

Now we consider the Boolean homomorphisms ${\delta_x: {\mathcal B} \rightarrow \{0,1\}}$ from ${{\mathcal B}}$ to the two-element Boolean algebra, or equivalently the space of finitely additive Boolean measures on ${{\mathcal B}}$ . We index this space by ${X}$ , thus ${\{ \delta_x: x \in X \}}$ is the space of Boolean homomorphisms. From Lemma 2, every such Boolean homomorphism ${\delta_x: {\mathcal B} \rightarrow \{0,1\}}$ uniquely determines a algebra homomorphism ${\iota_x: L^\infty \rightarrow {\bf R}}$ , and conversely every homomorphism comes from exactly one such homomorphism, thus ${\{ \iota_x: x \in X\}}$ is the space of algebra homomorphisms from ${L^\infty}$ to ${{\bf R}}$ .

One can identify ${X}$ with a closed subspace of the product space ${\{0,1\}^{\mathcal B}}$ , and so by Tychonoff’s theorem ${X}$ is a compact space. Every element ${E \in {\mathcal B}}$ of the abstract Boolean algebra ${{\mathcal B}}$ induces a subset ${\overline{E}}$ of ${X}$ defined by

$\displaystyle \overline{E} := \{ x \in X: \delta_x(E) = 1 \} = \{ x \in X: \iota_x( 1_E ) = 1 \}.$

The map ${E \mapsto \overline{E}}$ is easily seen to be a Boolean homomorphism. Let ${{\mathcal X}}$ be the ${\sigma}$ -algebra generated by the sets ${\overline{\mathcal B} := \{ \overline{E}: E \in {\mathcal B} \}}$ . Define a basic null set to be a subset of ${X}$ of the form ${\bigcap_{n=1}^\infty \overline{E_n}}$ with ${E_1 \supset E_2 \supset \dots}$ in ${{\mathcal B}}$ such that ${\bigcap_{n=1}^\infty E_n = \emptyset}$ , and let ${{\mathcal N}}$ be the collection of countable unions of basic null sets. This is a ${\sigma}$ -ideal of ${{\mathcal B}}$ , so we may form the quotient ${\sigma}$ -algebra ${{\mathcal B}/{\mathcal N}}$ . The map ${E \mapsto \overline{E} \hbox{ mod } {\mathcal N}}$ can be easily verified to be a ${\sigma}$ -algebra homomorphism (not just a boolean algebra homomorphism). We claim that this homomorphism is bijective (and thus an isomorphism). Surjectivity is clear from construction. For injectivity, suppose for contradiction that there was ${E \in {\mathcal B}}$ with ${E \neq \emptyset}$ such that ${\overline{E}}$ was in ${{\mathcal N}}$ , that is to say that ${\overline{E}}$ could be covered by a countable sequence of intersections ${\bigcap_{n=1}^\infty \overline{E_{n,m}}}$ with ${E_{1,m} \supset E_{2,m} \subset \dots}$ and ${\bigcap_{n=1}^\infty E_{n,m} = \emptyset}$ .

By induction, we may find ${n_1, n_2, \dots}$ such that ${\overline{E}}$ is not covered by ${\overline{E_{n_1,1}} \cup \dots \cup \overline{E_{n_m,m}}}$ for each ${m}$ . If we let ${E'_m := E \backslash \bigcup_{i=1}^m E_{n_i,i}}$ , we thus see that ${E'_1 \supset E'_2 \supset \dots}$ with each ${E'_m \neq \emptyset}$ , but ${\bigcap_{m=1}^\infty \overline{E'_m}}$ non-empty for all ${m}$ . But from the ultrafilter lemma, ${\overline{E'_m}}$ is non-empty for each ${m}$ , and ${\overline{E'_m}}$ is also closed, so we obtain a contradiction from compactness.

From the above isomorphism, we see that every element ${A}$ of ${{\mathcal X}}$ differs (up to an element of ${{\mathcal N}}$ ) by a unique set ${\overline{E}}$ in ${\overline{\mathcal B}}$ . We then define the measure ${\mu(A)}$ of ${A}$ by the formula

$\displaystyle \mu(A) := \tau(1_E).$

One can check that this gives a countably additive probability measure. We may now associate to each finite linear combination ${f}$ of idempotents, an element ${\overline{f}}$ of ${L^\infty(X,{\mathcal X}, \mu)}$ in such a way that the map ${f \mapsto \overline{f}}$ is an algebra homomorphism with

$\displaystyle \tau(f) = \int_X \overline{f}\ d\mu$

which implies that ${\|f\|_{L^\infty}}$ is the essential supremum of ${\overline{f}}$ , and ${\|f\|_{L^2}}$ is the ${L^2(X, {\mathcal X}, \mu)}$ norm of ${\overline{f}}$ . This and Lemma 2 allow us to define maps from ${L^\infty}$ to ${L^\infty(X, {\mathcal X}, \mu)}$ and ${L^2}$ to ${L^2( X, {\mathcal X}, \mu)}$ , which one easily verifies to be isomorphisms of Banach algebras and Hilbert spaces respectively.

We have now constructed the action of the functor ${G}$ on algebraic probability spaces. To finish the construction of ${G}$ , we have to describe the classical probability morphism ${G(\phi): (X_1, {\mathcal X}_1, \mu_1) \rightarrow (X_2, {\mathcal X}_2, \mu_2)}$ associated to an algebraic probability space morphism ${\phi: ({\mathcal A}_1,\tau_1) \rightarrow ({\mathcal A}_2,\tau_2)}$ . The pullback map ${\phi^*: {\mathcal A}_2 \rightarrow {\mathcal A}_1}$ preserves the ${L^2}$ norm and spectral radius, and thus also extends to a Hilbert space isometry ${\phi^*: L^2({\mathcal A}_2,\tau_2) \rightarrow L^2({\mathcal A}_1, \tau_1)}$ and a Banach algebra isometry ${\phi^*: L^\infty({\mathcal A}_2, \tau_2) \rightarrow L^\infty({\mathcal A}_1, \tau_1)}$ . As a consequence, we also have a ${\sigma}$ -algebra homomorphism ${\phi^*: {\mathcal B}_2 \rightarrow {\mathcal B}_1}$ . We then define ${G(\phi) \in X_2}$ by the formula

$\displaystyle \delta_{G(\phi)(x_1)}( E_2 ) = \delta_{x_1}( \phi^*(E_2) )$

for all ${x_1 \in X_1}$ and ${E_2 \in {\mathcal B}_2}$ ; one verifies that this indeed defines ${G(\phi)}$ as an element of ${X_2}$ (i.e., ${\delta_{G(\phi)(x_1)}: {\mathcal B}_2 \rightarrow \{0,1\}}$ is a Boolean algebra homomorphism. It is then a routine but tedious matter to check that ${G(\phi)}$ is a classical probability morphism and that ${G}$ is a functor.

Finally, the homomorphism ${\iota: {\mathcal A} \rightarrow L^\infty({\mathcal A}, \tau)}$ can viewed as a morphism from the abstract probability space ${(L^\infty({\mathcal A}, \tau), \tau) = F \circ G( {\mathcal A}, \tau )}$ to ${({\mathcal A}, \tau)}$ . Given a morphism ${\phi: ({\mathcal A}_1,\tau_1) \rightarrow ({\mathcal A}_2,\tau_2)}$ , the pullback maps ${\phi^*: {\mathcal A}_2 \rightarrow {\mathcal A}_1}$ and ${\phi^*: L^\infty({\mathcal A}_2, \tau_2) \rightarrow L^\infty({\mathcal A}_1, \tau_1)}$ are intertwined by these morphisms, so we have a natural transformation from ${F \circ G}$ to the identity functor, as claimed.

14 comments

Comments feed for this article

28 June, 2014 at 10:41 pm

Will Sawin

Once you quotient out by the null sets, are these adjoint functors?

29 June, 2014 at 9:35 am

Terence Tao

There does seem to be some relationship of this form (although to state it properly, one would need an intermediate category between $\textbf{Prb}$ and $\textbf{AlgPrb}$ , consisting of measure algebras in which the null sets have been quotiented out). Given a classical probability space $X$ and an algebraic probability space ${\mathcal A}$ , a morphism from $F(X)$ to ${\mathcal A}$ is essentially a representation of ${\mathcal A}$ in $L^\infty(X)$ . One can then represent $L^2({\mathcal A})$ in $L^2(X)$ , and thence $L^\infty({\mathcal A})$ in $L^\infty(X)$ , which one can use to write $G({\mathcal A})$ as a factor of $X$ on the level of measure algebras (i.e. after quotienting by null sets). These steps are all reversible, and I’m pretty sure they’re natural. So it looks like one can identify $\textup{Hom}(F(X),{\mathcal A})$ with $\textup{Hom}(X, G({\mathcal A}))$ if we apply the appropriate forgetful functors for quotienting out null sets.

30 June, 2014 at 2:04 am

In Definition 1, what do you mean by a *-homomorphism? I do not see any involution… Isn’t a unital algebra morphism enough? Thanks.

[Oops, this is a leftover from a previous iteration of the post; corrected now. -T.]

30 June, 2014 at 7:40 pm

Amy Huang

Why the morphis needs to be trace-preserving?

[By definition – T.]

2 July, 2014 at 2:02 pm

vznvzn

congratulations! :idea: :!:

3 July, 2014 at 9:28 am

$3M Breakthrough Prizes now awarded in mathematics! | Turing Machine

[…] CONGRATULATIONS! :idea: :!: […]

7 July, 2014 at 8:30 pm

omar aboura

There is an extra space in “Somewh at less obviously”.

[Corrected, thanks – T.]

8 July, 2014 at 2:42 am

Thank you so much for writing this. I’d heard about the random variable / probability space divide in probability theory, and I had always assumed it was parallel to the ring / variety dichotomy in algebraic geometry. So it’s nice to see it all laid out. So would you say then that working probabilists think in terms of algebraic probablility spaces rather than (usual) probability space? Or that algebraic probability spaces are the true objects of study in probability theory? Or am I missing the point?

I’m really looking forward to the follow-up posts. I’d be particularly interested to hear more about the boundedness axiom in your definition of algebraic probability space. You say you need it only for technical reasons. So is there any chance of removing it or replacing it with something more palatable from the algebraic or category theoretic perspective while maintaining the ability to ‘do probability theory’ on such a thing?

A second question: In algebraic geometry, a ring/scheme is not much more general than a nilpotent-free ring/variety. This is because all rings with nilpotents can be obtained by intersecting varieties together, as long as you do it in the algebraically natural manner, i.e. just tensor coordinate rings together and don’t follow that by modding out by nilpotents (=ignore tangency information when intersecting). Further, infinitely generated rings can be obtained as limits of finitely generated rings. So finitely generated rings without nilpotents, the objects of classical algebraic geometry, reassuringly determine the whole scheme-theoretic story as long as you take a suitably enlightened point of view. Is there any counterpart to this in the world of probability spaces? Is there some sense in which every algebraic probability space can be built out usual ones by applying some routine constructions?

8 July, 2014 at 10:39 am

Terence Tao

When working with noncommutative probability (e.g. with quantum probability, or with free probability) then one certainly takes the algebraic perspective (although often with some additional analytic axioms beyond the boundedness axiom here, e.g. that the algebra of random variables forms a von Neumann algebra). Indeed, much as in algebraic geometry, in the noncommutative setting it becomes quite unwieldy to try to work in a classical setting of spaces of points. See https://terrytao.wordpress.com/2010/02/10/245a-notes-5-free-probability/ for more discussion. [Note that in the noncommutative framework, one cannot easily reduce to the consideration of purely real (or more precisely, self-adjoint) random variables, since the space of such random variables is no longer closed under multiplication, and must instead work in a *-algebra of random variables rather than a real algebra, with a trace that is now complex-linear instead of real-linear. So the axiomatisation in the noncommutative case is a little different from the one given in this post.]

With more traditional probability, the philosophy is indeed largely to suppress the precise nature of the sample space and to work primarily with concepts flowing from the algebra of random variables, e.g. expectation, variance, correlation, and independence. See https://terrytao.wordpress.com/2010/01/01/254a-notes-0-a-review-of-probability-theory/ for more discussion. However, for some applications (particularly in ergodic theory) it has traditionally been convenient to impose some additional topological regularity hypotheses on the probability space (e.g. that it is a standard Borel space) in order to use some tools from descriptive set theory and classical analysis (e.g. Riesz representation, disintegration of measures, measurable sections). I believe though that by pursuing a primarily algebraic approach, most of these additional topological assumptions may be dropped, albeit at the cost of making the arguments look rather different (as one has to make everything that is currently based on a classical point-theoretic interpretation of the sample space “pointless” or “point-free”). I hope to give some examples of this in future posts.

As regards the second question, certainly in many applications (e.g. in the ergodic theory of actions of countable groups) one can often reduce to probability spaces which are separable (countably generated), which is perhaps analogous to the finitely generated rings in the algebraic geometry context. (Nilpotency does not arise as an issue in commutative probability (the spectral theorem eliminates the possibility of nilpotents when one commutes with one’s adjoint), though of course one could consider it in noncommutative contexts. But the dividing line between normal (commuting with one’s adjoint) and non-normal is typically more important than the dividing line between nilpotent and non-nilpotent, at least in the context of free probability which is the subfield of noncommutative probability that I am the most familiar with; I think this flows from the heavy reliance on the trace in algebraic probability theory, which does not have much of an analogue in algebraic geometry since one doesn’t do much integration on varieties from a purely algebraic viewpoint, whereas one is always taking expectations of random variables.)

15 July, 2014 at 4:22 am

Real analysis relative to a finite measure space | What's new

[…] standard Borel spaces (in fact, I will take them to be algebraic probability spaces, as defined in this previous post), in which case disintegrations are not available. However, it appears that the stochastic analysis […]

31 August, 2014 at 6:05 am

jingpyi

Reblogged this on jingpyi.

3 October, 2015 at 2:58 pm

275A, Notes 1: Integration and expectation | What's new

[…] the foundations of probability theory on the sample space models as discussed in Notes 0. (See this previous post for one such axiomatic […]

29 June, 2019 at 3:13 pm

Symmetric functions in a fractional number of variables, and the multilinear Kakeya conjecture | What's new

[…] commutative von Neumann algebra type structure (or the abstract probability structure discussed in this previous post) and then interpret it as a genuine measure space rather than as a virtual one. (This failure of […]

20 December, 2021 at 10:55 pm

Anonymous

What is the conditions on \tau_{\mu} so the one-to-one relation between the Classical and the Algebraic probability spaces holds? Could any functional other than expectation work?

	Anonymous on Erratum for “An inverse…
	Anonymous on Erratum for “An inverse…
	Anonymous on Pointwise ergodic theorems for…
	Anonymous on 275A, Notes 3: The weak and st…
	Terence Tao on Pointwise ergodic theorems for…
	Terence Tao on Erratum for “An inverse…
	Anonymous on Notes on the B+B+t theore…
	Anonymous on Pointwise ergodic theorems for…
	Anonymous on Erratum for “An inverse…
	Erratum for “A… on An inverse theorem for the Gow…
	Anonymous on Analysis II
	Anonymous on Notes on the B+B+t theore…
	Anonymous on Twisted convolution and the se…
	Anonymous on A generalized Cauchy-Schwarz i…
	Notes on the B+B+t t… on Ultrafilters, nonstandard anal…

Algebraic probability spaces

Recent Comments

Articles by others

Diversions

Mathematics

Selected articles

Software

The sciences

Top Posts

Archives

Categories

The Polymath Blog

14 comments

Leave a comment Cancel reply

For commenters

Algebraic probability spaces

Share this:

Recent Comments

Articles by others

Diversions

Mathematics

Selected articles

Software

The sciences

Top Posts

Archives

Categories

The Polymath Blog

14 comments

Leave a comment Cancel reply

For commenters