This is an adaptation of a talk I gave recently for a program at IPAM. In this talk, I gave a (very informal and non-rigorous) overview of Hrushovski’s use of model-theoretic techniques to establish new Freiman-type theorems in non-commutative groups, and some recent work in progress of Ben Green, Tom Sanders and myself to establish combinatorial proofs of some of Hrushovski’s results.
This is the last reading seminar of this quarter for the Hrushovski paper. Anush Tserunyan continued working through her notes on stable theories. We introduced the key notion of non-forking extensions (in the context of stable theories, at least) of types when constants are added; these are extensions which are “as generic as possible” with respect to the constants being added. The existence of non-forking extensions can be used for instance to generate Morley sequences – sequences of indiscernibles which are “in general position” in some sense.
Starting in the winter quarter (Monday Jan 4, to be precise), I will be giving a graduate course on random matrices, with lecture notes to be posted on this blog. The topics I have in mind are somewhat fluid, but my initial plan is to cover a large fraction of the following:
- Central limit theorem, random walks, concentration of measure
- The semicircular and Marcenko-Pastur laws for bulk distribution
- A little bit on the connections with free probability
- The spectral distribution of GUE and gaussian random matrices; theory of determinantal processes
- A little bit on the connections with orthogonal polynomials and Riemann-Hilbert problems
- Singularity probability and the least singular value; connections with the Littlewood-Offord problem
- The circular law
- Universality for eigenvalue spacing; Erdos-Schlein-Yau delocalisation of eigenvectors and applications
If time permits, I may also cover
- The Tracy-Widom law
- Connections with Dyson Brownian motion and the Ornstein-Uhlenbeck process; the Erdos-Schlein-Yau approach to eigenvalue spacing universality
- Conjectural connections with zeroes of the Riemann zeta function
Depending on how the course progresses, I may also continue it into the spring quarter (or else have a spring graduate course on a different topic – one potential topic I have in mind is dynamics on nilmanifolds and applications to combinatorics).
Ben Green, Tamar Ziegler and I have just uploaded to the arXiv our paper “An inverse theorem for the Gowers norm“. This paper establishes the next case of the inverse conjecture for the Gowers norm for the integers (after the
case, which was done by Ben and myself a few years ago). This conjecture has a number of combinatorial and number-theoretic consequences, for instance by combining this new inverse theorem with previous results, one can now get the correct asymptotic for the number of arithmetic progressions of primes of length five in any large interval
.
To state the inverse conjecture properly requires a certain amount of notation. Given a function and a shift
, define the multiplicative derivative
and then define the Gowers norm of a function
to (essentially) be the quantity
where we extend f by zero outside of . (Actually, we use a slightly different normalisation to ensure that the function 1 has a
norm of 1, but never mind this for now.)
Informally, the Gowers norm measures the amount of bias present in the
multiplicative derivatives of
. In particular, if
for some polynomial
, then the
derivative of
is identically 1, and so is the Gowers norm.
However, polynomial phases are not the only functions with large Gowers norm. For instance, consider the function , which is what we call a quadratic bracket polynomial phase. This function isn’t quite quadratic, but it is close enough to being quadratic (because one has the approximate linearity relationship
holding a good fraction of the time) that it turns out that third derivative is trivial fairly often, and the Gowers norm
is comparable to 1. This bracket polynomial phase can be modeled as a nilsequence
, where
is a polynomial orbit on a nilmanifold
, which in this case has step 2. (The function F is only piecewise smooth, due to the discontinuity in the floor function
, so strictly speaking we would classify this as an almost nilsequence rather than a nilsequence, but let us ignore this technical issue here.) In fact, there is a very close relationship between nilsequences and bracket polynomial phases, but I will detail this in a later post.
The inverse conjecture for the Gowers norm, GI(s), asserts that such nilsequences are the only obstruction to the Gowers norm being small. Roughly speaking, it goes like this:
Inverse conjecture, GI(s). (Informal statement) Suppose that
is bounded but has large
norm. Then there is an s-step nilsequence
of “bounded complexity” that correlates with f.
This conjecture is trivial for s=0, is a short consequence of Fourier analysis when s=1, and was proven for s=2 by Ben and myself. In this paper we establish the s=3 case. An equivalent formulation in this case is that any bounded function of large
norm must correlate with a “bracket cubic phase”, which is the product of a bounded number of phases from the following list
(*)
for various real numbers .
It appears that our methods also work in higher step, though for technical reasons it is convenient to make a number of adjustments to our arguments to do so, most notably a switch from standard analysis to non-standard analysis, about which I hope to say more later. But there are a number of simplifications available on the s=3 case which make the argument significantly shorter, and so we will be writing the higher s argument in a separate paper.
The arguments largely follow those for the s=2 case (which in turn are based on this paper of Gowers). Two major new ingredients are a deployment of a normal form and equidistribution theory for bracket quadratic phases, and a combinatorial decomposition of frequency space which we call the sunflower decomposition. I will sketch these ideas below the fold.
is the fundamental equation of motion for (non-relativistic) quantum mechanics, modeling both one-particle systems and -particle systems for
. Remarkably, despite being a linear equation, solutions
to this equation can be governed by a non-linear equation in the large particle limit
. In particular, when modeling a Bose-Einstein condensate with a suitably scaled interaction potential
in the large particle limit, the solution can be governed by the cubic nonlinear Schrödinger equation
I recently attended a talk by Natasa Pavlovic on the rigorous derivation of this type of limiting behaviour, which was initiated by the pioneering work of Hepp and Spohn, and has now attracted a vast recent literature. The rigorous details here are rather sophisticated; but the heuristic explanation of the phenomenon is fairly simple, and actually rather pretty in my opinion, involving the foundational quantum mechanics of -particle systems. I am recording this heuristic derivation here, partly for my own benefit, but perhaps it will be of interest to some readers.
This discussion will be purely formal, in the sense that (important) analytic issues such as differentiability, existence and uniqueness, etc. will be largely ignored.
After a one-week hiatus, we are resuming our reading seminar of the Hrushovski paper. This week, we are taking a break from the paper proper, and are instead focusing on the subject of stable theories (or more precisely, -stable theories), which form an important component of the general model-theoretic machinery that the Hrushovski paper uses. (Actually, Hrushovski’s paper needs to work with more general theories than the stable ones, but apparently many of the tools used to study stable theories will generalise to the theories studied in this paper.)
Roughly speaking, stable theories are those in which there are “few” definable sets; a classic example is the theory of algebraically closed fields (of characteristic zero, say), in which the only definable sets are boolean combinations of algebraic varieties. Because of this paucity of definable sets, it becomes possible to define the notion of the Morley rank of a definable set (analogous to the dimension of an algebraic set), together with the more refined notion of Morley degree of such sets (analogous to the number of top-dimensional irreducible components of an algebraic set). Stable theories can also be characterised by their inability to order infinite collections of elements in a definable fashion.
The material here was presented by Anush Tserunyan; her notes on the subject can be found here. Let me also repeat the previous list of resources on this paper (updated slightly):
- Henry Towsner’s notes (which most of Notes 2-4 have been based on; updated to remove references to homogeneity, using only countable saturation);
- Alex Usvyatsov’s notes on the derivation of Corollary 1.2 (broadly parallel to the notes here);
- Lou van den Dries’ notes (covering most of what we have done so far, and also material on stable theories); and
- Anand Pillay’s sketch of a simplified proof of Theorem 1.1.
[A little bit of advertising on behalf of my maths dept. Unfortunately funding for this scholarship was secured only very recently, so the application deadline is extremely near, which is why I am publicising it here, in case someone here may know of a suitable applicant. - T.]
UCLA Mathematics has launched a new scholarship to be granted to an entering freshman who has an exceptional background and promise in mathematics. The UCLA Math Undergraduate Merit Scholarship provides for full tuition, and a room and board allowance. To be considered for fall 2010, candidates must apply on or before November 30, 2009. Details and online application for the scholarship are available here.
Eligibility Requirements:
- 12th grader applying to UCLA for admission in Fall of 2010.
- Outstanding academic record and standardized test scores.
- Evidence of exceptional background and promise in mathematics, such as: placing in the top 25% in the U.S.A. Mathematics Olympiad (USAMO) or comparable (International Mathematics Olympiad level) performance on a similar national competition.
- Strong preference will be given to International Mathematics Olympiad medalists.
In the course of the ongoing logic reading seminar at UCLA, I learned about the property of countable saturation. A model of a language
is countably saturated if, every countable sequence
of formulae in
(involving countably many constants in
) which is finitely satisfiable in
(i.e. any finite collection
in the sequence has a solution
in
), is automatically satisfiable in
(i.e. there is a solution
to all
simultaneously). Equivalently, a model is countably saturated if the topology generated by the definable sets is countably compact.
Update, Nov 19: I have learned that the above terminology is not quite accurate; countable saturation allows for an uncountable sequence of formulae, as long as the constants used remain finite. So, the discussion here involves a weaker property than countable saturation, which I do not know the official term for. If one chooses a special type of ultrafilter, namely a “countably incomplete” ultrafilter, one can recover the full strength of countable saturation, though it is not needed for the remarks here. Most models are not countably saturated. Consider for instance the standard natural numbers as a model for arithmetic. Then the sequence of formulae “
” for
is finitely satisfiable in
, but not satisfiable.
However, if one takes a model of
and passes to an ultrapower
, whose elements
consist of sequences
in
, modulo equivalence with respect to some fixed non-principal ultrafilter
, then it turns out that such models are automatically countably compact. Indeed, if
are finitely satisfiable in
, then they are also finitely satisfiable in
(either by inspection, or by appeal to Los’s theorem and/or the transfer principle in non-standard analysis), so for each
there exists
which satisfies
. Letting
be the ultralimit of the
, we see that
satisfies all of the
at once.
In particular, non-standard models of mathematics, such as the non-standard model of the natural numbers, are automatically countably saturated.
This has some cute consequences. For instance, suppose one has a non-standard metric space (an ultralimit of standard metric spaces), and suppose one has a standard sequence
of elements of
which are standard-Cauchy, in the sense that for any standard
one has
for all sufficiently large
. Then there exists a non-standard element
such that
standard-converges to
in the sense that for every standard
one has
for all sufficiently large
. Indeed, from the standard-Cauchy hypothesis, one can find a standard
for each standard
that goes to zero (in the standard sense), such that the formulae “
” are finitely satisfiable, and hence satisfiable by countable saturation. Thus we see that non-standard metric spaces are automatically “standardly complete” in some sense.
This leads to a non-standard structure theorem for Hilbert spaces, analogous to the orthogonal decomposition in Hilbert spaces:
Theorem 1 (Non-standard structure theorem for Hilbert spaces) Let
be a non-standard Hilbert space, let
be a bounded (external) subset of
, and let
. Then there exists a decomposition
, where
is “almost standard-generated by
” in the sense that for every standard
, there exists a standard finite linear combination of elements of
which is within
of
, and
is “standard-orthogonal to
” in the sense that
for all
.
Proof: Let be the infimum of all the (standard) distances from
to a standard linear combination of elements of
, then for every standard
one can find a standard linear combination
of elements of
which lie within
of
. From the parallelogram law we see that
is standard-Cauchy, and thus standard-converges to some limit
, which is then almost standard-generated by
by construction. An application of Pythagoras then shows that
is standard-orthogonal to every element of
.
This is the non-standard analogue of a combinatorial structure theorem for Hilbert spaces (see e.g. Theorem 2.6 of my FOCS paper). There is an analogous non-standard structure theorem for -algebras (the counterpart of Theorem 3.6 from that paper) which I will not discuss here, but I will give just one sample corollary:
Theorem 2 (Non-standard arithmetic regularity lemma) Let
be a non-standardly finite abelian group, and let
be a function. Then one can split
, where
is standard-uniform in the sense that all Fourier coefficients are (uniformly)
, and
is standard-almost periodic in the sense that for every standard
, one can approximate
to error
in
norm by a standard linear combination of characters (which is also bounded).
This can be used for instance to give a non-standard proof of Roth’s theorem (which is not much different from the “finitary ergodic” proof of Roth’s theorem, given for instance in Section 10.5 of my book with Van Vu). There is also a non-standard version of the Szemerédi regularity lemma which can be used, among other things, to prove the hypergraph removal lemma (the proof then becomes rather close to the infinitary proof of this lemma in this paper of mine). More generally, the above structure theorem can be used as a substitute for various “energy increment arguments” in the combinatorial literature, though it does not seem that there is a significant saving in complexity in doing so unless one is performing quite a large number of these arguments.
One can also cast density increment arguments in a nonstandard framework. Here is a typical example. Call a non-standard subset of a non-standard finite set
dense if one has
for some standard
.
Theorem 3 Suppose Szemerédi’s theorem (every set of integers of positive upper density contains an arithmetic progression of length
) fails for some
. Then there exists an unbounded non-standard integer
, a dense subset
of
with no progressions of length
, and with the additional property that
for any subprogression
of
of unbounded size (thus there is no sizeable density increment on any large progression).
Proof: Let be a (standard) set of positive upper density which contains no progression of length
. Let
be the asymptotic maximal density of
inside a long progression, thus
. For any
, one can then find a standard integer
and a standard subset
of
of density at least
such that
can be embedded (after a linear transformation) inside
, so in particular
has no progressions of length
. Applying the saturation property, one can then find an unbounded
and a set
of
of density at least
for every standard
(i.e. of density at least
) with no progressions of length
. By construction, we also see that for any subprogression
of
of unbounded size,
hs density at most
for any standard
, thus has density at most
, and the claim follows.
This can be used as the starting point for any density-increment based proof of Szemerédi’s theorem for a fixed , e.g. Roth’s proof for
, Gowers’ proof for arbitrary
, or Szemerédi’s proof for arbitrary
. (It is likely that Szemerédi’s proof, in particular, simplifies a little bit when translated to the non-standard setting, though the savings are likely to be modest.)
I’m also hoping that the recent results of Hrushovski on the noncommutative Freiman problem require only countable saturation, as this makes it more likely that they can be translated to a non-standard setting and thence to a purely finitary framework.
Let be a finite subset of a non-commutative group
. As mentioned previously on this blog (as well as in the current logic reading seminar), there is some interest in classifying those
which obey small doubling conditions such as
or
. A full classification here has still not been established. However, I wanted to record here an elementary argument (based on Exercise 2.6.5 of my book with Van Vu, which in turn is based on this paper of Izabella Laba) that handles the case when
is very close to
:
Proposition 1 If
, then
and
are both finite groups, which are conjugate to each other. In particular,
is contained in the right-coset (or left-coset) of a group of order less than
.
Remark 1 The constant
is completely sharp; consider the case when
where
is the identity and
is an element of order larger than
. This is a small example, but one can make it as large as one pleases by taking the direct product of
and
with any finite group. In the converse direction, we see that whenever
is contained in the right-coset
(resp. left-coset
) of a group of order less than
, then
(resp.
) is necessarily equal to all of
, by the inclusion-exclusion principle (see the proof below for a related argument).
Proof: We begin by showing that is a group. As
is symmetric and contains the identity, it suffices to show that this set is closed under addition.
Let . Then we can write
and
for
. If
were equal to
, then
and we would be done. Of course, there is no reason why
should equal
; but we can use the hypothesis
to boost this as follows. Observe that
and
both have cardinality
and lie inside
, which has cardinality strictly less than
. By the inclusion-exclusion principle, this forces
to have cardinality greater than
. In other words, there exist more than
pairs
such that
, which implies that
. Thus there are more than
elements
such that
for some
(since
is uniquely determined by
); similarly, there exists more than
elements
such that
for some
. Again by inclusion-exclusion, we can thus find
in
for which one has simultaneous representations
and
, and so
, and the claim follows.
In the course of the above argument we showed that every element of the group has more than
representations of the form
for
. But there are only
pairs
available, and thus
.
Now let be any element of
. Since
, we have
, and so
. Conversely, every element of
has exactly
representations of the form
where
. Since
occupies more than half of
, we thus see from the inclusion-exclusion principle, there is thus at least one representation
for which
both lie in
. In other words,
, and the claim follows.
To relate this to the classical doubling constants , we first make an easy observation:
Again, this is sharp; consider equal to
where
generate a free group.
Proof: Suppose that is an element of
for some
. Then the sets
and
have cardinality
and lie in
, so by the inclusion-exclusion principle, the two sets intersect. Thus there exist
such that
, thus
. This shows that
is contained in
. The converse inclusion is proven similarly.
Proposition 3 If
, then
is a finite group of order
, and
for some
in the normaliser of
.
The factor is sharp, by the same example used to show sharpness of Proposition 1. However, there seems to be some room for further improvement if one weakens the conclusion a bit; see below the fold.
Proof: Let (the two sets being equal by Lemma 2). By the argument used to prove Lemma 2, every element of
has more than
representations of the form
for
. By the argument used to prove Proposition 1, this shows that
is a group; also, since there are only
pairs
, we also see that
.
Pick any ; then
, and so
. Because every element of
has
representations of the form
with
,
, and
occupies more than half of
and of
, we conclude that each element of
lies in
, and so
and
.
The intersection of the groups and
contains
, which is more than half the size of
, and so we must have
, i.e.
normalises
, and the proposition follows.
Because the arguments here are so elementary, they extend easily to the infinitary setting in which is now an infinite set, but has finite measure with respect to some translation-invariant Kiesler measure
. We omit the details. (I am hoping that this observation may help simplify some of the theory in that setting.)
This week, Henry Towsner concluded his portion of reading seminar of the Hrushovski paper, by discussing (a weaker, simplified version of) main model-theoretic theorem (Theorem 3.4 of Hrushovski), and described how this theorem implied the combinatorial application in Corollary 1.2 of Hrushovski. The presentation here differs slightly from that in Hrushovski’s paper, for instance by avoiding mention of the more general notions of S1 ideals and forking.
Here is a collection of resources so far on the Hrushovski paper:
- Henry Towsner’s notes (which most of Notes 2-4 have been based on);
- Alex Usvyatsov’s notes on the derivation of Corollary 1.2 (broadly parallel to the notes here);
- Lou van den Dries’ notes (covering most of what we have done so far, and also material on stable theories); and
- Anand Pillay’s sketch of a simplified proof of Theorem 1.1.
Recent Comments