You are currently browsing the monthly archive for May 2018.
I have just uploaded to the arXiv my paper “Commutators close to the identity“, submitted to the Journal of Operator Theory. This paper resulted from some progress I made on the problem discussed in this previous post. Recall in that post the following result of Popa: if are bounded operators on a Hilbert space
whose commutator
is close to the identity in the sense that
for some , then one has the lower bound
In the other direction, for any , there are examples of operators
obeying (1) such that
In this paper we improve the upper bound to come closer to the lower bound:
Theorem 1 For any
, and any infinite-dimensional
, there exist operators
obeying (1) such that
One can probably improve the exponent somewhat by a modification of the methods, though it does not seem likely that one can lower it all the way to
without a substantially new idea. Nevertheless I believe it plausible that the lower bound (2) is close to optimal.
We now sketch the methods of proof. The construction giving (3) proceeded by first identifying with the algebra
of
matrices that have entries in
. It is then possible to find two matrices
whose commutator takes the form
for some bounded operator (for instance one can take
to be an isometry). If one then conjugates
by the diagonal operator
, one can eusure that (1) and (3) both hold.
It is natural to adapt this strategy to matrices
rather than
matrices, where
is a parameter at one’s disposal. If one can find matrices
that are almost upper triangular (in that only the entries on or above the lower diagonal are non-zero), whose commutator
only differs from the identity in the top right corner, thus
for some , then by conjugating by a diagonal matrix such as
for some
and optimising in
, one can improve the bound
in (3) to
; if the bounds in the implied constant in the
are polynomial in
, one can then optimise in
to obtain a bound of the form (4) (perhaps with the exponent
replaced by a different constant).
The task is then to find almost upper triangular matrices whose commutator takes the required form. The lower diagonals of
must then commute; it took me a while to realise then that one could (usually) conjugate one of the matrices, say
by a suitable diagonal matrix, so that the lower diagonal consisted entirely of the identity operator, which would make the other lower diagonal consist of a single operator, say
. After a lot of further lengthy experimentation, I eventually realised that one could conjugate
further by unipotent upper triangular matrices so that all remaining entries other than those on the far right column vanished. Thus, without too much loss of generality, one can assume that
takes the normal form
for some , solving the system of equations
It turns out to be possible to solve this system of equations by a contraction mapping argument if one takes to be a “Hilbert’s hotel” pair of isometries as in the previous post, though the contraction is very slight, leading to polynomial losses in
in the implied constant.
There is a further question raised in Popa’s paper which I was unable to resolve. As a special case of one of the main theorems (Theorem 2.1) of that paper, the following result was shown: if obeys the bounds
(where denotes the space of all operators of the form
with
and
compact), then there exist operators
with
such that
. (In fact, Popa’s result covers a more general situation in which one is working in a properly infinite
algebra with non-trivial centre.) We sketch a proof of this result as follows. Suppose that
and
for some
. A standard greedy algorithm argument (see this paper of Brown and Pearcy) allows one to find orthonormal vectors
for
such that for each
, one has
for some
comparable to
, and some
orthogonal to all of the
. After some conjugation (and a suitable identification of
with
, one can thus place
in a normal form
where is a isometry with infinite deficiency, and
have norm
. Setting
, it then suffices to solve the commutator equation
with ; note the similarity with (3).
By the usual Hilbert’s hotel construction, one can complement with another isometry
obeying the “Hilbert’s hotel” identity
and also ,
. Proceeding as in the previous post, we can try the ansatz
for some operators , leading to the system of equations
Using the first equation to solve for , the second to then solve for
, and the third to then solve for
, one can obtain matrices
with the required properties.
Thus far, my attempts to extend this construction to larger matrices with good bounds on have been unsuccessful. A model problem would be to express
as a commutator with
significantly smaller than
. The construction in my paper achieves something like this, but with
replaced by a more complicated operator. One would also need variants of this result in which one is allowed to perturb the above operator by an arbitrary finite rank operator of bounded operator norm.
Previous set of notes: Notes 3.
Important note: As this is not a course in probability, we will try to avoid developing the general theory of stochastic calculus (which includes such concepts as filtrations, martingales, and Ito calculus). This will unfortunately limit what we can actually prove rigorously, and so at some places the arguments will be somewhat informal in nature. A rigorous treatment of many of the topics here can be found for instance in Lawler’s Conformally Invariant Processes in the Plane, from which much of the material here is drawn.
In these notes, random variables will be denoted in boldface.
Definition 1 A real random variable
is said to be normally distributed with mean
and variance
if one has
for all test functions
. Similarly, a complex random variable
is said to be normally distributed with mean
and variance
if one has
for all test functions
, where
is the area element on
.
A real Brownian motion with base pointis a random, almost surely continuous function
(using the locally uniform topology on continuous functions) with the property that (almost surely)
, and for any sequence of times
, the increments
for
are independent real random variables that are normally distributed with mean zero and variance
. Similarly, a complex Brownian motion with base point
is a random, almost surely continuous function
with the property that
and for any sequence of times
, the increments
for
are independent complex random variables that are normally distributed with mean zero and variance
.
Remark 2 Thanks to the central limit theorem, the hypothesis that the increments
be normally distributed can be dropped from the definition of a Brownian motion, so long as one retains the independence and the normalisation of the mean and variance (technically one also needs some uniform integrability on the increments beyond the second moment, but we will not detail this here). A similar statement is also true for the complex Brownian motion (where now we need to normalise the variances and covariances of the real and imaginary parts of the increments).
Real and complex Brownian motions exist from any base point or
; see e.g. this previous blog post for a construction. We have the following simple invariances:
Exercise 3
- (i) (Translation invariance) If
is a real Brownian motion with base point
, and
, show that
is a real Brownian motion with base point
. Similarly, if
is a complex Brownian motion with base point
, and
, show that
is a complex Brownian motion with base point
.
- (ii) (Dilation invariance) If
is a real Brownian motion with base point
, and
is non-zero, show that
is also a real Brownian motion with base point
. Similarly, if
is a complex Brownian motion with base point
, and
is non-zero, show that
is also a complex Brownian motion with base point
.
- (iii) (Real and imaginary parts) If
is a complex Brownian motion with base point
, show that
and
are independent real Brownian motions with base point
. Conversely, if
are independent real Brownian motions of base point
, show that
is a complex Brownian motion with base point
.
The next lemma is a special case of the optional stopping theorem.
Lemma 4 (Optional stopping identities)
- (i) (Real case) Let
be a real Brownian motion with base point
. Let
be a bounded stopping time – a bounded random variable with the property that for any time
, the event that
is determined by the values of the trajectory
for times up to
(or more precisely, this event is measurable with respect to the
algebra generated by this proprtion of the trajectory). Then
and
and
- (ii) (Complex case) Let
be a real Brownian motion with base point
. Let
be a bounded stopping time – a bounded random variable with the property that for any time
, the event that
is determined by the values of the trajectory
for times up to
. Then
Proof: (Slightly informal) We just prove (i) and leave (ii) as an exercise. By translation invariance we can take . Let
be an upper bound for
. Since
is a real normally distributed variable with mean zero and variance
, we have
and
and
By the law of total expectation, we thus have
and
and
where the inner conditional expectations are with respect to the event that attains a particular point in
. However, from the independent increment nature of Brownian motion, once one conditions
to a fixed point
, the random variable
becomes a real normally distributed variable with mean
and variance
. Thus we have
and
and
which give the first two claims, and (after some algebra) the identity
which then also gives the third claim.
Exercise 5 Prove the second part of Lemma 4.
This is the ninth “research” thread of the Polymath15 project to upper bound the de Bruijn-Newman constant , continuing this post. Discussion of the project of a non-research nature can continue for now in the existing proposal thread. Progress will be summarised at this Polymath wiki page.
We have now tentatively improved the upper bound of the de Bruijn-Newman constant to . Among the technical improvements in our approach, we now are able to use Taylor expansions to efficiently compute the approximation
to
for many values of
in a given region, thus speeding up the computations in the barrier considerably. Also, by using the heuristic that
behaves somewhat like the partial Euler product
, we were able to find a good location to place the barrier in which
is larger than average, hence easier to keep away from zero.
The main remaining bottleneck is that of computing the Euler mollifier bounds that keep bounded away from zero for larger values of
beyond the barrier. In going below
we are beginning to need quite complicated mollifiers with somewhat poor tail behavior; we may be reaching the point where none of our bounds will succeed in keeping
bounded away from zero, so we may be close to the natural limits of our methods.
Participants are also welcome to add any further summaries of the situation in the comments below.
Previous set of notes: Notes 2. Next set of notes: Notes 4.
We now approach conformal maps from yet another perspective. Given an open subset of the complex numbers
, define a univalent function on
to be a holomorphic function
that is also injective. We will primarily be studying this concept in the case when
is the unit disk
.
Clearly, a univalent function on the unit disk is a conformal map from
to the image
; in particular,
is simply connected, and not all of
(since otherwise the inverse map
would violate Liouville’s theorem). In the converse direction, the Riemann mapping theorem tells us that every open simply connected proper subset
of the complex numbers is the image of a univalent function on
. Furthermore, if
contains the origin, then the univalent function
with this image becomes unique once we normalise
and
. Thus the Riemann mapping theorem provides a one-to-one correspondence between open simply connected proper subsets of the complex plane containing the origin, and univalent functions
with
and
. We will focus particular attention on the univalent functions
with the normalisation
and
; such functions will be called schlicht functions.
One basic example of a univalent function on is the Cayley transform
, which is a Möbius transformation from
to the right half-plane
. (The slight variant
is also referred to as the Cayley transform, as is the closely related map
, which maps
to the upper half-plane.) One can square this map to obtain a further univalent function
, which now maps
to the complex numbers with the negative real axis
removed. One can normalise this function to be schlicht to obtain the Koebe function
which now maps to the complex numbers with the half-line
removed. A little more generally, for any
we have the rotated Koebe function
that is a schlicht function that maps to the complex numbers with the half-line
removed.
Every schlicht function has a convergent Taylor expansion
for some complex coefficients with
. For instance, the Koebe function has the expansion
and similarly the rotated Koebe function has the expansion
Intuitively, the Koebe function and its rotations should be the “largest” schlicht functions available. This is formalised by the famous Bieberbach conjecture, which asserts that for any schlicht function, the coefficients should obey the bound
for all
. After a large number of partial results, this conjecture was eventually solved by de Branges; see for instance this survey of Korevaar or this survey of Koepf for a history.
It turns out that to resolve these sorts of questions, it is convenient to restrict attention to schlicht functions that are odd, thus
for all
, and the Taylor expansion now reads
for some complex coefficients with
. One can transform a general schlicht function
to an odd schlicht function
by observing that the function
, after removing the singularity at zero, is a non-zero function that equals
at the origin, and thus (as
is simply connected) has a unique holomorphic square root
that also equals
at the origin. If one then sets
it is not difficult to verify that is an odd schlicht function which additionally obeys the equation
Conversely, given an odd schlicht function , the formula (4) uniquely determines a schlicht function
.
For instance, if is the Koebe function (1),
becomes
which maps to the complex numbers with two slits
removed, and if
is the rotated Koebe function (2),
becomes
De Branges established the Bieberbach conjecture by first proving an analogous conjecture for odd schlicht functions known as Robertson’s conjecture. More precisely, we have
Theorem 1 (de Branges’ theorem) Let
be a natural number.
- (i) (Robertson conjecture) If
is an odd schlicht function, then
- (ii) (Bieberbach conjecture) If
is a schlicht function, then
It is easy to see that the Robertson conjecture for a given value of implies the Bieberbach conjecture for the same value of
. Indeed, if
is schlicht, and
is the odd schlicht function given by (3), then from extracting the
coefficient of (4) we obtain a formula
for the coefficients of in terms of the coefficients of
. Applying the Cauchy-Schwarz inequality, we derive the Bieberbach conjecture for this value of
from the Robertson conjecture for the same value of
. We remark that Littlewood and Paley had conjectured a stronger form
of Robertson’s conjecture, but this was disproved for
by Fekete and Szegö.
To prove the Robertson and Bieberbach conjectures, one first takes a logarithm and deduces both conjectures from a similar conjecture about the Taylor coefficients of , known as the Milin conjecture. Next, one continuously enlarges the image
of the schlicht function to cover all of
; done properly, this places the schlicht function
as the initial function
in a sequence
of univalent maps
known as a Loewner chain. The functions
obey a useful differential equation known as the Loewner equation, that involves an unspecified forcing term
(or
, in the case that the image is a slit domain) coming from the boundary; this in turn gives useful differential equations for the Taylor coefficients of
,
, or
. After some elementary calculus manipulations to “integrate” this equations, the Bieberbach, Robertson, and Milin conjectures are then reduced to establishing the non-negativity of a certain explicit hypergeometric function, which is non-trivial to prove (and will not be done here, except for small values of
) but for which several proofs exist in the literature.
The theory of Loewner chains subsequently became fundamental to a more recent topic in complex analysis, that of the Schramm-Loewner equation (SLE), which is the focus of the next and final set of notes.
Recent Comments