Jim Colliander, Mark Keel, Gigliola Staffilani, Hideo Takaoka, and I have just uploaded to the arXiv the paper “Weakly turbulent solutions for the cubic defocusing nonlinear Schrödinger equation“, which we have submitted to Inventiones Mathematicae. This paper concerns the numerically observed phenomenon of weak turbulence for the periodic defocusing cubic non-linear Schrödinger equation
(1)
in two spatial dimensions, thus u is a function from to
. This equation has three important conserved quantities: the mass
the momentum
and the energy
.
(These conservation laws, incidentally, are related to the basic symmetries of phase rotation, spatial translation, and time translation, via Noether’s theorem.) Using these conservation laws and some standard PDE technology (specifically, some Strichartz estimates for the periodic Schrödinger equation), one can establish global wellposedness for the initial value problem for this equation in (say) the smooth category; thus for every smooth there is a unique global smooth solution
to (1) with initial data
, whose mass, momentum, and energy remain constant for all time.
However, the mass, momentum, and energy only control three of the infinitely many degrees of freedom available to a function on the torus, and so the above result does not fully describe the dynamics of solutions over time. In particular, the three conserved quantities inhibit, but do not fully prevent the possibility of a low-to-high frequency cascade, in which the mass, momentum, and energy of the solution remain conserved, but shift to increasingly higher frequencies (or equivalently, to finer spatial scales) as time goes to infinity. This phenomenon has been observed numerically, and is sometimes referred to as weak turbulence (in contrast to strong turbulence, which is similar but happens within a finite time span rather than asymptotically).
To illustrate how this can happen, let us normalise the torus as . A simple example of a frequency cascade would be a scenario in which solution
starts off at a low frequency at time zero, e.g.
for some constant amplitude A, and ends up at a high frequency at a later time T, e.g.
for some large frequency N. This scenario is consistent with conservation of mass, but not conservation of energy or momentum and thus does not actually occur for solutions to (1). A more complicated example would be a solution supported on two low frequencies at time zero, e.g.
, and ends up at two high frequencies later, e.g.
. This scenario is consistent with conservation of mass and momentum, but not energy. Finally, consider the scenario which starts off at
and ends up at
. This scenario is consistent with all three conservation laws, and exhibits a mild example of a low-to-high frequency cascade, in which the solution starts off at frequency N and ends up with half of its mass at the slightly higher frequency
, with the other half of its mass at the zero frequency. More generally, given four frequencies
which form the four vertices of a rectangle in order, one can concoct a similar scenario, compatible with all conservation laws, in which the solution starts off at frequencies
and propagates to frequencies
.
One way to measure a frequency cascade quantitatively is to use the Sobolev norms for
; roughly speaking, a low-to-high frequency cascade occurs precisely when these Sobolev norms get large. (Note that mass and energy conservation ensure that the
norms stay bounded for
.) For instance, in the cascade from
to
, the
norm is roughly
at time zero and
at time T, leading to a slight increase in that norm for
. Numerical evidence then suggests the following
Conjecture. (Weak turbulence) There exist smooth solutions
to (1) such that
goes to infinity as
for any
.
We were not able to establish this conjecture, but we have the following partial result (“weak weak turbulence”, if you will):
Theorem. Given any
, there exists a smooth solution
to (1) such that
and
for some time T.
This is in marked contrast to (1) in one spatial dimension , which is completely integrable and has an infinite number of conservation laws beyond the mass, energy, and momentum which serve to keep all
norms bounded in time. It is also in contrast to the linear Schrödinger equation, in which all Sobolev norms are preserved, and to the non-periodic analogue of (1), which is conjectured to disperse to a linear solution (i.e. to scatter) from any finite mass data (see this earlier post for the current status of that conjecture). Thus our theorem can be viewed as evidence that the 2D periodic cubic NLS does not behave at all like a completely integrable system or a linear solution, even for small data. (An earlier result of Kuksin gives (in our notation) the weaker result that the ratio
can be made arbitrarily large when
, thus showing that large initial data can exhibit movement to higher frequencies; the point of our paper is that we can achieve the same for arbitrarily small data.) Intuitively, the problem is that the torus is compact and so there is no place for the solution to disperse its mass; instead, it must continually interact nonlinearly with itself, which is what eventually causes the weak turbulence.
As mentioned earlier, the three conservation laws of mass, momentum, and energy work to inhibit the low-to-high frequency cascade necessary to prove the theorem. For instance, one can easily show that any cascade involving three or fewer frequencies is morally incompatible with the three conservation laws (since three equations in three unknowns gives basically just one possible value for the amplitudes at the three frequencies). The simplest cascade is the four-wave model discussed earlier, in which four frequencies at the corners of a rectangle transfer their mass, energy, and momentum from two diagonally opposite corners to the other two diagonally opposite corners. One can in fact rigorously show that such four-wave cascades (plus negligible noise terms) can actually occur in (1) in the high frequency limit, by using perturbation theory to eliminate all frequencies except the four frequencies of the rectangle (thus reducing the PDE to an ODE), using symmetry to assume that diagonally opposite frequencies have the same Fourier coefficients, and then using the conservation laws to reduce the dimension of the ODE down to a scalar first-order ODE that can be solved more or less exactly. The dynamics of the resulting solution (for the model ODE, rather than the PDE that it approximates) resembles an “Arnold whisker” in the theory of Arnold diffusion – it asymptotically converges to a “two-wave” periodic orbit based on one diagonal of the rectangle as , and to a two-wave periodic orbit on the other diagonal as
, lying in the unstable manifold of the former orbit and stable manifold of the latter.
This four-wave solution can be constructed from arbitrarily small initial data, but it only gets us a little way towards our theorem, as it only bumps up the norm by a factor of about
. To establish the theorem, we needed to chain a lot of these four-wave solutions together, in the spirit of the classic paper of Arnold that established the possibility of diffusion from one invariant torus in a certain Hamiltonian system to another. In terms of the approximating ODE (which was now more complicated, although still manageable thanks to symmetry reductions), the main task was to concoct a solution that started out near one invariant torus, bounced towards another torus, did a ricochet off of that to the next torus, and so on until one reached a torus extremely far away from the original torus. The best way we could describe this procedure succinctly was by citing a classic Super Bowl commercial for McDonald’s featuring Michael Jordan and Larry Bird:
(This citation, of course, appears in the bibliography to the paper, like any other reference.)
Actually showing such a solution exists turned out to require a very delicate stability and control theory analysis, in which stable, neutral, and unstable directions were treated in very different ways. As one might imagine, the initial data has to be extremely special in order to obtain a multi-ricochet solution.
Once such a solution for the approximating ODE is constructed, one still has to embed it somehow back into the PDE (1). Here one faces a combinatorial and number-theoretic problem of being able to place a large number of frequencies in the lattice , such that certain specific quadruples of these frequencies form rectangles, and all other quadruples do not. [The frequencies turn out to be organised in a series of "generations", in which a pair of "parent" frequencies from each generation pass on their mass to a pair of "child" frequencies to the next generation, with one child inheriting virtually all of the energy, and the other child inheriting close to nothing; the children in that generation then pair up and become parents themselves, and so forth. In the first generation, all frequencies initially have the same energy (having comparable magnitudes and amplitudes), but by the end of the process, almost all the energy in the last generation is concentrated in a single, extremely large, frequency.] As is well known, the problem of constructing rectangles in the lattice is closely related to that of constructing Pythagorean triples
, and so our construction ultimately relies on some basic facts about those triples (basically, we need that the unit vectors
arising from Pythagorean triples are dense on the unit circle).
There are still some further directions to pursue here. Most obviously, the conjecture stated above remains open; the perturbation theory we use stops working after a long, but finite, amount of time, which has prevented us so far from resolving the conjecture, though the task may not be completely hopeless with current technology. The other thing is that the solutions we construct for our Theorem, much like the shots in the above video, are extremely special, and do not really indicate what is going on for generic data. The only non-trivial result we have for such data is that they do not scatter to a linear solution even after quotienting out by phase rotations, unless they are one of the explicit “one-wave” solutions . This is done by a simple compactness argument involving, of all things, Liouville’s theorem that harmonic functions of polynomial growth are in fact polynomial.

13 comments
Comments feed for this article
15 August, 2008 at 3:07 pm
Anonymous
Dear Professor Tao,
Will you begin accepting requests for items/persons that we would like to see cited in your research papers? :-D
17 August, 2008 at 5:55 am
PDEbeginner
Dear Prof. Tao,
Thank you very much for your so nice blog!
I have several questions (probably stupid) about this article:
1. Now a lot of people are working on NLS, what is the motivation for studying NLS?
2. Under the Fourier transform, we obtain a very complex ODE from NLS. My understanding is you construct in this article a very special solution according to the three conservation laws, do we have any other solutions with the three conservation laws holding?
3. Is the stable directions in the article is the high frequency Fourier modes? (My understanding is when Fourier modes are very high, the linear part (Laplacian) could probably control the nonlinear terms).
Thanks a lot,
Yours sincerely,
PDEbeginner
17 August, 2008 at 8:41 am
Terence Tao
Dear PDEbeginner,
I think there are two main reasons NLS is studied. Firstly, it is one of the simplest and most symmetric models of nonlinear dispersion and soliton formation, and it has turned out that many insights, methods, and techniques were first developed for NLS and then later generalised to other models. Secondly, NLS is an approximating model for a variety of systems in physics, most notably Bose-Einstein condensates but also various plasma fields and meson fields.
There are of course an infinite-dimensional family of solutions to NLS, wandering around in an infinite dimensional phase space; the three conservation laws cut down the dimension of this space by three, but this still leaves a lot of room to move around. But if one can restrict the number of active frequencies to a sufficiently small number, and also impose as much symmetry as one can, then one can reduce to a reasonably manageable ODE, with relatively few degrees of freedom, to describe the evolution. This is what we do in our paper, but even within that simplified dynamics, the multi-ricochet solution is extremely special, with initial data lying in a very small (but open) portion of phase space.
We don’t understand very well what happens for general solutions, though Bourgain and Kuksin (and later Tzvetkov) have constructed invariant measures for the flow here, which may well capture the “generic” behaviour of the solution (though very little is known as to how unique these measures are, and whether they are attractors in any sense).
It is true in general that for subcritical equations (such as 2D cubic NLS), the high frequency components of the solution act in a nearly linear manner for short periods of time, and in non-periodic settings these components tend to disperse to infinity, escaping the region of nonlinear effects, and thus scatter to a linear solution for large times. In the periodic setting, though, dispersion to infinity is impossible, and the high-frequency components have to stick around and interact with other frequencies indefinitely. As such, they will eventually experience nonlinear effects, although it can take some time for this to happen. (In our model, we have components with a large frequency ~N and a small amplitude
(in order to keep the
norm bounded), and it takes about
units of time before nonlinear effects begin to make their presence felt.) Because of this, we end up renormalising away the dispersive effects of the linear evolution when transforming from the PDE to the ODE, and indeed the latter only exhibit the “resonant” effects of the nonlinearity and no linear effects whatsoever. So the stability and instability is caused purely by the nonlinearity. A toy “two-generation” example of this ODE, by the way, is the
system
There are two special periodic solutions here,
and
, which trace out two invariant tori (or just invariant circles, actually). But it turns out that there is a “slider” solution
where
is a cube root of unity, which is repelled exponentially fast away from the first invariant torus and is attracted exponentially fast to the second torus, lying as it does in the unstable manifold of the latter and stable manifold of the former. This slider solution forms the model for one “hop” of our eventual multi-hop solution from one torus to the next that exhibits the turbulence, though we have to perturb this solution in just the right way in order to set up the “shot”.
17 August, 2008 at 11:11 am
PDEbeginner
Dear Prof. Tao,
Thank you very much for your so detailed answer!
I have carefully read the answer, but still have some problems:
1. Did Bourgain and Kuksin construct the invariant measure on some subspace of
? (Because you asked if the invariant measures are attractors) .
2. 2D NLS is subcritical in the Sobolev space of
? It seems it is critical in
.
3. I still didn’t understand the time
for the nonlinearity made felt. I was wondering if you could explain it in a little more detail or give some literatures to understand it.
4. There seems some problems for the toy models:
(1) the equations have no interacting terms.
(2) the two special solutions don’t solve the second equation (it seems we need to change the equations and add some interacting terms)
(3) for the ’slider solution’, when
,
, shall we change it as
?
17 August, 2008 at 2:40 pm
Terence Tao
Dear PDEbeginner,
There were some sign errors in my earlier comment which I think I have corrected now.
Regarding subcriticality, this is with respect to the regularity class of the initial data, in this case
. To see the time scale
emerging from initial data of frequency N and amplitude
, e.g.
, observe that the cubic nonlinearity
is about
as large as u itself, and so by inspecting the NLS equation we expect the nonlinear effects to become significant by time
. In this particular case we can also see this time scale by comparing the linear solution
from this data with the nonlinear solution
.
Regarding what it means for a measure to be an attractor, this is a little different from what it means for a set to be an attractor, and would presumably be stated in some sort of statistical mechanics sense, for instance one might start with a random ensemble of data, evolve it with some stochastic forcing term, and ask whether the probability distribution of that ensemble converges (in some suitable topology) to one of the standard invariant measures. This type of result is already difficult at the level of ODE and is probably a little bit beyond our current technology for dispersive PDE, though perhaps not hopelessly so.
18 August, 2008 at 11:23 am
Anonymous
Dear Prof. Tao,
Now I understand much more on the toy model and the time scale. Thank you so much for your help and your patience!
21 August, 2008 at 4:06 pm
Peter
Is there any hope of taking some kind of limit of solutions with a finite number of ricochets to obtain a solution with an infinite number of ricochets?
25 August, 2008 at 12:58 pm
Terence Tao
Dear Peter,
That’s a good question. At the level of the model ODE, I think we can construct a solution which has an infinite number of richochets (over an infinite period of time), but it is extremely unstable, and as such I don’t think we can use perturbation theory to create a corresponding solution to the PDE with infinitely many ricochets. (Also, the number theoretic portion of the argument only allows one to embed in finitely many dimensions of the ODE into the PDE at a time; if one wants more ricochets, one has to use a different embedding involving many more (and much larger) frequencies.)
It’s perhaps possible that taking some sort of superposition of the finitely richocheting solutions, that one could create a solution that is weakly turbulent, but a nontrivial amount of perturbation theory would still be needed as the underlying equation is nonlinear and so does not perfectly obey the principle of superposition.
12 September, 2008 at 6:24 am
Blogging Professors: Big Boffins with Blogs « O’Really? at Duncan.Hull.name
[...] What’s new [...]
15 September, 2008 at 5:35 am
M. Majdoub
Dear Prof. Tao,
I have two questions:
1. Have you an example where the conjecture is true?
2. It seems that your result “weak weak turbulence” still holds for other examples of NLS. What do you think?
Thanks a lot,
M. Majdoub
15 September, 2008 at 8:55 am
Terence Tao
Dear M. Majdoub,
There are slightly artificial examples of Hamiltonian systems that have been shown to exhibit weak turbulence properties, but I do not know of a “natural” dispersive PDE (such as NLS) for which this result is known (excepting of course those equations which can develop singularities in finite time). Nevertheless it is reasonable to expect that any natural dispersive PDE on bounded domains which is not completely integrable and which does not develop singularities in finite time, should exhibit weak turbulence for generic choices of initial data.
23 September, 2008 at 11:37 am
Chris Hillman
I find this all very interesting, and would particularly enjoy any further comments you might have on (i) Arnold diffusion (ii) dimensionality vs. completely integrable (iii) torus versus plane.
Minor nit: are you sure about the +1/4 in your energy density up above? Shouldn’t it be -1/4? I determined the conserved flux-density pairs for the 1D NLS (with one sign change which interchanges real and imaginary parts) following the methods given in Olver’s book and got exactly the results quoted on Dispersive Wiki (same as you wrote above) except for that sign. I did check that the divergence of each flux-density pair vanishes, of course. I would be surprised if 1D -> 2D changes the sign -1/4.
19 October, 2008 at 11:34 am
Recent Links Tagged With "solution" - JabberTags
[...] public links >> solution Weakly turbulent solutions for the cubic defocusing nonlinear Schr … Saved by ROC1993 on Sat 18-10-2008 The Spiritual Solution to Materialism Saved by RubberRuffy on [...]