Semilinear dispersive and wave equations, of which the defocusing nonlinear wave equation
is a fixed exponent, and
is a scalar field), can be viewed as a “tug of war” between a linear dispersive equation, in this case the linear wave equation
If the nonlinear term was not present, leaving only the dispersive equation (2), then as the term “dispersive” suggests, in the asymptotic limit , the solution
would spread out in space and decay in amplitude. For instance, in the model case when
and the initial position
vanishes (leaving only the initial velocity
as non-trivial initial data), the solution
for
is given by the formula
where is surface measure on the sphere
. (To avoid technical issues, let us restrict attention to classical (smooth) solutions.) Thus, if the initial velocity was bounded and compactly supported, then the solution
would be bounded by
and would thus would decay uniformly to zero as
. Similar phenomena occur for all dimensions greater than
.
Conversely, if the dispersive term was not present, leaving only the ODE (3), then one no longer expects decay; indeed, given the conserved energy for the ODE (3), we do not expect any decay at all (and indeed, solutions are instead periodic in time for each fixed
, as can easily be seen by viewing the ODE (and the energy curves) in phase space).
Depending on the relative “size” of the dispersive term and the nonlinear term
, one can heuristically describe the behaviour of a solution
at various positions at times as either being dispersion dominated (in which
), nonlinearity dominated (in which
), or contested (in which
,
are comparable in size). Very roughly speaking, when one is in the dispersion dominated regime, then perturbation theory becomes effective, and one can often show that the solution to the nonlinear equation indeed behaves like the solution to the linear counterpart, in particular exhibiting decay as
. In principle, perturbation theory is also available in the nonlinearity dominated regime (in which the dispersion is now viewed as the perturbation, and the nonlinearity as the main term), but in practice this is often difficult to apply (due to the nonlinearity of the approximating equation and the large number of derivatives present in the perturbative term), and so one has to fall back on non-perturbative tools, such as conservation laws and monotonicity formulae. The contested regime is the most interesting, and gives rise to intermediate types of behaviour that are not present in the purely dispersive or purely nonlinear equations, such as solitary wave solutions (solitons) or solutions that blow up in finite time.
In order to analyse how solutions behave in each of these regimes rigorously, one usually works with a variety of function spaces (such as Lebesgue spaces and Sobolev spaces
). As such, one generally needs to first establish a number of function space estimates (e.g. Sobolev inequalities, Hölder-type inequalities, Strichartz estimates, etc.) in order to study these equations at the formal level.
Unfortunately, this emphasis on function spaces and their estimates can obscure the underlying physical intuition behind the dynamics of these equations, and the field of analysis of PDE sometimes acquires a reputation for being unduly technical as a consequence. However, as noted in a previous blog post, one can view function space norms as a way to formalise the intuitive notions of the “height” (amplitude) and “width” (wavelength) of a function (wave).
It turns out that one can similarly analyse the behaviour of nonlinear dispersive equations on a similar heuristic level, as that of understanding the dynamics as the amplitude and wavelength
(or frequency
) of a wave. Below the fold I give some examples of this heuristic; for sake of concreteness I restrict attention to the nonlinear wave equation (1), though one can of course extend this heuristic to many other models also. Rigorous analogues of the arguments here can be found in several places, such as the book of Shatah and Struwe, or my own book on the subject.
— 1. Bump functions —
To initiate the heuristic analysis, we make the assumption that any given time , the wave
“resembles” (or is “dominated” by) a bump function
, some phase
, some frequency
, and some position
, where
is a bump function. We will leave the terms “resembles” and “dominated” deliberately vague;
might not consist entirely of this bump function, but could instead be a superposition of multiple components, with this bump function being the “strongest” of these components in some sense. It is of course possible for a solution to concentrate its mass and energy in a different configuration than a bump function; but experience has shown that the most nonlinear behaviour tends to occur when such a concentration occurs, and so this ansatz is expected to capture the “worst-case” behaviour of the solution. (Basically, if a wave splits its energy into too many distinct components, then the nonlinear effects of each component become quite weak, even when superimposed back together again.) In particular, this type of bump function dominance is often seen when the solution exhibits soliton or near-soliton like behaviour, and often occurs shortly prior to blowup (especially for equations with critical nonlinearity). There are a variety of tools to formalise these sorts of intuitions, such as concentration-compactness and the induction-on-energy method, but we will not focus on these tools here.
(One can also refine the above ansatz in a number of ways, for instance by also introducing a frequency modulation , which is particularly important in models such as the mass-critical NLS which admit a frequency modulation symmetry, but for simplicity we will not consider this more complicated situation here.)
For this analysis, we shall ignore the role of the phase and position
, focusing instead on the amplitude
and frequency
. This collapses the infinite numbers of degrees of freedom for the wave
down to just two degrees of freedom. Of course, there is a significant amount of information lost when performing this collapse – in particular, the exact PDE (1) will no longer retain its deterministic form when projected to these two coordinates – but one can still discern non-trivial features of the original dynamics from this two-dimensional viewpoint.
With the ansatz (4), the solution has magnitude comparable
to a ball of radius roughly
. As a consequence, the nonlinearity
will have magnitude about
on this ball. Meanwhile, the dispersive term
would be expected to have magnitude about
(using a crude “rise-over-run” interpretation of the derivative, or else just computing the Laplacian of (4) explicitly). We thus expect dispersion dominant behaviour when
, or in other words when
, or in other words when
is comparable to
, or in other words when
The evolution of the parameters is partly constrained by a variety of conservation laws and monotonicity formulae. Consider for instance the energy conservation law, which asserts that the energy
is conserved in time. Inserting the ansatz (4) into the right-hand side, we obtain the heuristic bound
(We only write an upper bound here for the left-hand side, and not a lower bound, to allow for the possibility that most of the energy of the wave is not invested in the bump function (4), but is instead dispersed elsewhere.) This gives us the a priori bounds
The bounds (8) can be viewed as describing a sort of “energy surface” that the parameters ,
can vary in.
It is instructive to see how these bounds interact with the criteria (5), (6), (7), for various choices of dimension and exponent
. Let us first see what happens in a supercritical setting, such as
and
, with bounded energy
. In this case, the energy conservation law gives the bounds
Meanwhile, the threshold between dispersive behaviour and nonlinear behaviour is when . We can illustrate this by performing a log-log plot between
and
:
The region below the dotted line corresponds to dispersion-dominated behaviour, and the region above corresponds to nonlinearity-dominated behaviour. The region below the solid line corresponds to the possible values of amplitude and frequency that are permitted by energy conservation.
This diagram illustrates that for low frequencies , the energy constraint ensures dispersive behaviour; but for high frequencies
, one can venture increasingly far into the nonlinearity dominated regime while still being consistent with energy conservation. In particular, energy conservation does not prevent a scenario in which the frequency and amplitude both increase to infinity in finite time, while staying inside the nonlinearity dominated regime. And indeed, global regularity for this supercritical equation is a notoriously hard open problem, analogous in many ways to the even more famous global regularity problem for Navier-Stokes (see my previous blog post on this topic).
In contrast, let us consider a subcritical setting, such as and
, again with bounded energy
. Now, the energy conservation law gives the bounds
while the threshold between dispersive behaviour and nonlinear behaviour is when . The log-log plot now becomes
We now see that for high frequencies , the energy constraint ensures dispersive behaviour; but conversely, for low frequencies
, one can have highly nonlinear behaviour. On the other hand, low frequencies cannot exhibit finite time blowup (as can be inferred from (9)); however, other non-dispersive scenarios exist, such as a soliton-type solution in which
is low-frequency but essentially constant in time, or a self-similar decay in which
and
go slowly to
as
, while staying out of the dispersion-dominated regime. Again, this is reflected in the known theory for this equation for large (but finite energy) data: global regularity is known (there is no blowup), but it is unknown whether the solution disperses like a linear solution in the limit
.
Finally, we look at a critical setting, in which and
. Here, energy conservation gives the bounds
and the threshold between dispersive and nonlinear behaviour is . Thus, when the energy
is small, one expects only dispersive behaviour; and when the energy is large, then both dispersive and contested behaviour (at both high and low frequencies) are consistent with energy conservation. The large energy case is depicted in the diagram below, with the solid line slightly above the dashed line; in the small energy case, the positions of the two lines are reversed.
Now, it turns out that for small energy, one indeed has global regularity and scattering (for both the defocusing nonlinearity and the focusing nonlinearity
), which is consistent with the above heuristics. For large energy, blowup can occur in the focusing case, but in the defocusing case what happens is that the solution can linger in the contested region for a finite period of time, but eventually the nonlinearity “concedes” to the dispersion, and the solution enters the dispersion-dominated regime and scatters. This cannot be deduced solely from energy conservation, but requires some additional inputs.
First, let us assume for contradiction that one never enters the dispersion-dominated regime, but instead remains in the contested regime throughout. Then from (9) we see that for any time
, the quantities
and
will not change in magnitude much in the time interval
. This means that one can subdivide time into intervals
, with
comparable to
on this time interval, and
comparable to
. The asymptotic behaviour of the solution is then encoded in the combinatorial structure of these intervals. For instance, a soliton-like solution would correspond to a string of intervals
, all of roughly the same size, while a finite time blowup would correspond to a shrinking sequence of intervals converging to the blowup time, whereas a slow decay at infinity would be represented by a sequence of intervals of increasing length going off to infinity. (See Chapter 5 of my book for various depictions of these bubble evolutions.)
One can also take a spacetime view instead of a temporal view, and view the solution as a string of spacetime “bubbles”, each of which has some lifespan and amplitude comparable to
, and lives on a spacetime cube of sidelength comparable to
. If the number of bubbles is finite, then the nonlinearity eventually concedes and one has dispersive behaviour; if instead the number of bubbles is infinite, then one has contested behaviour that can lead either to finite time blowup or infinite time blowup (where blowup is defined here as failure of dispersion to dominate asymptotically, rather than formation of a singularity). While “number of bubbles” is not a precise quantity, in the rigorous theory of the critical NLW, one uses more quantitative expressions, such as the
norm
or variants such as , as proxies for this concept. Note that each bubble contributes an amount comparable to unity to each of the above expressions. This may help explain why obtaining bounds for these types of norms is so key to establishing global regularity and scattering for this equation.
The next ingredient is the Morawetz inequality
which can be established by an integration by parts argument. This inequality is easiest to exploit in the model case of spherical symmetry. To be consistent with the ansatz (4), we must have in this case. We then have
and so
Each bubble of contested dynamics contributes roughly a unit amount to the integral on the left, and so the Morawetz inequality bounds the total number of bubbles and thus is a mechanism for forcing dispersive behaviour asymptotically.
In the non-spherically symmetric case, the position of the bubble can vary. However, finite speed of propagation heuristics indicate that this position cannot move faster than the speed of light, which is normalised to be
, thus
. (The situation is more complicated if one generalises the ansatz (4) to allow for the solution
to consist of a superposition of several bump functions at several different places for each point in time. However, for the critical equation and in the contested regime, each bump function absorbs an amount of energy bounded from below, and so there can only be a bounded number of such bumps existing at any given time; as such, one should morally be able to decompose the evolution into independent “particle-like” components, each of which obeys finite speed of propagation.) The Morawetz inequality then instead gives bounds such as
In the model case when stay sroughly bounded,
can only grow at most linearly in
, and the logarithmic divergence of the integral
at infinity then again forces the number of bubbles to be finite (but this time the bound is exponential in the energy, rather than polynomial; see this paper of mine, as well as this earlier paper of Nakanishi, for further discussion.
One can extend this heuristic analysis to explain why the global regularity results for the energy-critical equation can extend very slightly to the supercritical regime, and in particular (in the spherically symmetric case) to the logarithmically supercritical equation
as was done in this paper of mine (and a non-spherically symmetric analogue, with a double logarithmically supercritical nonlinearity, by Roy). This equation behaves more or less identically to the critical NLW for low frequencies , but exhibits slightly different behaviour for high frequencies
. In this regime, the dividing line between dispersive and nonlinear behaviour is now
. Meanwhile, the energy bounds (assuming bounded energy) now give
so that there is now a logarithmically wide window of opportunity for nonlinear behaviour at high frequencies.
The energy bounds also give
from (9), but we can do a little bit better if we invoke the heuristic of equipartition of energy, which states that the kinetic portion of the energy is roughly in balance with the potential portion
(where
is the antiderivative of
). There are several ways to make this heuristic precise; one is to start with the identity
which suggests (together with the fundamental theorem of calculus) that the right-hand side should average out to zero after integration in time. Using this heuristic, one is soon led to the slight improvement
of the previous bound.
The contested regions of the evolution then break up into bubbles in spacetime, each of which has some length and lifespan comparable to , and amplitude
comparable to
(in the high-frequency case
).
In contrast, the Morawetz inequality for this equation asserts that
and amplitude
with
contributes about
to the integral in (10). This quantity goes to zero as
, but very slowly; in particular, as
increases to infinity along dyadic scales
, the sum
is divergent, which explains why the nonlinearity cannot sustain an infinite chain of such bubbles. (It also suggests that perhaps the logarithmic supercriticality is not quite the right threshold here, and that some slight improvement might be possible in this regard; I believe this is being looked into right now.)




9 comments
Comments feed for this article
2 April, 2010 at 8:49 pm
timur
Thanks a lot!
In paragraph 3 of “Bump functions”, should phi be theta? And after (7), it seems that “r” is missing in “constained”.
[Corrected, thanks - T.]
3 April, 2010 at 6:00 am
Sujit
The amplitude/frequency constraint given by (8) does not treat the amplitude and frequency equally. The frequency can blow up and the amplitude can remain bounded while still preserving energy. Whereas it seems from (8) that one cannot have a situation where amplitude blows up and frequency remains bounded while preserving energy. It looks like when amplitude blows up, the frequency also has to blow up so that the integral of the amplitude multiplied with a highly oscillatory part somehow stays bounded.
Is this true or am I just reading something wrong?
3 April, 2010 at 7:09 am
Terence Tao
A scenario in which the amplitude remains bounded and the frequency goes off to infinity (and the width of the wave, i.e. the wavelength, shrinks to zero) is compatible with energy conservation; but will not actually occur in practice because one would soon enter the dispersion-dominated region (5), at which point the solution would disperse instead (at which point the ansatz (4) will be abandoned).
This is reflected in the classical fact (proven using the energy method) that smooth solutions stay smooth as long as they stay bounded in the
norm.
10 April, 2010 at 7:48 am
Akmal
Hi Dr. Tao,
I am a Computational Mathematician, and completed some 82 units of PhD level courseworks related to Computational Mathematics. At my previous school (San Diego State Univ.), I once took a course with an Erdos no. 2 (Dr. Peter Salamon), an he gave me an inspiration to pursue more studies in Pure Mathematics.
I’d like to gain more insight into PDE and analysis, and realized this is also your interest. However, I do not have a background in pure mathematics; the only course I ever took that is relevant is Advanced Calculus (intro to set theory and topology, pretty much).
Therefore I’m designing my own track in order to pursue studies in PDE and analysis by starting from:
1. Morash’s Bridge to Abstract Mathematics
2. Mendelson’s Intro. To Topology
3. Simmons’s Intro. To Topology and Modern Analysis
4. Bridge’s Real Analysis (A Constructive Approach)
5. Cohn’s Measure Theory
6. Rudin’s Real and Complex Analysis
7. Evans’s Partial Differential Equations (Grad Studies in Math)
8. Folland’s Introduction to Partial Differential Equations.
If you or anybody else here can give me an advice or critique about the way I approach this, I really appreciate your kind help.
Thank you in advance Dr. Tao
Best wishes,
Akmal
10 April, 2010 at 7:55 am
Akmal
correction: my apology, the professor I mentioned (Dr. Peter Salamon) has an Erdos Number of 1, not 2.
11 April, 2010 at 12:35 pm
Anonymous
Dear Terry,
Request: Please consider an expository post on global climate modeling, a topic with close connections with nonlinear PDE, as well
as other subjects of interest to you, such as probability and decision theory. If you have anything particularly insightful to say about this topic, many of us would greatly enjoy reading it.
21 May, 2011 at 11:31 am
Anonymous
In the heuristic bound before equation 8, why is there no term where N is differentiated in the t-variable?
21 May, 2011 at 2:38 pm
Terence Tao
One can certainly add such terms to the heuristics if desired, giving some predictions as to how fast N(t) will vary in time. But these bounds are not relevant for the discussion here as we are not trying to investigate the specific dynamics of N(t) in time. (But if one was interested in, say, blowup rates for such equations, then these heuristic computations would become relevant.)
4 August, 2011 at 6:53 pm
Localisation and compactness properties of the Navier-Stokes global regularity problem « What’s new
[...] one can guess at the existence of such a bound by using the amplitude-frequency heuristics, discussed in this post. If the Navier-Stokes solution has an amplitude and a frequency for a period of time over a [...]