Adjoint Brascamp-Lieb inequalities

29 June, 2023 in math.CA, paper | Tags: adjoint operator, Brascamp Lieb inequalities, Jon Bennett | by Terence Tao

Jon Bennett and I have just uploaded to the arXiv our paper “Adjoint Brascamp-Lieb inequalities“. In this paper, we observe that the family of multilinear inequalities known as the Brascamp-Lieb inequalities (or Holder-Brascamp-Lieb inequalities) admit an adjoint formulation, and explore the theory of these adjoint inequalities and some of their consequences.

To motivate matters let us review the classical theory of adjoints for linear operators. If one has a bounded linear operator ${T: L^p(X) \rightarrow L^q(Y)}$ for some measure spaces ${X,Y}$ and exponents ${1 < p, q < \infty}$ , then one can define an adjoint linear operator ${T^*: L^{q'}(Y) \rightarrow L^{p'}(X)}$ involving the dual exponents ${\frac{1}{p}+\frac{1}{p'} = \frac{1}{q}+\frac{1}{q'} = 1}$ , obeying (formally at least) the duality relation

$\displaystyle \langle Tf, g \rangle = \langle f, T^* g \rangle \ \ \ \ \ (1)$

for suitable test functions ${f, g}$ on ${X, Y}$ respectively. Using the dual characterization

$\displaystyle \|f\|_{L^{p'}(X)} = \sup_{g: \|g\|_{L^p(X)} \leq 1} |\langle f, g \rangle|$

of ${L^{p'}(X)}$ (and similarly for ${L^{q'}(Y)}$ ), one can show that ${T^*}$ has the same operator norm as ${T}$ .

There is a slightly different way to proceed using Hölder’s inequality. For sake of exposition let us make the simplifying assumption that ${T}$ (and hence also ${T^*}$ ) maps non-negative functions to non-negative functions, and ignore issues of convergence or division by zero in the formal calculations below. Then for any reasonable function ${g}$ on ${Y}$ , we have

$\displaystyle \| T^* g \|_{L^{p'}(X)}^{p'} = \langle (T^* g)^{p'-1}, T^* g \rangle = \langle T (T^* g)^{p'-1}, g \rangle$

$\displaystyle \leq \|T\|_{op} \|(T^* g)^{p'-1}\|_{L^p(X)} \|g\|_{L^{p'}(Y)}$

$\displaystyle = \|T\|_{op} \|T^* g \|_{L^{p'}(X)}^{p'-1} \|g\|_{L^{p'}(Y)};$

by (1) and Hölder; dividing out by ${\|T^* g \|_{L^{p'}(X)}^{p'-1}}$ we obtain ${\|T^*\|_{op} \leq \|T\|_{op}}$ , and a similar argument also recovers the reverse inequality.

The first argument also extends to some extent to multilinear operators. For instance if one has a bounded bilinear operator ${B: L^p(X) \times L^q(Y) \rightarrow L^r(Z)}$ for ${1 < p,q,r < \infty}$ then one can then define adjoint bilinear operators ${B^{*1}: L^q(Y) \times L^{r'}(Z) \rightarrow L^{p'}(X)}$ and ${B^{*2}: L^p(X) \times L^{r'}(Z) \rightarrow L^{q'}(Y)}$ obeying the relations

$\displaystyle \langle B(f, g),h \rangle = \langle B^{*1}(g,h), f \rangle = \langle B^{*2}(f,h), g \rangle$

and with exactly the same operator norm as ${B}$ . It is also possible, formally at least, to adapt the Hölder inequality argument to reach the same conclusion.

In this paper we observe that the Hölder inequality argument can be modified in the case of Brascamp-Lieb inequalities to obtain a different type of adjoint inequality. (Continuous) Brascamp-Lieb inequalities take the form

$\displaystyle \int_{{\bf R}^d} \prod_{i=1}^k f_i^{c_i} \circ B_i \leq \mathrm{BL}(\mathbf{B},\mathbf{c}) (\prod_{i=1}^k \int_{{\bf R}^{d_i}} f_i)^{c_i}$

for various exponents ${c_1,\dots,c_k}$ and surjective linear maps ${B_i: {\bf R}^d \rightarrow {\bf R}^{d_i}}$ , where ${f_i: {\bf R}^{d_i} \rightarrow {\bf R}}$ are arbitrary non-negative measurable functions and ${\mathrm{BL}(\mathbf{B},\mathbf{c})}$ is the best constant for which this inequality holds for all such ${f_i}$ . [There is also another inequality involving variances with respect to log-concave distributions that is also due to Brascamp and Lieb, but it is not related to the inequalities discussed here.] Well known examples of such inequalities include Hölder’s inequality and the sharp Young convolution inequality; another is the Loomis-Whitney inequality, the first non-trivial example of which is

$\displaystyle \int_{{\bf R}^3} f(y,z)^{1/2} g(x,z)^{1/2} h(x,y)^{1/2}$

$\displaystyle \leq (\int_{{\bf R}^2} f)^{1/2} (\int_{{\bf R}^2} g)^{1/2} (\int_{{\bf R}^2} h)^{1/2} \ \ \ \ \ (2)$

for all non-negative measurable ${f,g,h: {\bf R}^2 \rightarrow {\bf R}}$ . There are also discrete analogues of these inequalities, in which the Euclidean spaces ${{\bf R}^d, {\bf R}^{d_i}}$ are replaced by discrete abelian groups, and the surjective linear maps ${B_i}$ are replaced by discrete homomorphisms.

The operation ${f \mapsto f \circ B_i}$ of pulling back a function on ${{\bf R}^{d_i}}$ by a linear map ${B_i: {\bf R}^d \rightarrow {\bf R}^{d_i}}$ to create a function on ${{\bf R}^d}$ has an adjoint pushforward map ${(B_i)_*}$ , which takes a function on ${{\bf R}^d}$ and basically integrates it on the fibers of ${B_i}$ to obtain a “marginal distribution” on ${{\bf R}^{d_i}}$ (possibly multiplied by a normalizing determinant factor). The adjoint Brascamp-Lieb inequalities that we obtain take the form

$\displaystyle \|f\|_{L^p({\bf R}^d)} \leq \mathrm{ABL}( \mathbf{B}, \mathbf{c}, \theta, p) \prod_{i=1}^k \|(B_i)_* f \|_{L^{p_i}({\bf R}^{d_i})}^{\theta_i}$

for non-negative ${f: {\bf R}^d \rightarrow {\bf R}}$ and various exponents ${p, p_i, \theta_i}$ , where ${\mathrm{ABL}( \mathbf{B}, \mathbf{c}, \theta, p)}$ is the optimal constant for which the above inequality holds for all such ${f}$ ; informally, such inequalities control the ${L^p}$ norm of a non-negative function in terms of its marginals. It turns out that every Brascamp-Lieb inequality generates a family of adjoint Brascamp-Lieb inequalities (with the exponent ${p}$ being less than or equal to ${1}$ ). For instance, the adjoints of the Loomis-Whitney inequality (2) are the inequalities

$\displaystyle \| f \|_{L^p({\bf R}^3)} \leq \| (B_1)_* f \|_{L^{p_1}({\bf R}^2)}^{\theta_1} \| (B_2)_* f \|_{L^{p_2}({\bf R}^2)}^{\theta_2} \| (B_3)_* f \|_{L^{p_3}({\bf R}^2)}^{\theta_3}$

for all non-negative measurable ${f: {\bf R}^3 \rightarrow {\bf R}}$ , all ${\theta_1, \theta_2, \theta_3>0}$ summing to ${1}$ , and all ${0 < p \leq 1}$ , where the ${p_i}$ exponents are defined by the formula

$\displaystyle \frac{1}{2} (1-\frac{1}{p}) = \theta_i (1-\frac{1}{p_i})$

and the ${(B_i)_* f:{\bf R}^2 \rightarrow {\bf R}}$ are the marginals of ${f}$ :

$\displaystyle (B_1)_* f(y,z) := \int_{\bf R} f(x,y,z)\ dx$

$\displaystyle (B_2)_* f(x,z) := \int_{\bf R} f(x,y,z)\ dy$

$\displaystyle (B_3)_* f(x,y) := \int_{\bf R} f(x,y,z)\ dz.$

One can derive these adjoint Brascamp-Lieb inequalities from their forward counterparts by a version of the Hölder inequality argument mentioned previously, in conjunction with the observation that the pushforward maps ${(B_i)_*}$ are mass-preserving (i.e., they preserve the ${L^1}$ norm on non-negative functions). Conversely, it turns out that the adjoint Brascamp-Lieb inequalities are only available when the forward Brascamp-Lieb inequalities are. In the discrete case the forward and adjoint Brascamp-Lieb constants are essentially identical, but in the continuous case they can (and often do) differ by up to a constant. Furthermore, whereas in the forward case there is a famous theorem of Lieb that asserts that the Brascamp-Lieb constants can be computed by optimizing over gaussian inputs, the same statement is only true up to constants in the adjoint case, and in fact in most cases the gaussians will fail to optimize the adjoint inequality. The situation appears to be complicated; roughly speaking, the adjoint inequalities only use a portion of the range of possible inputs of the forward Brascamp-Lieb inequality, and this portion often misses the gaussian inputs that would otherwise optimize the inequality.

We have located a modest number of applications of the adjoint Brascamp-Lieb inequality (but hope that there will be more in the future):

The inequalities become equalities at ${p=1}$ ; taking a derivative at this value (in the spirit of the replica trick in physics) we recover the entropic Brascamp-Lieb inequalities of Carlen and Cordero-Erausquin. For instance, the derivative of the adjoint Loomis-Whitney inequalities at ${p=1}$ yields Shearer’s inequality.
The adjoint Loomis-Whitney inequalities, together with a few more applications of Hölder’s inequality, implies the log-concavity of the Gowers uniformity norms on non-negative functions, which was previously observed by Shkredov and by Manners.
Averaging the adjoint Loomis-Whitney inequalities over coordinate systems gives reverse ${L^p}$ inequalities for the X-ray transform and other tomographic transforms that appear to be new in the literature. In particular, we obtain some monotonicity of the ${L^{p_k}}$ norms or entropies of the ${k}$ -plane transform in ${k}$ (if the exponents ${p_k}$ are chosen in a dimensionally consistent fashion).

We also record a number of variants of the adjoint Brascamp-Lieb inequalities, including discrete variants, and a reverse inequality involving ${L^p}$ norms with ${p>1}$ rather than ${p<1}$ .

12 comments

Comments feed for this article

30 June, 2023 at 3:59 am

adj

amazing adjoints

30 June, 2023 at 8:03 am

Anonymous

Hi Terry. Love your blog. I want to ask though, why are you linking to mathscinet papers? That severely limits accessibility.

1 July, 2023 at 12:49 pm

Terence Tao

This appears to be a recent change on mathscinet’s end. I would be open to suggestions for alternatives (in particular, a comparison with Zentralblatt or with doi).

30 June, 2023 at 11:36 am

Anonymous

Is it true that whenever the adjoint and forward constants are the same (in some sense indicating “no loss of information”) than the adjoint inequalities are also optimized by gaussians?

1 July, 2023 at 12:47 pm

Terence Tao

Actually, we found that in all non-degenerate cases that gaussians do *not* extremize the adjoint inequality; see Theorem 6.1. For instance, the adjoint Loomis-Whitney inequality is not optimized by gaussians, but instead by indicator functions of boxes (or more generally, Cartesian products $E_1 \times E_2 \times E_3$ ). This is despite the Loomis-Whitney inequality being optimized by gaussians (and with no loss in passing back and forth between the forward and adjoint inequalities). What seems to be going on here is that in order for gaussians to be good test functions for the forward inequality, the $k$ gaussians $f_1,\dots,f_k$ have to be related to each other in a very specific way; but for the adjoint inequality there is only one input $f$ and the dual functions $f_1,\dots,f_k$ associated to that input are unlikely to obey this specific relation even if $f$ is gaussian.

For the discrete analogues of these inequalities the situation is different; for both the forward and adjoint inequalities, indicator functions of subspaces serve as extremizers (Theorem 1.14).

2 July, 2023 at 9:11 am

Splundiferous

4 July, 2023 at 4:56 am

Great results! Small thing: in the early derivation right after “Then for any reasonable function”, I think that one should replace q by p’.

6 July, 2023 at 1:46 pm

Quadratini Loacker

A small typo: Cordero-Erausquin instead of Cordero-Erasquin.

[Corrected, thanks – T.]

8 July, 2023 at 3:51 am

Tom Courtade

Unless I’m mistaken, a simple observation is that the adjoint BL inequality can be restated in terms of Rényi entropies, in a way that parallels the Cordero–Erausquin dual formulation of the BL inequalities. Namely, for a random vector $X$ with density $f$ , the adjoint BL inequality is equivalently restated as

$h_p(X) \leq \sum_i c_i h_{p_i}(B_i X) + C$

for a suitable constant $C = C(B,c,\theta,p)$ , where $h_p$ denotes the Rényi entropy of order $p$ . As $h_p$ tends to Shannon entropy as $p\to 1$ , the connection to the Cordero–Erausquin entropic formulation of BL is fairly transparent.

This rewriting of the adjoint BL inequality also suggests what other “reverse” formulations of the adjoint BL inequality might look like (Question 10.8 in your paper), since we know their dual entropic formulations. For instance, just based on pattern matching, one might conjecture the analogue to Barthe’s inequality would look something like this: For $X_i\sim f_i$ random vectors on $\mathbb{R}^{d_i}$ , do we have

$\sum_i c_i h_{p_i}(X_i) \leq \sup_{\pi} h_p(\sum_i c_i B_i^* X_i) + C$ ,

where the sup is over all couplings of the $X_i$ ‘s? Taking $p\to 1$ would recover Barthe’s inequality.

10 July, 2023 at 2:12 pm

Terence Tao

Nice observation! We will add this to the next revision of the ms.

20 July, 2023 at 10:16 am

JojoPizza

Terence tao We need you in the field of Geometric complexity theory. Please contribute your time to advance this important field.

5 August, 2023 at 2:22 pm

Paul Pritchard

Small typo: “various” should be “variants” in the last sentence.
[Corrected, thanks – T.]

	Aleksandar on 245C, Notes 4: Sobolev sp…
	Anonymous on Erratum for “An inverse…
	Anonymous on Erratum for “An inverse…
	Anonymous on Erratum for “An inverse…
	Anonymous on Erratum for “An inverse…
	Terence Tao on 245C, Notes 4: Sobolev sp…
	Terence Tao on 275A, Notes 3: The weak and st…
	Terence Tao on What is a gauge?
	Terence Tao on Erratum for “An inverse…
	Terence Tao on 275A, Notes 3: The weak and st…
	Terence Tao on An epsilon of room: pages from…
	Aleksandar on 245C, Notes 4: Sobolev sp…
	Anonymous on Some notes on amenability
	Anonymous on On writing
	Anonymous on 275A, Notes 3: The weak and st…

Adjoint Brascamp-Lieb inequalities

Recent Comments

Articles by others

Diversions

Mathematics

Selected articles

Software

The sciences

Top Posts

Archives

Categories

The Polymath Blog

12 comments

Leave a comment Cancel reply

For commenters

Adjoint Brascamp-Lieb inequalities

Share this:

Recent Comments

Articles by others

Diversions

Mathematics

Selected articles

Software

The sciences

Top Posts

Archives

Categories

The Polymath Blog

12 comments

Leave a comment Cancel reply

For commenters