, , ,

In this work we investigate the global in time regularity properties of the Yang-Mills equations on high dimensional Minkowski space with compact semi-simple gauge group

G

. Specifically, we show that if a certain gauge covariant Sobolev norm is small, the so called critical regularity

{\dot{H}}_{A}^{\frac{n - 4}{2}}

, and the dimension satisfies

6 ⩽ n

, then a global solution exists and remains regular for all times given that the initial data is regular. This is in the same spirit as the recent result [8] for the Maxwell-Klein-Gordon system, as well as earlier results for high dimensional wave-maps (see [11] , [6] , [9] , and [7] ). Our approach shares many similarities with those works, whose underlying philosophy in basically the same. That is, to introduce Coulomb type gauges in order to treat a specific potential term as a quadratic error. In our setup, we use a non-abelian variant of the remarkable parametrix construction contained in [8] , in conjunction with a version of the Uhlenbeck lemma [13] on the existence of global Coulomb gauges. This latter result has been used for high dimensional wave-maps to globally “renormalize” the equation so that the existence theory can be treated directly through Strichartz estimates applied to multi-linear expressions.

In the present situation, as was the case with the Maxwell-Klein-Gordon system, the corresponding renormalization procedure is necessarily more involved because it needs to be done separately for each distinct direction in phase space. That is, we provide a renormalization of the Yang-Mills equations through the construction of a Fourier integral operator with

G

-valued phase. The construction and estimation of such an object relies heavily on elliptic-Coulomb theory, primary due to the difficulty one faces in that the

G

-valued phase function cannot be localized within a neighborhood of any given point on the group due to the critical nature of the problem (if you like, there is a logarithmic “twisting” of the group element as one moves around in physical space; fortunately the group is compact so this doesn't ruin things).

To get things started, we now give a simple gauge covariant description of the equations we are considering. The (hyperbolic) Yang-Mills equations arise as the evolution equations for a connection on the bundle

V = ℳ^{n} \times g

, where

ℳ^{n}

is some

n

(spatial) dimensional Minkowski space, with metric

g : = (- 1, 1, \dots, 1)

in inertial coordinates

(x^{0}, x^{i})

, and

g

is the Lie algebra of some compact semi-simple Lie group

G

. Here we are considering

V

with the

A d (G)

gauge structure. If

φ

is any section to

V

over

ℳ

, then a connection assigns to every vector-field

X

on the base

ℳ^{n}

, a derivative which we denote as

D_{X}

, such that the following Leibniz rule is satisfied for every scalar field

f

D_{X} (f φ) = X (f) φ + f D_{X} φ .

In this setup, we assume that

V

is equipped with an

A d (G)

invariant metric

〈 \cdot, \cdot 〉

which respects the action of

D

. That is, one has the formula:

\begin{matrix} d 〈 φ, ψ 〉 = 〈 D φ, ψ 〉 + 〈 φ, D ψ 〉 . \end{matrix}

(1)

In the present situation we will take

〈 \cdot, \cdot 〉

to be the Killing form on

g

. The curvature associated to

D

is the

g

valued two-form

F

which arises from the commutation of covariant derivatives and is defined via the formula:

D_{X} D_{Y} φ - D_{Y} D_{X} φ - D_{[X, Y]} φ = [F (X, Y), φ] .

We say that the connection

D

satisfies the Yang-Mills equations if its curvature is a (formal) local minima of the following Maxwell type functional:

\begin{matrix} ℒ [F] = - \frac{1}{4} \int_{ℳ^{n}} 〈 F_{α β}, F^{α β} 〉 D V_{ℳ^{n}} . \end{matrix}

(2)

The Euler-Lagrange equations of 2 read:

\begin{matrix} D^{β} F_{α β} = 0 . \end{matrix}

(3)

Also, from the fact that

F

arises as the curvature of some connection, we have that the following identity known as “Bianchi” is satisfied:

\begin{matrix} D_{[α} F_{β γ]} = 0 . \end{matrix}

(4)

From now on we will refer to the system 3 – 4 as the first order Yang–Mills equations (FYM).

As we have already mentioned, our aim is to study the regularity properties of the Cauchy problem for the (FYM) system. To describe this in a geometrically invariant way, we make use of the following splitting of the connection-curvature pair

(F, D)

: Foliating

ℳ

into the standard Cauchy hypersurfaces

t = c o n s t .

, we decompose:

(F, D) = (\underset{̲}{F}, \underset{̲}{D}) \oplus (E, D_{0}),

where

(\underset{̲}{F}, \underset{̲}{D})

denotes the portion of

(F, D)

which is tangent to the surfaces

t = c o n s t .

(i.e. the induced connection), and

(E, D_{0})

denotes respectively the interior product of

F

with the foliation generator

T = \partial_{t}

, and the normal portion of

D

. In inertial coordinates we have:

E_{i} = F_{0 i} .

On the initial Cauchy hypersurface

t = 0

we call a set

(\underset{̲}{F} (0), \underset{̲}{D} (0), E (0))

admissible Cauchy data ¹ if it satisfies the following compatibility condition:

\begin{matrix} {\underset{̲}{D}}^{i} E_{i} (0) = 0 . \end{matrix}

(5)

We define the Cauchy problem for the Yang-Mills equation to be the task of construction a connection

(F, D)

which solves 3 , and has Cauchy data equal to

(\underset{̲}{F} (0), \underset{̲}{D} (0), E (0))

In order to understand what the appropriate condition on the initial data should be (and what we would like it to be!), it is necessary to consider the following two basic mathematical features of the system 3 – 4 . The first is conservation. From the Lagrangian nature of the field equations 3 – 4 , we have the tensorial conservation law:

\begin{matrix} Q_{α β} [F] & = 〈 F_{α γ}, F_{β}^{γ} 〉 - \frac{1}{4} g_{α β} 〈 F_{γ δ}, F^{γ δ} 〉, \end{matrix}

\begin{matrix} \nabla^{α} Q_{α β} [F] & = 0, \end{matrix}

\begin{matrix}  \end{matrix}

where

\nabla

is the covariant derivative on

ℳ^{n}

. In particular, contracting

Q

with the vector-field

T = \partial_{t}

, we arrive at the following constant of motion for the system 3 – 4 :

\begin{matrix} \int_{R n} Q_{00} d x = \frac{1}{2} \int_{R n} (| E |^{2} + | \underset{̲}{F} |^{2}) d x . \end{matrix}

(6)

The second main aspect of the system 3 – 4 is that of scaling. If we perform the transformation:

\begin{matrix} (x^{0}, x^{i}) ⇝ (λ x^{0}, λ x^{i}), \end{matrix}

(7)

ℳ^{n}

, then an easy calculation shows that:

\begin{matrix} D & ⇝ λ D, & F & ⇝ λ^{2} F . \end{matrix}

(8)

\begin{matrix}  \end{matrix}

If we now define the gauge covariant (integer) Sobolev spaces:

\begin{matrix} ∥ F ∥^{2} {\dot{H}}_{A}^{s} : = \sum_{| I | = s} ∥ {\underset{̲}{D}}^{I} F ∥^{2} L^{2} (R^{n}), \end{matrix}

(9)

where for each multiindex

I = (i_{1}, \dots, i_{s})

we have that

D^{I} = D_{\partial_{i_{1}}} \dots D_{\partial_{i_{s}}}

is the repeated covariant differentiation with respect to the translation invariant spatial vector-fields

{\partial_{1}, \dots, \partial_{n}}

, then for even² spatial dimensions, the norm

{\dot{H}}_{A}^{\frac{n - 4}{2}}

is invariant with respect to the scaling transformation 8 . In particular, the conserved quantity 6 is invariant when

n = 4

and this is called the critical dimension.

Now, based on numerical evidence as well as analytical arguments, it is suspected that in general the Cauchy problem for 3 – 4 with smooth initial data will not be well behaved without size control of the critical regularities

s_{c} = \frac{n - 4}{2}

in high dimensions. What we will take this statement to mean here is simply that if

4 ⩽ n

and the

{\dot{H}}_{A}^{s_{c}}

norm of the initial data is not sufficiently small, then one can expect the existence of regular (i.e.

C_{A}^{\infty}

) sets

(\underset{̲}{F} (0), \underset{̲}{D} (0), E (0))

such that the corresponding solution to 3 – 4 will develop a singularity in finite time. By singularity development, we mean that some higher norm of the type 9 will fail to be bounded at a later time, given that it was initially; or even more specifically, that the

L^{\infty}

norm of the curvature

F

will blow up in finite time for some regular initial data sets. Since these norms are gauge covariant, this type of singularity development would correspond to an intrinsic geometric breakdown of the equations, and could not be an artifact of poorly chosen local coordinates (gauge) on

V

. This has been rigorously demonstrated in the equivariant category for the supercritical dimensions

5 ⩽ n

(see [3] ). In the critical dimension things are much less clear, although there is numerical evidence that on still has blowup for large initial data (see [2] ). This is thought to be connected with the existence of large static solutions (instantons).

One possible conjecture is that there is global regularity when the norm 6 is below the ground state energy.

Going in the other direction, it is expected that if the critical norm

{\dot{H}}_{A}^{\frac{n - 4}{2}}

is sufficiently small, then regular initial data will remain regular for all times. This can be seen as an easier preliminary step toward understanding in detail the issue of large data for dimension

n = 4

, and is furthermore an interesting problem in its own right. A central difficulty in the demonstration of this conjecture is to construct a stable set coordinates on the bundle

V

such that the Christoffel symbols of

D

are well behaved in the sense that they obey the natural range of estimates one expects for this type of problem. This is precisely what we shall do in dimensions

6 ⩽ n

through the well known procedure of using (spatial) Coulomb gauges. Unfortunately, this preliminary gauge construction is far from sufficient to close the regularity argument, and it will in fact be necessary for us to go much further and control infinitely many Coulomb gauges, each of which correspond to a distinct polarized plane wave solution to the usual (flat) wave equation

□ = \nabla^{α} \nabla_{α}

However, this does not effect the statement of our main result which is in fact quite simple:

Theorem 1.1 (Critical regularity for high dimensional Yang-Mills). Let the number of spatial dimensions be even and such that

6 ⩽ n

. Then there exists fixed constants

0 < ɛ_{0}, C

such that if

(\underset{̲}{F} (0), \underset{̲}{D} (0), E (0))

is an admissible data set which satisfies the smallness condition:

\begin{matrix} ∥ (\underset{̲}{F} (0), E (0)) ∥ {\dot{H}}_{A}^{\frac{n - 4}{2}} ⩽ ɛ_{0}, \end{matrix}

(10)

and there exists constants

M_{k} < \infty

\frac{n - 4}{2} < k \in N

such that:

\begin{matrix} ∥ (\underset{̲}{F} (0), E (0)) ∥ {\dot{H}}_{A}^{k} = M_{k}, \end{matrix}

(11)

then there exists a unique global solution to the field equations 3 – 4 with this initial data, and furthermore one has that the following inductive norm bounds hold:

\begin{matrix} ∥ F ∥ {\dot{H}}_{A}^{\frac{n - 4}{2}} & ⩽ C ɛ_{0}, \end{matrix}

\begin{matrix} ∥ F ∥ {\dot{H}}_{A}^{k} & ⩽ C (M_{\frac{n - 4}{2}}, \dots, M_{k - 1}) M_{k} . \end{matrix}

\begin{matrix}  \end{matrix}

In particular, in this case

F

remains smooth (in the gauge covariant sense) and bounded for all times.

Remark 1.2. As alluded to above, we will more specifically prove the existence of a global (in space and time) spatial Coulomb gauge such that the coefficient functions of the curvature

F

, as well as the Christoffel symbols (gauge potentials) of the connection

D

are in the classical Sobolev spaces

{\dot{H}}^{s}

, and such that they satisfy appropriate angularly and spatially microlocalized Strichartz estimates. We have elected to eliminate a discussion of this from the statement from the main theorem in favor of the simpler geometric language so that the reader can at a first glance gain an idea of the content of our result without being confronted with too many technical details.

¹ Of course, this set is overdetermined as the curvature $\underset{̲}{F}$ depends completely on the connection $\underset{̲}{D}$ . Also, it is perhaps not completely obvious at first that the set $(\underset{̲}{F} (0), \underset{̲}{D} (0), E (0))$ determined uniquely a solution $(F, D)$ to 3 – 4 . For example, the initial normal derivative $D_{0} (0)$ does not need to be specified. We will show this is the case in the sequel (in particular see Proposition 5.3 ).

² For odd spatial dimensions, the above discussion needs to be modified somewhat because we will not make an attempt here to define fractional powers of the spaces ${\dot{H}}_{A}^{s}$ . Instead, what one should do is to simply put things in a Coulomb gauge and then use the usual fractional Sobolev spaces. This later approach is what we will take in the sequel, although for sake of concreteness we will only discuss the case of even dimensions. We have opted for the covariant approach in the introduction because it makes stating our main result a bit easier, and has an appealing simplicity. Also, since we shall need many specifics on how Coulomb gauges are constructed in order to create and control our parametrix, we will explain how the Coulomb gauge relates to the Cauchy problem in detail in the following two sections.

Acknowledgements

First and foremost, we would like to thank our advisors Sergiu Klainerman and Matei Machedon for their continuing support and encouragement. This subject matter as well as our own point of view owes much to them. We would also like to thank Igor Rodnianski, Terry Tao, and Daniel Tataru for many interesting and helpful conversations. This work began at the Institute for Advanced Study during the Fall 2003 semester when both authors were in attendance. The second author would like to thank Harvard University for for its hospitality during the Spring of 2004 and Winter 2005. The first author was partially supported under NSF grant DMS-0401177. The second author was supported in part by an NSF postdoctoral fellowship.

2 Some Basic Notation

We list here some of the basic conventions used in this work, as well as some constants which will be needed in the sequel. We use the usual notation

a ≲ b

, to denote that

a ⩽ C \cdot b

for some (possibly large) constant

C

which may change from line to line. Likewise we write

a ≪ b

to mean

a ⩽ C^{- 1} \cdot b

for some large constant

C

. In general,

C

will denote a large constant, but at times we will also call

C

a connection. The difference should be clear from context. Overall, we will have use for a family of small constants, which satisfy the hierarchy:

\begin{matrix} 0 < ɛ_{0} ≪ ε_{0} ≪ \tilde{ε_{0}} ≪ ℰ ≪ γ ≪ δ ≪ 1 . \end{matrix}

(12)

3 Some gauge-theoretic preliminaries

In this paper, we are working with a compact semi-simple group Lie

G

. However, all of our calculations will be carried out in a somewhat larger context. Firstly, we will assume that

G

is embedded as a subgroup of matrices of some (possibly) larger orthogonal group

O (m)

. In particular, we can identify the Lie algebra

g

with an appropriate sub-algebra of

o (m)

. This allows us to perform all of our calculations on a specific collection of matrices. Since our main computation involves complex valued integral operators, we will further need to work in the complexified algebra

C \otimes o (m)

. The Killing form

〈 \cdot, \cdot 〉

g

extends easily to this context to yield the bilinear form:

\begin{matrix} 〈 A, B 〉 = t r a c e (A B^{*}) . \end{matrix}

(13)

Notice that this is a positive definite form when restricted to the real vector space

o (m)

, and is a sesquilinear form on the corresponding complexified algebra

C \otimes o (m)

. More importantly,

〈 \cdot, \cdot 〉

A d (O (m))

invariant, and in fact the more general identity holds:

\begin{matrix} 〈 g_{1}^{- 1} A h_{1}, g_{2}^{- 1} B h_{2} 〉 = 〈 g_{2} g_{1}^{- 1} A h_{1} h_{2}^{- 1}, B 〉, \end{matrix}

(14)

for

A, B \in C \otimes o (m)

and

g_{i}, h_{i} \in O (m)

. In fact, it is not difficult to see that the form 13 extends to a sesquilinear form on all complex matrices in

M (m \times m)

, and that it can be identified with the usual matrix inner product:

\begin{matrix} 〈 A, B 〉 = \sum_{i, j} a_{i j} {\bar{b}}_{i j}, \end{matrix}

(15)

which come from considering these matrices as vectors in

C^{m^{2}}

. Furthermore, it is easy to see that the general adjoint formula 14 continues to hold in this context.

This will be of fundamental importance in the sequel. In general, we will use the notation

∥ A ∥^{2} = 〈 A, A 〉

to denote the action of this norm on any matrix. Also, notice that directly from 14 one has the isometric identity:

\begin{matrix} ∥ g A ∥ & = ∥ A ∥, & g & \in O (m) . \end{matrix}

(16)

\begin{matrix}  \end{matrix}

These are all very simple algebraic identities, but our method is incredibly sensitive to them and would collapse entirely if they did not hold.

In the context of matrices, we may compute the action of the connection

D

on sections

F

V

as follows:

D_{X} F = X^{α} (\nabla_{α} (F) + [A_{α}, F]),

Here, the gauge potentials

{A_{α}}

are

g

-valued, and are defined via the equation:

D_{α} 1_{V} = [A_{α}, 1_{V}],

where

1_{V}

denotes some chosen orthonormal frame in

V

, and we are abusively writing

F = F 1_{V}

. In shorthand notation, we write:

D = d + A,

where

d

is the usual exterior derivative on matrix valued functions. Likewise, in this notation we have the well known identity for the curvature of

D

F^{A} = d A + [A, A] .

In this last formula, we use the superscript notation to emphasize the fact that the curvature is not gauge invariant, but transforms according to the

A d (G)

action:

F ⇝ g F g^{- 1},

whenever one performs the change of frame

1_{V} ⇝ g 1_{V} g^{- 1}

. As is well known, the potentials

{A_{α}}

themselves do not transform according to

A d (G)

, but instead take on an affine group of transformations:

\begin{matrix} B = g A g^{- 1} + g d g^{- 1}, \end{matrix}

(17)

where

{B_{α}}

represents the connection

D

in the frame

g 1_{V} g^{- 1}

. In particular, the difference of two connections obeys the

A d (G)

structure, a fact we will have use for in a moment. For instance, any connection

{C_{α}}

with

F^{C} = 0

obeys

A d (G)

Furthermore, as is the basic fact of gauge theory, such connections always lead to a globally³ integrable ODE:

d g = g C,

where the solution

g

belongs to

G

. Thus, we may identify flat connections

C

with infinitesimal gauge transformations, and it is easy to see that every gauge transformation 17 leads to a flat connection which we may define as

C = g^{- 1} d g

This completes our discussion of elementary gauge theory.

It will also be necessary for us to make use of the basic facts from (non-gauge-covariant) Hodge theory. Even though the connections we work with in this paper are on the full space-time

ℳ^{n}

, our use of Hodge theory will always be restricted to time slices

{t} \times R^{n}

. In particular we use the general notation

d, d^{*}

for the exterior derivative and its adjoint acting on

g

(and more generally

M (m \times m)

) valued differential forms on

R^{n}

. To emphasize this restriction, we will use Latin indices when computing these operators. For example:

\begin{matrix} (d A)_{i j} & = \nabla_{{i} A_{j}}, & (d F)_{i j k} & = \nabla_{[i} F_{j k]}, \end{matrix}

\begin{matrix}  \end{matrix}

where

{\dots}

and

[\dots]

denote anti-symmetric and symmetric cyclic summing respectively. Also, the adjoint here is taken with respect to the Killing form 13 . In particular, we have the Hodge Laplacean:

\begin{matrix} Δ = - (d d^{*} + d^{*} d), \end{matrix}

(18)

which in our context is simply the usual scalar Laplacean acting component-wise on matrices. Finally, we have the Hodge decomposition which we write as

A = A^{d f} + A^{c f}

where:

\begin{matrix} A^{d f} & = - d^{*} d Δ^{- 1} A, \end{matrix}

\begin{matrix} A^{c f} & = - d d^{*} Δ^{- 1} A . \end{matrix}

\begin{matrix}  \end{matrix}

This decomposition is bounded on

L^{p}

spaces for

1 < p < \infty

as the operators involved are SIO's. Also, since these operators are all real, this decomposition respects the Lie algebra structure of

g

inside of

C \otimes o (m)

The last topic we cover here is the basic underpinning of much of analysis in the context of compact gauge groups. This is the remarkable Uhlenbeck lemma, which allows one to “straighten out” a connection as long as its curvature satisfies appropriate bounds. The important thing for us is that these bounds are precisely at the level of the critical regularity

{\dot{H}}_{A}^{\frac{n - 4}{2}}

. This result is:

Lemma 3.1 (Classical Uhlenbeck lemma). Let

D^{A} = d + A

be a connection with compact (matrix) group on

R^{n}

. Then there is a pair of constants

ε_{0}, C

which only depend on the dimension

n

such that if the curvature

F^{A}

D^{A}

satisfies the bound:

∥ F^{A} ∥ L^{\frac{n}{2}} ⩽ ε_{0},

then

D^{A}

is gauge equivalent to a connection

D^{B} = d + B

where the potentials

{B_{i}}

satisfy the condition:

d^{*} B = 0,

and such that the following estimate holds:

\begin{matrix} ∥ B ∥ L^{n} ⩽ C ε_{0} . \end{matrix}

(19)

In the sequel, it will be useful for us to have a somewhat more refined version of Lemma 3.1 which does not make reference to the size of the curvature, but rather to the size of the connection

{A_{α}}

itself in a critical norm which does not involve derivatives. This will allow us to prove certain connections exist more directly. Furthermore, since the basic formulas used in the proof of this result will be important in constructing our parametrix, it will set the pace for much of what follows. Finally, we mention here that our proof is a bit different from that of [13] in that it does not rely on any implicit function theorem type arguments, and is instead completely explicit being based on a simple Picard iteration.

Lemma 3.2 (Uhlenbeck lemma for small

L^{n}

perturbations of Coulomb potentials with small

L^{\frac{n}{2}}

curvature.). Let

D^{A} = d + A

be a connection on

R^{n} \times V

with compact (matrix) gauge group

G

. Then there exists constants

ε_{0}, C

such that if:

\begin{matrix} ∥ F^{A} ∥ L^{\frac{n}{2}} ⩽ ε_{0}, \end{matrix}

(20)

and such that

d + A

is gauge equivalent to

d + B

with

d^{*} B = 0

, where one has the bounds:

\begin{matrix} ∥ A ∥ L^{n} ⩽ C ε_{0}, \end{matrix}

(21)

then for every connection

d + \tilde{A}

such that:

\begin{matrix} ∥ \tilde{A} - A ∥ L^{n} ⩽ \sqrt{C} ε_{0}, \end{matrix}

(22)

there exists a gauge equivalent connection

d + \tilde{B}

such that

d^{*} \tilde{B} = 0

, and one has the same size control:

\begin{matrix} ∥ \tilde{B} ∥ L^{n} ⩽ C ε_{0} . \end{matrix}

(23)

Remark 3.3. Before continuing with proof, let us remark here that Lemma 3.2 is in fact more general that the classical Uhlenbeck Lemma. Specifically, 3.2 easily implies 3.1 with smallness condition

ε_{0} / 2

(where

ε_{0}

is determined by Lemma 3.2 ) through a straightforward induction procedure which we outline now.

First of all, from Lemma 3.2 we see that the set of all connections

d + A

with curvature such that:

\begin{matrix} ∥ F^{A} ∥ L^{\frac{n}{2}} ⩽ \frac{ε_{0}}{2}, \end{matrix}

(24)

and such that

d + A

is equivalent to

d + B

with

d^{*} B = 0

, and such that one has the bounds 19 , is an open set in the intersection of

L^{n}

with the set determined by 24 (in the sense of distributions). Therefore, if the conclusion of Lemma 3.1 were to be violated, it must then be the case that there is a smallest number

r^{*}

such that the sphere of radius

r^{*}

contains a connection

d + A

with the property that it cannot be put in the Coulomb gauge (with

L^{n}

bounds), even though the bound 24 is valid for this connection. Now, consider the set of connections

d + λ A

where

0 < (1 - λ) ≪ 1

A quick calculation shows that these have curvature:

F^{λ A} = λ F^{A} + λ (λ - 1) [A, A] .

Choose

λ

such that:

(1 - λ) ⩽ (1 + r^{*})^{- 2} \cdot \frac{ε_{0}}{2} .

By the triangle and Hölders inequality, and the definition of

r^{*}

, we have that:

∥ F^{λ A} ∥ L^{\frac{n}{2}} ⩽ ε_{0} .

Therefore, by the minimality of

r^{*}

we have that

d + λ A

can be Coulomb gauged.

Again, by the definition of

λ

, we have that:

d + A = d + λ A + \tilde{A},

where we easily have the bound (we may assume

1 ⩽ C

∥ \tilde{A} ∥ L^{n} ⩽ \sqrt{C} ε_{0} .

Therefore, by an application of Lemma 3.2 we have that

d + A

can be put in the Coulomb gauge with the 19 holds. This contradicts the minimality of

r^{*}

as was to be shown.

Proof of Lemma 3.2 . It suffices to show that

d + \tilde{A}

is gauge equivalent to

d + \tilde{B}

, where

d^{*} \tilde{B} = 0

and with the bound 23 , provided that:

\begin{matrix} ∥ \tilde{A} ∥ L^{n} ⩽ 2 \sqrt{M} ε_{0}, \end{matrix}

(25)

when

ε_{0}

chosen sufficiently small, and where

M

is some sufficiently large fixed constant which will be determined in a moment, and which will be chosen to be our

C

in the estimates 21 and 23 (the reason for the notation switch will become clear in a moment). To see this, notice that the smallness condition 22 is gauge invariant because the difference of two connections transforms according to the

A d (G)

action which fixes the Killing form used to compute

∥ \cdot ∥ L^{n}

. Therefore, we may assume from the start that the original connection

A

is in the Coulomb gauge with size control 21 . In particular, the connection

d + A

satisfies the div-curl system:

\begin{matrix} d A & = F^{A} - [A, A], & d^{*} A & = 0, \end{matrix}

\begin{matrix}  \end{matrix}

which we can integrate to form the equation:

\begin{matrix} A = - d^{*} Δ (F^{A} - [A, A]) . \end{matrix}

(26)

Everything we do now will be based on the Riesz operator bounds:

\begin{matrix} \nabla^{2} Δ^{- 1} : L^{n} & ↪ L^{n}, \end{matrix}

(27)

\begin{matrix} \nabla Δ^{- 1} : L^{\frac{n}{2}} & ↪ L^{n} . \end{matrix}

(28)

\begin{matrix}  \end{matrix}

We choose our constant

C

such that

\frac{\sqrt{M}}{8}

is the constant in (the various vector analogs) of the embeddings 27 – 28 . Using these bounds and the integral equation 26 in conjunction with the assumed smallness conditions 20 and 21 , and a round of Hölder's inequality, we have the following improved bounds for

d + A

\begin{matrix} ∥ A ∥ L^{n} ⩽ \sqrt{M} ε_{0}, \end{matrix}

(29)

as long as

ε_{0}

is chosen sufficiently small. In particular, using 22 and some addition and subtraction we have the bound 25 .

We now construct by hand the gauge transformation:

\begin{matrix} d g = g \tilde{A} - \tilde{B} g, \end{matrix}

(30)

with

d^{*} \tilde{B} = 0

. This will be done by constructing the infinitesimal gauge transformation

C = g^{- 1} d g

. A quick calculation shows that this must satisfy the following div-curl system:

\begin{matrix} d C & = - [C, C], \end{matrix}

(31a)

\begin{matrix} d^{*} C & = d^{*} \tilde{A} + [\tilde{A}, C] . \end{matrix}

(31b)

\begin{matrix}  \end{matrix}

Unfortunately, the system 31 cannot be solved constructively, say through an iteration scheme. This is because implicit in its structure is the compatibility condition

d [C, C] = 0

, which gets destroyed through (at least the usual) Picard iteration. This could be side-stepped by using an implicit function theorem type argument, but since we prefer to do things explicitly we proceed as follows: We first write the system 31 in terms of integral equations:

\begin{matrix} C^{d f} & = \frac{d^{*}}{Δ} [C, C], \end{matrix}

(32a)

\begin{matrix} C^{c f} & = \frac{d}{Δ} (- d^{*} \tilde{A} - [\tilde{A}, C]) . \end{matrix}

(32b)

\begin{matrix}  \end{matrix}

Here

C = C^{d f} + C^{c f}

denotes the Hodge decomposition of the matrix valued one-form

C

. A solution to system 32 can now be constructed from scratch via Picard iteration starting with

C^{(0)} = 0

. The condition 25 and the embeddings 27 – 28 guarantee convergence to a solution. Furthermore, because it is true for each iterate, one has the following bounds on the solution:

\begin{matrix} ∥ C ∥ L^{n} ⩽ 2 \cdot \frac{\sqrt{M}}{8} ∥ \tilde{A} ∥ L^{n} ⩽ \frac{M}{2} ε_{0} . \end{matrix}

(33)

Also, since each iterate belongs pointwise to

g

, the solution does also due to the fact that

g

is a linear (and hence closed) subspace of the matrices

M (m \times m)

. We now need to show that this

C

is indeed a solution to the original system 31 . That is, we need to establish the identity:

\begin{matrix} d d^{*} Δ^{- 1} [C, C] = - [C, C] . \end{matrix}

(34)

Notice that this does not follow algebraically from the form of the integral system 32 , because it is not a-priori clear that in fact

d [C, C] = 0

. However, this is the case, which is a consequence of the following a-priori estimate for solutions to 32 :

\begin{matrix} ∥ d d^{*} Δ^{- 1} [C, C] + [C, C] ∥ L^{\frac{n}{2}} ≲ ∥ C ∥ L^{n} \cdot ∥ d d^{*} Δ^{- 1} [C, C] + [C, C] ∥ L^{\frac{n}{2}} . \end{matrix}

(35)

Notice that 33 and 35 taken together immediately imply the identity 34 .

In order to show 35 , we first use the Hodge Laplacean 18 to write:

d d^{*} Δ^{- 1} [C, C] + [C, C] = - d^{*} Δ^{- 1} (d [C, C]) .

Next, we compute that:

\begin{matrix} (d [C, C])_{i j k} & = \nabla_{[i} [C_{j}, C_{k]}], \end{matrix}

\begin{matrix} = [\nabla_{[i} C_{j}, C_{k]}] - [\nabla_{[i} C_{k}, C_{j]}], \end{matrix}

\begin{matrix} = - [C_{[i}, (d C)_{j k]}] . \end{matrix}

\begin{matrix}  \end{matrix}

Therefore, using this last identity in conjunction with fractional integration, and using the identity from line 32a above, we have that:

\begin{matrix} ∥ d d^{*} Δ^{- 1} [C, C] + [C, C] ∥ L^{\frac{n}{2}} & = ∥ d^{*} Δ^{- 1} [C, d C] ∥ L^{\frac{n}{2}}, \end{matrix}

\begin{matrix} ≲ ∥ [C, d C] ∥ L^{\frac{n}{3}}, \end{matrix}

\begin{matrix} ⩽ ∥ [C, (d d^{*} Δ^{- 1} [C, C] + [C, C])] ∥ L^{\frac{n}{3}} + ∥ [C, [C, C]] ∥ L^{\frac{n}{3}}, \end{matrix}

\begin{matrix} ⩽ 2 ∥ C ∥ L^{n} \cdot ∥ d d^{*} Δ^{- 1} [C, C] + [C, C] ∥ L^{\frac{n}{2}} . \end{matrix}

\begin{matrix}  \end{matrix}

Notice that the last inequality here follows simply from the Jacobi identity

[C, [C, C]] = 0

To wrap things up, we only need to establish the existence of

g

on line 30 above with

d^{*} \tilde{B} = 0

, and such that we have the size control 23 (with constant

M

). Now, by design we have that

F^{C} = 0

, so we may integrate the equation:

d g = g C,

with initial conditions

g (0) = I

on all of

R^{n}

. Defining now:

\tilde{B} = g \tilde{A} g^{- 1} + g d g^{- 1},

we have that:

\begin{matrix} - d^{*} \tilde{B} & = D_{i}^{\tilde{B}} {\tilde{B}}^{i}, \end{matrix}

\begin{matrix} = g D_{i}^{\tilde{A}} (g^{- 1} {\tilde{B}}^{i} g) g^{- 1}, \end{matrix}

\begin{matrix} = g D_{i}^{\tilde{A}} ({\tilde{A}}^{i} - C^{i}) g^{- 1}, \end{matrix}

\begin{matrix} = g (- d^{*} \tilde{A} + d^{*} C - [\tilde{A}, C]) g^{- 1}, \end{matrix}

\begin{matrix} = 0, \end{matrix}

\begin{matrix}  \end{matrix}

as was to be shown. Finally, by using the bounds 25 and 33 and the definition of the potentials

{\tilde{B}}

and

{C}

we have the bound:

∥ \tilde{B} ∥ L^{n} ⩽ ∥ \tilde{A} ∥ L^{n} + ∥ C ∥ L^{n} ⩽ M ε_{0} .

This completes the proof of Lemma 3.2 . □

³ Of course this ODE is non-linear, but in the present context it also satisfies the conservation law $g g^{†} = I$ .

4 Some analytic preliminaries

We record here some useful formulas, mostly from elementary harmonic analysis, which will be used many times in the sequel. Firstly, we define the Fourier transform on

C \otimes o (m)

, which is merely the usual scalar Fourier transform acting component-wise on matrices:

\begin{matrix} \hat{A} (ξ) = \int_{R^{n}} e^{- 2 π i x \cdot ξ} A (x) d x . \end{matrix}

(36)

The Plancherel theorem with respect to the Killing form 13 reads:

\int_{R_{x}^{n}} 〈 A, B 〉 d x = \int_{R_{ξ}^{n}} 〈 \hat{A}, \hat{B} 〉 d ξ .

This follows simply from definition of the inner product 15 . While the constructions we make in the sequel are almost explicitly based on the spatial transform 36 , it will in certain places be convenient for us to work with the space-time Fourier transform:

\hat{A} (τ, ξ) = \int_{R^{n + 1}} e^{- 2 π i (t τ + x \cdot ξ)} A (t, x) d t d x .

In the sequel, we will have much use for dyadic frequency decompositions with respect to the spatial variable. For the most part, we will use a fairly loose and heuristic notation for this operation. This will help us to avoid having to come up with different symbols for multipliers which are basically the same. First of all, we let

χ (ξ)

denote some smooth bump function adapted to the unit frequency annulus

{2^{- a} ⩽ | ξ | ⩽ 2^{a}}

, where

1 ⩽ a

is some constant used to define

χ

which may change from line to line. For a dyadic number

μ \in {2^{i} | i \in Z}

, we define the rescaled cutoffs:

χ_{μ} (ξ) = χ (μ^{- 1} ξ),

and the associated Fourier multipliers

\hat{P_{μ} A} = χ_{μ} \hat{A}

. The two main facts we will need about these multipliers is the Bernstein inequality:

\begin{matrix} ∥ P_{μ} A ∥ L^{p} ≲ μ^{n (\frac{1}{q} - \frac{1}{p})} ∥ A ∥ L^{q}, \end{matrix}

(37)

which holds for all

1 ⩽ q ⩽ p ⩽ \infty

, and the Littlewood-Paley equivalence:

\begin{matrix} ∥ (\sum_{μ} | P_{μ} A |^{2})^{\frac{1}{2}} ∥ L^{p} \sim ∥ A ∥ L^{p}, \end{matrix}

(38)

which holds under the restriction

1 < p < \infty

. All of the norms above can be taken with respect to 13 .

There are two simple analysis lemmas involving derivatives and multipliers which will come in useful in the sequel. The first of these is the low frequency (operator) commutator estimate:

\begin{matrix} ∥ [A, P_{1}] \cdot F ∥ L^{p} ≲ ∥ \nabla_{x} A ∥ L^{q} \cdot ∥ F ∥ L^{r}, \end{matrix}

(39)

where

\frac{1}{p} = \frac{1}{q} + \frac{1}{r}

(see [8] ). The second is the homogeneous paraproduct estimate:

\begin{matrix} ∥ \nabla_{x}^{k} (A \cdot F) ∥ L^{p} ≲ ∥ \nabla_{x}^{k} A ∥ L^{q_{1}} \cdot ∥ F ∥ L^{r_{1}} + ∥ A ∥ L^{q_{2}} \cdot ∥ \nabla_{x}^{k} F ∥ L^{r_{2}}, \end{matrix}

(40)

for

1 < p, q_{i}, r_{i} < \infty

\frac{1}{p} = \frac{1}{q_{1}} + \frac{1}{r_{1}}

, and

\frac{1}{p} = \frac{1}{q_{2}} + \frac{1}{r_{2}}

whenever

0 < k

. This estimate is true even for non-integer

0 ⩽ k

by a simple Littlewood-Paley argument. We note here that we only use it the integer case, and there it is only employed as a convenience. For a proof of this, see e.g. Chapter 2 of [12] .

We would now like to set up a system to formalize many of the dyadic estimates which will appear in this paper. This is most easily done using the language of Besov spaces. Since we have a specific purpose for these in mind, we introduce the following notation:

\begin{matrix} ∥ A ∥^{2} {\dot{B}}_{2}^{p, (q, s)} = \sum_{μ} μ^{2 s - 2 n (\frac{1}{q} - \frac{1}{p})} ∥ P_{μ} A ∥^{2} L^{p}, \end{matrix}

(41)

This notation may seem a bit mysterious at first, but the thing to keep in mind here is that the first index

p

in some sense controls the decay, while the second double index

(q, s)

controls the scaling, which is the same as

{\dot{W}}^{s, q}

(homogeneous

L^{q}

Sobolev space). In general, the second index will be fixed, so we will strive to have

p

as low as possible (see Remark 4.2 below). This notation has the following simple significance:

{\dot{B}}_{2}^{p, (q, s)}

is the

ℓ^{2}

Besov space of Lebesgue index

p

which contains the standard Besov space

{\dot{B}}_{2}^{q, s}

defined by:

∥ A ∥^{2} {\dot{B}}_{2}^{q, s} = \sum_{μ} μ^{2 s} ∥ P_{μ} A ∥^{2} L^{q} .

This identification is a direct consequence of the Bernstein embedding 37 . In general, one has the inclusions:

\begin{matrix} {\dot{B}}_{2}^{p_{1}, (q, s)} & \subseteq {\dot{B}}_{2}^{p_{2}, (q, s)}, & q ⩽ p_{1} ⩽ p_{2} ⩽ \infty . \end{matrix}

(42)

\begin{matrix}  \end{matrix}

Furthermore, a quick application of the Littlewood-Paley identity 38 gives the Lebesgue space inclusion:

\begin{matrix} {\dot{B}}_{2}^{p, (q, n (\frac{1}{q} - \frac{1}{p}))} & \subseteq L^{p}, & 2 ⩽ p < \infty . \end{matrix}

(43)

\begin{matrix}  \end{matrix}

The reason we prefer to use this more involved notation, instead of the usual Besov norm convention is that ours allows one to tell at first glance which norms are critical, which is particularly useful in a scale invariant problem like the one of this paper. Specifically, the norms

{\dot{B}}_{2}^{p, (2, \frac{n - 2}{2})}

will play a prominent role in what follows.

It will also be necessary for us to employ the

ℓ^{1}

summing version of the norm 41 , which we label by

{\dot{B}}_{1}^{p, (q, s)}

. This will essentially be used for one purpose only, and that is that the

L^{\infty}

end-point of 43 is true for this space:

\begin{matrix} {\dot{B}}_{1}^{\infty, (q, \frac{n}{q})} & \subseteq L^{\infty}, & 1 ⩽ q ⩽ \infty . \end{matrix}

(44)

\begin{matrix}  \end{matrix}

Besov spaces are particularly well behaved with respect to the action of Riesz operators, which is exactly why we use them. In general, we define the operator

| D_{x} |^{- σ}

to be the Fourier multiplier with symbol

| ξ |^{- σ}

. The basic embedding we will use in the sequel is the following:

Lemma 4.1. One has the following bilinear estimate for Besov spaces for

0 ⩽ σ

\begin{matrix} | D_{x} |^{- σ} : {\dot{B}}_{2}^{p, (2, s_{1})} \cdot {\dot{B}}_{2}^{q, (2, s_{2})} ↪ {\dot{B}}_{1}^{r, (2, s_{3})}, \end{matrix}

(45)

where the indices

1 ⩽ p, q, r ⩽ \infty

and

σ, s_{i}

satisfy the following conditions:

\begin{matrix} s_{3} & = s_{1} + s_{2} + σ - \frac{n}{2}, & (s c a l i n g), \end{matrix}

(46)

\begin{matrix} σ + \frac{n}{2} - s_{3} & < n (\frac{1}{p} + \frac{1}{q}), & (H i g h \times H i g h), \end{matrix}

(47)

\begin{matrix} s_{1} & < \frac{n}{2} + min {n (\frac{1}{q} - \frac{1}{r}), 0}, & (L o w \times H i g h), \end{matrix}

(48)

\begin{matrix} s_{2} & < \frac{n}{2} + min {n (\frac{1}{p} - \frac{1}{r}), 0}, & (H i g h \times L o w), \end{matrix}

(49)

\begin{matrix} \frac{1}{r} & ⩽ \frac{1}{p} + \frac{1}{q}, & (L e b e s g u e) . \end{matrix}

(50)

\begin{matrix}  \end{matrix}

Remark 4.2. As will become apparent in the proof, it is possible to show frequency localized versions of the embedding 45 such that not all of the conditions 47 – 49 need to be satisfied. Indeed, we will show the following two frequency localized “improvements” are possible:

\begin{matrix} | D_{x} |^{- σ} : P_{∙ ≪ λ} ({\dot{B}}_{2}^{p, (2, s_{1})}) \cdot P_{λ} ({\dot{B}}_{2}^{q, (2, s_{2})}) & ↪ P_{λ} ({\dot{B}}_{1}^{r, (2, s_{3})}), \end{matrix}

(51)

\begin{matrix} | D_{x} |^{- σ} : P_{λ} ({\dot{B}}_{2}^{p, (2, s_{1})}) \cdot P_{λ} ({\dot{B}}_{2}^{q, (2, s_{2})}) & ↪ {(\frac{μ}{λ})}^{δ} P_{μ} ({\dot{B}}_{1}^{r, (2, s_{3})}), \end{matrix}

(52)

\begin{matrix}  \end{matrix}

where

δ = n (\frac{1}{p} + \frac{1}{q}) + s_{3} - σ - \frac{n}{2}

in estimate 52 . Estimate 51 holds whenever 46 , 48 , and 50 are satisfied. The second estimate 52 is valid whenever we have 46 , 47 , and 50 . In particular, notice that for larger

σ

this estimate requires lower values of

p, q

. This fact will have an immense bearing on the estimates we prove in the sequel, and seems to be one of the most difficult factors in lowering the dimension of the overall argument from

n = 6

(apart from even more difficult things such as null-form estimates).

Proof of estimate 45 . The proof is a simple matter of the standard technique of trichotomy. That is, we start with two test matrices

A

and

C

, and we run a frequency decomposition on the product:

A \cdot C = \sum_{λ, μ_{i}} P_{λ} (P_{μ_{1}} A \cdot P_{μ_{2}} C) .

Setting now:

γ = min {\frac{n}{2} - s_{1}, \frac{n}{2} - s_{2}, n (\frac{1}{p} + \frac{1}{q}) + s_{3} - σ - \frac{n}{2}, \frac{n}{2} + n (\frac{1}{q} - \frac{1}{r}) - s_{1}, \frac{n}{2} + n (\frac{1}{p} - \frac{1}{r}) - s_{2}},

we have from the conditions 47 – 49 that

0 < γ

. To prove 45 it suffices to show that:

\begin{matrix} \sum_{μ_{1} : μ_{1} ≪ μ_{2} λ \sim μ_{2}} λ^{s_{3} - n (\frac{1}{2} - \frac{1}{r}) - σ} ∥ P_{λ} (P_{μ_{1}} A \cdot P_{μ_{2}} C) ∥ L^{r} ≲ \end{matrix}

\begin{matrix} \sum_{μ_{1} : μ_{1} ≪ μ_{2} λ \sim μ_{2}} {(\frac{μ_{1}}{μ_{2}})}^{γ} ∥ P_{μ_{1}} A ∥ {\dot{B}}^{p, (2, s_{1})} \cdot ∥ P_{μ_{2}} C ∥ {\dot{B}}^{q, (2, s_{2})}, \end{matrix}

\begin{matrix} \sum_{μ_{2} : μ_{2} ≪ μ_{1} λ \sim μ_{1}} λ^{s_{3} - n (\frac{1}{2} - \frac{1}{r}) - σ} ∥ P_{λ} (P_{μ_{1}} A \cdot P_{μ_{2}} C) ∥ L^{r} ≲ \end{matrix}

\begin{matrix} \sum_{μ_{2} : μ_{2} ≪ μ_{1} λ \sim μ_{1}} {(\frac{μ_{2}}{μ_{1}})}^{γ} ∥ P_{μ_{1}} A ∥ {\dot{B}}^{p, (2, s_{1})} \cdot ∥ P_{μ_{2}} C ∥ {\dot{C}}^{q, (2, s_{2})}, \end{matrix}

\begin{matrix} \sum_{λ : μ_{2} \sim μ_{1} λ ≲ μ_{i}} λ^{s_{3} - n (\frac{1}{2} - \frac{1}{r}) - σ} ∥ P_{λ} (P_{μ_{1}} A \cdot P_{μ_{2}} C) ∥ L^{r} ≲ ∥ P_{μ_{1}} A ∥ {\dot{B}}^{p, (2, s_{1})} \cdot ∥ P_{μ_{2}} C ∥ {\dot{B}}^{q, (2, s_{2})}, \end{matrix}

\begin{matrix}  \end{matrix}

That 45 follows from these three estimates is a simple consequence of Young's inequality and Cauchy-Schwartz respectively. These estimates, in turn, are all a consequence of the single fixed frequency bound:

(53) λ s 3 − n ( 1 2 − 1 r ) − σ ∥ P λ ( P μ 1 A ⋅ P μ 2 C ) ∥ L r ≲ ( λ max { μ i } ) γ ⋅ min { ( μ 1 μ 2 ) γ , ( μ 2 μ 1 ) γ } ⋅ ∥ P μ 1 A ∥ B ˙ p , ( 2 , s 1 ) ⋅ ∥ P μ 2 C ∥ B ˙ q , ( 2 , s 2 ) .

The proof of 53 is a simple matter of Hölders and Bernstein's inequalities, and counting weights. There are three cases corresponding to the three summing estimates above. In the first case, we assume that

λ ≲ μ_{1} \sim μ_{2}

. Since 53 is scale invariant, we may assume in this case that both

μ_{i} \sim 1

. Using now Hölders inequality which is permissible by 50 , followed by the Bernstein inequality, we have that:

∥ P_{λ} (P_{μ_{1}} A \cdot P_{μ_{2}} C) ∥ L^{r} ≲ λ^{n (\frac{1}{p} + \frac{1}{q} - \frac{1}{r})} ∥ P_{μ_{1}} A ∥ L^{p} \cdot ∥ P_{μ_{2}} C ∥ L^{q} .

Multiplying this last estimate by the weight

λ^{s_{3} - n (\frac{1}{2} - \frac{1}{r}) - σ}

we arrive at the bound:

(L.H.S.)gen_besov_fixed_freq 53 ≲ λ^{n (\frac{1}{p} + \frac{1}{q}) + s_{3} - σ - \frac{n}{2}} ∥ P_{μ_{1}} A ∥ L^{p} \cdot ∥ P_{μ_{2}} C ∥ L^{q} .

Then 53 follows in this case from the definition of

γ

and the fact that

μ_{i} \sim 1

The other two cases, which correspond to

μ_{1} ≪ μ_{2}

or vice versa are similar, so it suffices to consider the first. In this case we rescale to

μ_{2} \sim λ \sim 1

. In the case where

r < q

we set

\frac{1}{\tilde{p}} = \frac{1}{r} - \frac{1}{q}

, and we again use Hölder and Bernstein to estimate:

∥ P_{λ} (P_{μ_{1}} A \cdot P_{μ_{2}} C) ∥ L^{r} ≲ μ_{1}^{n (\frac{1}{p} - \frac{1}{\tilde{p}})} ∥ P_{μ_{1}} A ∥ L^{p} \cdot ∥ P_{μ_{2}} C ∥ L^{q} .

If it is the case that

q ⩽ r

, then we simply estimate:

∥ P_{λ} (P_{μ_{1}} A \cdot P_{μ_{2}} C) ∥ L^{r} ≲ μ_{1}^{\frac{n}{p}} ∥ P_{μ_{1}} A ∥ L^{p} \cdot ∥ P_{μ_{2}} C ∥ L^{q} .

In either case, the claim 53 follows from the definition of

γ

. This completes the proof of 45 . □

Before continuing on, let us note here a slight refinement of the Besov norms 41 and the embedding 45 . This involves taking into account functions which live at frequency

≲ 1

. If we let

〈 D_{x} 〉

denote the multiplier with symbol

(1 + | ξ |^{2})^{\frac{1}{2}}

, then we form the low frequency spaces:

\begin{matrix} ∥ A ∥ {\dot{B}}_{2, 10 n}^{p, (q, s)} = ∥ 〈 D_{x} 〉^{10 n} A ∥ {\dot{B}}_{2}^{p, (q, s)}, \end{matrix}

(54)

with a similar definition for the

ℓ^{1}

version

{\dot{B}}_{1, 10 n}^{p, (q, s)}

. By a straightforward adaptation of the previous argument, it is easy to see that the embedding 45 is equally valid for these low frequency spaces. We leave the details to the reader.

It will also be necessary for us to perform various dyadic decompositions with respect to the angular frequency variable. For each fixed direction

ω

in the frequency plane

R_{ξ}^{n}

, we decompose the unit sphere

S_{ξ}^{n - 1}

into dyadic conical regions:

\begin{matrix} ℛ (ω, θ) = {η \in S_{ξ}^{n - 1} | ∠ (ω, η) \sim θ}, \end{matrix}

(55)

where

θ \in {\frac{π}{2} \cdot 2^{i} | i \in Z, i ⩽ 0}

. Here we will not bother to fix the constant in the

\sim

notation used to define the regions 55 , but we will let it change from line to line as we have done for the spatial multipliers above. We also define a smooth partition of unity adapted to these regions, which we label by

b_{θ}^{ω}

. These can always be chosen (e.g. by defining them on a larger sphere and then rescaling) so that they satisfy the differential bounds:

\begin{matrix} | (ω \cdot \nabla_{ξ})_{ω}^{k} p_{1} b_{θ}^{ω} | & ≲ 1, & | (ω^{⊥} \cdot \nabla_{ξ})^{k} p_{1} b_{θ}^{ω} | & ≲ θ^{- k}, \end{matrix}

\begin{matrix}  \end{matrix}

where the implicit constants depend on

k

but are uniform in

θ

. In particular, if we define the multipliers

\hat{ω Π_{θ} A} = b_{θ}^{ω} \hat{A}

, then the operators

ω Π_{θ} P_{μ}

are bounded on all

L^{p}

spaces uniformly in

μ

and

θ

. In fact, the following refinement of the inequality 37 holds, which we also call Bernstein:

\begin{matrix} ∥ ω Π_{θ} P_{μ} A ∥ L^{p} ≲ μ^{n (\frac{1}{q} - \frac{1}{p})} θ^{(n - 1) (\frac{1}{q} - \frac{1}{p})} ∥ A ∥ L^{q} . \end{matrix}

(56)

In all of the above inequalities, we have kept

ω

as a fixed directional value.

However, it will also be necessary for us to have an account of how our multipliers depend on this parameter. In particular, we will need to have bounds for the operators

\nabla_{ω} ω Π_{θ}

. This is easily achieved by differentiating the associated multiplier.

In fact, one has the bounds for fixed

ξ

\begin{matrix} | \nabla_{ω}^{k} b_{θ}^{ω} | ≲ θ^{- k} . \end{matrix}

(57)

The way we shall express this bound in calculations is through the following heuristic operator identity:

\begin{matrix} \nabla_{ω}^{k} ω Π_{θ} \approx θ^{- k} ω Π_{θ}, \end{matrix}

(58)

which we shall take to mean that the left hand side satisfies all

L^{p}

space bounds as the right hand side. Notice that this relation has a preferred direction (left

\Rightarrow

right).

In practice, this means that we have the bound 56 for the operator on the left hand side of 58 with the added factor of

θ^{- k}

Finally, let us end this section by making the following conventions. Firstly, it will be convenient for us at times to write

P_{μ} A = A_{μ}

for a localized object.

This should not be confused with the

μ^{t h}

component of

A

in the case that it is a one-form. This should usually be clear from context. Secondly, it will be necessary for us to ensure that certain of our multipliers have real symbol so that they respect the subalgebra

g (m) \subseteq M (m \times m)

. This will be done by taking their real part which simply symmetrizes their (real) symbols. In particular, we will denote this by:

ℜ (ω Π_{θ}) = ω {\bar{Π}}_{θ} .

Secondly, we use the following bulleted notation for the sum of various cutoffs over a given range:

\begin{matrix} P_{∙ < c} & = \sum_{μ < c} P_{μ}, & ω Π_{∙ < c} & = \sum_{θ < c} ω Π_{θ}, \end{matrix}

\begin{matrix}  \end{matrix}

etc. We will also use the notation

A_{∙ < c}

etc. for these operators applied to tensors.

Finally, we will set aside a special notation here for cutting off on angles sectors whose width depends on the frequency:

\begin{matrix} ω {\bar{Π}}^{(σ)} = \sum_{μ} ω {\bar{Π}}_{μ^{σ} < ∙} P_{μ} . \end{matrix}

(59)

Notice that this multiplier does not satisfy good bounds of the form 57 . However, it can be dealt with using the Littlewood-Paley equivalence 38 if there is a little extra room left to sum over fixed angular dyadics. This ends our description of the basic analysis we will use in this paper.

5 Gauge construction for the initial data; Reduction to a second order system and the main a-priori estimate

We now begin our proof of the main theorem 1.1 . As we have already mentioned, one of the central components of the proof is to construct a stable set of “elliptic coordinates” on the bundle

V

. The way we will do this is to construct the desired frame on the

t = 0

slice

R n \times g

. We will then show that this frame propagates as the system evolves by solving an auxiliary set of equations for the gauge potentials which respects the chosen frame automatically. The regularity of this system of equations will be provided in the usual translation invariant Sobolev spaces. We then show that our auxiliary solution is in fact a true solution to the system of equations 3 – 4 by employing a bootstrapping procedure which is similar to that used in the proof of Lemma 3.2 . The desired gauge covariant regularity, which is contained in the statement of Theorem 1.1 , will be provided by a comparison principle. These constructions are all local in time and are more or less standard. We have included them here for the convenience of the reader, the sake of completeness, and the fact that some of the formulas we develop along the way will be central to what we do in later sections.

With the local theory established, the global conclusion of Theorem 1.1 will then be a consequence of a certain a-priori estimate on the (usual Sobolev) energy of solutions to 3 – 4 in the gauge we construct. Our task will then be to show that this a-priori estimate is true for all solutions to yet another system of auxiliary equations, this time for the curvature. This can be considered to be the main estimate of the paper. The proof turns out to be quite involved, and will occupy the rest of the paper. In the next section, we will prove the main a-priori estimate itself with the help of a certain family of microlocalized space-time (Strichartz) estimates for solutions to second order covariant wave equations on bundles with connections satisfying estimates consistent with our bootstrapping assumptions.

The breakdown here is based on the Smith-Tataru (see [10] )

ℰ

-parametrix idea, which allows one to reduce the needed Strichartz estimates to proving them for a suitable family of approximate frequency localized fundamental solutions. Our rendition of this is essentially equivalent to that contained in the paper [8] .

Finally, in the remaining sections of the paper we develop the linear theory. This is by far the most involved portion of the present work, and requires the construction of some fairly sophisticated oscillatory integrals and microlocal function spaces.

This material can be read without reference to the non-linear problem, as long as one is familiar with the algebraic and analytic assumptions we make on the geometry (frequency localized connection). While these come from the non-linear problem, they are of course a bit more general.

5.1 Construction of the initial frame, and the comparison principle

The first thing we do here is to put the initial connection

\underset{̲}{D}

into the Coulomb gauge. Via the Uhlenbeck lemma 3.1 , we simply need to show that:

∥ \underset{̲}{F} ∥ L^{\frac{n}{2}} ≲ ɛ_{0},

for

ɛ_{0}

the sufficiently small parameter from line 10 (which should not be confused with the small constant from Lemma 3.1 above). This

L^{p}

bound follows immediately from the gauge covariant Sobolev embedding (for

n

even):

{\dot{H}}_{A}^{\frac{n - 4}{2}} \subseteq L^{\frac{n}{2}},

which in turn follows from repeated application of the usual single derivative Sobolev embeddings and the Kato estimate (which follows immediately from 1 and Cauchy-Schwatrz):

\begin{matrix} | d | F | | ⩽ | \underset{̲}{D} F |, \end{matrix}

(60)

where

F

is any section to

ℳ \times g

and the absolute norm

| \cdot |

is taken with respect to the Killing inner product 13 .

We may now assume that we are dealing with an initial data set:

\begin{matrix} (\underset{̲}{F} (0), \underset{̲}{D} (0), E (0)), \end{matrix}

(61)

for the system which is such that connection

\underset{̲}{D} (0) = d + \underset{̲}{A} (0)

satisfied the elliptic div-curl system:

\begin{matrix} d \underset{̲}{A} (0) + [\underset{̲}{A} (0), \underset{̲}{A} (0)] & = \underset{̲}{F} (0), & d^{*} \underset{̲}{A} (0) = 0, \end{matrix}

(62)

\begin{matrix}  \end{matrix}

and such that the compatibility condition 5 is satisfied. Furthermore, from 19 we have the bounds:

∥ \underset{̲}{A} (0) ∥ L^{n} ≲ ɛ_{0} .

We will now use this last bound to show that the initial data set 61 is in fact in the classical Sobolev spaces

{\dot{H}}^{k}

. This is a consequence of the following:

Lemma 5.1 (Comparison principle for Sobolev norms on

R n

). Let

\underset{̲}{D} = d + \underset{̲}{A}

be a connection on

R^{n}

, with

n

even, such that one has the potential and curvature bounds:

\begin{matrix} ∥ \underset{̲}{A} ∥ L^{n}, ∥ \underset{̲}{F} ∥ {\dot{H}}_{A}^{\frac{n - 4}{2}} & ⩽ ε_{0}, \end{matrix}

(63)

\begin{matrix} ∥ \underset{̲}{F} ∥ {\dot{H}}_{A}^{k} & ⩽ M_{k}, \end{matrix}

(64)

\begin{matrix}  \end{matrix}

for

\frac{n - 4}{2} < k

. Suppose also that

\underset{̲}{D}

is in the gauge

d^{*} \underset{̲}{A} = 0

. Then we have the critical classical Sobolev bounds:

\begin{matrix} ∥ \underset{̲}{F} ∥ {\dot{H}}^{\frac{n - 4}{2}} & ⩽ C ε_{0}, \end{matrix}

(65)

\begin{matrix} ∥ \underset{̲}{A} ∥ {\dot{H}}^{\frac{n - 2}{2}} & ⩽ C ε_{0} . \end{matrix}

(66)

\begin{matrix}  \end{matrix}

Furthermore, if

G

is any

g

valued function, then we have the following inductive comparison of norms:

\begin{matrix} C^{- 1} (M_{\frac{n - 4}{2}}, \dots, M_{k - 1}) ∥ G ∥ H^{[k^{*}, k]} & ⩽ ∥ G ∥ H_{A}^{[k^{*}, k]}, \end{matrix}

(67)

\begin{matrix} ⩽ C (M_{\frac{n - 4}{2}}, \dots, M_{k - 1}) ∥ G ∥ H^{[k^{*}, k]}, \end{matrix}

(68)

\begin{matrix}  \end{matrix}

where the index

k^{*}

is such that

\frac{n - 4}{2} ⩽ k^{*} < n

, and where we have set:

∥ G ∥^{2} H_{A}^{[k^{*}, k]} = \sum_{k^{*} ⩽ m ⩽ k} ∥ {\underset{̲}{D}}^{m} G ∥^{2} L^{2},

to be the interval gauge-covariant Sobolev space. We use an analogous definition for the space

H^{[k^{*}, k]}

. We also have the non-inductive equivalence between

\nabla_{x} \underset{̲}{A}

and

\underset{̲}{F}

\begin{matrix} N_{k}^{- 1} ∥ \underset{̲}{A} ∥ {\dot{H}}^{k} ⩽ ∥ \underset{̲}{F} ∥ {\dot{H}}^{k - 1} ⩽ N_{k} ∥ \underset{̲}{A} ∥ {\dot{H}}^{k}, \end{matrix}

(69)

where

N_{k}

\frac{n - 2}{2} ⩽ k

, is a set of constants which depends only on the dimension and not on the constant

ε_{0}

once it is sufficiently small. In particular, combining all of this, we have the following classical Sobolev bounds on the pair

(\underset{̲}{A}, \underset{̲}{F})

\begin{matrix} ∥ \underset{̲}{F} ∥ {\dot{H}}^{k} & ⩽ C (M_{\frac{n - 4}{2}}, \dots, M_{k - 1}) M_{k}, \end{matrix}

(70)

\begin{matrix} ∥ \underset{̲}{A} ∥ {\dot{H}}^{k + 1} & ⩽ C (M_{\frac{n - 4}{2}}, \dots, M_{k - 1}) M_{k} . \end{matrix}

(71)

\begin{matrix}  \end{matrix}

for

\frac{n - 4}{2} < k

Proof of Lemma 5.1 . The proof will be accomplished via a series of inductions.

In what follows, we will assume the estimate 69 , whose proof follows from simple analysis of the elliptic system 62 in Besov spaces of the kind

{\dot{B}}_{2}^{p, (2, s)}

. We will perform many reductions like this in the sequel so we leave this one to the reader.

The first step is to prove the critical classical Sobolev 65 . Note that the potential bounds 66 follow from this and 69 . The inductive hypothesis that we make here is that:

\begin{matrix} ∥ \nabla_{x}^{l} {\underset{̲}{D}}^{m} \underset{̲}{F} ∥ L^{\frac{n}{k}} ≲ ε_{0}, \end{matrix}

(72)

for

k = l + m + 2 ⩽ \frac{n}{2}

whenever

0 ⩽ l ⩽ l_{0}

. Notice that this hypothesis is verified for

l_{0} = 0

on account of the assumption 63 and by applying the Kato estimate 60 in conjunction with integer Sobolev embeddings. Notice also that by applying Riesz operator estimates to the elliptic system 62 , and using the product estimate 40 along with Sobolev embeddings we have the bounds:

\begin{matrix} ∥ \nabla_{x}^{l + 1} \underset{̲}{A} ∥ L^{\frac{n}{k}} & ≲ ∥ \nabla_{x}^{l} \underset{̲}{F} ∥ L^{\frac{n}{k}} + ∥ \nabla_{x}^{l} ([\underset{̲}{A}, \underset{̲}{A}]) ∥ L^{\frac{n}{k}}, \end{matrix}

\begin{matrix} ≲ ∥ \nabla_{x}^{l} \underset{̲}{F} ∥ L^{\frac{n}{k}} + ∥ \nabla_{x}^{l} \underset{̲}{A} ∥ L^{\frac{n}{k - 1}} \cdot ∥ \underset{̲}{A} ∥ L^{n}, \end{matrix}

\begin{matrix} ≲ ∥ \nabla_{x}^{l} \underset{̲}{F} ∥ L^{\frac{n}{k}} + ε_{0} \cdot ∥ \nabla_{x}^{l + 1} \underset{̲}{A} ∥ L^{\frac{n}{k}} . \end{matrix}

\begin{matrix}  \end{matrix}

Therefore, the inductive hypothesis 72 may be assumed to also contain the estimate:

\begin{matrix} ∥ \nabla_{x}^{l + 1} \underset{̲}{A} ∥ L^{\frac{n}{k}} ≲ ε_{0}, \end{matrix}

(73)

for

k = l + 2 ⩽ \frac{n}{2}

and

l ⩽ l_{0}

. To show that 72 holds for all

l ⩽ l_{0} + 1

, we start with

l ⩽ l_{0}

and we compute using 40 and Sobolev embeddings that:

\begin{matrix} ∥ \nabla_{x}^{l + 1} {\underset{̲}{D}}^{m - 1} \underset{̲}{F} ∥ L^{\frac{n}{k}}, \end{matrix}

\begin{matrix} ≲ & ∥ \nabla_{x}^{l} {\underset{̲}{D}}^{m} \underset{̲}{F} ∥ L^{\frac{n}{k}} + ∥ \nabla_{x}^{l} ([\underset{̲}{A}, {\underset{̲}{D}}^{m - 1} \underset{̲}{F}]) ∥ L^{\frac{n}{k}}, \end{matrix}

\begin{matrix} ≲ & ε_{0} + ∥ \nabla_{x}^{l} \underset{̲}{A} ∥ L^{\frac{n}{l + 1}} \cdot ∥ {\underset{̲}{D}}^{m - 1} \underset{̲}{F} ∥ L^{\frac{n}{k - l - 1}} + ∥ \underset{̲}{A} ∥ L^{n} \cdot ∥ \nabla_{x}^{l} {\underset{̲}{D}}^{m - 1} \underset{̲}{F} ∥ L^{\frac{n}{k - 1}}, \end{matrix}

\begin{matrix} ≲ & ε_{0} + ε_{0} \cdot ∥ \nabla_{x}^{l + 1} {\underset{̲}{D}}^{m - 1} \underset{̲}{F} ∥ L^{\frac{n}{k}} . \end{matrix}

\begin{matrix}  \end{matrix}

This inductively establishes 72 and hence proves 65 .

We now show 68 . We first deal with the leftmost inequality. Our inductive hypothesis here is that:

\begin{matrix} ∥ \nabla_{x}^{l} {\underset{̲}{D}}^{m} G ∥ L^{2} ≲ C (M_{\frac{n - 4}{2}}, \dots, M_{k - 1}) ∥ G ∥ H_{A}^{[k^{*}, k]}, \end{matrix}

(74)

where

l + m = k_{0}

for

k_{0} = k

k_{0} = k^{*}

, and for all

l ⩽ l_{0}

. To compute

\nabla_{x}^{l + 1} {\underset{̲}{D}}^{m - 1} G

in terms of this, we need to split into cases depending on whether or not

l + 1 < \frac{n}{2}

In the former case we compute that:

\begin{matrix} ∥ \nabla_{x}^{l + 1} {\underset{̲}{D}}^{m - 1} G ∥ L^{2}, \end{matrix}

\begin{matrix} ≲ & ∥ \nabla_{x}^{l} {\underset{̲}{D}}^{m} G ∥ L^{2} + ∥ \nabla_{x}^{l} ([\underset{̲}{A}, {\underset{̲}{D}}^{m - 1} G]) ∥ L^{2}, \end{matrix}

\begin{matrix} ≲ & C (M_{\frac{n - 4}{2}}, \dots, M_{k - 1}) ∥ G ∥ H_{A}^{[k^{*}, k]} + ∥ \nabla_{x}^{l} \underset{̲}{A} ∥ L^{\frac{n}{l + 1}} \cdot ∥ {\underset{̲}{D}}^{m - 1} G ∥ L^{\frac{2 n}{n - 2 l - 2}} \end{matrix}

(75)

\begin{matrix} + ∥ \underset{̲}{A} ∥ L^{n} \cdot ∥ \nabla_{x}^{l} {\underset{̲}{D}}^{m - 1} G ∥ L^{\frac{2 n}{n - 2}}, \end{matrix}

\begin{matrix} ≲ & C (M_{\frac{n - 4}{2}}, \dots, M_{k - 1}) ∥ G ∥ H_{A}^{[k^{*}, k]} + ε_{0} \cdot ∥ \nabla_{x}^{l + 1} {\underset{̲}{D}}^{m - 1} G ∥ L^{2} . \end{matrix}

\begin{matrix}  \end{matrix}

In the case where

\frac{n}{2} - 1 ⩽ l

we have the inequality:

\begin{matrix} ∥ \nabla_{x}^{l + 1} {\underset{̲}{D}}^{m - 1} G ∥ L^{2}, \end{matrix}

\begin{matrix} ≲ & C (M_{\frac{n - 4}{2}}, \dots, M_{k - 1}) ∥ G ∥ H_{A}^{[k^{*}, k]} + ∥ \nabla_{x}^{l} \underset{̲}{A} ∥ L^{\frac{2 n}{n - 2}} \cdot ∥ {\underset{̲}{D}}^{m - 1} G ∥ L^{n} \end{matrix}

(76)

\begin{matrix} + ∥ \underset{̲}{A} ∥ L^{n} \cdot ∥ \nabla_{x}^{l} {\underset{̲}{D}}^{m - 1} G ∥ L^{\frac{2 n}{n - 2}}, \end{matrix}

\begin{matrix} ≲ & C (M_{\frac{n - 4}{2}}, \dots, M_{k - 1}) ∥ G ∥ H_{A}^{[k^{*}, k]} + ∥ \nabla_{x}^{l + 1} \underset{̲}{A} ∥ L^{2} \cdot ∥ {\underset{̲}{D}}^{\frac{n - 2}{2} + m - 1} G ∥ L^{2} \end{matrix}

(77)

\begin{matrix} + ε_{0} \cdot ∥ \nabla_{x}^{l + 1} {\underset{̲}{D}}^{m - 1} G ∥ L^{2} . \end{matrix}

\begin{matrix}  \end{matrix}

Notice that this last line above used the

L^{2} ↪ L^{n}

gauge covariant Sobolev embedding.

To bound the second term on this line, notice that since

\frac{n}{2} - 1 ⩽ l

and we must assume that

1 ⩽ m

for the induction to make sense, we have the bound

k^{*} ⩽ \frac{n - 2}{2} + m - 1 ⩽ k

. This allows us to bound:

∥ {\underset{̲}{D}}^{\frac{n - 2}{2} + m - 1} G ∥ L^{2} ⩽ ∥ G ∥ H_{A}^{[k^{*}, k]} .

Furthermore, by placing all of these calculations within an induction on the value of

k

itself, and using the bound 69 while noting that

l ⩽ k - 1

we may assume the bound:

∥ \nabla_{x}^{l + 1} \underset{̲}{A} ∥ L^{2} ≲ ∥ \nabla_{x}^{l} \underset{̲}{F} ∥ L^{2} ≲ C (M_{\frac{n - 4}{2}}, \dots, M_{k - 1}) .

This completes our inductive proof of 74 above.

The proof of the second inequality on line 68 follows from reasoning similar as that used to prove 74 inductively. We leave it to the reader to set up the inductive hypothesis for this case and work out the details. This completes our proof of Lemma 5.1 . □

Using Lemma 5.1 and the assumed bounds 10 – 11 , we may assume that our initial data 61 is such that:

\begin{matrix} ∥ (\underset{̲}{F} (0), E (0)) ∥ {\dot{H}}^{\frac{n - 4}{2}} & ⩽ {\tilde{ε}}_{0}, \end{matrix}

(78)

\begin{matrix} ∥ \underset{̲}{A} (0) ∥ {\dot{H}}^{\frac{n - 2}{2}} & ⩽ {\tilde{ε}}_{0}, \end{matrix}

(79)

\begin{matrix} ∥ (\underset{̲}{F} (0), E (0)) ∥ {\dot{H}}^{k} & ⩽ {\tilde{M}}_{k}, \end{matrix}

(80)

\begin{matrix} ∥ \underset{̲}{A} (0) ∥ {\dot{H}}^{k + 1} & ⩽ {\tilde{M}}_{k}, \end{matrix}

(81)

\begin{matrix}  \end{matrix}

where

\frac{n - 4}{2} < k

, and the

{\tilde{M}}_{k}

depend on the

M_{k}

in some inductive way, and we also have that

\tilde{ε_{0}} ⩽ C ɛ_{0}

for some constant

C

which depends only on the dimension.

Here

M_{k}

and

ɛ_{0}

refer to the constants introduced in the statement of Theorem 1.1 .

We now decompose the initial field strength

{E_{i} (0)}

in a way that will be consistent with the evolution of the system 3 – 4 . This will be convenient for discussing the Cauchy problem. Our first step is to define the following elliptic quantity:

\begin{matrix} Δ a_{0} = - [a_{i}, \nabla^{i} a_{0}] + [a^{i}, E_{i}] . \end{matrix}

(82)

where for convenience we have labeled

{a_{i}} = {{\underset{̲}{A}}_{i} (0)}

. We then define the auxiliary set of quantities:

\begin{matrix} {\dot{a}}_{i} = E_{i} + \nabla_{i} a_{0} - [a_{0}, a_{i}] . \end{matrix}

(83)

Notice that as an immediate consequence of the constraint equation 5 , the form of 82 , and the Coulomb condition

d^{*} a = 0

, we have the secondary Coulomb condition:

\nabla^{i} {\dot{a}}_{i} = 0 .

This will turn out to be important in a moment. Now, from the definition of the quantities 82 and 83 , the already established bounds 78 – 81 , and several rounds of Sobolev embeddings, we have the following differential bounds on the quantities

{{\dot{a}}_{i}}

\begin{matrix} ∥ \dot{a} ∥ {\dot{H}}^{\frac{n - 4}{2}} ⩽ {\tilde{ε}}_{0}, \end{matrix}

(84)

\begin{matrix} ∥ \dot{a} ∥ {\dot{H}}^{k} ⩽ {\tilde{M}}_{k}, \end{matrix}

(85)

\begin{matrix}  \end{matrix}

for

\frac{n - 4}{2} < k

(after a possible slight redefinition of the constants

{\tilde{ε}}_{0}, {\tilde{M}}_{k}

via multiplication by some fixed dimensional constant). We now define a Coulomb admissible initial data set to be a collection

(\underset{̲}{F}, {a_{i}}, {{\dot{a}}_{i}})

such that:

\begin{matrix} d a + [a, a] & = \underset{̲}{F}, & d^{*} a & = 0, & d^{*} \dot{a} & = 0 . \end{matrix}

(86)

\begin{matrix}  \end{matrix}

Notice that

\underset{̲}{F}

is uniquely determined by the

{a_{i}}

, therefore we do not need to include it in the definition of initial data. We define the Coulomb-Cauchy problem to be the task of finding a space-time connection

D = d + A

such that it satisfies the set of equations:

\begin{matrix} D^{β} F_{α β} = 0, \end{matrix}

(87a)

\begin{matrix} d A + [A, A] = F, \end{matrix}

(87b)

\begin{matrix} d^{*} \underset{̲}{A} = 0, \end{matrix}

(87c)

\begin{matrix}  \end{matrix}

and such that at time

t = 0

we have that:

\begin{matrix} \underset{̲}{A} (0) & = a, & \partial_{t} \underset{̲}{A} (0) = \dot{a} . \end{matrix}

(88)

\begin{matrix}  \end{matrix}

We remark briefly here that solving the problem 86 – 88 provides a solution to the original Yang Mills system 3 – 4 with Cauchy data 61 as long as we define the collection

{\dot{a}}

according to the equations 82 – 83 . All we need to do to prove this assertion is to show that:

F_{0 i} (0) = E_{i} .

Our proof of this follows the same bootstrapping philosophy used to show the equivalence 34 in the proof of Lemma 3.2 . The claim will follow at once from equation 83 if we can first establish that:

A_{0} (0) = a_{0},

where

a_{0}

is defined by 82 . Now, from the system of equations 87 we have that the quantity

A_{0}

is elliptically determined by the equation:

\begin{matrix} Δ_{\underset{̲}{A}} A_{0} = [A_{i}, \nabla_{t} A^{i}], \end{matrix}

(89)

where

Δ_{\underset{̲}{A}} = {\underset{̲}{D}}^{i} {\underset{̲}{D}}_{i}

is the gauge covariant Laplacean. Furthermore, by using equation 83 as the definition of

E_{i}

, and substituting this into equation 82 , we have that the quantity

a_{0}

is elliptically determined by the equation:

\begin{matrix} Δ_{a} a_{0} = [a_{i}, {\dot{a}}^{i}] . \end{matrix}

(90)

By subtracting 90 from 89 at time

t = 0

we have that:

Δ_{a} (A_{0} (0) - a_{0}) = 0 .

Uniqueness now comes from the Sobolev type estimate:

∥ B ∥ L^{n} ≲ ∥ Δ_{a} B ∥ L^{\frac{n}{3}},

which follows from the smallness condition 79 and the usual Sobolev estimates.

The details of the proof are left to the reader.

Keeping the equivalence we have just established in mind, and the first inequality contained in the comparison estimates 68 and 69 , we have reduced the demonstration of Theorem 1.1 to showing the following non-gauge covariant global regularity theorem:

Theorem 5.2 (Global regularity in the Coulomb gauge). Let the number of spatial dimensions be

6 ⩽ n

. Then there exists a set of constants

{\tilde{ε}}_{0}

and

C, C_{k}

\frac{n - 2}{2} ⩽ k

such that if

(\underset{̲}{F}, {a_{i}}, {{\dot{a}}_{i}})

is a Coulomb admissible initial data set such that is satisfies the bounds:

\begin{matrix} ∥ \underset{̲}{F} ∥ {\dot{H}}^{\frac{n - 4}{2}} & ⩽ {\tilde{ε}}_{0}, & ∥ \underset{̲}{F} ∥ {\dot{H}}^{k} & ⩽ {\tilde{M}}_{k}, \end{matrix}

(91a)

\begin{matrix} ∥ a ∥ {\dot{H}}^{\frac{n - 2}{2}} & ⩽ {\tilde{ε}}_{0}, & ∥ \dot{a} ∥ {\dot{H}}^{\frac{n - 4}{2}} & ⩽ {\tilde{ε}}_{0}, \end{matrix}

(91b)

\begin{matrix} ∥ a ∥ {\dot{H}}^{k} & ⩽ {\tilde{M}}_{k - 1}, & ∥ \dot{a} ∥ {\dot{H}}^{k - 1} & ⩽ {\tilde{M}}_{k - 1}, \end{matrix}

(91c)

\begin{matrix}  \end{matrix}

then if

{\tilde{ε}}_{0}

is sufficiently small there exists a unique global solution

{A_{α}}

to the system 87 with this initial data. Furthermore, this solution obeys the following differential estimates:

\begin{matrix} ∥ A ∥ {\dot{H}}^{\frac{n - 2}{2}} & ⩽ C {\tilde{ε}}_{0}, & ∥ \partial_{t} A ∥ {\dot{H}}^{\frac{n - 4}{2}} & ⩽ C {\tilde{ε}}_{0}, \end{matrix}

(92a)

\begin{matrix} ∥ A ∥ {\dot{H}}^{k} & ⩽ C_{k - 1} {\tilde{M}}_{k - 1}, & ∥ \partial_{t} A ∥ {\dot{H}}^{k - 1} & ⩽ C_{k - 1} {\tilde{M}}_{k - 1}, \end{matrix}

(92b)

\begin{matrix}  \end{matrix}

5.2 Local existence in the Coulomb gauge

Our goal here is to reduce the proof of Theorem 5.2 to a certain a-priori estimate involving the energies of the field strength

F

. This amounts to proving a local existence theorem for the system 86 – 88 . The proof of this will allow us to set up a system of equations for the coulomb potentials

{A_{α}}

which will be of central importance in the sequel. We will show that:

Proposition 5.3 (Local existence in the Coulomb gauge). Let the number of spatial dimensions be

6 ⩽ n

. Then for every set of constants

C, C_{k}

\frac{n - 2}{2} ⩽ k

, there exists an

{\tilde{ε}}_{0}

which only depends on

C

with the following property: If

({a_{i}}, {{\dot{a}}_{i}})

is any set of Coulomb admissible initial data such that:

\begin{matrix} ∥ a ∥ {\dot{H}}^{\frac{n - 2}{2}} & ⩽ C {\tilde{ε}}_{0}, & ∥ \dot{a} ∥ {\dot{H}}^{\frac{n - 4}{2}} & ⩽ C {\tilde{ε}}_{0}, \end{matrix}

(93)

\begin{matrix} ∥ a ∥ {\dot{H}}^{k} & ⩽ C_{k - 1} {\tilde{M}}_{k - 1}, & ∥ \dot{a} ∥ {\dot{H}}^{k - 1} & ⩽ C_{k - 1} {\tilde{M}}_{k - 1}, \end{matrix}

(94)

\begin{matrix}  \end{matrix}

then for

{\tilde{ε}}_{0}

sufficiently small there exists a time

0 < T^{*}

, which only depends on the quantities

C {\tilde{ε}}_{0}, C_{\frac{n}{2}} {\tilde{M}}_{\frac{n}{2}}, C_{\frac{n + 2}{2}} {\tilde{M}}_{\frac{n + 2}{2}}

such that there exists a unique local solution

{A_{α}}

to the system 86 – 88 with this set of initial data. Furthermore, on the time interval

[0, T^{*}]

one has the following norm bounds on the collection

{A_{α}}

\begin{matrix} {sup}_{0 ⩽ t ⩽ T^{*}} ∥ A (t) ∥ {\dot{H}}^{\frac{n - 2}{2}} & ⩽ 2 C {\tilde{ε}}_{0}, \end{matrix}

(95)

\begin{matrix} {sup}_{0 ⩽ t ⩽ T^{*}} ∥ \partial_{t} A (t) ∥ {\dot{H}}^{\frac{n - 4}{2}} & ⩽ 2 C {\tilde{ε}}_{0}, \end{matrix}

(96)

\begin{matrix} {sup}_{0 ⩽ t ⩽ T^{*}} ∥ A (t) ∥ {\dot{H}}^{k} & ⩽ 2 C_{k - 1} {\tilde{M}}_{k - 1}, \end{matrix}

(97)

\begin{matrix} {sup}_{0 ⩽ t ⩽ T^{*}} ∥ \partial_{t} A (t) ∥ {\dot{H}}^{k - 1} & ⩽ 2 C_{k - 1} {\tilde{M}}_{k - 1} . \end{matrix}

(98)

\begin{matrix}  \end{matrix}

Proof of Proposition 5.3 . The proof will be reduced to the standard procedure of energy estimates and Sobolev embeddings. Since we are assuming that the initial data has enough smoothness to cover

L^{\infty}

, this is more or less trivial. We start by plugging 87b directly into 87a . After an application of the gauge condition

d^{*} \underset{̲}{A} = 0

this yields a general second order system of equations which we write as:

\begin{matrix} □ A_{β} = - \partial_{β} \partial_{t} A_{0} + [\partial_{t} A_{0}, A_{β}] - [A_{α}, \partial^{α} A_{β}] - [A^{α}, F_{α β}] . \end{matrix}

(99)

To split this into a hyperbolic-elliptic system, we decompose the set of equations 99 into its spatial and temporal parts, and apply the Leray projection:

P = - \frac{d^{*} d}{Δ} = (I - \nabla_{x} \frac{(d i v)}{Δ}),

to the resulting spatial equation. After some rearrangement of the elliptic equation this yields the coupled system:

\begin{matrix} □ A_{i} & = P ([\partial_{t} A_{0}, A_{i}] - [A_{α}, \partial^{α} A_{i}] - [A^{α}, F_{α i}]), \end{matrix}

(100a)

\begin{matrix} Δ A_{0} & = - [A_{i}, \partial^{i} A_{0}] + [A^{i}, F_{0 i}] . \end{matrix}

(100b)

\begin{matrix}  \end{matrix}

The above system of equations can be solved locally in time with the bounds 95 – 95 through a Picard iteration scheme. We leave this as an exercise for the reader. Notice that the projection

P

can be removed in energy estimates because it is an order zero operator. Notice also that even though the smallness of the time interval

[0, T^{*}]

will not make up for estimates involving the elliptic equation 100b , the critical smallness assumption 93 allows one to obtain the bootstrapping estimates 95 – 95 if one uses Littlewood-Paley decompositions and paraproducts to make sure at least one factor in the non-linearity on the right hand side of 100b goes in a critical space. This same comment goes for bounding terms on the right hand side of 100a in energy estimates when one is bootstrapping the higher norm constants

C_{k} {\tilde{M}}_{k}

for

\frac{n + 2}{2} < k

. Again, the smallness in time makes up for the size of the first few constants

C {\tilde{ε}}_{0}, C_{\frac{n}{2}} {\tilde{M}}_{\frac{n}{2}}, C_{\frac{n + 2}{2}} {\tilde{M}}_{\frac{n + 2}{2}}

Having now produced a local solution to the system 100 with the desired properties, we have shown the conclusion of Proposition 5.3 once we show that the spatial potentials which solve 100a are in fact solutions to the spatial portion of the original second order equation 99 . This will be shown through our general strategy of coming up with a quantity which yields a critical elliptic bootstrapping estimate which will force it to be zero. This time, the desired quantity turns out to be related to the conservation of electric charge for the Yang-Mills equations. We first write the spatial portion of the non-linearity on the right hand side of 99 as a vector:

\begin{matrix} N_{i} = - \partial_{i} \partial_{t} A_{0} + [\partial_{t} A_{0}, A_{i}] - [A_{α}, \partial^{α} A_{i}] - [A^{α}, F_{α i}] . \end{matrix}

(101)

We would like to show that the equations 100 force

(I - P) N = 0

. We compute that:

(I - P) N = \nabla_{x} Δ^{- 1} (- \partial_{t} Δ A_{0} - \partial^{i} \partial^{α} [A_{α}, A_{i}] - \partial^{i} [A^{α}, F_{α i}]) .

Now, using the equation 100 to compute

\partial_{t} Δ A_{0}

, this last line becomes:

\begin{matrix} (I - P) N & = \nabla_{x} Δ^{- 1} (- \partial^{β} \partial^{α} [A_{α}, A_{β}] - \partial^{β} [A^{α}, F_{α β}]), \end{matrix}

\begin{matrix} = - \nabla_{x} Δ^{- 1} \partial^{β} [A^{α}, F_{α β}] . \end{matrix}

\begin{matrix}  \end{matrix}

where the equality of the second line follows on account of skew symmetry. We now isolate the interesting portion of the term on the right hand side of the last line above and use the Jacobi identity to compute that:

\begin{matrix} \partial^{β} [A^{α}, F_{α β}] & = \frac{1}{2} [(d A)^{α β}, F_{α β}] + [A^{α}, \partial^{β} F_{α β}], \end{matrix}

\begin{matrix} = \frac{1}{2} [[A^{α}, A^{β}], F_{α β}] - [A^{α}, [A^{β}, F_{α β}]] + [A^{α}, D^{β} F_{α β}], \end{matrix}

\begin{matrix} = [A^{α}, D^{β} F_{α β}] . \end{matrix}

\begin{matrix}  \end{matrix}

Now, again using equation 100b we have that

D^{β} F_{0 β} = 0

. Furthermore, from equation 100a we also have the identity:

D^{β} F_{i β} = - (I - P)_{i} N .

Combining all of this, we have the following equality:

\begin{matrix} (I - P) N = \nabla_{x} Δ^{- 1} [A^{i}, (I - P)_{i} N] . \end{matrix}

(102)

Finally, from the form of 101 and the already established estimates 95 – 98 as well as the boundedness properties of the operator

(1 - P)

we have that:

∥ (I - P) N (t) ∥ L^{\frac{n}{3}} < \infty,

for all times

t \in [0, T^{*}]

. However, from the smallness bound 95 , the identity 102 , and a Sobolev embedding we also have the fixed time bound:

\begin{matrix} ∥ (I - P) N ∥ L^{\frac{n}{3}} & ≲ ∥ [A^{i}, (1 - P)_{i} N] ∥ L^{\frac{n}{4}}, \end{matrix}

\begin{matrix} ⩽ ∥ A ∥ L^{n} \cdot ∥ (I - P) N ∥ L^{\frac{n}{3}}, \end{matrix}

\begin{matrix} ≲ {\tilde{ε}}_{0} \cdot ∥ (I - P) N ∥ L^{\frac{n}{3}} . \end{matrix}

\begin{matrix}  \end{matrix}

Therefore, for

{\tilde{ε}}_{0}

sufficiently small we see that we must have

(I - P) N = 0

as was to be shown. This completes the proof that the solution to 100 is a solution to the general system 99 , and therefore ends our proof of Proposition 5.3 . □

5.3 The second order curvature equation and the main a-priori estimate

Through a repeated application of the local existence theorem 5.3 , we may reduce the proof of the global existence theorem 5.2 to showing a-priori that any solution to the Coulomb system 86 – 88 which exists on a time interval

[0, T^{*}]

(possibly large!), and such that it obeys the both the initial data bounds 91a – 91c , as well as the evolution bounds 95 – 98 , in fact obeys the improved evolution bounds 92a – 92b .

Now, it turns out that the system of equations 100 is by itself not so well adapted⁴ to the proof of such an a-priori estimate. This stems from the fact that these equations are not covariant. This manifests itself in the projection operator

P

. If one were to try to write the hyperbolic system of equations 100a in terms of covariant wave operator

□_{A}

and a source term, the projection operator which is non-local would end up causing problems in various commutator terms. The way around this is to not only consider the system 100 , but to also work directly with the curvature in the equations 87a – 87b . This is possible because we are not attempting to set up an iteration scheme, but are instead merely trying to prove an a-priori estimate, so we may safely assume that the quantities we work with satisfy any equation which results from the system 87 . We will in fact use several such elliptic and hyperbolic equations. As a very rough description of this kind of philosophy, the reader may find it useful to keep in mind the following schematic:

\begin{matrix} W e a k c o n t r o l o f t h e c o n n e c t i o n & ⟹ I m p r o v e d c o n t r o l o f t h e c u r v a t u r e, \end{matrix}

\begin{matrix} ⟹ I m p r o v e d c o n t r o l o f t h e c o n n e c t i o n, \end{matrix}

\begin{matrix} ⟹ W e a k c o n t r o l o f t h e c o n n e c t i o n f o r l o n g e r t i m e s . \end{matrix}

\begin{matrix}  \end{matrix}

To provide the improved control on the curvature, we will employ a second order equation for it. To derive this, we write the Bianchi identities 87b in the form 4 and then contract this expression with the covariant derivative

D

. This yields the equations:

\begin{matrix} 0 & = D^{γ} (D_{α} F_{β γ} + D_{γ} F_{α β} + D_{β} F_{γ α}), \end{matrix}

\begin{matrix} = □_{A} F_{α β} + [F_{α}^{γ}, F_{β γ}] + [F_{β}^{γ}, F_{γ α}], \end{matrix}

\begin{matrix} = □_{A} F_{α β} - 2 [F_{α γ}, F_{β}^{γ}] . \end{matrix}

(103)

\begin{matrix}  \end{matrix}

In addition to 103 and the system 100 , it will also be useful for us to employ a secondary elliptic equation. This will be for the quantity

\partial_{t} A_{0}

\begin{matrix} \partial_{t} A_{0} = Δ^{- 1} \partial^{i} (- [A_{i}, \partial_{t} A_{0}] + [A_{0}, \partial_{t} A_{i}] + [A^{α}, F_{i α}]) . \end{matrix}

(104)

This equation follows immediately from differentiating the equation 100b with respect to time, and then applying the conservation law

\nabla^{α} [A^{β}, F_{α β}] = 0

to the resulting expression. We are now ready to state our main a-priori estimate:

Theorem 5.4 (Main a-priori estimate for the curvature of the Coulomb system 86 – 88 ). Let the space-time connection

D = d + A

R^{(n + 1)}

, where

6 ⩽ n

, be given such that it satisfies the following system of equations on some finite time interval

[0, T^{*}]

\begin{matrix} □_{A} F_{α β} & = 2 [F_{α γ}, F_{β}^{γ}], \end{matrix}

(105a)

\begin{matrix} d A + [A, A] & = F, \end{matrix}

(105b)

\begin{matrix} d^{*} \underset{̲}{A} & = 0, \end{matrix}

(105c)

\begin{matrix} □ A_{i} & = P ([\partial_{t} A_{0}, A_{i}] - [A_{α}, \partial^{α} A_{i}] - [A^{α}, F_{α i}]), \end{matrix}

(105d)

\begin{matrix} Δ A_{0} & = \partial^{i} [A_{0}, A_{i}] + [A^{i}, F_{0 i}], \end{matrix}

(105e)

\begin{matrix} Δ (\partial_{t} A_{0}) & = \partial^{i} (- [A_{i}, (\partial_{t} A_{0})] + [A_{0}, \partial_{t} A_{i}] + [A^{α}, F_{i α}]) . \end{matrix}

(105f )

\begin{matrix}  \end{matrix}

Here we have split

{A_{α}} = (A_{0}, {{\underset{̲}{A}}_{i}})

. Let there also be given a set of fixed constants

L, N, L_{k}, N_{k}

for the indices

\frac{n - 2}{2} ⩽ k

, such that at time

t = 0

we have the initial bounds:

\begin{matrix} ∥ F (0) ∥ {\dot{H}}^{\frac{n - 4}{2}} & ⩽ {\tilde{ε}}_{0}, & ∥ \partial_{t} F (0) ∥ {\dot{H}}^{\frac{n - 6}{2}} & ⩽ L {\tilde{ε}}_{0}, \end{matrix}

(106)

\begin{matrix} ∥ F (0) ∥ {\dot{H}}^{k} & ⩽ {\tilde{M}}_{k}, & ∥ \partial_{t} F (0) ∥ {\dot{H}}^{k - 1} & ⩽ L_{k} {\tilde{M}}_{k} . \end{matrix}

(107)

\begin{matrix}  \end{matrix}

Then if

\tilde{ε_{0}}

is chosen as to be sufficiently small on line 106 above, there exists a collection constants

C, C_{k}

, which only depend on the dimension and the collection

L, N, L_{k}, N_{k}

but not on

{\tilde{ε}}_{0}

(once it is small enough) or the collection

{\tilde{M}}_{k}

, such that if at later times we have the bounds:

\begin{matrix} {sup}_{0 ⩽ t ⩽ T^{*}} ∥ \underset{̲}{A} (t) ∥ {\dot{H}}^{\frac{n - 2}{2}} & ⩽ 2 N C {\tilde{ε}}_{0}, & {sup}_{0 ⩽ t ⩽ T^{*}} ∥ \partial_{t} \underset{̲}{A} (t) ∥ {\dot{H}}^{\frac{n - 4}{2}} & ⩽ 2 N C {\tilde{ε}}_{0}, \end{matrix}

(108)

\begin{matrix} {sup}_{0 ⩽ t ⩽ T^{*}} ∥ F (t) ∥ {\dot{H}}^{\frac{n - 4}{2}} & ⩽ 2 N C {\tilde{ε}}_{0}, & {sup}_{0 ⩽ t ⩽ T^{*}} ∥ \partial_{t} F (t) ∥ {\dot{H}}^{\frac{n - 6}{2}} & ⩽ 2 N C {\tilde{ε}}_{0}, \end{matrix}

(109)

\begin{matrix} {sup}_{0 ⩽ t ⩽ T^{*}} ∥ \underset{̲}{A} (t) ∥ {\dot{H}}^{k} & < \infty, & {sup}_{0 ⩽ t ⩽ T^{*}} ∥ \partial_{t} \underset{̲}{A} (t) ∥ {\dot{H}}^{k - 1} & < \infty, \end{matrix}

(110)

\begin{matrix} {sup}_{0 ⩽ t ⩽ T^{*}} ∥ F (t) ∥ {\dot{H}}^{k} & < \infty, & {sup}_{0 ⩽ t ⩽ T^{*}} ∥ \partial_{t} F (t) ∥ {\dot{H}}^{k - 1} & < \infty, \end{matrix}

(111)

\begin{matrix}  \end{matrix}

the following set of stronger bounds holds:

\begin{matrix} {sup}_{0 ⩽ t ⩽ T^{*}} ∥ F (t) ∥ {\dot{H}}^{\frac{n - 4}{2}} & ⩽ N^{- 1} C {\tilde{ε}}_{0}, & {sup}_{0 ⩽ t ⩽ T^{*}} ∥ \partial_{t} F (t) ∥ {\dot{H}}^{\frac{n - 6}{2}} & ⩽ N^{- 1} C {\tilde{ε}}_{0}, \end{matrix}

(112)

\begin{matrix} {sup}_{0 ⩽ t ⩽ T^{*}} ∥ F (t) ∥ {\dot{H}}^{k} & ⩽ N_{k}^{- 1} C_{k} {\tilde{M}}_{k}, & {sup}_{0 ⩽ t ⩽ T^{*}} ∥ \partial_{t} F (t) ∥ {\dot{H}}^{k - 1} & ⩽ N_{k}^{- 1} C_{k} {\tilde{M}}_{k} . \end{matrix}

(113)

\begin{matrix}  \end{matrix}

Remark 5.5. The bounds involving 111 and 113 express the fact that the control we provide here is at the critical level. That is, bounds on the higher norms are completely irrelevant in the bootstrapping procedure, except for the fact that they are finite. The only place where we need higher norms to accomplish anything here is in the local existence theorem 5.3 . The way we will prove Theorem 5.4 is by first establishing control at the critical level through a bootstrapping argument. The control of the higher norms will then be provided through an a-priori estimate who's proof is essentially identical to that of the critical bootstrapping bound, and will therefore be left to the reader.

Remark 5.6. The reader my find it useful to have a brief description of the various constants appearing in Proposition 5.3 and Theorem 5.4 . The constants

L, L_{k}, N, N_{k}

are input into the a-priori machine, and these are meant to cover the transition to and from estimates involving the connection and curvature. The set

L, L_{k}

is only needed to deal with the initial data. This is necessary because we must have an account of bounds involving the quantities

\partial_{t} F

. The other constants

N, N_{k}

govern comparison type estimates similar to 69 . The constants

C, C_{k}

are byproducts of the proof of the a-priori estimate itself. These will very much depend on the

L, L_{k}, N, N_{k}

, but are independent of

{\tilde{ε}}_{0}

when it is small enough. Finally, the main adjusting parameter

{\tilde{ε}}_{0}

has two important roles. First and foremost, it is needed to prove the a-priori estimate itself. However, it has a second purpose which is also crucial, and that is to keep the dependence of

C, C_{k}

L, L_{k}, N, N_{k}

from creating a feedback loop. Specifically, we need our various comparison estimates to have constants which do not depend on the large constants

C, C_{k}

. Since the critical energy of the curvature can grow by a factor of

C

, we will need the extra influence of

{\tilde{ε}}_{0}

to make sure this does not cycle back to

L, L_{k}, N, N_{k}

Proof that Theorem 5.4 and Proposition 5.3 together imply Theorem 5.2 . The proof here is more or less straightforward and will be largely left to the reader. Everything relies on two sets of estimates. The first has to do with showing that the initial data bounds 91a – 91c imply the initial control assumed in 106 – 107 . This is just a matter of bounding the time derivatives

\partial_{t} F

, and is why we have included the set of auxiliary constants

L, L_{k}

. Using now the field equations 3 – 4 (we have not included them in the system 105 , but we may assume they hold), we have the general schematic identity at time

t = 0

\begin{matrix} \partial_{t} F (0) = \nabla_{x} F (0) + [a, F (0)], \end{matrix}

(114)

where we have generically set

a = (a_{0}, {a_{i}})

. Therefore, to establish the control 106 – 107 , we only need to prove the estimates:

\begin{matrix} ∥ [a, F (0)] ∥ {\dot{H}}^{\frac{n - 6}{2}} & ≲ {\tilde{ε}}_{0}, & ∥ [a, F (0)] ∥ {\dot{H}}^{k - 1} & ≲ {\tilde{M}}_{k}, \end{matrix}

(115)

\begin{matrix}  \end{matrix}

assuming that the bounds 91a – 91c hold. Notice that while these initial norms do not contain estimates on the quantities

E_{i} = F_{0 i} (0)

, we originally had bounds on this from lines 78 – 81 above. Also, any estimates on

a_{0}

which are needed in this process can be provided, for instance, through the equation 82 . Since the proof of estimate 115 is a straightforward paraproduct type bound, similar to what was done in the proof of Lemma 5.1 above, we leave it to the interested reader (see below for more details).

The second set of estimates we need to prove here has to do with the relationship between the later time norms 108 – 113 and the ones 95 – 98 contained in the proof of the local existence proposition. Since our global regularity proof is by iteration of this latter result, we need to first show that the weak control 95 – 98 implies the bootstrapping assumption 108 – 111 . This assertion is trivial for norms involving the potentials 108 and 110 , as well as the larger norms 111 just by applying the definition of curvature. Therefore, we only need to see that 95 – 96 implies the bounds 109 . We first establish the desired bounds for the undifferentiated term

F

For the spatial curvature and potentials

(\underset{̲}{F}, \underset{̲}{A})

, this is just the comparison principle form line 69 , and we can assume that the constants

N, N_{k}

are large enough to cover that case. To deal with potentials involving time derivatives of

\underset{̲}{A}

or the temporal potential

A_{0}

we have the following general calculation:

\begin{matrix} ∥ F ∥ {\dot{H}}^{\frac{n - 4}{2}} & ⩽ ∥ d A ∥ {\dot{H}}^{\frac{n - 4}{2}} + ∥ [A, A] ∥ {\dot{H}}^{\frac{n - 4}{2}}, \end{matrix}

\begin{matrix} ≲ ∥ A ∥ {\dot{H}}^{\frac{n - 2}{2}} + ∥ A ∥^{2} {\dot{H}}^{\frac{n - 2}{2}}, \end{matrix}

\begin{matrix}  \end{matrix}

where the quadratic term follows from paraproduct decompositions, Hölders inequality, and Sobolev embeddings as in the proof of 69 . The desired result now follows from the smallness of

{\tilde{ε}}_{0}

and the fact that we may assume the constant

C

in line 95 does not depend on it. To establish the estimate for the quantity

\partial_{t} F

, we use the later time version of the identity 114 , as well as the estimate which is responsible for the first estimate on line 115 above, which is:

∥ [A, F] ∥ {\dot{H}}^{\frac{n - 6}{2}} ≲ ∥ A ∥ {\dot{H}}^{\frac{n - 2}{2}} \cdot ∥ F ∥ {\dot{H}}^{\frac{n - 4}{2}} .

By again assume that the constant

{\tilde{ε}}_{0}

is sufficiently small with respect to

C

we have the desired bound.

The final thing we need to do here is to show that the improved bounds 112 – 113 imply the assumed estimates of the local existence theorem 93 – 94 . This is again a comparison estimate either identical or similar to 69 . Note that we only need to bound the spatial portion of the potentials

{A_{α}}

and their time derivatives. The undifferentiated terms can be bounded directly by 69 because we may assume that the constant

{\tilde{ε}}_{0}

on line 112 is small enough that the critical estimate 63 holds.

To deal with the time differentiated potentials

\partial_{t} \underset{̲}{A}

, one can simply differentiate the Hodge system 105b – 105c with respect to time and then apply essentially the same proof as was used to produce 69 . The details of this are left to the ambitious reader. □

⁴ Strictly speaking, this is not entirely true. This can be seen from the fact that if one looks at the localized commutator $[□_{A}, P] P_{λ}$ , where the connection ${A_{α}}$ is assumed to be of much lower frequency than $λ$ , then this is essentially a “derivative falls on low” interaction which can be handled with the available Strichartz estimates in $5 ⩽ n$ dimensions. We have elected instead to follow a formulation of the YM system which is based on the curvature because of its conceptual appeal. However, in lower dimensions, it may be best to work directly with the connection ${A_{α}}$ , in part to help mitigate bad $H i g h \times H i g h \Rightarrow L o w$ frequency interactions which come from the quadratic term on the right hand side of 103 .

6 Proof of the Main Bootstrapping Estimate

We are now ready to begin our proof of the (improved) main critical a-priori estimate 112 . In order to do this, we will need to bootstrap in a function space which is much stronger than the energy type spaces of Theorem 5.4 . This will cost us another bootstrapping procedure, but this will be easy to set up because it will be clear the extra norms we create have good bounds on some very small initial time interval due to the fact that we are assuming the higher energy boundedness 111 and that these norms involve integration in time. All of the norms we construct here will be of Strichartz type, with an

ℓ^{2}

Besov structure in the spatial variable.

It will also be necessary for us to include an angular square sum structure in many of the estimates we prove. This may seem a bit odd at first because we will not need such bounds directly in our proof of Theorem 5.4 . These extra bounds will instead be used to give the fine control which is needed to handle the linear part of the problem. At each fixed frequency, we form the square-sum norms:

\begin{matrix} ∥ P_{λ} A ∥ S L^{P} = {sup}_{θ ≲ 1} {(\sum_{φ : ω_{0} \in Γ_{φ}} ∥^{ω_{0}} Π_{θ} P_{λ} A ∥^{2} L^{p})}^{\frac{1}{2}}, \end{matrix}

(116)

where

Γ_{φ}

is taken to be a (uniformly) finitely overlapping set of spherical caps such that

S^{n - 1} = \cup_{φ} Γ_{φ}

, each of which has size

\sim θ

and constructed such a way that one has the bounds:

{(\sum_{φ : ω_{0} \in Γ_{φ}} ∥^{ω_{0}} Π_{θ} P_{λ} A ∥ L^{2})}^{\frac{1}{2}} ≲ ∥ P_{λ} A ∥ L^{2},

independent of the size of

θ

. Here we take the condition

ω_{0} \in Γ_{φ}

to mean that the variable

ω_{0}

is essentially in the center of that spherical cap

Γ_{φ}

. The exact placement is not essential. Notice that by construction, these norms are contained in the usual

L^{p}

spaces because we can assume that one set of angular sectors we are summing over contains the whole sphere.

Next, using the same prescription that defined the Besov spaces 41 , we define the angular square sum Besov spaces to be:

\begin{matrix} ∥ A ∥ S {\dot{B}}_{2}^{p, (q, s)} = {(\sum_{λ} λ^{2 s - 2 n (\frac{1}{q} - \frac{1}{p})} ∥ P_{λ} A ∥^{2} S L^{p})}^{\frac{1}{2}} . \end{matrix}

(117)

We now define the main dispersive component of the function spaces we will be working with. These are

L_{t}^{2}

based Strichartz spaces, built on the norms 117 and 41 . These are all defined on a finite time interval

[0, T^{*}]

, which will for the most part be left implicit:

\begin{matrix} ∥ A ∥ {\dot{Z}}^{s} & = ∥ A ∥ L_{t}^{2} ({\dot{B}}_{2}^{\frac{2 (n - 1)}{n - 3}, (2, s + \frac{1}{2})}) [0, T^{*}], \end{matrix}

(118)

\begin{matrix} ∥ A ∥ S {\dot{Z}}^{s} & = ∥ A ∥ L_{t}^{2} (S {\dot{B}}_{2}^{\frac{2 (n - 1)}{n - 3}, (2, s + \frac{1}{2})}) [0, T^{*}] . \end{matrix}

(119)

\begin{matrix}  \end{matrix}

To gain some intuition about these spaces, notice that they all scale like

L^{\infty} ({\dot{H}}^{s})

under the change of variables 7 . Therefore, they all scale like solutions to the wave equations with

{\dot{H}}^{s}

initial data. Indeed, these spaces are consistent with the available range of Strichartz estimates for the usual scalar wave equation, and it will be our goal to show that one has bounds on the norm 119 for solutions of the covariant wave operator on the left hand side of 105 .

To form the overall spaces we will bootstrap in, we add the above space-time norms to the energy type norms used in the statement of the main a-priori estimate 5.4 :

\begin{matrix} {\dot{X}}^{s} & = L^{\infty} [0, T^{*}] ({\dot{H}}^{s}) \cap S {\dot{Z}}^{s}, \end{matrix}

(120)

\begin{matrix} {\dot{Y}}^{s} & = L^{\infty} [0, T^{*}] ({\dot{H}}^{s}) \cap {\dot{Z}}^{s} . \end{matrix}

(121)

\begin{matrix}  \end{matrix}

It will also be necessary for us to estimate time derivatives in the above spaces.

Since differentiation will decrease the scaling by one unit, we use the norms:

∥ A ∥ {\dot{X}}^{s} \times \partial_{t}^{- 1} ({\dot{X}}^{s - 1}) = ∥ A ∥ {\dot{X}}^{s} + ∥ \partial_{t} A ∥ {\dot{X}}^{s - 1},

with an analogous definition for

{\dot{Y}}^{s} \times \partial_{t}^{- 1} ({\dot{Y}}^{s - 1})

6.1 Proof of the Critical Bootstrapping Estimate

We are now ready to prove the critical component of Theorem 5.4 (we will now change notation from

{\tilde{ε}}_{0}

back to

ε_{0}

Proposition 6.1 (Critical bootstrapping estimate in the

{\dot{X}}^{s}

spaces). Let the dimension be

6 ⩽ n

. Let the collection

(F, A)

be a space-time connection curvature pair which obeys the general smoothness conditions 110 – 111 , and which satisfies the system of equations 105 . Let

L, N

be given constants such that one has the initial bounds:

\begin{matrix} ∥ F (0) ∥ {\dot{H}}^{\frac{n - 4}{2}} + ∥ \partial_{t} F (0) ∥ {\dot{H}}^{\frac{n - 6}{2}} ⩽ L ε_{0} . \end{matrix}

(122)

Then there exists a constant

C

which depends only on

L, N

and the dimension such that if one has the bootstrapping bounds on a time interval

[0, T^{*}]

\begin{matrix} {sup}_{0 ⩽ t ⩽ T^{*}} ∥ (\underset{̲}{A}, \partial_{t} \underset{̲}{A}) (t) ∥ {\dot{H}}^{\frac{n - 2}{2}} \times {\dot{H}}^{\frac{n - 4}{2}} & ⩽ 2 N C ε_{0}, \end{matrix}

(123)

\begin{matrix} ∥ F ∥ {\dot{X}}^{\frac{n - 4}{2}} \times \partial_{t}^{- 1} ({\dot{X}}^{\frac{n - 6}{2}}) & ⩽ 2 N C ε_{0}, \end{matrix}

(124)

\begin{matrix}  \end{matrix}

then for

ε_{0}

sufficiently small, we have that the following improved bounds on the same time interval

[0, T^{*}]

\begin{matrix} ∥ F ∥ {\dot{X}}^{\frac{n - 4}{2}} \times \partial_{t}^{- 1} ({\dot{X}}^{\frac{n - 6}{2}}) ⩽ N^{- 1} C ε_{0} . \end{matrix}

(125)

The proof of Proposition 6.1 will be accomplished through the standard use of Littlewood-Paley paraproduct decompositions, and the application of space-time estimates. All of the linear bounds we will need are provided by the following, which is the main technical result of this work:

Theorem 6.2 (Gauge covariant angular square-sum Strichartz estimates for Yang-Mills connections). Let the number of dimensions be such that

6 ⩽ n

, and let

d + \tilde{\underset{̲}{A}}

be a space-time connection defined defined on all of Minkowski space

ℳ^{n + 1}

such that it satisfies the conditions:

\begin{matrix} {\tilde{\underset{̲}{A}}}_{0} & = 0 & (T e m p o r a l G a u g e), \end{matrix}

(126a)

\begin{matrix} d^{*} \tilde{\underset{̲}{A}} & = 0 & (C o u l o m b G a u g e), \end{matrix}

(126b)

\begin{matrix} P_{| ξ | ≪ | τ |} (\tilde{\underset{̲}{A}}) & = 0 & (S p a c e - t i m e f r e q u e n c y l o c a l i z a t i o n), \end{matrix}

(126c)

\begin{matrix} ∥ \tilde{\underset{̲}{A}} ∥ {\dot{X}}^{\frac{n - 2}{2}} & ⩽ ℰ & (S p a c e - t i m e e s t i m a t e), \end{matrix}

(126d)

\begin{matrix} □ \tilde{\underset{̲}{A}} & = \tilde{P} ([B, H]) & (S t r u c t u r e e q u a t i o n), \end{matrix}

(126e)

\begin{matrix} ∥ (B, H) ∥ {\dot{Y}}^{\frac{n - 2}{2}} \times {\dot{Y}}^{\frac{n - 4}{2}} & ⩽ ℰ & (S t r u c t u r e e s t i m a t e s), \end{matrix}

(126f )

\begin{matrix}  \end{matrix}

where

(B, H)

is an auxiliary set of

g

valued functions defined on all of

ℳ^{n + 1}

. The symbol

\tilde{P}

denotes a composition of the Leray projection

P

with some frequency cutoff function which is bounded on all mixed Lebesgue-Besov spaces of the type

L^{p} ({\dot{B}}_{2}^{p, (2, s)})

. We assume also that the connection

d + \tilde{\underset{̲}{A}}

satisfies the general smoothness bounds:

\begin{matrix} {sup}_{- T^{*} ⩽ t ⩽ T^{*}} ∥ \tilde{\underset{̲}{A}} (t) ∥ {\dot{H}}^{k} & < \infty, & \frac{n - 2}{2} < k, \end{matrix}

(127)

\begin{matrix}  \end{matrix}

for each fixed time

T^{*}

. Let now

F

be any other

g

valued function which satisfies the inhomogeneous equation:

\begin{matrix} □_{\tilde{\underset{̲}{A}}} F = G, \end{matrix}

(128)

with Cauchy data:

\begin{matrix} F (0) & = f, & \partial_{t} F (0) = \dot{f} . \end{matrix}

(129)

\begin{matrix}  \end{matrix}

Then if the constant

ℰ

in lines 126d and 126f above is sufficiently small, one has the following family of space-time estimates:

\begin{matrix} ∥ F ∥ {\dot{X}}^{\frac{n - 4}{2}} \times \partial_{t}^{- 1} ({\dot{X}}^{\frac{n - 6}{2}}) ≲ ∥ (f, \dot{f}) ∥ {\dot{H}}^{\frac{n - 4}{2}} \times {\dot{H}}^{\frac{n - 6}{2}} + ∥ G ∥ L^{1} ({\dot{H}}^{\frac{n - 6}{2}}) . \end{matrix}

(130)

Remark 6.3. In the above Theorem, the Strichartz estimates have a preferred scaling. This is consistent with the application we have in mind. In general, it is not possible to prove estimates of the type 130 for higher Sobolev indices without assuming that the connection

\tilde{\underset{̲}{A}}

itself has more regularity. In the case where

\tilde{\underset{̲}{A}}

does have better regularity, a proof similar to that given after Proposition 7.1 below can be used to show estimates for those higher norms.

Proof of Proposition 6.1 . The proof requires another bootstrapping argument. This will be done on subintervals

[0, T^{* *}] \subseteq [0, T^{*}]

. Using the initial bounds 122 and the general smoothness assumption 111 we may assume that for

T^{* *} ≪ 1

we have the estimate 124 . Therefore, it suffices to prove that 124 implies 125 on all subintervals

[0, T^{* *}]

. But this is just the same as proving Proposition 6.1 itself since

T^{*}

is arbitrary.

The proof will be accomplished in a series of steps. Our first goal will be to derive

{\dot{X}}^{s}

and

{\dot{Z}}^{s}

type bounds for the connection

d + A

. We will then split this connection into a sum of two pieces

d + \tilde{A} + \tilde{\tilde{A}}

, where the potentials

\tilde{A}

satisfy the criteria of Theorem 6.2 and the remainder term

\tilde{\tilde{A}}

obeys the better

L^{1} (L^{\infty})

space-time estimate. This is enough to be able to write the equation 105a schematically as:

\begin{matrix} □_{\tilde{A}} F = [\nabla \tilde{\tilde{A}}, F] + [\tilde{\tilde{A}}, \nabla F] + [\tilde{A}, [\tilde{\tilde{A}}, F]] + [\tilde{\tilde{A}}, [\tilde{\tilde{A}}, F]] + [F, F] . \end{matrix}

(131)

One is then in a position where Theorem 6.2 can be applied directly, and we only need to choose our constant

C

depending on

L, N

and the constant which appears on line 130 . The key thing is that the dangerous term

[\tilde{\tilde{A}}, \nabla F]

can safely be put in

L^{1} ({\dot{H}}^{\frac{n - 6}{2}})

using the improved space-time estimate for

\tilde{\tilde{A}}

and the energy estimate for

F

. Throughout the proof we will use the usual splitting

{A_{α}} = (A_{0}, \underset{̲}{A})

d + A

into its temporal and spatial components.

∙

{\dot{X}}^{\frac{n - 2}{2}}

estimates for

{{\underset{̲}{A}}_{i}}

Here we write

\underset{̲}{F}

for the spatial components of the field strength and use the Hodge system 105b – 105c to write schematically:

\begin{matrix} \underset{̲}{A} = \nabla_{x} Δ^{- 1} (- \underset{̲}{F} + [\underset{̲}{A}, \underset{̲}{A}]) . \end{matrix}

(132)

As a preliminary first step, we will show that the potentials

{{\underset{̲}{A}}_{i}}

can be estimated in

{\dot{Y}}^{\frac{n - 2}{2}}

with bounds comparable to

N C ε_{0}

. Now, it is not too difficult to see directly from the definition that:

\nabla_{x} Δ^{- 1} : {\dot{Y}}^{\frac{n - 4}{2}} ↪ {\dot{Y}}^{\frac{n - 2}{2}} .

Next, notice that we have the bilinear estimate:

\begin{matrix} \nabla_{x} Δ^{- 1} : L^{\infty} ({\dot{H}}^{\frac{n - 2}{2}}) \cdot {\dot{Y}}^{\frac{n - 2}{2}} ↪ {\dot{Y}}^{\frac{n - 2}{2}}, \end{matrix}

(133)

which follows integrating the bound 45 . Note that in this case, the range restrictions 46 – 50 are easily satisfied. Therefore, using the critical bounds 123 as well as the general smoothness criteria 110 (so that in particular we may assume the

{\dot{Y}}^{\frac{n - 2}{2}}

norm of

{{\underset{̲}{A}}_{i}}

is finite) we see we may absorb the quadratic term on the right hand side of 132 onto the left in the desired estimates.

Our task is now to show the more restrictive

{\dot{X}}^{\frac{n - 2}{2}}

estimates for the potentials

{{\underset{̲}{A}}_{i}}

. Again from the definition, it is not hard to see that we have the embedding:

\nabla_{x} Δ^{- 1} : {\dot{X}}^{\frac{n - 4}{2}} ↪ {\dot{X}}^{\frac{n - 2}{2}} .

Therefore, keeping in mind the

{\dot{Y}}^{\frac{n - 2}{2}}

bounds just proved, we see that is suffices to be able to show the bilinear estimate:

\begin{matrix} \nabla_{x} Δ^{- 1} : {\dot{Y}}^{\frac{n - 2}{2}} \cdot {\dot{Y}}^{\frac{n - 2}{2}} ↪ {\dot{X}}^{\frac{n - 2}{2}} . \end{matrix}

(134)

The main issue here is, of course, to be able to include the angular square sum structure. This turns out to be very simple. Notice first that by orthogonality and the general nesting 42 we have the inclusion (on any finite time interval

[0, T^{*}]

L^{\infty} ({\dot{H}}^{\frac{n - 2}{2}}) \cap L^{2} ({\dot{H}}^{\frac{n - 1}{2}}) \subseteq {\dot{X}}^{\frac{n - 2}{2}} .

Therefore, to conclude 134 we see that it suffices to be able to show the set of bilinear estimates:

\begin{matrix} \nabla_{x} Δ^{- 1} : {\dot{Y}}^{\frac{n - 2}{2}} \cdot {\dot{Y}}^{\frac{n - 2}{2}} ↪ L^{\infty} ({\dot{H}}^{\frac{n - 2}{2}}), \end{matrix}

(135)

\begin{matrix} \nabla_{x} Δ^{- 1} : {\dot{Y}}^{\frac{n - 2}{2}} \cdot {\dot{Y}}^{\frac{n - 2}{2}} ↪ L^{2} ({\dot{H}}^{\frac{n - 1}{2}}) . \end{matrix}

(136)

\begin{matrix}  \end{matrix}

The first of these embedding follows easily from:

\nabla_{x} Δ^{- 1} : L^{\infty} ({\dot{H}}^{\frac{n - 2}{2}}) \cdot L^{\infty} ({\dot{H}}^{\frac{n - 2}{2}}) ↪ L^{\infty} ({\dot{H}}^{\frac{n - 2}{2}}),

which in turn follows directly from 45 . The second estimate 136 above is more bilinear in nature. It follows from applying a trichotomy and then summing the following two fixed frequency bilinear inclusions:

\begin{matrix} \nabla_{x} Δ^{- 1} : P_{∙ ≪ λ} (L^{2} ({\dot{B}}_{2}^{\frac{2 (n - 1)}{n - 3}, (2, \frac{n - 1}{2})})) \cdot P_{λ} (L^{\infty} ({\dot{H}}^{\frac{n - 2}{2}})) & ↪ P_{λ} (L^{2} ({\dot{H}}^{\frac{n - 1}{2}})) . \end{matrix}

(137)

\begin{matrix} \nabla_{x} Δ^{- 1} : P_{λ} (L^{2} ({\dot{B}}_{2}^{\frac{2 (n - 1)}{n - 3}, (2, \frac{n - 1}{2})})) \cdot P_{λ} (L^{\infty} ({\dot{H}}^{\frac{n - 2}{2}})) & ↪ {(\frac{μ}{λ})}^{δ} P_{μ} (L^{2} ({\dot{H}}^{\frac{n - 1}{2}})), \end{matrix}

(138)

\begin{matrix}  \end{matrix}

where we have set

δ = n (\frac{n - 2}{n - 1}) - \frac{3}{2}

to be the “gap” constant. The estimates 137 – 138 follow directly from the frequency localized bounds 51 – 52 . Note that in this case, the various positivity conditions are satisfied.

∙

{\dot{Y}}^{\frac{n - 2}{2}} \times {\dot{Y}}^{\frac{n - 4}{2}}

bounds for the pair

(A_{0}, \partial_{t} A_{0})

Our first step here is to deal with the variable

A_{0}

. We integrate equation 105e and write it schematically as:

\begin{matrix} A_{0} = Δ^{- 1} (\nabla_{x} [A_{0}, \underset{̲}{A}] + [\underset{̲}{A}, F]) . \end{matrix}

(139)

The desired estimate now follows by constructing

A_{0}

from scratch by iteration, using the already established estimates and bilinear embedding 133 and the following:

\begin{matrix} Δ^{- 1} : {\dot{Y}}^{\frac{n - 2}{2}} \cdot {\dot{Y}}^{\frac{n - 4}{2}} ↪ {\dot{Y}}^{\frac{n - 2}{2}} . \end{matrix}

(140)

This last embedding follows in turn from the pair of estimates:

\begin{matrix} Δ^{- 1} : L^{\infty} ({\dot{H}}^{\frac{n - 2}{2}}) \cdot L^{\infty} ({\dot{H}}^{\frac{n - 4}{2}}) & ↪ L^{\infty} ({\dot{H}}^{\frac{n - 2}{2}}), \end{matrix}

\begin{matrix} Δ^{- 1} : L^{2} ({\dot{B}}_{2}^{\frac{2 (n - 1)}{n - 3}, (2, \frac{n - 1}{2})}) \cdot L^{\infty} ({\dot{H}}^{\frac{n - 4}{2}}) & ↪ L^{2} ({\dot{B}}_{2}^{\frac{2 (n - 1)}{n - 3}, (2, \frac{n - 1}{2})}) . \end{matrix}

\begin{matrix}  \end{matrix}

Both of these are easy consequences of 45 and we leave the numerology to the reader.

To establish the

{\dot{Y}}^{\frac{n - 4}{2}}

bound for

\partial_{t} A_{0}

, we can use the equation 105f to treat it as a separate variable. In that equation we have quantities of the form

\partial_{t} \underset{̲}{A}

. We can use the curvature equation 105b to swap this for spatial derivatives as follows:

\begin{matrix} \partial_{t} \underset{̲}{A} = \nabla_{x} A_{0} - [A_{0}, \underset{̲}{A}] + F . \end{matrix}

(141)

This allows us to write schematically:

\begin{matrix} (\partial_{t} A_{0}) = \nabla_{x} Δ^{- 1} ([A, (\partial_{t} A_{0})] + [A, \nabla_{x} A] + [A, [A, A]] + [A, F]), \end{matrix}

(142)

where

A

now denotes any of the full set of potentials

{A_{α}}

which we have estimated in the space

{\dot{Y}}^{\frac{n - 2}{2}}

. We may now iterate the equation 142 in the space

{\dot{Y}}^{\frac{n - 4}{2}}

to constructively obtain the desired bounds using the bilinear embedding:

\nabla_{x} Δ^{- 1} : {\dot{Y}}^{\frac{n - 2}{2}} \cdot {\dot{Y}}^{\frac{n - 4}{2}} ↪ {\dot{Y}}^{\frac{n - 4}{2}} .

which follows from differentiating 140 above. Notice that the needed inclusion

[A, A] ↪ {\dot{Y}}^{\frac{n - 4}{2}}

follows, for instance, from differentiating the embedding 133 .

∙

Splitting the spatial potentials Our next goal is to split the spatial potentials

{{\underset{̲}{A}}_{i}}

into a sum of two pieces which are each more easily managed. This will be done using the “structure” equation 105d . Using the formula 141 to get rid of terms of the form

\partial_{t} \underset{̲}{A}

on the right hand side of this equation, and using the various

{\dot{Y}}^{s}

space embeddings we have just shown (on the time interval

[0, T^{*}]

), we may write this equation in the schematic form:

\begin{matrix} □ \underset{̲}{A} = P ([B, H]), \end{matrix}

(143)

where the quantities

(B, H)

obey the estimate:

∥ (B, H) ∥ {\dot{Y}}^{\frac{n - 2}{2}} \times {\dot{Y}}^{\frac{n - 4}{2}} ≲ N C ε_{0},

where the implicit constant in the above inequality comes from the estimates just shown. Using Duhamel's principle and (sharp) time cutoffs, we now extend 143 to all possible times. This is done simply by writing:

\begin{matrix} \underset{̲}{A} (t) = {\underset{̲}{A}}^{(0)} (t) + \int_{0}^{t} \frac{sin ((t - s) \sqrt{- Δ})}{\sqrt{- Δ}} P ([B, H]) (s) \cdot χ_{[0, T^{*}]} (s) d s, \end{matrix}

(144)

where

{\underset{̲}{A}}^{(0)}

denotes to propagation of

(\underset{̲}{A} (0), \partial_{t} \underset{̲}{A} (0))

as a solution to the free scalar wave equation. Also, here

χ_{[0, T^{*}]}

denotes the indicator function of the time interval

[0, T^{*}]

. This implies that we have the condition:

\begin{matrix} □ \underset{̲}{A} (t) & = 0, & t & < 0, & T^{*} & < t . \end{matrix}

\begin{matrix}  \end{matrix}

Now, from the bootstrapping assumption 123 we have the pair of bounds:

\begin{matrix} ∥ (\underset{̲}{A} (0), \partial_{t} \underset{̲}{A} (0)) ∥ {\dot{H}}^{\frac{n - 2}{2}} \times {\dot{H}}^{\frac{n - 4}{2}} & ⩽ N C ε_{0}, \end{matrix}

\begin{matrix} ∥ (\underset{̲}{A} (T^{*}), \partial_{t} \underset{̲}{A} (T^{*})) ∥ {\dot{H}}^{\frac{n - 2}{2}} \times {\dot{H}}^{\frac{n - 4}{2}} & ⩽ N C ε_{0} . \end{matrix}

\begin{matrix}  \end{matrix}

Therefore, using the bounds we have just shown in conjunction with the usual Strichartz estimates for the wave equation, we have that this extension of the potentials

{{\underset{̲}{A}}_{i}}

satisfies the bounds:

∥ \underset{̲}{A} ∥ {\dot{X}}^{\frac{n - 2}{2}} ≲ N C ε_{0} .

Notice that the angular square function structure inherent in the

{\dot{X}}^{s}

norms is provided automatically by the fact that the usual wave equation commutes with the angular cutoffs

ω Π_{θ}

Our next step to introduce the space–time frequency cutoff

S_{| τ | ≲ | ξ |}

, which cuts off smoothly on the region

| τ | ≲ | ξ |

. That is, the compound multipliers

P_{λ} S_{| τ | ≲ | ξ |}

all have

L^{1}

kernels with uniform bounds. We denote by

S_{| ξ | ≪ | τ |} = I - S_{| τ | ≲ | ξ |}

. Our decomposition of

{{\underset{̲}{A}}_{i}}

is now given by the formula:

\begin{matrix} \tilde{\underset{̲}{A}} & = S_{| τ | ≲ | ξ |} \underset{̲}{A}, & \tilde{\tilde{\underset{̲}{A}}} & = S_{| ξ | ≪ | τ |} \underset{̲}{A} . \end{matrix}

\begin{matrix}  \end{matrix}

We now need to show that both the potential sets

{\tilde{{\underset{̲}{A}}_{i}}}

and

{\tilde{\tilde{{\underset{̲}{A}}_{i}}}}

obey good

{\dot{X}}^{\frac{n - 2}{2}}

estimates. Since the original collection of extended potentials does, we only need to prove this assertion for one of these sets. This is most easily shown for the collection

{\tilde{{\underset{̲}{A}}_{i}}}

. As we have already mentioned, the cutoffs

P_{λ} S_{| τ | ≲ | ξ |}

are bounded on all mixed Lebesgue spaces. Therefore, the entire multiplier

S_{| τ | ≲ | ξ |}

is bounded on any mixed Lebesgue-Besov space of the type

L^{q} ({\dot{B}}_{2}^{p, (2, s)})

. This implies that this multiplier is in fact bounded on the

{\dot{X}}^{s}

spaces, which is enough to support our claim.

Finally, we would like to prove two fixed frequency multiplier estimates which will be useful in the sequel when dealing with the two sets of potentials

{\tilde{{\underset{̲}{A}}_{i}}}

and

{\tilde{\tilde{{\underset{̲}{A}}_{i}}}}

. The first is:

\begin{matrix} ∥ \partial_{t} P_{λ} S_{| τ | ≲ | ξ |} A ∥ L^{p} ≲ & λ ∥ A ∥ L^{p} & 1 ⩽ p ⩽ \infty . \end{matrix}

(145)

\begin{matrix}  \end{matrix}

This is easily demonstrated by rescaling to frequency

λ = 1

and using the

L^{1}

bound on the convolution kernel of

\partial_{t} P_{λ} S_{| τ | ≲ | ξ |}

. Combining this with the remarks made above, we see that we have the estimate:

∥ \partial_{t} \tilde{\underset{̲}{A}} ∥ {\dot{X}}^{\frac{n - 4}{2}} ≲ N C ε_{0} .

In particular, from everything we have shown, the potential set

{\tilde{{\underset{̲}{A}}_{i}}}

satisfies all of the requirements 126 of Theorem 6.2 when

ε_{0}

is sufficiently small.

The second fixed frequency multiplier bound that will be of use shortly is the space–time estimate:

\begin{matrix} ∥ Ξ^{- 1} P_{λ} S_{| ξ | ≪ | τ |} A ∥ L^{q} (L^{p}) ≲ λ^{- 2} ∥ A ∥ L^{q} (L^{p}) . \end{matrix}

(146)

Here

Ξ

is the multiplier with symbol

Ξ (τ, ξ) = τ^{2} - | ξ |^{2}

. To prove this, we employ a family of Littlewood-Paley space-time cutoffs which we denote by

S_{μ}

. By this we mean that the space-time frequency support of these is supported where

| τ | + | ξ | \sim μ

. As usual, these are all chosen so as to have uniform

L^{1}

bounds on their convolution kernels. Using the support restrictions of the

S_{| ξ | ≪ | τ |}

multiplier, we have the formula:

P_{λ} S_{| ξ | ≪ | τ |} A = \sum_{μ : λ ≲ μ} P_{λ} S_{μ} S_{| ξ | ≪ | τ |} A .

Therefore, by dyadic summing and the boundedness of the multiplier

P_{λ}

, to prove 146 it suffices to be able to show that:

∥ Ξ^{- 1} S_{| ξ | ≪ | τ |} S_{μ} A ∥ L^{q} (L^{p}) ≲ μ^{2} ∥ A ∥ L^{q} (L^{p}) .

This last bound follows easily from rescaling to frequency

μ = 1

and the appropriate differential bounds on the symbol of

Ξ^{- 1} S_{| ξ | ≪ | τ |}

which we leave to the reader.

∙

L^{1} (L^{\infty})

bounds for the potentials

{{\tilde{\tilde{A}}}_{α}} = (A_{0}, {\tilde{\tilde{\underset{̲}{A}}}})

Our goal here is to show the

ℓ^{1}

type Besov estimate:

\begin{matrix} ∥ (A_{0}, {\tilde{\tilde{\underset{̲}{A}}}}) ∥ L^{1} ({\dot{B}}_{1}^{\infty, (2, \frac{n}{2})}) ≲ N C ε_{0} . \end{matrix}

(147)

By repeatedly using the estimate 146 , we have that the multiplier

Ξ^{- 1} Δ S_{| ξ | ≪ | τ |}

is bounded on the space

L^{1} ({\dot{B}}_{1}^{\infty, (2, \frac{n}{2})})

. Furthermore, from all of the estimates we have shown above, and by distributing the derivative in the first term on the right hand side of 139 , we see that the right hand side of the schematics 139 and 143 are equivalent. Therefore, we have the following heuristic schematic for the potentials

{{\tilde{\tilde{A}}}_{α}}

\tilde{\tilde{A}} = Δ^{- 1} ([B, H]),

where the pair

(B, H)

enjoys the bounds:

∥ (B, H) ∥ {\dot{Y}}^{\frac{n - 2}{2}} \times {\dot{Y}}^{\frac{n - 4}{2}} ≲ N C ε_{0} .

The bound 147 now follows from the bilinear estimate:

Δ^{- 1} : {\dot{Y}}^{\frac{n - 2}{2}} \cdot {\dot{Y}}^{\frac{n - 4}{2}} ↪ L^{1} ({\dot{B}}_{1}^{\infty, (2, \frac{n}{2})}) .

This in turn follows from the product estimate:

Δ^{- 1} : L^{2} ({\dot{B}}_{2}^{\frac{2 (n - 1)}{n - 3}, (2, \frac{n - 1}{2})}) \cdot L^{2} ({\dot{B}}_{2}^{\frac{2 (n - 1)}{n - 3}, (2, \frac{n - 3}{2})}) ↪ L^{1} ({\dot{B}}_{1}^{\infty, (2, \frac{n}{2})}) .

This last estimate follows at once from 45 . The check on the conditions 46 – 50 is left to the reader.

∙

Improving the curvature This is the final part of the proof of Proposition 6.1 .

Recalling the schematic 131 and using the Strichartz estimates 130 , our goal here is to show the following four bounds:

\begin{matrix} ∥ [\nabla \tilde{\tilde{A}}, F] ∥ L^{1} ({\dot{H}}^{\frac{n - 6}{2}}) & ≲ N^{2} C^{2} ε_{0}^{2}, \end{matrix}

(148)

\begin{matrix} ∥ [\tilde{\tilde{A}}, \nabla F] ∥ L^{1} ({\dot{H}}^{\frac{n - 6}{2}}) & ≲ N^{2} C^{2} ε_{0}^{2}, \end{matrix}

(149)

\begin{matrix} ∥ [\tilde{A}, [\tilde{\tilde{A}}, F]] ∥ L^{1} ({\dot{H}}^{\frac{n - 6}{2}}) & ≲ N^{2} C^{2} ε_{0}^{2}, \end{matrix}

(150)

\begin{matrix} ∥ [\tilde{\tilde{A}}, [\tilde{\tilde{A}}, F]] ∥ L^{1} ({\dot{H}}^{\frac{n - 6}{2}}) & ≲ N^{2} C^{2} ε_{0}^{2}, \end{matrix}

(151)

\begin{matrix} ∥ [F, F] ∥ L^{1} ({\dot{H}}^{\frac{n - 6}{2}}) & ≲ N^{2} C^{2} ε_{0}^{2} . \end{matrix}

(152)

\begin{matrix}  \end{matrix}

For

ε_{0}

sufficiently small, this will be enough for us to conclude the improved bootstrapping estimates 125 by choosing

C

to be such that

\frac{1}{2} (L N)^{- 1} C

is equal to the constant appearing on the right hand side of estimate 130 . This works because the implicit constants which appear in 148 – 152 above have only been manufactured in the estimates of this proof, and can all be chosen to be independent of

N

and

C

ε_{0}

is chosen small enough.

To prove these bounds, first notice that the estimates 148 and 150 – 152 are essentially identical. This follows from the equivalence (in terms of

{\dot{Y}}^{s}

spaces)

\nabla \tilde{\tilde{A}} \approx F

. We also have the equivalences

[\tilde{A}, \tilde{\tilde{A}}] \approx F

and

[\tilde{\tilde{A}}, \tilde{\tilde{A}}] \approx F

. These are given by the inclusion:

\begin{matrix} {\dot{Y}}^{\frac{n - 2}{2}} \cdot {\dot{Y}}^{\frac{n - 2}{2}} \subseteq {\dot{Y}}^{\frac{n - 4}{2}} . \end{matrix}

(153)

This is easily demonstrated, as we have already mentioned, by differentiating the inclusion 133 and using the boundedness of

\nabla_{x}^{2} Δ^{- 1}

on the various

{\dot{Y}}^{s}

component spaces. Therefore, to prove 148 and 150 – 152 we only need to know that:

\begin{matrix} L^{2} ({\dot{B}}_{2}^{\frac{2 (n - 1)}{n - 3}, (2, \frac{n - 3}{2})}) \cdot L^{2} ({\dot{B}}_{2}^{\frac{2 (n - 1)}{n - 3}, (2, \frac{n - 3}{2})}) ↪ L^{1} ({\dot{H}}^{\frac{n - 6}{2}}) . \end{matrix}

(154)

This is yet again a consequence of our general Besov calculus 45 , and we leave the various additions to the reader.

Our final task here is to prove the estimate 149 . This needs to be frequency decomposed using a trichotomy. Specifically, we have the following set of fixed frequency estimates in the three cases (note that in the first two estimates below the square summing needs to be done inside the time integral):

\begin{matrix} P_{∙ ≪ λ} (L^{1} ({\dot{B}}_{1}^{\infty, (n, \frac{n}{2})})) \cdot P_{λ} (L^{\infty} ({\dot{H}}^{\frac{n - 6}{2}})) & ↪ P_{λ} (L^{1} ({\dot{H}}^{\frac{n - 6}{2}})), \end{matrix}

(155)

\begin{matrix} P_{λ} (L^{2} ({\dot{B}}_{2}^{\frac{2 (n - 1)}{n - 3}, (2, \frac{n - 1}{2})}) \cdot P_{∙ ≪ λ} (L^{2} ({\dot{B}}_{2}^{\frac{2 (n - 1)}{n - 3}, (2, \frac{n - 3}{2})})) & ↪ P_{λ} (L^{1} ({\dot{H}}^{\frac{n - 6}{2}})), \end{matrix}

(156)

\begin{matrix} P_{λ} (L^{2} ({\dot{B}}_{2}^{\frac{2 (n - 1)}{n - 3}, (2, \frac{n - 1}{2})}) \cdot P_{λ} (L^{2} ({\dot{B}}_{2}^{\frac{2 (n - 1)}{n - 3}, (2, \frac{n - 3}{2})})) & ↪ {(\frac{μ}{λ})}^{δ} P_{μ} (L^{1} ({\dot{H}}^{\frac{n - 6}{2}})), \end{matrix}

(157)

\begin{matrix}  \end{matrix}

where the quantity

δ

in the last estimate 157 above can be computed to be

δ = n (\frac{n - 3}{n - 1}) - 3

. The estimate 155 follows from inspection. The latter two estimates 156 – 157 follow from 51 – 51 of Remark 4.2 . This completes the proof of Proposition 6.1 . □

7 Reduction to Approximate Half-Wave Operators

This is a preliminary technical section where we reduce the proof of the Strichartz estimates 130 to a more easily managed form. This material more or less standard, and we again follow closely what was done in [8] . Our first step here is to reduce the proof of Theorem 6.2 to the following:

Proposition 7.1 (Existence of a fixed frequency parametrix). Let the number of dimensions be

6 ⩽ n

, and let

d + \underset{̲}{A} ∙ ≪ λ

be a connection which satisfies the conditions 126 . In addition assume that we have the frequency localization condition:

\begin{matrix} P_{λ ≲ ∙} (\underset{̲}{A} ∙ ≪ λ) = 0, \end{matrix}

(158)

where

P_{λ ≲ ∙}

is a frequency cutoff on the region where

2^{- 10 a} λ ⩽ | ξ |

, where

1 ⩽ a

is some fixed parameter. Then if the constant

ℰ

on lines 126d and 126f is sufficiently small, there exists a family of approximate propagation operators

W_{\underset{̲}{A} ∙ ≪ λ}^{λ} (s)

(or just

W_{s}^{λ}

for short) such that if

(f_{λ}, g_{λ})

is any set of

λ

–frequency initial data with Fourier support in the region

2^{- a} λ ⩽ | ξ | ⩽ 2^{a} λ

, the following estimates hold:

\begin{matrix} ∥ W_{s}^{λ} (f_{λ}, g_{λ}) ∥ {\dot{X}}^{0} \times \partial_{t}^{- 1} ({\dot{X}}^{- 1}) & ≲ E^{\frac{1}{2}} (f_{λ}, g_{λ}), \end{matrix}

(159a)

\begin{matrix} ∥ W_{s}^{λ} (f_{λ}, g_{λ}) (s) - f_{λ} ∥ L^{2} & ≲ ℰ^{\frac{1}{2}} E^{\frac{1}{2}} (f_{λ}, g_{λ}), \end{matrix}

(159b)

\begin{matrix} ∥ \partial_{t} W_{s}^{λ} (f_{λ}, g_{λ}) (s) - g_{λ} ∥ L^{2} & ≲ λ ℰ^{\frac{1}{2}} E^{\frac{1}{2}} (f_{λ}, g_{λ}), \end{matrix}

(159c)

\begin{matrix} ∥ □_{\underset{̲}{A} ∙ ≪ λ} W_{s}^{λ} (f_{λ}, g_{λ}) ∥ L^{1} (L^{2}) & ≲ λ ℰ E^{\frac{1}{2}} (f_{λ}, g_{λ}) . \end{matrix}

(159d)

\begin{matrix}  \end{matrix}

Here we have set

E (f_{λ}, g_{λ})

to the

L^{2}

normalized energy:

E (f_{λ}, g_{λ}) = ∥ f_{λ} ∥^{2} L^{2} + λ^{- 2} ∥ g_{λ} ∥^{2} L^{2} .

Finally, we have that the frequency support of the parametrix is contained in the set

2^{- 2 a} λ ⩽ | ξ | ⩽ 2^{2 a} λ

, where

a

is as above.

Proof that Proposition 7.1 implies Theorem 6.2 . The first step here is to reduce the estimate 130 to the case where

G \equiv 0

. This is done in the usual way via Duhamel's principle. We define the true propagation operator

U_{s} (t)

via the formulas:

\begin{matrix} U_{s} (s) (f, g) & = f, & \partial_{t} U_{s} (s) (f, g) & = g, \end{matrix}

\begin{matrix}  \end{matrix}

and:

□_{\underset{̲}{A}} U_{s} (f, g) = 0,

We then have that:

\begin{matrix} F (t) = U_{0} (t) (f, \dot{f}) + \int_{0}^{t} U_{s} (t) (0, G (s)) d s, \end{matrix}

(160)

solves the problem 128 – 129 . In particular, by Minkowski's triangle inequality we easily have that:

∥ \int_{0}^{t} U_{s} (t) (0, G (s)) d s ∥ {\dot{X}}^{\frac{n - 4}{2}} \times \partial_{t}^{- 1} ({\dot{X}}^{\frac{n - 6}{2}}) ⩽ \int_{0}^{\infty} ∥ U_{s} (0, G (s)) ∥ {\dot{X}}^{\frac{n - 4}{2}} \times \partial_{t}^{- 1} ({\dot{X}}^{\frac{n - 6}{2}}) d s .

Therefore, we are trying to show:

\begin{matrix} ∥ U_{s} (f, g) ∥ {\dot{X}}^{\frac{n - 4}{2}} \times \partial_{t}^{- 1} ({\dot{X}}^{\frac{n - 6}{2}}) ⩽ C ∥ (f, g) ∥ {\dot{H}}^{\frac{n - 4}{2}} \times {\dot{H}}^{\frac{n - 6}{2}}, \end{matrix}

(161)

for any pair of functions

(f, g)

and any initial time

s

. Since it is easy to see that the conditions 126 are translation invariant, it suffices to show this estimates for

s = 0

The estimate 161 will be shown using a bootstrapping procedure. This will be done inside of the compact intervals

[0, T^{*}]

. What we will do is to first assume that 161 is true for all

0 ⩽ s ⩽ T^{*}

on all time intervals of the form

[0, s]

and

[s, T^{*}]

, where the constant on the left hand side of 161 is replaced by

2 C

. Our goal is then to improve the constant by proving the desired bound 161 on the time subintervals of

[0, T^{*}]

. Once this is accomplished, we can easily extend the bound 161

to all subintervals of a slightly larger time interval

[0, T^{*} + γ]

, where the constant

0 < γ ≪ 1

is determined by the bound 127 . This is provided by the usual local existence theory based on energy and

L^{\infty}

estimates. Once this is done, the bootstrapping closes. Notice again that, by using the local existence theory and the bound 127 , we may begin the argument for some very small time interval

[0, γ]

We are now assuming that 161 holds on our time interval

[0, T^{*}]

with constant

2 C

which we will decide on in a moment. We are working with a solution:

\begin{matrix} □_{\underset{̲}{A}} F = 0, \end{matrix}

(162)

where the connection

d + \underset{̲}{A}

satisfies 126 , and where we have the initial data:

\begin{matrix} F (0) & = f, & \partial_{t} F (0) = g . \end{matrix}

(163)

\begin{matrix}  \end{matrix}

We now split this initial data into a sum frequency localized pieces:

\begin{matrix} f & = \sum_{λ} P_{λ} (f) = \sum_{λ} f_{λ}, \end{matrix}

\begin{matrix} g & = \sum_{λ} P_{λ} (g) = \sum_{λ} g_{λ}, \end{matrix}

\begin{matrix}  \end{matrix}

and then repeatedly use Proposition 7.1 to construct an approximate solution to 162 – 163 as follows:

\tilde{F} = \sum_{λ} {\tilde{F}}_{λ} = \sum_{λ} W_{0}^{λ} (f_{λ}, g_{λ}) .

By summing over the parametrix estimate 159a we automatically have that:

∥ \tilde{F} ∥ {\dot{X}}^{\frac{n - 4}{2}} \times \partial_{t}^{- 1} ({\dot{X}}^{\frac{n - 6}{2}}) ⩽ \frac{1}{2} C ∥ (f, g) ∥ {\dot{H}}^{\frac{n - 4}{2}} \times {\dot{H}}^{\frac{n - 6}{2}},

where

C

is some fixed constant. We choose this to be our definition of the constant on the right hand side of 161 . Thus, our goal is to conclude that:

\begin{matrix} ∥ F - \tilde{F} ∥ {\dot{X}}^{\frac{n - 4}{2}} \times \partial_{t}^{- 1} ({\dot{X}}^{\frac{n - 6}{2}}) ⩽ \frac{1}{2} C ∥ (f, g) ∥ {\dot{H}}^{\frac{n - 4}{2}} \times {\dot{H}}^{\frac{n - 6}{2}} . \end{matrix}

(164)

To do this, we use the Duhamel formula 160 to express everything in terms of the operators

U_{s} (t)

F (t) - \tilde{F} (t) = U_{0} (t) (f - \tilde{F} (0), g - \partial_{t} \tilde{F} (0)) - \int_{0}^{t} U_{s} (t) (0, □_{\underset{̲}{A}} \tilde{F} (s)) d s .

By combining the assumed estimate 161 and the approximation bounds 159b – 159c , we have that:

∥ U_{0} (f - \tilde{F} (0), g - \partial_{t} \tilde{F} (0)) ∥ {\dot{X}}^{\frac{n - 4}{2}} \times \partial_{t}^{- 1} ({\dot{X}}^{\frac{n - 6}{2}}) ≲ C ℰ^{\frac{1}{2}} ∥ (f, g) ∥ {\dot{H}}^{\frac{n - 4}{2}} \times {\dot{H}}^{\frac{n - 6}{2}} .

Therefore, by using Minkowski's triangle inequality and again using the bootstrapping assumption 161 , we see that in order to conclude 164 we only need to show the following remainder estimate on the time interval

[0, T^{*}]

\begin{matrix} ∥ □_{\underset{̲}{A}} \tilde{F} ∥ L^{1} ({\dot{H}}^{\frac{n - 6}{2}}) ≲ C ℰ ∥ (f, g) ∥ {\dot{H}}^{\frac{n - 4}{2}} \times {\dot{H}}^{\frac{n - 6}{2}} . \end{matrix}

(165)

To show the estimate 165 , we use a family of frequency cutoffs:

I = P_{∙ ≪ λ} + P_{λ ≲ ∙},

for each scale

λ

such that they all have

L^{1}

kernels with uniform bounds, and such that the cutoff

P_{∙ ≪ λ}

is consistent with the definition of

d + \underset{̲}{A} ∙ ≪ λ

in the statement of Proposition 7.1 . This allows us to schematically write:

(166) □ A ̲ F ~ = ∑ λ ( □ A ̲ ∙ ≪ λ F ~ λ + [ ∇ x A ̲ λ ≲ ∙ , F ~ λ ] + [ A ̲ λ ≲ ∙ , ∇ x F ~ λ ] + [ [ A ̲ ∙ ≪ λ , A ̲ λ ≲ ∙ ] , F ~ λ ] + [ [ A ̲ λ ≲ ∙ , A ̲ λ ≲ ∙ ] , F ~ λ ] ) .

The bound 165 for the term

\sum_{λ} □_{\underset{̲}{A} ∙ ≪ λ} {\tilde{F}}_{λ}

is a direct consequence of repeatedly applying the estimate 159d while using the fact that each term in this sum is supported in frequency where

| ξ | \sim λ

to gain the orthogonality needed to obtain bounds in terms of the pair

(f, g)

. Therefore, we are reduced to showing the following family of error estimates:

\begin{matrix} \sum_{λ} ∥ [\nabla_{x} \underset{̲}{A} λ ≲ ∙, {\tilde{F}}_{λ}] ∥ L^{1} ({\dot{H}}^{\frac{n - 6}{2}}) ≲ ℰ ∥ (f, g) ∥ {\dot{H}}^{\frac{n - 4}{2}} \times {\dot{H}}^{\frac{n - 6}{2}}, \end{matrix}

(167)

\begin{matrix} \sum_{λ} ∥ [\underset{̲}{A} λ ≲ ∙, \nabla_{x} {\tilde{F}}_{λ}] ∥ L^{1} ({\dot{H}}^{\frac{n - 6}{2}}) ≲ ℰ ∥ (f, g) ∥ {\dot{H}}^{\frac{n - 4}{2}} \times {\dot{H}}^{\frac{n - 6}{2}}, \end{matrix}

(168)

\begin{matrix} ∥ \sum_{λ} [[\underset{̲}{A} ∙ ≪ λ, \underset{̲}{A} λ ≲ ∙], {\tilde{F}}_{λ}] ∥ L^{1} ({\dot{H}}^{\frac{n - 6}{2}}) ≲ ℰ ∥ (f, g) ∥ {\dot{H}}^{\frac{n - 4}{2}} \times {\dot{H}}^{\frac{n - 6}{2}}, \end{matrix}

(169)

\begin{matrix} ∥ \sum_{λ} [[\underset{̲}{A} λ ≲ ∙, \underset{̲}{A} λ ≲ ∙], {\tilde{F}}_{λ}] ∥ L^{1} ({\dot{H}}^{\frac{n - 6}{2}}) ≲ ℰ ∥ (f, g) ∥ {\dot{H}}^{\frac{n - 4}{2}} \times {\dot{H}}^{\frac{n - 6}{2}} . \end{matrix}

(170)

\begin{matrix}  \end{matrix}

These estimates are all very similar to each other, and to estimates we have already proved in the last section, in particular 148 – 152 . To prove the first estimate 167 above, we further decompose the left hand side into frequencies and use the triangle inequality to bound: 167

(L . H . S .) ⩽ \sum_{λ, μ : λ ≲ μ} ∥ [\nabla_{x} P_{μ} (\underset{̲}{A}), {\tilde{F}}_{λ}] ∥ L^{1} ({\dot{H}}^{\frac{n - 6}{2}}) .

Thus, by Young's inequality, it suffices to show the following family of fixed frequency estimates:

∥ [\nabla_{x} P_{μ} (\underset{̲}{A}), {\tilde{F}}_{λ}] ∥ L^{1} ({\dot{H}}^{\frac{n - 6}{2}}) ≲ {(\frac{λ}{μ})}^{δ} ∥ P_{μ} (\underset{̲}{A}) ∥ {\dot{Z}}^{\frac{n - 2}{2}} \cdot ∥ (f_{λ}, g_{λ}) ∥ {\dot{H}}^{\frac{n - 4}{2}} \times {\dot{H}}^{\frac{n - 6}{2}},

where we have set

δ = \frac{3}{2} - \frac{n}{n - 1}

. Notice that we have used the

{\dot{Z}}^{\frac{n - 2}{2}}

norm for the

{{\underset{̲}{A}}_{i}}

on the right hand side. This allows us to reconstruct norms through square-summing. For

λ \sim μ

this estimate is nothing but a fixed frequency version of the estimate 154 above, so it suffices to consider case

λ ≪ μ

. Using the simple inclusion

\nabla_{x} {\dot{X}}^{\frac{n - 2}{2}} \subseteq {\dot{X}}^{\frac{n - 4}{2}}

, this is a consequence of the fixed frequency embedding:

\begin{matrix} P_{μ} (L^{2} ({\dot{B}}_{2}^{\frac{2 (n - 1)}{n - 3}, (2, \frac{n - 3}{2})})) \cdot P_{λ} (L^{2} ({\dot{B}}_{2}^{\frac{2 (n - 1)}{n - 3}, (2, \frac{n - 3}{2})})) ↪ {(\frac{λ}{μ})}^{δ} L^{1} ({\dot{H}}^{\frac{n - 6}{2}}), \end{matrix}

(171)

which follows at once from the fixed frequency estimate 53 which helps to generate the general estimate 45 . Notice that the proof of the second estimate 168 above is very similar to what we have just done. In fact, there is more room because the derivative is on the low frequency term. We leave the details to the reader.

It remains to prove the two estimates 169 – 170 . Since these follow from essentially identical reasoning, we concentrate on proving the second of these estimates. This one in fact requires a bit more work than the fist because it has more frequency overlap. Applying a trichotomy to the product, we see that it suffices to be able to show the following three estimates:

\begin{matrix} \int_{0}^{T^{*}} {(\sum_{λ} {(\sum_{μ : μ ≪ λ} ∥ [P_{μ} ([\underset{̲}{A} λ ≲ ∙, \underset{̲}{A} λ ≲ ∙]), {\tilde{F}}_{λ}] (s) ∥ {\dot{H}}^{\frac{n - 6}{2}})}^{2})}^{\frac{1}{2}} d s \end{matrix}

(172)

\begin{matrix} ≲ ∥ \underset{̲}{A} ∥^{2} {\dot{X}}^{\frac{n - 2}{2}} \cdot ∥ (f, g) ∥ {\dot{H}}^{\frac{n - 4}{2}} \times {\dot{H}}^{\frac{n - 6}{2}}, \end{matrix}

(173)

\begin{matrix} \int_{0}^{T^{*}} {(\sum_{μ} {(\sum_{λ : λ ≪ μ} ∥ [P_{μ} ([\underset{̲}{A} λ ≲ ∙, \underset{̲}{A} λ ≲ ∙]), {\tilde{F}}_{λ}] (s) ∥ L^{1} ({\dot{H}}^{\frac{n - 6}{2}}))}^{2})}^{\frac{1}{2}} d s \end{matrix}

(174)

\begin{matrix} ≲ ∥ \underset{̲}{A} ∥^{2} {\dot{X}}^{\frac{n - 2}{2}} \cdot ∥ (f, g) ∥ {\dot{H}}^{\frac{n - 4}{2}} \times {\dot{H}}^{\frac{n - 6}{2}}, \end{matrix}

(175)

\begin{matrix} \sum_{λ, μ : λ \sim μ} ∥ [P_{μ} ([\underset{̲}{A} λ ≲ ∙, \underset{̲}{A} λ ≲ ∙]), {\tilde{F}}_{λ}] ∥ L^{1} ({\dot{H}}^{\frac{n - 6}{2}}) \end{matrix}

(176)

\begin{matrix} ≲ ∥ \underset{̲}{A} ∥^{2} {\dot{X}}^{\frac{n - 2}{2}} \cdot ∥ (f, g) ∥ {\dot{H}}^{\frac{n - 4}{2}} \times {\dot{H}}^{\frac{n - 6}{2}} . \end{matrix}

(177)

\begin{matrix}  \end{matrix}

The first two estimates 173 – 175 follow from first fixing time and then proving the fixed frequency estimate:

∥ [P_{μ} ([\underset{̲}{A} λ ≲ ∙, \underset{̲}{A} λ ≲ ∙]), {\tilde{F}}_{λ}] (s) ∥ {\dot{H}}^{\frac{n - 6}{2}} ≲ {min}_{\pm} {(\frac{λ}{μ})}^{\pm δ} ∥ P_{μ} ([\underset{̲}{A} λ ≲ ∙, \underset{̲}{A} λ ≲ ∙]) (s) ∥ {\dot{B}}^{\frac{2 (n - 1)}{n - 3}, (2, \frac{n - 3}{2})} \cdot ∥ {\tilde{F}}_{λ} (s) ∥ {\dot{B}}^{\frac{2 (n - 1)}{n - 3}, (2, \frac{n - 3}{2})},

where

δ

is the same constant from estimate 171 . Indeed, this last line follows from the non-time integrated version of that estimate. Applying Young's inequality to this, integrating in time and applying Cauchy-Schwartz, using the parametrix bound 159a , the product embedding 153 , and the fact that for each fixed value of

λ

the multipliers

P_{∙ ≪ λ}

and

P_{λ ≲ ∙}

are bounded on the

{\dot{X}}^{s}

spaces we arrive at the desired pair of estimates.

It remains for us to prove the last estimate 177 above. After another application of the embedding 154 and a Cauchy-Schwartz, followed by the parametrix estimate 159a , we are left with showing the bound:

{(\sum_{λ, μ : λ \sim μ} ∥ [P_{μ} ([\underset{̲}{A} λ ≲ ∙, \underset{̲}{A} λ ≲ ∙]) ∥^{2} L^{2} ({\dot{B}}^{\frac{2 (n - 1)}{n - 3}, (2, \frac{n - 3}{2})}))}^{\frac{1}{2}} ≲ ∥ \underset{̲}{A} ∥^{2} {\dot{X}}^{\frac{n - 2}{2}} .

This last estimate follows from applying a further trichotomy, and then using Young's inequality after reduction to the various fixed frequency versions of the product estimate 153 which are provided by the general fixed frequency estimates 51 – 51 . We leave the details to the diligent reader. This completes the proof of our reduction of Theorem 6.2 to Proposition 7.1 . □

The final thing we will do in this section is to make one further reduction of the Strichartz estimates 130 . This involves the following proposition:

Proposition 7.2 (Existence of approximate half-wave parametrices). Let the number of dimensions be

6 ⩽ n

, and let

d + \underset{̲}{A} ∙ ≪ 1

be a connection which satisfies the conditions 126 as well as the frequency localization condition 158 for

λ = 1

. Then there exists pair of evolution operators

Φ^{\pm} (\hat{f}) (t)

from

L^{2} (R_{ξ}^{n})

L^{2} (R_{x}^{n})

such that the fixed time adjoints

(Φ^{\pm} (t))^{*}

are always supported in the region

2^{- a} ⩽ | ξ | ⩽ 2^{a}

for some fixed

1 ⩽ a

, and such that they obey the following estimates:

\begin{matrix} ∥ (P_{1} Φ^{\pm} (\hat{f}), Φ^{\pm} (\hat{f})) ∥ {\dot{X}}^{0} \times L_{x}^{2} & ≲ ∥ \hat{f} ∥ L_{ξ}^{2}, \end{matrix}

(178a)

\begin{matrix} ∥ \nabla_{x} Φ^{\pm} (\hat{f}) ∥ L_{t}^{2} (L_{x}^{\frac{2 (n - 1)}{n - 3}}) & ≲ ∥ \hat{f} ∥ L_{ξ}^{2}, \end{matrix}

(178b)

\begin{matrix} ∥ \partial_{t} P_{1} Φ^{\pm} (\hat{f}) \mp P_{1} Φ^{\pm} (2 π i | ξ | \hat{f}) ∥ {\dot{X}}^{0} & ≲ ℰ ∥ \hat{f} ∥ L_{ξ}^{2}, \end{matrix}

(178c)

\begin{matrix} ∥ Φ^{\pm} (0) ((2 π | ξ |)^{α} (Φ^{\pm} (0))^{*}) g - (- Δ)^{\frac{α}{2}} P_{1} (g) ∥ L_{x}^{2} & ≲ ℰ^{\frac{1}{2}} ∥ g ∥ L_{x}^{2}, \end{matrix}

(178d)

\begin{matrix} ∥ □_{\underset{̲}{A} ∙ ≪ 1} Φ^{\pm} (\hat{f}) ∥ L_{t}^{1} (L_{x}^{2}) & ≲ ℰ ∥ \hat{f} ∥ L_{ξ}^{2} . \end{matrix}

(178e)

\begin{matrix}  \end{matrix}

Proof that Proposition 7.2 implies Proposition 7.1 . This is a simple matter, and we explain it briefly. Notice first that it suffices to prove Proposition 7.1 on the scale

λ = 1

because everything in sight is scale invariant. We now let

(f_{1}, g_{1})

be any pair of unit frequency initial data, and we define the approximate unit frequency wave propagator:

(179) W 0 1 ( f 1 , g 1 ) ( t ) = P 1 ( 1 2 Φ + ( t ) ( Φ + ( 0 ) ) * f 1 + 1 2 Φ − ( t ) ( Φ − ( 0 ) ) * f 1 + Φ + ( t ) ( 1 4 π i | ξ | ( Φ + ( 0 ) ) * ) g 1 − Φ − ( t ) ( 1 4 π i | ξ | ( Φ − ( 0 ) ) * ) g 1 ) .

Here

P_{1}

is defined to be the cutoff on line 178d which is also chosen large enough such that

P_{1} (f_{1}, g_{1}) = (f_{1}, g_{1})

. From the boundedness of the

P_{1}

multiplier, the estimates 178a and 178c , the frequency support of the adjoints, and the dualized

L_{x}^{2} \to L_{ξ}^{2}

estimate contained in 178a , we easily have that the operator 179 obeys the estimate 159a . Next, notice that by applying 178d with

α = 0

and

α = - 1

, and using the unit frequency condition which implies the boundedness of

(- Δ)^{- \frac{1}{2}}

, we have the estimate 159b . Furthermore, by using estimate 178c in conjunction with 178d , where this time we use the indices

α = 0

and

α = 1

, and using the boundedness of

(- Δ)^{\frac{1}{2}}

at unit frequency, we have the second accuracy estimate 159c . Therefore, it remains to show that we have the error estimate 159d . By the estimate 178e and by again making use of the dual

L_{x}^{2} \to L_{ξ}^{2}

adjoint bound, we are reduced to proving (operator) commutator bounds of the type:

∥ [□_{\underset{̲}{A} ∙ ≪ 1}, P_{1}] Φ^{\pm} (\hat{h}) ∥ L_{t}^{1} (L_{x}^{2}) ≲ ℰ ∥ \hat{h} ∥ L_{ξ}^{2} .

Using the commutator estimate 39 in conjunction with the parametrix bounds 178a – 178b (this is where the extra bound on the gradient comes in), this reduces to showing the two bounds:

\begin{matrix} ∥ \nabla_{x} \underset{̲}{A} ∙ ≪ 1 ∥ L_{t}^{2} (L_{x}^{n - 1}) & ≲ ∥ \underset{̲}{A} ∙ ≪ 1 ∥ {\dot{X}}^{\frac{n - 2}{2}}, \end{matrix}

(180)

\begin{matrix} ∥ \nabla_{x} [\underset{̲}{A} ∙ ≪ 1, \underset{̲}{A} ∙ ≪ 1] ∥ L_{t}^{1} (L_{x}^{\infty}) & ≲ ∥ \underset{̲}{A} ∙ ≪ 1 ∥^{2} {\dot{X}}^{\frac{n - 2}{2}} . \end{matrix}

(181)

\begin{matrix}  \end{matrix}

The first estimate follows easily from integrating the following Besov and low frequency Besov nestings:

P_{∙ ≲ 1} ({\dot{B}}_{2}^{\frac{2 (n - 1)}{n - 3}, (2, \frac{n - 3}{2})}) \subseteq {\dot{B}}_{2}^{n - 1, (2, n (\frac{n - 3}{2 (n - 1)}))} \subseteq L^{n - 1} .

The second estimate follows as easily from first distributing the derivative and then integrating the two low frequency nestings:

P_{∙ ≲ 1} ({\dot{B}}_{2}^{\frac{2 (n - 1)}{n - 3}, (2, \frac{n - 3}{2})}), P_{∙ ≲ 1} ({\dot{B}}_{2}^{\frac{2 (n - 1)}{n - 3}, (2, \frac{n - 1}{2})}) \subseteq B_{1}^{\infty, (2, \frac{n}{2})} \subseteq L^{\infty} .

This completes the proof that Proposition 7.2 implies Proposition 7.1 . □

8 Construction of the half wave operators

We now begin construction of our approximate solutions

Φ^{\pm}

to the reduced covariant wave equation

□_{\underset{̲}{A} ∙ ≪ 1}

. This will be accomplished by integrating over a collection of gauge transformations designed to eliminate the highest order effect of troublesome term

{\underset{̲}{A}}^{α} ∙ ≪ 1 \nabla_{α}

. In order to understand what such a gauge transformation should be, we begin with a simple calculation. We consider the covariant wave equation

□_{ω A}

, where the connection

^{ω} D = d + ω A

will be determined in a moment, acting on a vector valued plane wave

e^{2 π i λ ω u_{\pm}} \hat{f}

. Here

\hat{f}

is a constant complex valued matrix in

C \otimes o (m)

, and the

ω u^{\pm}

are the standard plane wave optical functions:

\begin{matrix} ω u^{+} & = t + ω \cdot x, & ω u^{-} & = - t + ω \cdot x . \end{matrix}

\begin{matrix}  \end{matrix}

In particular,

\nabla^{α} (ω u^{\pm}) = (ω L^{\mp})^{α}

, where the

ω L^{\pm}

are the associated null hyper-surface generators:

\begin{matrix} ω L^{+} & = \nabla_{t} + ω \cdot \nabla_{x}, & ω L^{-} & = - \nabla_{t} + ω \cdot \nabla_{x} . \end{matrix}

\begin{matrix}  \end{matrix}

With these identifications, we easily have the calculation:

\begin{matrix} □_{ω A} (e^{2 π i λ ω u_{\pm}} \hat{f}) = e^{2 π i λ ω u_{\pm}} \cdot (4 π i λ [ω A (ω L^{\mp}), \hat{f}] + D_{α}^{ω A} [ω A^{α}, \hat{f}]) . \end{matrix}

(182)

Using the heuristic⁵ that terms of the form

\nabla (ω A)

and

[ω A, ω A]

are lower order, and splitting the potentials

{ω A_{α}}

into the sets

{ω A_{α}^{\pm}}

associated with the optical functions

ω u_{\pm}

(resp.), we see that in order eliminate the highest order term on the right hand side of 182 would need to assume this connection is in the backward (resp. forward)

ω

-null-gauge:

\begin{matrix} ω A^{+} (ω L^{-}) & = 0, & ω A^{-} (ω L^{+}) & = 0 . \end{matrix}

(183)

\begin{matrix}  \end{matrix}

Of course, it is not possible to assume that a given fixed connection will simultaneously be in the null-gauge for every direction

ω

. However, it is more or less clear that since these gauges are of Crönstrom type, it is always possible to transform a given connection so that it is in the null-gauge for a fixed direction. This motivates the following form of an approximate solution to

□_{\underset{̲}{A} ∙ ≪ 1}

\begin{matrix} Φ^{\pm} (\hat{f}) = \int_{R^{n}} e^{2 π i λ ω u^{\pm}} ω g_{\pm}^{- 1} \hat{f} (λ ω) ω g_{\pm} χ_{(\frac{1}{2}, 2)} (λ) λ^{n - 1} d λ d ω, \end{matrix}

(184)

where

χ_{(\frac{1}{2}, 2)}

is a smooth bump function such that

χ_{(\frac{1}{2}, 2)} \equiv 1

on the interval

[2^{- 1}, 2]

and such that

χ_{(\frac{1}{2}, 2)} \equiv 0

outside of

[4^{- 1}, 4]

(the variable width assumption of Proposition 7.2 can be achieved with similar bump functions). Here, the gauge transformation:

\begin{matrix} ω B^{\pm} = ω g_{\pm} {\underset{̲}{A}}_{∙ ≪ 1} (ω g_{\pm}^{- 1}) + ω g_{\pm} d (ω g_{\pm}^{- 1}), \end{matrix}

(185)

will be chosen so that

ω B^{\pm}

approximately satisfies 183 . It seems that there are in fact many choices of how to do this, although the naive choice of letting

ω B^{\pm}

satisfy 183 directly by solving the appropriate transport equations⁶ leads to group elements with poor regularity properties. Therefore, the procedure for arriving at the correct choice deserves some motivation.

The heart of the matter is two-fold. First and foremost, we need to come up with a construction that gives us explicit formulas so that we may perform certain standard calculations on the integral 184 . In particular, we will need to perform integration by parts with respect to the variable

ω

. Since

G

is assumed to be non-abelian, and since we will not be able to localize things to a neighborhood of any fixed point on the group⁷ , this is actually a non-trivial matter. For example, it is not possible to do this directly through a use of the exponential map because we would run into trouble with conjugate points.

Secondly, we will need to replace the transport equation which defines the naive pure null-gauge transformation, with something that has more “elliptic” features.

That such a choice is possible is, strangely enough, determined by the fact that the connection

{{\underset{̲}{A}}_{∙ ≪ 1}}

is not arbitrary, but instead evolves according to a hyperbolic equation. This is taken into account by condition 126e . This kind of structure seems to be ubiquitous in geometric wave equations, both semi and quasi-linear, and the observation that it makes the crucial difference goes back to work of Klainerman-Rodnianski on quasi-linear wave equations [5] . The particular form we will use it in here is almost identical to that of [8] , but since everything we do is non-abelian, the derivation will seem a bit different at first.

The first observation we use is that just like the Crönstrom gauge, the null-gauge allows one to recover the potentials directly from the curvature. However, since we aim to derive an (sub)-elliptic equation, we do not do this by simply integrating along null directions. Instead, we write:

\begin{matrix} ω L^{\mp} ω B_{α}^{\pm} = F^{ω B^{\pm}} (ω L^{\mp}, \partial_{α}) . \end{matrix}

(186)

Making now the approximate assumption that the

{ω B^{\pm}}

are simply a solution to the scalar wave equation

□ = \nabla_{α} \nabla^{α}

, which we write as:

\begin{matrix} □ = ω L^{\pm} ω L^{\mp} + Δ_{ω^{⊥}}, \end{matrix}

(187)

the identity 186 can be written in the integral form:

\begin{matrix} ω B_{α}^{\pm} = - ω L^{\pm} Δ_{ω^{⊥}}^{- 1} F^{ω B^{\pm}} (ω L^{\mp}, \partial_{α}) . \end{matrix}

(188)

Here

Δ_{ω^{⊥}} = Δ - \nabla_{ω}^{2}

is the Laplacean on the plane perpendicular to the

ω

direction in

R^{n}

. We would now like to make 188 our “choice” for the gauge transformed connection on the right hand side of 185 . For example, even though it was based on the approximate assumption the

{ω B^{\pm}}

satisfy the scalar wave equation, it still respects the null-gauge 183 simply by the skew-symmetry property of the curvature. Unfortunately, 188 has several undesirable features. Firstly, we would like an expression which involves the curvature of

{\underset{̲}{A} ∙ ≪ 1}

, not the curvature

F^{ω B^{\pm}}

Secondly, the sub-Laplacean on the right hand side of this expression needs to be smoothed out in some way so that its dependence on the angular variable

ω

is not so rough.

To get around the first of these problems, we simply pretend that the various differential operators on the right hand side of 188 are gauge covariant. Assuming this and then conjugating both sides of that expression by

ω g_{\pm}

, moving these group elements past the differential operators on the right, and throwing away quadratic terms from the curvature while assuming that the reduced connection

{\underset{̲}{A}}_{∙ ≪ 1}

satisfies the usual homogeneous wave equation, we are left with the approximate identities:

\begin{matrix} ω g_{\pm}^{- 1} ω B_{α}^{\pm} ω g_{\pm} & \approx - ω L^{\pm} Δ_{ω^{⊥}}^{- 1} F^{{\underset{̲}{A}}_{∙ ≪ 1}} (ω L^{\mp}, \partial_{α}), \end{matrix}

\begin{matrix} \approx ({\underset{̲}{A}}_{∙ ≪ 1})_{α} + \nabla_{α} ω L^{\pm} Δ_{ω^{⊥}}^{- 1} {\underset{̲}{A}}_{∙ ≪ 1} (ω L^{\mp}) . \end{matrix}

\begin{matrix}  \end{matrix}

To get around the second problem, we mollify the angular variable of the second term on the right hand side of this last expression. Doing this and looking back on the definition 185 , we see that we would like our group elements to be such that:

\begin{matrix} ω g_{\pm}^{- 1} d (ω g_{\pm}) \approx - ω {\bar{Π}}^{(\frac{1}{2} - δ)} \nabla_{x} ω L^{\pm} Δ_{ω^{⊥}}^{- 1} {\underset{̲}{A}}_{∙ ≪ 1} (\partial_{ω}) . \end{matrix}

(189)

Here we have set:

\begin{matrix} 0 < γ ≪ δ ≪ 1, \end{matrix}

(190)

where

γ

is our small all purpose constant from line 12 above. Now the problem is, of course, that right hand side of the above formula does not in general represent a flat connection. However, as one can see immediately, its curvature is small in some sense because it is a quadratic expression. At this point, the problem now looks essentially like what happens for wave-maps⁸ (see e.g. [11] and [9] ). In particular, it is clear that the right way to define the group elements

ω g_{\pm}

so that the approximate formula 189 holds is to flatten out the right hand side of that expression as much as possible by using the potential version 3.2 of the Uhlenbeck lemma. Therefore, what we need to do is to show the fixed time estimate:

\begin{matrix} ∥ ω {\bar{Π}}^{(\frac{1}{2} - δ)} \nabla_{x} ω L^{\pm} Δ_{ω^{⊥}}^{- 1} \underset{̲}{A} ∙ ≪ 1 (\partial_{ω}) ∥ L^{n} ≲ ℰ, \end{matrix}

(191)

and then assume that

ℰ

is chosen small enough to that we may use it as the constant in 22 . Because of its utility in the sequel, we will in fact prove the more general estimate:

\begin{matrix} ∥ ω {\bar{Π}}^{(\frac{1}{2} - δ)} \nabla_{x} ω L^{\pm} Δ_{ω^{⊥}}^{- 1} \underset{̲}{A} ∙ ≪ 1 (\partial_{ω}) ∥ {\dot{B}}_{2, 10 n}^{p_{γ}, (2, \frac{n - 2}{2})} ≲ ℰ, \end{matrix}

(192)

where

p_{γ}

is a dimension dependent Lebesgue index which we set to:

\begin{matrix} p_{γ} = \frac{2 (n - 1)}{n - 3 - 2 γ} . \end{matrix}

(193)

Here

0 < γ ≪ 1

is again the all-purpose constant which we have fixed in section 2 to be small enough so that it is compatible with its use here. Notice that 192 implies the estimate 191 thanks to the embedding 43 and the fact that for

γ

sufficiently small there is plenty of room in the inequality

p_{γ} < n

Now, because the norm

{\dot{B}}_{2, 10 n}^{p_{γ}, (2, \frac{n - 2}{2})}

ℓ^{2}

based, by orthogonality and the

L^{\infty} ({\dot{H}}^{\frac{n - 2}{2}})

estimate contained in the bootstrapping assumption 126d , we see that in order to conclude 192 it is enough to show the fixed frequency estimate (note that there are no high frequencies here):

∥ \nabla_{x} ω L^{\pm} Δ_{ω^{⊥}}^{- 1} (\underset{̲}{A} ∙ ≪ 1)_{μ} (\partial_{ω}) ∥ L^{p_{γ}} ≲ μ^{n (\frac{1}{2} - \frac{1}{p_{γ}})} ∥ (\underset{̲}{A} ∙ ≪ 1)_{μ} ∥ L^{2} .

Decomposing the spatial frequency variable into fixed dyadic angular sectors spread from the direction

ω

P_{μ} = \sum_{θ} ω Π_{θ} P_{μ}

, this estimate further reduces (after dyadic summing) to being able being able to prove that:

\begin{matrix} ∥ ω Π_{θ} \nabla_{x} ω L^{\pm} Δ_{ω^{⊥}}^{- 1} (\underset{̲}{A} ∙ ≪ 1)_{μ} (\partial_{ω}) ∥ L^{p_{γ}} ≲ θ^{γ} μ^{n (\frac{1}{2} - \frac{1}{p_{γ}})} ∥ (\underset{̲}{A} ∙ ≪ 1)_{μ} ∥ L^{2} . \end{matrix}

(194)

We are now almost at the point where we can apply the angular Bernstein inequality 56 directly, because in the current localized setting we have the symbol bounds:

\begin{matrix} ω Π_{θ} \nabla_{x} ω L^{\pm} Δ_{ω^{⊥}}^{- 1} S_{| τ | ≲ | ξ |} P_{μ} \approx θ^{- 2} P_{μ}, \end{matrix}

(195)

where we are enforcing the heuristic notation introduced on line 58 . However, since Bernstein only nets us a savings of:

θ^{(n - 1) (\frac{1}{2} - \frac{1}{p_{γ}})} = θ^{1 + γ},

in this context, we need to be a bit more careful in order to gain an extra power of

θ

. This is provided by the fact that the potentials

{\underset{̲}{A} ∙ ≪ 1}

are in the Coulomb gauge. Notice that if say,

\frac{1}{10} < θ

there is nothing to worry about and we have estimate 194 without any problem. On the other-hand, if it is the case that

θ < \frac{1}{10}

, then we can use the fact that

ω Π_{θ} \nabla_{ω}^{- 1}

is elliptic (in terms of symbol bounds) in conjunction with the gauge condition

d^{*} \underset{̲}{A} ∙ ≪ 1 = 0

to write:

\begin{matrix} ω Π_{θ} \underset{̲}{A} ∙ ≪ 1 (\partial_{ω}) = \nabla_{ω}^{- 1} ω Π_{θ} / d^{*} / \underset{̲}{A} ∙ ≪ 1 \approx θ ω Π_{θ} / \underset{̲}{A} ∙ ≪ 1 . \end{matrix}

(196)

Here

{/ \underset{̲}{A} ∙ ≪ 1}

the induced connection (angular portion) on the hyperplane

ℋ_{ω^{⊥}}

perpendicular to

ω

, and

/ d^{*}

is the associated divergence. We note here that this identity will turn out to be very useful and will be used many times throughout the sequel. With these extra savings in mind, an application of Bernstein now directly yields the desired estimate 194 .

We have now constructed the infinitesimal group elements

ω g_{\pm}

in equations 185 , which is explicitly defined by the formulas 31 in Lemma 3.2 applied to the connection:

\begin{matrix} ω {\underset{̲}{A}}^{\pm} = - ω {\bar{Π}}^{(\frac{1}{2} - δ)} \nabla_{x} ω L^{\pm} Δ_{ω^{⊥}}^{- 1} {\underset{̲}{A}}_{∙ ≪ 1} (\partial_{ω}) . \end{matrix}

(197)

This has the pleasant effect that we will never need to explicitly refer to the connection

{ω B^{\pm}}

in line 185 . We can calculate the conjugated right hand side of that expression to be:

\begin{matrix} ω g_{\pm}^{- 1} ω B^{\pm} ω g_{\pm} = {\underset{̲}{A}}_{∙ ≪ 1} -^{ω} C^{\pm}, \end{matrix}

(198)

where we have set:

\begin{matrix} ω g_{\pm}^{- 1} d (ω g_{\pm}) =^{ω} C^{\pm} . \end{matrix}

(199)

Using the formulas 31 , we have the following expressions for the spatial components

{ω {\underset{̲}{C}}^{\pm}}

\begin{matrix} (ω {\underset{̲}{C}}^{\pm})^{d f} & = d^{*} Δ^{- 1} [ω {\underset{̲}{C}}^{\pm}, ω {\underset{̲}{C}}^{\pm}], \end{matrix}

(200a)

\begin{matrix} (ω {\underset{̲}{C}}^{\pm})^{c f} & = ω {\underset{̲}{A}}^{\pm} - \nabla_{x} Δ^{- 1} [ω {\underset{̲}{A}}^{\pm}, ω {\underset{̲}{C}}^{\pm}] . \end{matrix}

(200b)

\begin{matrix}  \end{matrix}

In order to compute a formula for the temporal potential

^{ω} C_{0}^{\pm}

, we simply use the fact that

F^{^{ω} C^{\pm}} = 0

and the formula 200b which together imply (by computing

d^{*} E^{^{ω} C^{\pm}}

\begin{matrix} ^{ω} C_{0}^{\pm} = ω A_{0}^{\pm} - \nabla_{t} Δ^{- 1} [ω {\underset{̲}{A}}^{\pm}, ω {\underset{̲}{C}}^{\pm}] - d^{*} Δ^{- 1} [^{ω} C_{0}^{\pm}, ω {\underset{̲}{C}}^{\pm}], \end{matrix}

(201)

where we have:

ω A_{0}^{\pm} = - ω {\bar{Π}}^{(\frac{1}{2} - δ)} \nabla_{t} ω L^{\pm} Δ_{ω^{⊥}}^{- 1} {\underset{̲}{A}}_{∙ ≪ 1} (\partial_{ω}) .

We remark here that the importance of the system of equations 200a – 201 is that they give the following decomposition of the infinitesimal gauge transformation

{^{ω} C^{\pm}}

\begin{matrix} ^{ω} C^{\pm} = - \nabla_{t, x} ω {\bar{Π}}^{(\frac{1}{2} - δ)} ω L^{\pm} Δ_{ω^{⊥}}^{- 1} \underset{̲}{A} ∙ ≪ 1 (\partial_{ω}) + {Q u a d r a t i c E r r o r} . \end{matrix}

(202)

The linear term in the above expression is enough to kill off the worst error term when differentiating the parametrix 184 . It should be noted that this linear term is precisely what one gets more directly in the abelian case studied in [8] . We should also point out here that the quadratic error on the right hand side of 202 above is much more delicate than the quadratic error resulting form the cancellation involving the linear term in this expression. In order to control this, we will need the full force of the orthogonality properties of our parametrix, which are contained in the bootstrapping assumption 126d , as well as some rather technical function spaces and multilinear estimates which we will develop in Section 11 .

To close out this section, we apply the truncated covariant wave operator

□_{\underset{̲}{A} ∙ ≪ 1}

to the parametrix 184 and record the various error terms which result. We gather this together in the following proposition:

Proposition 8.1 (Error terms for the differentiated parametrix). Consider the parametrix

Φ^{\pm} (\hat{f})

defined by the formula 184 , with infinitesimal gauge transformations given by equations 200a – 201 . Then one has the identity:

\begin{matrix} □_{\underset{̲}{A} ∙ ≪ 1} Φ^{\pm} (\hat{f}) \end{matrix}

(203)

\begin{matrix} = & 4 π i \int_{R^{n}} e^{2 π i λ ω u^{\pm}} [\underset{̲}{A} ∙ ≪ 1 (ω L^{\mp}) -^{ω} C^{\pm} (ω L^{\mp}), ω g_{\pm}^{- 1} \hat{f} (λ ω) ω g_{\pm}] χ_{(2^{- 1}, 2)} (λ) λ^{n} d λ d ω \end{matrix}

\begin{matrix} - \int_{R^{n}} e^{2 π i λ ω u^{\pm}} [D_{α}^{\underset{̲}{A} ∙ ≪ 1} {(^{ω} C^{\pm})}^{α}, ω g_{\pm}^{- 1} \hat{f} (λ ω) ω g_{\pm}] χ_{(2^{- 1}, 2)} (λ) λ^{n - 1} d λ d ω \end{matrix}

\begin{matrix} + \int_{R^{n}} e^{2 π i λ ω u^{\pm}} [{\underset{̲}{A}}^{α} ∙ ≪ 1 - (^{ω} C^{\pm})^{α}, [(\underset{̲}{A} ∙ ≪ 1)_{α} -^{ω} C_{α}^{\pm}, ω g_{\pm}^{- 1} \hat{f} (λ ω) ω g_{\pm}]] χ_{(2^{- 1}, 2)} (λ) λ^{n - 1} d λ d ω . \end{matrix}

\begin{matrix}  \end{matrix}

Remark 8.2. The worst error term in the expression 203 is of course the “derivative fall on high” term which is the first on the right hand side. However, using the structure equation 126e , this takes the form:

\begin{matrix} \underset{̲}{A} ∙ ≪ 1 (ω L^{\mp}) -^{ω} C^{\pm} (ω L^{\mp}), \end{matrix}

(204)

\begin{matrix} = & \underset{̲}{A} ∙ ≪ 1 (\partial_{ω}) + ω Π^{(\frac{1}{2} - δ)} ω L^{\mp} ω L^{\pm} Δ_{ω^{⊥}}^{- 1} \underset{̲}{A} ∙ ≪ 1 (\partial_{ω}) + {Q u a d r a t i c E r r o r}, \end{matrix}

\begin{matrix} = & (I - ω Π^{(\frac{1}{2} - δ)}) \underset{̲}{A} ∙ ≪ 1 (\partial_{ω}) + {Q u a d r a t i c E r r o r} . \end{matrix}

\begin{matrix}  \end{matrix}

The key observation now is that since the operator

(I - ω Π^{(\frac{1}{2} - δ)})

cuts off on such a small angular sector with respect to the spatial frequency, an application of Bernstein's inequality gains enough extra spatial derivatives to put this term in the mixed Lebesgue space

L^{2} (L^{n - 1})

. Furthermore, the quadratic error term which is left over involves enough bilinear interactions to go in

L^{1} (L^{\infty})

. So in this sense, as we have mentions before, the problem reduces to something which is reminiscent of wave-maps. Of course, there is a somewhat heavy price to pay for this “renormalization”, which is that it must take place under an integral sign. Finally, it is worth pointing out that this top order cancellation is completely analogous to what happens in the abelian case [8] .

Proof of the error identity 203 . The proof is a simple consequence of using gauge transformations in conjunction with the identity 182 . Applying the truncated covariant wave operator, and differentiating under the integral sign, we see that:

203

\begin{matrix} □_{\underset{̲}{A} ∙ ≪ 1} Φ_{f}^{\pm}, \end{matrix}

\begin{matrix} = & \int_{R^{n}} □_{\underset{̲}{A} ∙ ≪ 1} (e^{2 π i λ ω u^{\pm}} ω g_{\pm}^{- 1} \hat{f} (λ ω) ω g_{\pm}) χ_{(2^{- 1}, 2)} (λ) λ^{n - 1} d λ d ω, \end{matrix}

\begin{matrix} = & \int_{R^{n}} ω g_{\pm}^{- 1} □_{ω B^{\pm}} (e^{2 π i λ ω u^{\pm}} \hat{f} (λ ω)) ω g_{\pm} χ_{(2^{- 1}, 2)} (λ) λ^{n - 1} d λ d ω, \end{matrix}

\begin{matrix} = & \int_{R^{n}} e^{2 π i λ ω u_{\pm}} ω g_{\pm}^{- 1} (4 π i λ [ω B^{\pm} (ω L^{\mp}), \hat{f}] + D_{α}^{ω B^{\pm}} [ω {B^{\pm}}^{α}, \hat{f}]) ω g_{\pm} χ_{(2^{- 1}, 2)} (λ) λ^{n - 1} d λ d ω, \end{matrix}

\begin{matrix} = & 4 π i \int_{R^{n}} e^{2 π i λ ω u_{\pm}} [ω g_{\pm}^{- 1} ω B^{\pm} (ω L^{\mp}) ω g_{\pm}, ω g_{\pm}^{- 1} \hat{f} ω g_{\pm}] λ^{n - 1} χ_{(2^{- 1}, 2)} (λ) d λ d ω \end{matrix}

(205)

\begin{matrix} + \int_{R^{n}} e^{2 π i λ ω u_{\pm}} D_{α}^{\underset{̲}{A} ∙ ≪ 1} [ω g_{\pm}^{- 1} ω {B^{\pm}}^{α} ω g_{\pm}, ω g_{\pm}^{- 1} \hat{f} ω g_{\pm}] λ^{n - 1} χ_{(2^{- 1}, 2)} (λ) d λ d ω, \end{matrix}

\begin{matrix} = & 4 π i \int_{R^{n}} e^{2 π i λ ω u_{\pm}} [\underset{̲}{A} ∙ ≪ 1 (ω L^{\mp}) -^{ω} C^{\pm} (ω L^{\mp}), ω g_{\pm}^{- 1} \hat{f} ω g_{\pm}] λ^{n - 1} χ_{(2^{- 1}, 2)} (λ) d λ d ω \end{matrix}

(206)

\begin{matrix} - \int_{R^{n}} e^{2 π i λ ω u_{\pm}} [\nabla_{α} (^{ω} C^{\pm})^{α}, ω g_{\pm}^{- 1} \hat{f} ω g_{\pm}] λ^{n - 1} χ_{(2^{- 1}, 2)} (λ) d λ d ω \end{matrix}

(207)

\begin{matrix} + \int_{R^{n}} e^{2 π i λ ω u_{\pm}} [(\underset{̲}{A} ∙ ≪ 1)_{α} -^{ω} C_{α}^{\pm}, \nabla^{α} (ω g_{\pm}^{- 1} \hat{f} ω g_{\pm})] λ^{n - 1} χ_{(2^{- 1}, 2)} (λ) d λ d ω \end{matrix}

(208)

\begin{matrix} + \int_{R^{n}} e^{2 π i λ ω u_{\pm}} [(\underset{̲}{A} ∙ ≪ 1)^{α}, [(\underset{̲}{A} ∙ ≪ 1)_{α} -^{ω} C_{α}^{\pm}, ω g_{\pm}^{- 1} \hat{f} ω g_{\pm}]] λ^{n - 1} χ_{(2^{- 1}, 2)} (λ) d λ d ω \end{matrix}

\begin{matrix} = & (L . H . S .) . \end{matrix}

\begin{matrix}  \end{matrix}

Notice that the equality on the last line follows from:

\nabla_{α} (ω g_{\pm}^{- 1} \hat{f} ω g_{\pm}) = [ω g_{\pm}^{- 1} \hat{f} ω g_{\pm},^{ω} C_{α}^{\pm}],

which is a consequence of line 199 above, followed by the Jacobi identity:

\begin{matrix} [(\underset{̲}{A} ∙ ≪ 1)^{α} - (^{ω} C^{\pm})^{α}, [ω g_{\pm}^{- 1} \hat{f} ω g_{\pm},^{ω} C_{α}^{\pm}]], \end{matrix}

\begin{matrix} = - & [^{ω} C_{α}^{\pm}, [(\underset{̲}{A} ∙ ≪ 1)^{α} - (^{ω} C^{\pm})^{α}, ω g_{\pm}^{- 1} \hat{f} ω g_{\pm}]] \end{matrix}

\begin{matrix} - [ω g_{\pm}^{- 1} \hat{f} ω g_{\pm}, [^{ω} C_{α}^{\pm}, (\underset{̲}{A} ∙ ≪ 1)^{α} - (^{ω} C^{\pm})^{α}]], \end{matrix}

\begin{matrix} = - & [^{ω} C_{α}^{\pm}, [(\underset{̲}{A} ∙ ≪ 1)^{α} - (^{ω} C^{\pm})^{α}, ω g_{\pm}^{- 1} \hat{f} ω g_{\pm}]] \end{matrix}

\begin{matrix} - [[(\underset{̲}{A} ∙ ≪ 1)_{α}, (^{ω} C^{\pm})^{α}], ω g_{\pm}^{- 1} \hat{f} ω g_{\pm}]] . \end{matrix}

\begin{matrix}  \end{matrix}

This completes the proof of 203 . □

⁵ For those who are familiar with this kind of problem, this is precisely a reduction to the famous $L o w \times H i g h$ frequency interaction ${\underset{̲}{A}}^{α} ∙ ≪ 1 \nabla_{α} Φ_{1}$ .

⁶ This would end up being the usual a frequency based Hadamard parametrix for the operator $□_{\underset{̲}{A} ∙ ≪ 1}$ .

⁷ This is an artifact of the critical nature of the problem.

Specifically, the group elements have the heuristic form

ω g = e x p (\nabla^{- 1} ω A)

. Since we do not have

L^{\infty}

control on

\nabla^{- 1} ω A

we cannot localize its image.

⁸ It is very much our philosophy here that this problem is essentially equivalent to wave-maps after a microlocalization.

Of course, as the reader will see, this microlocalization is quite costly and introduces many objects that are not present in the original wave-maps problem.

9 Fixed Time $L^{2}$ Estimates for the Parametrix

We now begin our proof of the estimates 178 for the integral operator 184 introduced in the last section. Here we cover bounds which are of non-differentiated energy type. Specifically, we will show the undifferentiated

L^{\infty} (L^{2})

estimate contained in 178a , as well as the multiplier-approximation bound 178d . Both of these will follow from the same set of estimates. At a heuristic level, they are not much more involved that a standard

T T^{*}

argument followed by some integration by parts, although the details turn out to be a bit involved. Things will be computed more or less directly by an appeal to the explicit equations 200a – 201 , taking a little bit of care to use them properly. This will be done by considering them as “path lifting” formulas from Minkowski space

ℳ^{n}

to the compact group

G

This allows us to employ an integral form of the intermediate value theorem from elementary calculus which is valid in the context of Lie groups. It turns out that this identity can be differentiated as many times as necessary with respect to the angular frequency variable, although this fact is provided through a surprisingly delicate bootstrapping argument. Here the unitarity of the group is needed in a crucial way to keep everything from collapsing. Once the bootstrapping is complete, the estimates themselves will be proved using a “trace-Bernstein” type inequality that we construct by hand using various multipliers. Once the integration by parts portion of things is taken care of, we will close the

L^{2}

estimate by showing that a “non-smooth” remainder kernel has small amplitudes after integration in the angular frequency variable. This involves some fairly technical bilinear estimates because the necessary othogonality arguments are difficult to pass through Hodge systems. The details of these procedures are as follows.

Throughout this section we will replace the specific cutoff function

χ_{(\frac{1}{2}, 2)}

appearing in the definition of parametrix 184 with an arbitrary smooth scalar bump function

χ (ξ)

that we may assume to be supported in the frequency annulus

{4^{- 1} < | ξ | < 4}

At fixed time

t_{0}

, we define the operator

T (\hat{f}) = Φ (\hat{f}) (t_{0})

, where we have suppressed the

\pm

notation because it will be irrelevant for what we do here. Our first goal is the prove the bound:

\begin{matrix} ∥ T (\hat{f}) ∥ L^{2} ≲ ∥ \hat{f} ∥ L^{2} . \end{matrix}

(209)

Squaring this, it suffices to show that (here

f

has no relation to

\hat{f}

and simply represents a function of the physical-space variables):

\begin{matrix} ∥ T T^{*} (f) ∥ L^{2} ≲ ∥ f ∥ L^{2}, \end{matrix}

(210)

where the adjoint

T^{*}

is taken with respect to the Killing form 13 . A quick calculation of the kernel of this operator shows that:

\begin{matrix} K^{T T^{*}} (x, y) = \int_{R^{n}} e^{2 π i (x - y) \cdot ξ} ω g^{- 1} (x) ω g (y) [∙] ω g^{- 1} (y) ω g (x) χ (ξ) d ξ, \end{matrix}

(211)

where we use the

[∙]

notation to emphasize the fact that this operator acts via conjugation. Our task is now to show the estimates:

\begin{matrix} ∥ K^{T T^{*}} ∥ L_{y}^{\infty} (L_{x}^{1}), ∥ K^{T T^{*}} ∥ L_{x}^{\infty} (L_{y}^{1}) ≲ 1 . \end{matrix}

(212)

Since

K^{T T^{*}}

is essentially symmetric in

(x, y)

, we may concentrate on the first such estimate.

To proceed, we first decompose the product physical space

R n \times R n

into the dyadic regions:

\begin{matrix} D_{σ} = {| x - y | \sim σ | σ = 2^{i}, i \in N} . \end{matrix}

(213)

We then decompose the kernel

T T^{*}

kernel into the dyadic sum:

K^{T T^{*}} = \sum_{σ} χ_{D_{σ}} K^{T T^{*}} = \sum_{σ} K_{σ}^{T T^{*}} .

By dyadic summing, to show 212 it suffices to be able to show the single estimate:

\begin{matrix} ∥ K_{σ}^{T T^{*}} ∥ L_{y}^{\infty} (L_{x}^{1}) ≲ σ^{- γ}, \end{matrix}

(214)

where

0 < γ ≪ 1

now represents a small savings in physical space decay. Now 214 would be easy to show if we had the absolute decay estimate:

| K_{σ}^{T T^{*}} (x, y) | ≲ | x - y |^{- (n + γ)},

and this is almost true. Unfortunately, there is a regularity problem due to the degeneracy of the sub-Laplacean

Δ_{ω^{⊥}}

used in the connection 200 which provides the group elements

ω g

. This forces us to write the kernel

K_{σ}^{T T^{*}}

as a sum of two terms:

\begin{matrix} K_{σ}^{T T^{*}} = {\tilde{K}}_{σ}^{T T^{*}} + ℛ_{σ}^{T T^{*}} . \end{matrix}

(215)

We will then prove that both:

\begin{matrix} | {\tilde{K}}_{σ}^{T T^{*}} (x, y) | & ≲ | x - y |^{- (n + γ)}, \end{matrix}

(216)

\begin{matrix} ∥ ℛ_{σ}^{T T^{*}} ∥ L_{y}^{\infty} (L_{x}^{1}) & ≲ σ^{- γ} . \end{matrix}

(217)

\begin{matrix}  \end{matrix}

To define the splitting 215 , we factor the group elements

ω g

into a product of smooth and small parts. This is completely analogous to the procedure used in [8] , but since things are non-abelian (and hence non-linear) here, the estimates required are quite a bit more involved. What we will do is construct another gauge transformation

\tilde{ω g}

, which is based on a further smoothing of the connection 197 .

This will produce a group element which can be treated as a standard symbol. To this end, we define the scale mollified connection:

\begin{matrix} \tilde{ω {\underset{̲}{A}}^{(σ)}} = - ω {\bar{Π}}_{σ^{- 1 + γ} < ∙} ω {\bar{Π}}^{(\frac{1}{2} - δ)} \nabla_{x} ω L Δ_{ω^{⊥}}^{- 1} \underset{̲}{A} ∙ ≪ 1 (\partial_{ω}), \end{matrix}

(218)

where

γ

is, again, the small dimensional constant from line 190 . Again, we have dropped the

\pm

notation because it is irrelevant. Following the proof of 191 , and using the fact that the multipliers

ω {\bar{Π}}_{σ^{- 1 + γ} < ∙}

are bounded on frequency localized Lebesgue spaces, we may apply Lemma 3.2 to the connection

{\tilde{ω {\underset{̲}{A}}^{(σ)}}}

This produces a group element

\tilde{ω g}

, which is defined by the infinitesimal generator:

\begin{matrix} {\tilde{ω g}}^{- 1} d (\tilde{ω g}) = \tilde{ω \underset{̲}{C}} . \end{matrix}

(219)

Furthermore, this generator is itself defined via the Hodge system:

\begin{matrix} (\tilde{ω \underset{̲}{C}})^{d f} & = d^{*} Δ^{- 1} [\tilde{ω \underset{̲}{C}}, \tilde{ω \underset{̲}{C}}], \end{matrix}

(220a)

\begin{matrix} (\tilde{ω \underset{̲}{C}})^{c f} & = \tilde{ω {\underset{̲}{A}}^{(σ)}} - \nabla_{x} Δ^{- 1} [\tilde{ω {\underset{̲}{A}}^{(σ)}}, \tilde{ω \underset{̲}{C}}] . \end{matrix}

(220b)

\begin{matrix}  \end{matrix}

Using this new group element

\tilde{ω g}

, we define the remainder group element

ω h

via the product:

\begin{matrix} ω g = ω h \tilde{ω g} . \end{matrix}

(221)

To compute the infinitesimal generator of

ω h

, we first use the identity:

\begin{matrix} d (ω h) & = d (ω g) {\tilde{ω g}}^{- 1} + g d ({\tilde{ω g}}^{- 1}), \end{matrix}

\begin{matrix} = ω h \tilde{ω g} (ω \underset{̲}{C} - \tilde{ω \underset{̲}{C}}) {\tilde{ω g}}^{- 1} . \end{matrix}

(222)

\begin{matrix}  \end{matrix}

This leads us to define the difference connection:

\begin{matrix} \tilde{\tilde{ω \underset{̲}{C}}} = ω \underset{̲}{C} - \tilde{ω \underset{̲}{C}} . \end{matrix}

(223)

A quick calculation using the systems 200 and 220 shows that this new connection can be pinned down via the Hodge system:

\begin{matrix} (\tilde{\tilde{ω \underset{̲}{C}}})^{d f} & = d^{*} Δ^{- 1} ([\tilde{ω \underset{̲}{C}}, \tilde{\tilde{ω \underset{̲}{C}}}] + [\tilde{\tilde{ω \underset{̲}{C}}}, \tilde{ω \underset{̲}{C}}]), \end{matrix}

(224a)

\begin{matrix} (\tilde{\tilde{ω \underset{̲}{C}}})^{c f} & = \tilde{ω \underset{̲}{A}} - \tilde{ω {\underset{̲}{A}}^{(σ)}} - \nabla_{x} Δ^{- 1} ([\tilde{ω \underset{̲}{A}} - \tilde{ω {\underset{̲}{A}}^{(σ)}}, \tilde{ω \underset{̲}{C}}] + [\tilde{ω {\underset{̲}{A}}^{(σ)}}, \tilde{\tilde{ω \underset{̲}{C}}}]), \end{matrix}

(224b)

\begin{matrix}  \end{matrix}

where a simple computation shows that:

\begin{matrix} ω \underset{̲}{A} - \tilde{ω {\underset{̲}{A}}^{(σ)}} = - ω {\bar{Π}}_{∙ ⩽ σ^{- 1 + γ}} ω {\bar{Π}}^{(\frac{1}{2} - δ)} \nabla_{x} ω L Δ_{ω^{⊥}}^{- 1} \underset{̲}{A} ∙ ≪ 1 (\partial_{ω}), \end{matrix}

(225)

We now define the decomposition 215 along the following decompositions of the group element products in the kernel 211 :

\begin{matrix} ω g^{- 1} (x) ω g (y) & = {\tilde{ω g}}^{- 1} (x) \tilde{ω g} (y) + {\tilde{ω g}}^{- 1} (x) (ω h^{- 1} (x) ω h (y) - I) \tilde{ω g} (y), \end{matrix}

(226)

\begin{matrix} ω g^{- 1} (y) ω g (x) & = {\tilde{ω g}}^{- 1} (y) \tilde{ω g} (x) + {\tilde{ω g}}^{- 1} (y) (ω h^{- 1} (y) ω h (x) - I) \tilde{ω g} (x) . \end{matrix}

(227)

\begin{matrix}  \end{matrix}

Accordingly, we define:

\begin{matrix} {\tilde{K}}^{T T^{*}} (x, y) = \int_{R^{n}} e^{2 π i (x - y) \cdot ξ} {\tilde{ω g}}^{- 1} (x) \tilde{ω g} (y) [∙] {\tilde{ω g}}^{- 1} (y) \tilde{ω g} (x) χ (ξ) d ξ, \end{matrix}

(228)

and then define

ℛ_{σ}^{T T^{*}}

according to the formula 215 . The idea now is that while one can only perform integration by parts in the kernel 228 above, the group element

ω h^{- 1} (x) ω h (y)

and its inverse, which must be contained as at least one factor in the remainder, are so close to the identity matrix that the resulting difference expression can be estimated without use of the oscillations which take place under the integral sign.

We now begin our proof of the estimate 216 . To do this, we simply integrate by parts as may times as necessary with respect to the variable

ξ

in order to pick up the needed point-wise decay. Doing this, we see that in order to draw our conclusion, it suffices to show the following symbol bounds for

1 ⩽ k

\begin{matrix} χ_{D_{σ}} ∥ \nabla_{ξ}^{k} ({\tilde{ω g}}^{- 1} (x) \tilde{ω g} (y)) ∥ ≲ ℰ \cdot σ^{k (1 - γ)}, \end{matrix}

(229)

\begin{matrix} χ_{D_{σ}} ∥ \nabla_{ξ}^{k} ({\tilde{ω g}}^{- 1} (y) \tilde{ω g} (x)) ∥ ≲ ℰ \cdot σ^{k (1 - γ)} . \end{matrix}

(230)

\begin{matrix}  \end{matrix}

In fact, we shall prove the following more general bounds, which contain 229 – 230 as a special case, and which will be useful in the sequel:

Proposition 9.1 (Symbol bounds for the smoothed amplitudes

{\tilde{ω g}}^{- 1} (t, x) \tilde{ω g} (s, y)

and

{\tilde{ω g}}^{- 1} (s, y) \tilde{ω g} (t, x)

). Let the group elements

\tilde{ω g}

be defined infinitesimally by the Hodge system 220 , where the parameter

σ^{- 1 + γ}

is replaced by

M^{- 1}

, where

M

lies in the range:

\begin{matrix} (| t - s | + | x - y |)^{\frac{1}{2}} ⩽ M ⩽ | t - s | + | x - y | . \end{matrix}

(231)

Then for any integer

1 ⩽ k

, one has the following symbol bounds assuming that the bootstrapping constant

ℰ

from line 126d is chosen sufficiently small (with respect to each fixed

k

\begin{matrix} ∥ \nabla_{ξ}^{k} ({\tilde{ω g}}^{- 1} (t, x) \tilde{ω g} (s, y)) ∥ ≲ ℰ \cdot M^{k}, \end{matrix}

(232)

\begin{matrix} ∥ \nabla_{ξ}^{k} ({\tilde{ω g}}^{- 1} (s, y) \tilde{ω g} (t, x)) ∥ ≲ ℰ \cdot M^{k} . \end{matrix}

(233)

\begin{matrix}  \end{matrix}

Here the

\nabla_{ξ}^{k}

notation is shorthand for all

k^{t h}

order partial derivatives involving the variable

ξ

, and

∥ \cdot ∥

is the standard matrix vector-norm from line 15 . The implicit constants on the right hand side depend on

k

, but are uniform in the parameter

M

for each fixed

k

Proof of the estimates 232 – 233 . It suffices for us to prove the first bound 232 , as the second follows from virtually identical reasoning. The goal is to reduce this via an ODE bootstrapping type argument to an associated estimate involving the connection

{\tilde{^{ω} C}}

. This associated estimate will then be proved by another bootstrapping argument in certain mixed Lebesgue-Besov spaces naturally associated with the ODE problem from the first step. The goal of the second bootstrapping will be to reduce things to proving the Besov estimates for the connection

{\underset{̲}{A} ∙ ≪ 1}

which appears as the linear term on the right hand side of the Hodge system 220a .

Before proceeding, we first make a preliminary reduction on the product

{\tilde{ω g}}^{- 1} (t, x) \tilde{ω g} (s, y)

We would like be set up as to only have to handle products which involve the same space or same time variables. This is easily accomplished via the product decomposition:

\begin{matrix} {\tilde{ω g}}^{- 1} (t, x) \tilde{ω g} (s, y) = {\tilde{ω g}}^{- 1} (t, x) \tilde{ω g} (t, y) \cdot {\tilde{ω g}}^{- 1} (t, y) \tilde{ω g} (s, y) . \end{matrix}

(234)

It is clear that if we can produce the bounds 232 for each of the terms on the right hand side of 234 separately, then by the product rule for derivatives we have the estimate 232 for the full term. Since they require slightly different arguments, we will proceed separately for each of these two factors.

Our first task is to prove the bound 232 for the spatial product

{\tilde{ω g}}^{- 1} (t, x) \tilde{ω g} (t, y)

This will be done inductively with respect to the value of

k

. Since we will proceed via a bootstrapping type procedure, we first assume that we can prove the desired bounds over small intervals and then try to use this knowledge to extend things to longer intervals. To do this, we differentiate the product

{\tilde{ω g}}^{- 1} (t, ℓ) \tilde{ω g} (t, y)

, where

[y, ℓ]

is some shorter line segment inside of

[y, x]

, with respect to the operators

(M^{- 1} \nabla_{ξ})^{k}

. This yields the equation:

(235) ( M − 1 ∇ ξ ) k ( ω g ~ − 1 ( ℓ ) ω g ~ ( y ) ) = ∑ i = 0 k − 1 ( M − 1 ∇ ξ ) k − i ( ω g ~ − 1 ( ℓ ) ω g ~ ( x 1 ) ) ⋅ ( M − 1 ∇ ξ ) i ( ω g ~ − 1 ( x 1 ) ω g ~ ( y ) ) + ( ω g ~ − 1 ( ℓ ) ω g ~ ( x 1 ) ) ⋅ ( M − 1 ∇ ξ ) k ( ω g ~ − 1 ( x 1 ) ω g ~ ( y ) ) .

In the above identity, we have dropped the dependence on time as it no longer has any bearing on how we proceed. Also

[x_{1}, ℓ]

denotes an even smaller interval embedded in the overall bootstrapping line segment

[y, ℓ]

. We will let this smaller segment go to zero. Before doing this, we collect the last term on the right hand side of 235 onto the left, apply the matrix norm 15 and the reverse triangle inequality, and use the isometric identity 16 to arrive at the bound:

(236) | ∥ ( M − 1 ∇ ξ ) k ( ω g ~ − 1 ( ℓ ) ω g ~ ( y ) ) ∥ − ∥ ( M − 1 ∇ ξ ) k ( ω g ~ − 1 ( x 1 ) ω g ~ ( y ) ) ∥ | ⩽ ∑ i = 0 k − 1 ∥ ( M − 1 ∇ ξ ) k − i ( ω g ~ − 1 ( ℓ ) ω g ~ ( x 1 ) ) ∥ ⋅ ∥ ( M − 1 ∇ ξ ) i ( ω g ~ − 1 ( x 1 ) ω g ~ ( y ) ) ∥ .

We now divide both sides of this last expression by the small interval length

| x_{1} - ℓ |

and let the resulting expression go the limit

x_{1} \to ℓ

. To compute this, we only need to handle the expressions:

\begin{matrix} {lim}_{x_{1} \to ℓ} | x_{1} - ℓ |^{- 1} \cdot ∥ (M^{- 1} \nabla_{ξ})^{k - i} ({\tilde{ω g}}^{- 1} (ℓ) \tilde{ω g} (x_{1})) ∥, \end{matrix}

(237)

where we have the important restriction

1 ⩽ k - i

. We do this by using the fact that the gauge equation 219 gives us an explicit realization of the product

{\tilde{ω g}}^{- 1} (ℓ) \tilde{ω g} (x_{1})

as an integral over the interval

[x_{1}, ℓ]

\begin{matrix} {\tilde{ω g}}^{- 1} (ℓ) \tilde{ω g} (x_{1}) = \int_{x_{1}}^{ℓ} {\tilde{ω g}}^{- 1} (x_{1}) \tilde{ω g} (s) {\tilde{ω \underset{̲}{C}}}_{α (ℓ)} (s) d s + I . \end{matrix}

(238)

Here the

α (ℓ)

index denotes the component of the connection

{ω \underset{̲}{C}}

in the direction of the line segment

[y, x]

. Plugging this last expression into the limit 237 and using the fundamental theorem of calculus on the resulting identity we arrive at the simple equation: 237

\begin{matrix} = ∥ (M^{- 1} \nabla_{ξ})^{k - i} ({\tilde{ω \underset{̲}{C}}}_{α (ℓ)} (ℓ)) ∥ . \end{matrix}

(239)

Notice that the identity matrix on line 238 drops out because of the condition

1 ⩽ k - i

, and that all terms where the derivatives fall on the group elements are zero because when

x_{1} = ℓ

these are again just derivatives of the identity matrix

I

. Now, substituting 239 into the limiting version of 236 we have the differential inequality:

(240) | ∥ ( M − 1 ∇ ξ ) k ( ω g ~ − 1 ( ℓ ) ω g ~ ( y ) ) ∥ ′ | ⩽ ∑ i = 0 k − 1 ∥ ( M − 1 ∇ ξ ) k − i ( ω C ̲ ~ α ( ℓ ) ( ℓ ) ) ∥ ⋅ ∥ ( M − 1 ∇ ξ ) i ( ω g ~ − 1 ( ℓ ) ω g ~ ( y ) ) ∥ .

Assuming now that we have proved the inductive bound:

{sup}_{0 ⩽ i ⩽ k - 1} ∥ (M^{- 1} \nabla_{ξ})^{i} ({\tilde{ω g}}^{- 1} (ℓ) \tilde{ω g} (y)) ∥ ≲ 1,

which is easy when

k - 1 = 0

on account of the compactness of the group

O (m)

, we see that by integrating the expression

∥ (M^{- 1} \nabla_{ξ})^{k} ({\tilde{ω g}}^{- 1} (ℓ) \tilde{ω g} (y)) ∥^{'}

the proof of 229 at the

k^{t h}

step boils down to being able to establish the line integral estimate:

\begin{matrix} \sum_{i = 0}^{k - 1} \int_{y}^{x} ∥ (M^{- 1} \nabla_{ξ})^{k - i} ({\tilde{ω \underset{̲}{C}}}_{α (ℓ)} (ℓ)) ∥ d ℓ ≲ ℰ . \end{matrix}

(241)

The reason this bound will be possible is that we have taken care to make sure that there is always at least one copy of the operator

(M^{- 1} \nabla_{ξ})

in each of the above integrals, and it is the presence of the extra factor

M^{- 1}

in conjunction with the range restriction 231 that will be enough to provide the needed integrability. In fact, using the condition that

M^{- 1} ⩽ | x - y |^{- \frac{1}{2}}

and the Cauchy-Schwartz inequality, we see that it suffices to be able to prove the bound:

\begin{matrix} \sum_{i = 0}^{k - 1} \int_{y}^{x} ∥ (M^{- 1} \nabla_{ξ})^{i} \nabla_{ξ} ({\tilde{ω \underset{̲}{C}}}_{α (ℓ)} (ℓ)) ∥^{2} d ℓ ≲ ℰ^{2} . \end{matrix}

(242)

This last integral can now be bounded in terms of energy type estimates once one applies the

L^{\infty} \to L^{2}

trace theorem to it. However, because of the various angular degeneracies involved in the potentials

{\nabla_{ξ} \tilde{ω \underset{̲}{C}}}

, it will be necessary for us to use a more refined “trace-Bernstein” type inequality. Furthermore, since the connection

{\tilde{ω \underset{̲}{C}}}

is only defined implicitly via the Hodge system 220 , it will be necessary for us to prove estimate 242 via a bootstrapping argument in mixed Lebesgue spaces.

What we will do is to show the following somewhat more restrictive estimate which yields 242 as a consequence:

Lemma 9.2. Let the connection

{\tilde{ω \underset{̲}{C}}}

be defined via the Hodge system 220 :

\begin{matrix} (\tilde{ω \underset{̲}{C}})^{d f} & = d^{*} Δ^{- 1} [\tilde{ω \underset{̲}{C}}, \tilde{ω \underset{̲}{C}}], \end{matrix}

(243a)

\begin{matrix} (\tilde{ω \underset{̲}{C}})^{c f} & = \tilde{ω {\underset{̲}{A}}^{(M)}} - \nabla_{x} Δ^{- 1} [\tilde{ω {\underset{̲}{A}}^{(M)}}, \tilde{ω \underset{̲}{C}}] . \end{matrix}

(243b)

\begin{matrix}  \end{matrix}

where we have set:

\begin{matrix} \tilde{ω {\underset{̲}{A}}^{(M)}} = - \nabla_{x} ω {\bar{Π}}_{M^{- 1} < ∙} ω {\bar{Π}}^{(\frac{1}{2} - δ)} ω L Δ_{ω^{⊥}}^{- 1} \underset{̲}{A} ∙ ≪ 1 (\partial_{ω}) . \end{matrix}

(244)

Furthermore, the parameter

M^{- 1}

which lies in the range 231 (although this is not essential). Then the following mixed Lebesgue space estimates of Besov type hold:

\begin{matrix} \sum_{i = 0}^{k - 1} \sum_{μ} ∥ (M^{- 1} \nabla_{ξ})^{i} \nabla_{ξ} P_{μ} (\tilde{ω \underset{̲}{C}}) ∥ L_{ℓ^{⊥}}^{\infty} (L_{ℓ}^{2}) ≲ ℰ . \end{matrix}

(245)

Proof of estimate 245 . Things will be a bit easier if we prove the following more restrictive estimate:

\begin{matrix} \sum_{i = 0}^{k - 1} \sum_{μ} μ^{- γ} (1 + μ)^{n} ∥ (M^{- 1} \nabla_{ξ})^{i} \nabla_{ξ} P_{μ} (\tilde{ω \underset{̲}{C}}) ∥ L_{ℓ}^{2} (L_{ℓ^{⊥}}^{\infty}) ≲ ℰ . \end{matrix}

(246)

That 245 is a consequence of 246 is a simple matter applying the Minkowski inequality for mixed Lebesgue spaces and the fact that the weights in 246 are clearly more restrictive. Now, the proof of this second estimate is essentially no more complicated than using the Bernstein inequality in the hyperplane plane

R_{ℓ^{⊥}}^{n - 1}

to turn things into the energy estimate contained in the bootstrapping norm 126d .

To see this, we begin our proof of 246 by first establishing this bound for the reduced Coulomb potentials

{\tilde{ω {\underset{̲}{A}}^{(M)}}}

We are now trying to prove that:

\begin{matrix} \sum_{j = 0, 1} \sum_{i = 0}^{k - 1} \sum_{μ} μ^{- γ} (1 + μ)^{n} ∥ (M^{- 1} \nabla_{ξ})^{i} \nabla_{ξ}^{j} P_{μ} (\tilde{ω {\underset{̲}{A}}^{(M)}}) ∥ L_{ℓ}^{2} (L_{ℓ^{⊥}}^{\infty}) ≲ ℰ . \end{matrix}

(247)

For each fixed frequency in the above sum, we decompose things into all frequencies corresponding to the

R_{ℓ^{⊥}}^{n - 1}

plane, as well as all possible dyadic angular sectors spread from the

ω

(fixed) direction:

P_{μ} = \sum_{θ, λ : λ ⩽ μ} ω Π_{θ} Q_{λ} P_{μ},

where

Q_{λ}

is an

(n - 1)

dimensional fixed frequency multiplier which is defined in analogy with

P_{λ}

. Freezing all frequencies, our goal will be to show the following estimate:

\begin{matrix} ∥ (M^{- 1} \nabla_{ξ})^{i} \nabla_{ξ}^{j} ω Π_{θ} Q_{λ} P_{μ} (\tilde{ω {\underset{̲}{A}}^{(M)}}) ∥ L_{ℓ}^{2} (L_{ℓ^{⊥}}^{\infty}) ≲ θ^{γ} {(\frac{λ}{μ})}^{γ} μ^{2 γ} \cdot ℰ . \end{matrix}

(248)

By adding in the weights

μ^{- γ} (1 + μ)^{n}

, using the fact that the potentials

{\tilde{ω {\underset{̲}{A}}^{(M)}}}

are truncated to frequencies

μ ≪ 1

, and dyadic summing, the fixed frequency estimate 248 implies 247 with room to spare. To deal with all of the

ξ

derivatives, notice that we have the following heuristic multipliers bounds:

\begin{matrix} (M^{- 1} \nabla_{ξ})^{i} \nabla_{ξ}^{j} ω Π_{θ} Q_{λ} P_{μ} (\tilde{ω {\underset{̲}{A}}^{(M)}}) \approx θ^{- 2} ω Π_{θ} ω {\bar{Π}}^{(\frac{1}{2} - δ)} Q_{λ} P_{μ} (\underset{̲}{A} ∙ ≪ 1), \end{matrix}

(249)

where we are enforcing the notation introduced on line 58 . That is, the left hand side of the above identity satisfies all mixed Lebesgue space bounds as the right hand side with the same constants. Notice that this bound uses the extra Coulomb savings introduced on line 196 above to kill off one power of

θ^{- 1}

from the degenerate Laplacean

Δ_{ω^{⊥}}

. The other power of

θ^{- 1}

on the right hand side of 249 comes from the operator

\nabla_{ξ}

which has no smoothing factor of

M^{- 1}

. This is precisely what one pays for passing from the

L^{1}

integral 241 to the more manageable

L^{2}

integral 242 . Finally, it is important to point out that although we have not emphasized it, the multipliers

Q_{λ}

depend on

ω

, but the fact that

λ ≪ θ

implies that the multiplier product on the left hand side of 249 is zero prevents the derivatives of

Q_{λ}

with respect to

ξ

from costing more than derivatives of

ω Π_{θ}

(alternatively, we could have applied the

Q_{λ}

multipliers on the outside of the

\nabla_{ξ}^{k}

operators, because differentiation will not change the support of the various multipliers).

Now, to use the Bernstein inequality on the

R_{ℓ^{⊥}}^{n - 1}

plane, we simply note that one has the multiplier identity:

\begin{matrix} ω Π_{θ} Q_{λ} P_{μ} =^{ω | | ℓ^{⊥}} B_{(μ θ)} ω Π_{θ} Q_{λ} P_{μ}, \end{matrix}

(250)

where

^{ω | | ℓ^{⊥}} B_{(μ θ)}

is a (smooth symbol) block type cutoff in the

R_{ℓ^{⊥}}^{n - 1}

frequency plane of dimensions

1 \times (μ θ) \times \dots \times (μ θ)

which has its long side centered along the projection⁹ of the unit vector

ω

onto the

R_{ℓ^{⊥}}^{n - 1}

(frequency) plane. The crucial fact about the geometry of the multiplier 250 is that is has support contained in a box of size

λ \times (μ θ) \times \dots \times (μ θ)

in the

R_{ξ}^{n - 1}

(frequency) plane. Using now the identities 249 and 250 , as well as the

n - 1

dimensional Bernstein inequality, we see that we may estimate:

(251) ∥ ( M − 1 ∇ ξ ) i ∇ ξ j ω Π θ Q λ P μ ( ω A ̲ ( M ) ~ ) ∥ L ℓ 2 ( L ℓ ⊥ ∞ ) ≲ θ − 2 ⋅ λ 1 2 ( μ θ ) n − 2 2 ∥ P μ ( A ̲ ∙ ≪ 1 ) ∥ L 2 .

To deal with the weights on the right hand side, we use the truncation condition that

μ^{\frac{1}{2} - δ} ⩽ θ

, as well as the fact that

λ ⩽ μ

to conclude the bound:

θ^{- 2} \cdot λ^{\frac{1}{2}} (μ θ)^{\frac{n - 2}{2}} ⩽ μ^{\frac{n - 2}{2}} \cdot θ^{γ} {(\frac{λ}{μ})}^{γ} μ^{2 γ} .

Substituting this into the right hand side of estimate 251 and using the

L^{\infty} ({\dot{H}}^{\frac{n - 2}{2}})

bound contained in the bootstrapping estimate 126d , we have achieved the desired result 248 .

It is now our task to use 247 and the Hodge system 243 to pass to the more general estimate 245 . In order to do this, it will be necessary for us to first prove some critical estimates for the potentials

{\tilde{ω \underset{̲}{C}}}

. These will then be used as a reference point in certain bilinear estimates involving the space used to define estimate 245 .

While we're at it, this will also give us a chance to prove some estimates which will be used many times in the sequel. What we will show is that:

\begin{matrix} ∥ (M^{- 1} \nabla_{ξ})^{k} ω \underset{̲}{C} ∥ {\dot{B}}_{2, 10 n}^{p_{γ}, (2, \frac{n - 2}{2})} & ≲ ℰ, \end{matrix}

(252)

\begin{matrix} ∥ (M^{- 1} \nabla_{ξ})^{k} \nabla_{t} ω \underset{̲}{C} ∥ {\dot{B}}_{2, 10 n}^{p_{γ}, (2, \frac{n - 4}{2})} & ≲ ℰ, \end{matrix}

(253)

\begin{matrix}  \end{matrix}

where

p_{γ}

is exponent defined on line 193 above. Both of the bounds 252 – 253 will easily follow via our general Besov embedding 45 once we have established them for the linear term on the right hand side of the Hodge system 243 . That is, we fist establish that:

\begin{matrix} ∥ (M^{- 1} \nabla_{ξ})^{k} \tilde{ω {\underset{̲}{A}}^{(M)}} ∥ {\dot{B}}_{2, 10 n}^{p_{γ}, (2, \frac{n - 2}{2})} & ≲ ℰ, \end{matrix}

(254)

\begin{matrix} ∥ (M^{- 1} \nabla_{ξ})^{k} \nabla_{t} \tilde{ω {\underset{̲}{A}}^{(M)}} ∥ {\dot{B}}_{2, 10 n}^{p_{γ}, (2, \frac{n - 4}{2})} & ≲ ℰ, \end{matrix}

(255)

\begin{matrix}  \end{matrix}

These follow from immediately from the steps used to prove 192 above, and the following heuristic identity which follows our convention established on line 58 :

\begin{matrix} \nabla_{ξ}^{k} (ω Π_{θ} P_{μ} \tilde{ω {\underset{̲}{A}}^{(M)}}) & \approx θ^{- k} ω Π_{θ} ω {\bar{Π}}_{M^{- 1} < ∙} P_{μ} \nabla_{x} ω L Δ_{ω^{⊥}}^{- 1} \underset{̲}{A} ∙ ≪ 1 (\partial_{ω}), \end{matrix}

(256)

\begin{matrix} \nabla_{ξ}^{k} (ω Π_{θ} P_{μ} \nabla_{t} \tilde{ω {\underset{̲}{A}}^{(M)}}) & \approx μ θ^{- k} ω Π_{θ} ω {\bar{Π}}_{M^{- 1} < ∙} P_{μ} \nabla_{x} ω L Δ_{ω^{⊥}}^{- 1} \underset{̲}{A} ∙ ≪ 1 (\partial_{ω}), \end{matrix}

(257)

\begin{matrix}  \end{matrix}

Notice that the space-time frequency localization 126c allows us to trade the

\nabla_{t}

with the factor of

μ

on the second line above.

We now prove the estimates 252 – 253 by proceeding inductively on the value of

k

. If

k = 0

the first estimate 252 holds because one can solve the system 243 via Picard iteration in the space

{\dot{B}}_{2, 10 n}^{p_{γ}, (2, \frac{n - 2}{2})}

thanks to the bilinear embedding 45 which furnishes the embedding:

\begin{matrix} \nabla_{x} Δ^{- 1} : {\dot{B}}_{2, 10 n}^{p_{γ}, (2, \frac{n - 2}{2})} \cdot {\dot{B}}_{2, 10 n}^{p_{γ}, (2, \frac{n - 2}{2})} ↪ {\dot{B}}_{2, 10 n}^{p_{γ}, (2, \frac{n - 2}{2})} . \end{matrix}

(258)

The key thing to point out here is that for

γ

sufficiently small, and in dimensions

6 ⩽ n

we have the bound

p_{γ} < n

, which is all that is needed to satisfy the gap condition 47 in this case. The other conditions of Lemma 4.1 are also easily seen to be satisfied for this set of indices.

To establish 252 for

0 < k

, we simply differentiate the system 243

k

times with respect to the operator

(M^{- 1} \nabla_{ξ})

. Doing this yields the linearized set of equations:

\begin{matrix} (M^{- 1} \nabla_{ξ})^{k} (\tilde{ω \underset{̲}{C}})^{d f} & = \sum_{j = 0}^{k} d^{*} Δ^{- 1} [(M^{- 1} \nabla_{ξ})^{k - j} \tilde{ω \underset{̲}{C}}, (M^{- 1} \nabla_{ξ})^{j} \tilde{ω \underset{̲}{C}}], \end{matrix}

(259)

\begin{matrix} (M^{- 1} \nabla_{ξ})^{k} (\tilde{ω \underset{̲}{C}})^{c f} & = (M^{- 1} \nabla_{ξ})^{k} \tilde{ω {\underset{̲}{A}}^{(M)}} - \end{matrix}

(260)

\begin{matrix} \sum_{j = 0}^{k} \nabla_{x} Δ^{- 1} [(M^{- 1} \nabla_{ξ})^{k - j} \tilde{ω {\underset{̲}{A}}^{(M)}}, (M^{- 1} \nabla_{ξ})^{j} \tilde{ω \underset{̲}{C}}], \end{matrix}

(261)

\begin{matrix}  \end{matrix}

which can again be solved in the Besov space

{\dot{B}}_{2, 10 n}^{p_{γ}, (2, \frac{n - 2}{2})}

by using the already established estimate 254 for the linear term, in conjunction with the (inductive) hypothesis that estimate 252 holds for

k - 1

, and absorbing the highest derivative (involving

(M^{- 1} \nabla_{ξ})

falling on

\tilde{ω \underset{̲}{C}}

) term to the left hand side. All of this is permissible by referring to the embedding 258 .

To prove the second estimate 253 above, we first apply the time derivative

\nabla_{t}

to both sides of the system 259 – 261 above. The resulting system of equations, which we will not write down, can easily be solved in the derivative critical Besov space

{\dot{B}}_{2, 10 n}^{p_{γ}, (2, \frac{n - 4}{2})}

by again using an induction on

k

, the already established estimate 255 for the linear term, and the following bilinear Besov estimate which is again a special case of 45 :

\begin{matrix} \nabla_{x} Δ^{- 1} : {\dot{B}}_{2, 10 n}^{p_{γ}, (2, \frac{n - 2}{2})} \cdot {\dot{B}}_{2, 10 n}^{p_{γ}, (2, \frac{n - 4}{2})} ↪ {\dot{B}}_{2, 10 n}^{p_{γ}, (2, \frac{n - 4}{2})} . \end{matrix}

(262)

Notice that 262 is permissible because for

γ

sufficiently small, we have the condition

p_{γ} < \frac{2 n}{3}

in dimensions

6 ⩽ n

which is necessary to get around the gap condition 47 . The other conditions of 45 are easily satisfied for this choice of indices.

Armed with estimates 247 and 252 , we now move back to the proof of estimate 245 . We set the norm in that latter bound equal to:

∥ A ∥ N_{1}^{- γ, 2, \infty} = \sum_{μ} μ^{- γ} (1 + μ)^{n} ∥ P_{μ} (A) ∥ L_{ℓ}^{2} (L_{ℓ^{⊥}}^{\infty}) .

By differentiating the system 243 with respect to the operators

(M^{- 1} \nabla_{ξ})^{k} \nabla_{ξ}

, we see that the claim will now follow once we can prove the bilinear Riesz operator bound:

\begin{matrix} \nabla_{x} Δ^{- 1} : {\dot{B}}_{2, 10 n}^{p_{γ}, (2, \frac{n - 2}{2})} \cdot N_{1}^{- γ, 2, \infty} ↪ N_{1}^{- γ, 2, \infty} . \end{matrix}

(263)

We now let

A

and

C

be any two elements of the two spaces on the left hand side of 263 . By applying the trichotomy, we see that it suffices to be able to prove the three estimates:

\begin{matrix} \sum_{λ, μ_{i} : μ_{1} ≪ μ_{2} λ \sim μ_{2}} λ^{- γ} (1 + λ)^{n} ∥ \nabla_{x} Δ^{- 1} P_{λ} (P_{μ_{1}} A \cdot P_{μ_{2}} C) ∥ L_{ℓ}^{2} (L_{ℓ^{⊥}}^{\infty}) & ≲ \end{matrix}

(264)

\begin{matrix} ∥ A ∥ {\dot{B}}_{2, 10 n}^{p_{γ}, (2, \frac{n - 2}{2})} \cdot ∥ C ∥ N_{1}^{- γ, 2, \infty}, \end{matrix}

(265)

\begin{matrix} \sum_{λ, μ_{i} : μ_{2} ≪ μ_{1} λ \sim μ_{1}} λ^{- γ} (1 + λ)^{n} ∥ \nabla_{x} Δ^{- 1} P_{λ} (P_{μ_{1}} A \cdot P_{μ_{2}} C) ∥ L_{ℓ}^{2} (L_{ℓ^{⊥}}^{\infty}) & ≲ \end{matrix}

(266)

\begin{matrix} ∥ A ∥ {\dot{B}}_{2, 10 n}^{p_{γ}, (2, \frac{n - 2}{2})} \cdot ∥ C ∥ N_{1}^{- γ, 2, \infty}, \end{matrix}

(267)

\begin{matrix} \sum_{λ, μ_{i} : μ_{1} \sim μ_{2} λ ≲ μ_{1}, μ_{2}} λ^{- γ} (1 + λ)^{n} ∥ \nabla_{x} Δ^{- 1} P_{λ} (P_{μ_{1}} A \cdot P_{μ_{2}} C) ∥ L_{ℓ}^{2} (L_{ℓ^{⊥}}^{\infty}) & ≲ \end{matrix}

(268)

\begin{matrix} ∥ A ∥ {\dot{B}}_{2, 10 n}^{p_{γ}, (2, \frac{n - 2}{2})} \cdot ∥ C ∥ N_{1}^{- γ, 2, \infty} . \end{matrix}

(269)

\begin{matrix}  \end{matrix}

The proofs of 265 – 267 are very simple, and follow from essentially the same principle. First of all, we use the fact that the kernel of the fixed frequency operator

λ \cdot \nabla_{x} Δ^{- 1} P_{λ}

is in

L^{1}

with norm independent of

λ

. Thus, it is bounded on all mixed Lebesgue spaces. This, used in conjunction with the estimate:

\begin{matrix} \sum_{μ_{1} : μ_{1} ≲ λ} λ^{- 1} ∥ P_{μ_{1}} A ∥ L^{\infty} ≲ ∥ A ∥ {\dot{B}}_{2, 10 n}^{p_{γ}, (2, \frac{n - 2}{2})}, \end{matrix}

(270)

which follows easily from Bernstein's inequality and dyadic summing, is enough for us to conclude the first estimate 265 . To conclude the second estimate, 267 , we simply employ the fixed frequency version of 270 and then estimate:

\begin{matrix} \sum_{λ, μ_{2} : μ_{2} ≲ λ} λ^{- γ} ∥ P_{μ_{2}} C ∥ L_{ℓ}^{2} (L_{ℓ^{⊥}}^{\infty}) & = \sum_{λ, μ_{2} : μ_{2} ≲ λ} {(\frac{μ_{2}}{λ})}^{γ} μ_{2}^{- γ} ∥ P_{μ_{2}} C ∥ L_{ℓ}^{2} (L_{ℓ^{⊥}}^{\infty}), \end{matrix}

\begin{matrix} = \sum_{μ_{2}} μ_{2}^{- γ} ∥ P_{μ_{2}} C ∥ L_{ℓ}^{2} (L_{ℓ^{⊥}}^{\infty}) \cdot \sum_{λ : μ_{2} ≲ λ} {(\frac{μ_{2}}{λ})}^{γ}, \end{matrix}

\begin{matrix} ≲ ∥ C ∥ N_{1}^{- γ, 2, \infty} . \end{matrix}

\begin{matrix}  \end{matrix}

We now move to the last estimate 269 . This is only slightly more complicated than what we have already done. Here we will only bother to estimate things for

λ ≲ 1

The case where

1 ≪ λ

is much easier due to the extra smoothing in the norms we are working with and is left to the reader. As a first step, we freeze all frequencies and decompose

P_{λ} = \sum_{σ : σ ⩽ λ} Q_{σ} P_{λ}

, where

Q_{σ}

is again the fixed frequency cutoff in the

R_{ℓ^{⊥}}^{n - 1}

frequency plane. Using Bernstein, this allows us to estimate:

\begin{matrix} λ^{- γ} ∥ \nabla_{x} Δ^{- 1} P_{λ} (P_{μ_{1}} A \cdot P_{μ_{2}} C) ∥ L_{ℓ}^{2} (L_{ℓ^{⊥}}^{\infty}), \end{matrix}

(271)

\begin{matrix} ≲ & \sum_{σ : σ ⩽ λ} λ^{- 1 - γ} ∥ Q_{σ} (P_{μ_{1}} A \cdot P_{μ_{2}} C) ∥ L_{ℓ}^{2} (L_{ℓ^{⊥}}^{\infty}), \end{matrix}

\begin{matrix} ≲ & \sum_{σ : σ ⩽ λ} λ^{- 1 - γ} σ^{\frac{n - 1}{p_{γ}}} ∥ P_{μ_{1}} A \cdot P_{μ_{2}} C ∥ L_{ℓ}^{2} (L_{ℓ^{⊥}}^{p_{γ}}), \end{matrix}

\begin{matrix} ≲ & λ^{\frac{n - 1}{p_{γ}} - 1 - γ} ∥ P_{μ_{1}} A \cdot P_{μ_{2}} C ∥ L_{ℓ}^{2} (L_{ℓ^{⊥}}^{p_{γ}}) . \end{matrix}

\begin{matrix}  \end{matrix}

By using Hölders inequality and rearranging weights, and using the index bound:

1 + 2 γ < \frac{n - 1}{p_{γ}},

which follows on account of the fact that

γ ≪ 1

and we are in

n ⩽ 6

dimensions, we see that we have the fixed frequency estimate: 271

\begin{matrix} ≲ {(\frac{λ}{μ_{1}})}^{γ} μ_{1}^{\frac{n - 1}{p_{γ}} - 1} ∥ P_{μ_{1}} A ∥ L_{ℓ}^{\infty} (L_{ℓ^{⊥}}^{p_{γ}}) \cdot μ_{2}^{- γ} ∥ P_{μ_{2}} C ∥ L_{ℓ}^{2} (L_{ℓ^{⊥}}^{\infty}) . \end{matrix}

(272)

We are done once we deal with the first factor on the right hand side of the above inequality. To do this, we run a multiplier decomposition

P_{μ_{1}} = \sum_{σ : σ ⩽ μ_{1}} {\tilde{Q}}_{σ} P_{μ_{1}}

, where this time

{\tilde{Q}}_{σ}

is a fixed frequency cutoff on the

R_{ℓ}

line. Using Minkowski's integral inequality and Bernstein on the real line, we estimate:

\begin{matrix} μ_{1}^{\frac{n - 1}{p_{γ}} - 1} ∥ P_{μ_{1}} A ∥ L_{ℓ}^{\infty} (L_{ℓ^{⊥}}^{p_{γ}}) & ≲ \sum_{σ : σ ⩽ μ_{1}} μ_{1}^{\frac{n - 1}{p_{γ}} - 1} ∥ {\tilde{Q}}_{σ} P_{μ_{1}} A ∥ L_{ℓ^{⊥}}^{p_{γ}} (L_{ℓ}^{\infty}), \end{matrix}

\begin{matrix} ≲ μ_{1}^{\frac{n}{p_{γ}} - 1} ∥ P_{μ_{1}} A ∥ L^{p_{γ}}, \end{matrix}

\begin{matrix} = ∥ P_{μ_{1}} A ∥ {\dot{B}}_{2}^{p_{γ}, (2, \frac{n - 2}{2})} . \end{matrix}

\begin{matrix}  \end{matrix}

Plugging this last bound into the right hand side of 272 and summing over all

λ ≲ μ_{1}, μ_{2}

yields the bound 269 as was to be shown. This completes the proof of the bilinear estimate 263 and hence the proof of 245 . □

To wrap things up here, we need to prove the symbol bounds 232 – 233 for the second factor in the product 234 . By repeating the steps which started on line 235 and culminated in the differential inequality 240 for this term, we arrive at the temporal differential inequality:

(273) | ∥ ( M − 1 ∇ ξ ) k ( ω g ~ − 1 ( ℓ ) ω g ~ ( s ) ) ∥ ′ | ⩽ ∑ i = 0 k − 1 ∥ ( M − 1 ∇ ξ ) k − i ( ω C ~ 0 ( ℓ ) ) ∥ ⋅ ∥ ( M − 1 ∇ ξ ) i ( ω g ~ − 1 ( ℓ ) ω g ~ ( s ) ) ∥ ,

where this time

ℓ

denotes a single variable which lies in the range

s ⩽ ℓ ⩽ t

, and

{\tilde{^{ω} C}}_{0}

is the temporal potential which is defined via the equation:

{\tilde{ω g}}^{- 1} \nabla_{t} (\tilde{ω g}) = {\tilde{^{ω} C}}_{0} .

Via integration in time of the quantity

∥ (M^{- 1} \nabla_{ξ})^{k} ({\tilde{ω g}}^{- 1} (ℓ) \tilde{ω g} (s)) ∥^{'}

, and keeping in mind the derivation of the temporal potential equation 201 above, we see that to prove the estimates 232 for the product

{\tilde{ω g}}^{- 1} (t) \tilde{ω g} (s)

it suffices to show (the same estimate works to establish 233 ):

Lemma 9.3. Let the temporal potential

{\tilde{C}}_{0}

be defined via the elliptic equation:

\begin{matrix} {\tilde{^{ω} C}}_{0} = \tilde{ω A_{0}^{(M)}} - \nabla_{t} Δ^{- 1} [\tilde{ω {\underset{̲}{A}}^{(M)}}, \tilde{ω \underset{̲}{C}}] - d^{*} Δ^{- 1} [{\tilde{^{ω} C}}_{0}, \tilde{ω \underset{̲}{C}}], \end{matrix}

(274)

where the spatial connection

{\tilde{ω \underset{̲}{C}}}

is defined via the Hodge system 243 , and where we have set:

\begin{matrix} \tilde{ω A_{0}^{(M)}} = - \nabla_{t} ω {\bar{Π}}_{M^{- 1} < ∙} ω {\bar{Π}}^{(\frac{1}{2} - δ)} ω L Δ_{ω^{⊥}}^{- 1} \underset{̲}{A} ∙ ≪ 1 (\partial_{ω}), \end{matrix}

(275)

The parameter

M^{- 1}

which lies in the range 231 (this is essential).

Then the following mixed Lebesgue space estimates of Besov type hold:

\begin{matrix} \sum_{i = 1}^{k} \sum_{μ} ∥ (M^{- 1} \nabla_{ξ})^{i} P_{μ} ({\tilde{^{ω} C}}_{0}) ∥ L_{x}^{\infty} (L_{t}^{1} [s, t]) ≲ ℰ . \end{matrix}

(276)

Proof of the estimate 277 . As with the proof of 245 above, it will be convenient to prove the somewhat more restrictive estimate:

\begin{matrix} \sum_{i = 1}^{k} \sum_{μ} μ^{- γ} (1 + μ)^{n} ∥ (M^{- 1} \nabla_{ξ})^{i} P_{μ} ({\tilde{^{ω} C}}_{0}) ∥ L_{t}^{1} [s, t] (L_{x}^{\infty}) ≲ ℰ . \end{matrix}

(277)

This will again be done by essentially proving that this estimate is true for the potentials

{\tilde{ω A^{(M)}}}

contained in the right hand side of 274 , and then transferring that knowledge to

{\tilde{^{ω} C}}_{0}

through that elliptic equation. A little care needs to be taken in this regard due to the effect of bad

H i g h \times H i g h \Rightarrow L o w

frequency interactions coming from the

Δ^{- 1}

in the second term on the right hand side of 274 which sits by itself because the time derivative must be distributed. This all needs to be tempered against the fact that we need to recover enough in the low frequencies to apply

L^{2} (L^{q})

Strichartz estimates to integrate over the line segment

[s, t]

. The Lebesgue exponent which is important in this regard is the following:

\begin{matrix} q_{γ} = \frac{2 (n - 1)}{n - 5 - 2 γ} . \end{matrix}

(278)

The significance of

q_{γ}

is that it is the smallest Lebesgue exponent such that one can recover an extra angular weight of

θ^{- 1}

via running Bernstein from the Strichartz endpoint and still have an extra factor of

θ^{γ}

to spare for dyadic summing.

We proceed with our proof of 277 by first establishing a fixed time estimate.

Notice that the Besov norm

{\dot{B}}_{1, 10 n}^{\infty, (2, \frac{n}{2} - γ)}

embeds into the spatial norm on the left hand side of 277 . Thus, our first step will be to establish the following fixed time estimate for

1 ⩽ k

\begin{matrix} ∥ (M^{- 1} \nabla_{ξ})^{k} {\tilde{^{ω} C}}_{0} (t_{0}) ∥ {\dot{B}}_{1, 10 n}^{\infty, (2, \frac{n}{2} - γ)} ≲ \sum_{i = 1}^{k} ∥ (M^{- 1} \nabla_{ξ})^{i} \tilde{ω A^{(M)}} (t_{0}) ∥ {\dot{B}}_{2, 10 n}^{q_{γ}, (2, \frac{n}{2} - 2 γ)}, \end{matrix}

(279)

where

{\tilde{ω A^{(M)}}}

is the full space-time connection defined on lines 244 and 275 .

The important thing about estimate 279 is that we retain at least one copy of the operator

(M^{- 1} \nabla_{ξ})

on the right hand side so that we may pass to an

L_{t}^{2}

integral via Cauchy-Schwartz. Our proof of 279 require that the bootstrapping constant

ℰ

from line 126d is sufficiently small. Based on previous work, our task here is largely finished. Our first step here will be to differentiate the equation 277 as many times as necessary with respect to the operators

(M^{- 1} \nabla_{ξ})

. Doing this and distributing the time derivative in the second term on the right hand side yields the equation:

\begin{matrix} (M^{- 1} \nabla_{ξ})^{k} {\tilde{^{ω} C}}_{0} & = (M^{- 1} \nabla_{ξ})^{k} \tilde{ω A_{0}^{(M)}} \end{matrix}

(280)

\begin{matrix} - \sum_{i = 0}^{k} (Δ^{- 1} [(M^{- 1} \nabla_{ξ})^{k - i} \nabla_{t} \tilde{ω {\underset{̲}{A}}^{(M)}}, (M^{- 1} \nabla_{ξ})^{i} \tilde{ω \underset{̲}{C}}] \end{matrix}

\begin{matrix} + Δ^{- 1} [(M^{- 1} \nabla_{ξ})^{k - i} \tilde{ω {\underset{̲}{A}}^{(M)}}, (M^{- 1} \nabla_{ξ})^{i} \nabla_{t} \tilde{ω \underset{̲}{C}}]) \end{matrix}

\begin{matrix} - \sum_{i = 0}^{k} d^{*} Δ [(M^{- 1} \nabla_{ξ})^{k - i} {\tilde{^{ω} C}}_{0}, (M^{- 1} \nabla_{ξ})^{i} \tilde{ω \underset{̲}{C}}], \end{matrix}

\begin{matrix} = T_{1} + T_{2} + T_{3} . \end{matrix}

\begin{matrix}  \end{matrix}

Our second step is to prove the intermediate estimate:

(281) ∥ ( M − 1 ∇ ξ ) k ω C ~ 0 ( t 0 ) ∥ B ˙ 1 , 10 n ∞ , ( 2 , n 2 − γ ) ≲ ∥ ( M − 1 ∇ ξ ) k ω A ( M ) ~ ( t 0 ) ∥ B ˙ 2 , 10 n q γ , ( 2 , n 2 − 2 γ ) + ∑ i = 1 k ∥ ( M − 1 ∇ ξ ) i ω C ̲ ~ ( t 0 ) ∥ B ˙ 2 , 10 n q γ , ( 2 , n 2 − γ ) + ∑ i = 1 k ∥ ( M − 1 ∇ ξ ) i ∇ t ω C ̲ ~ ( t 0 ) ∥ B ˙ 2 , 10 n q γ , ( 2 , n − 2 2 − γ ) .

This in turn is a consequence of the three estimates:

\begin{matrix} ∥ T_{1} ∥ {\dot{B}}_{1, 10 n}^{\infty, (2, \frac{n}{2} - γ)} & ≲ ∥ (M^{- 1} \nabla_{ξ})^{k} \tilde{ω A^{(M)}} (t_{0}) ∥ {\dot{B}}_{2, 10 n}^{q_{γ}, (2, \frac{n}{2} - 2 γ)}, \end{matrix}

(282)

\begin{matrix} ∥ T_{2} ∥ {\dot{B}}_{1, 10 n}^{\infty, (2, \frac{n}{2} - γ)} & ≲ ∥ (M^{- 1} \nabla_{ξ})^{k} \tilde{ω A^{(M)}} (t_{0}) ∥ {\dot{B}}_{2, 10 n}^{q_{γ}, (2, \frac{n}{2} - γ)} \end{matrix}

(283)

\begin{matrix} + \sum_{i = 1}^{k} ∥ (M^{- 1} \nabla_{ξ})^{i} \tilde{ω \underset{̲}{C}} (t_{0}) ∥ {\dot{B}}_{2, 10 n}^{q_{γ}, (2, \frac{n}{2} - γ)} \end{matrix}

(284)

\begin{matrix} + \sum_{i = 1}^{k} ∥ (M^{- 1} \nabla_{ξ})^{i} \nabla_{t} \tilde{ω \underset{̲}{C}} (t_{0}) ∥ {\dot{B}}_{2, 10 n}^{q_{γ}, (2, \frac{n - 2}{2} - γ)}, \end{matrix}

(285)

\begin{matrix} ∥ T_{3} ∥ {\dot{B}}_{1, 10 n}^{\infty, (2, \frac{n}{2} - γ)} & ≲ ∥ (M^{- 1} \nabla_{ξ})^{k} {\tilde{^{ω} C}}_{0} (t_{0}) ∥ {\dot{B}}_{1, 10 n}^{\infty, (2, \frac{n}{2} - γ)} \cdot ∥ \tilde{ω \underset{̲}{C}} (t_{0}) ∥ {\dot{B}}_{2, 10 n}^{p_{γ}, (2, \frac{n - 2}{2})} \end{matrix}

(286)

\begin{matrix} + \sum_{i = 1}^{k} ∥ (M^{- 1} \nabla_{ξ})^{i} \tilde{ω \underset{̲}{C}} (t_{0}) ∥ {\dot{B}}_{2, 10 n}^{q_{γ}, (2, \frac{n}{2} - γ)} . \end{matrix}

(287)

\begin{matrix}  \end{matrix}

Notice that these all combine to give 281 because the estimate 252 in conjunction with the assumption that

ℰ

is sufficiently small allows one to absorb the first term on the right hand side of 287 into the left hand side. The proof of the first estimate, 282 above is a trivial consequence of the Besov nesting 42 , and the fact that we allow for an extra power of

μ^{γ}

to sum over the low frequencies to turn the

ℓ^{2}

sum into and

ℓ^{1}

sum.

The proof of 285 is the most involved, and is why we have been forced to work with the exponent

q_{γ}

. There are several cases to consider, depending on whether or not the time derivative falls on the term containing at least one copy of the operator

(M^{- 1} \nabla_{ξ})

. An inspection of the structure of the

T_{2}

term shows that these can all be taken into account through an application of the already established estimates 252 – 253 and 254 – 255 , and application of the bilinear embeddings:

\begin{matrix} Δ^{- 1} : {\dot{B}}_{2, 10 n}^{p_{γ}, (2, \frac{n - 2}{2})} \cdot {\dot{B}}_{2, 10 n}^{q_{γ}, (2, \frac{n - 2}{2} - γ)} & ↪ {\dot{B}}_{1, 10 n}^{\infty, (2, \frac{n}{2} - γ)}, \end{matrix}

(288)

\begin{matrix} Δ^{- 1} : {\dot{B}}_{2, 10 n}^{p_{γ}, (2, \frac{n - 4}{2})} \cdot {\dot{B}}_{2, 10 n}^{q_{γ}, (2, \frac{n}{2} - γ)} & ↪ {\dot{B}}_{1, 10 n}^{\infty, (2, \frac{n}{2} - γ)} . \end{matrix}

(289)

\begin{matrix}  \end{matrix}

A quick calculation shows that one has the needed gap bound (by condition 47 ):

2 + γ < n (\frac{1}{p_{γ}} + \frac{1}{q_{γ}}),

for

γ

sufficiently small when the dimension satisfies the bound

6 ⩽ n

. For example, when

n = 6

we have that

p_{γ} = \frac{10}{3} + ε

and

q_{γ} = 10 + ε

where

ε \to 0

γ \to 0

. Notice that there is not a whole lot of room in this. The other condition in 46 – 50 are easily verified in the above estimates. Notice that for the term in

T_{2}

where all the

(M^{- 1} \nabla_{ξ})

derivatives, as well as the time derivative

\nabla_{t}

fall on the linear potentials

{\tilde{ω {\underset{̲}{A}}^{(M)}}}

, we use 252 and the first embedding 288 above, together with the easy bound (which follows from the truncation 126c ):

\begin{matrix} ∥ (M^{- 1} \nabla_{ξ})^{k} \nabla_{t} \tilde{ω {\underset{̲}{A}}^{(M)}} ∥ {\dot{B}}_{2, 10 n}^{q_{γ}, (2, \frac{n - 2}{2} - γ)} ≲ ∥ (M^{- 1} \nabla_{ξ})^{k} \tilde{ω {\underset{̲}{A}}^{(M)}} ∥ {\dot{B}}_{2, 10 n}^{q_{γ}, (2, \frac{n}{2} - γ)}, \end{matrix}

(290)

to conclude 285 for that portion of things.

To conclude our second main step in the proof of 279 , we need to establish the estimate 287 . To tie things down, we first need to know that

{\tilde{^{ω} C}}_{0}

satisfies a critical estimate similar to 252 . This is:

\begin{matrix} ∥ (M^{- 1} \nabla_{ξ})^{i} {\tilde{^{ω} C}}_{0} ∥ {\dot{B}}_{2, 10 n}^{p_{γ}, (2, \frac{n - 2}{2})} ≲ ℰ . \end{matrix}

(291)

This in turn is provided through applying the already established estimates 252 – 253 and 255 to the equation 274 with the help of the bilinear estimate 258 and the following embedding which follows as yet another special case of our general bound 45 :

\begin{matrix} Δ^{- 1} : {\dot{B}}_{2, 10 n}^{p_{γ}, (2, \frac{n - 4}{2})} \cdot {\dot{B}}_{2, 10 n}^{p_{γ}, (2, \frac{n - 2}{2})} ↪ {\dot{B}}_{1, 10 n}^{p_{γ}, (2, \frac{n - 2}{2})} . \end{matrix}

(292)

Armed with the estimate 291 , we can proceed to prove 287 by applying the following bilinear estimates to the various terms contained in

T_{3}

\begin{matrix} \nabla_{x} Δ : {\dot{B}}_{2, 10 n}^{\infty, (2, \frac{n}{2} - γ)} \cdot {\dot{B}}_{2, 10 n}^{p_{γ}, (2, \frac{n - 2}{2})} & ↪ {\dot{B}}_{1, 10 n}^{\infty, (2, \frac{n}{2} - γ)}, \end{matrix}

(293)

\begin{matrix} \nabla_{x} Δ : {\dot{B}}_{2, 10 n}^{p_{γ}, (2, \frac{n - 2}{2})} \cdot {\dot{B}}_{2, 10 n}^{q_{γ}, (2, \frac{n}{2} - γ)} & ↪ {\dot{B}}_{1, 10 n}^{q_{γ}, (2, \frac{n}{2} - γ)} . \end{matrix}

(294)

\begin{matrix}  \end{matrix}

We use the second embedding 294 in conjunction with the nesting 42 to derive the second term on the right hand side of 287 . We note that in estimates 293 – 294 , it is a simple matter to check the validity of the conditions 46 – 50 . We leave this as an exercise for the reader.

To complete this portion of the proof, we need to establish the implication 281

\Rightarrow

279 . This will be done once we can show that (keeping in mind the bounds of the form 290 ):

\begin{matrix} \sum_{i = 1}^{k} ∥ (M^{- 1} \nabla_{ξ})^{i} \tilde{ω \underset{̲}{C}} (t_{0}) ∥ {\dot{B}}_{2, 10 n}^{q_{γ}, (2, \frac{n}{2} - γ)} & ≲ \sum_{i = 1}^{k} ∥ (M^{- 1} \nabla_{ξ})^{i} \tilde{ω A^{(M)}} (t_{0}) ∥ {\dot{B}}_{2, 10 n}^{q_{γ}, (2, \frac{n}{2} - γ)}, \end{matrix}

(295)

\begin{matrix} \sum_{i = 1}^{k} ∥ (M^{- 1} \nabla_{ξ})^{i} \nabla_{t} \tilde{ω \underset{̲}{C}} (t_{0}) ∥ {\dot{B}}_{2, 10 n}^{q_{γ}, (2, \frac{n - 2}{2} - γ)} & ≲ \sum_{i = 1}^{k} ∥ (M^{- 1} \nabla_{ξ})^{i} \nabla_{t} \tilde{ω A^{(M)}} (t_{0}) ∥ {\dot{B}}_{2, 10 n}^{q_{γ}, (2, \frac{n - 2}{2} - γ)} \end{matrix}

(296)

\begin{matrix} + \sum_{i = 1}^{k} ∥ (M^{- 1} \nabla_{ξ})^{i} \tilde{ω A^{(M)}} (t_{0}) ∥ {\dot{B}}_{2, 10 n}^{q_{γ}, (2, \frac{n}{2} - γ)} + \sum_{i = 1}^{k} ∥ (M^{- 1} \nabla_{ξ})^{i} \tilde{ω \underset{̲}{C}} (t_{0}) ∥ {\dot{B}}_{2, 10 n}^{q_{γ}, (2, \frac{n}{2} - γ)} . \end{matrix}

(297)

\begin{matrix}  \end{matrix}

The estimate 295 is a simple consequence of applying the embedding 294 to the differentiated Hodge system 259 – 261 , while using the already established critical estimates 252 and 254 to tie things down. To prove the second estimate 297 above, we apply the time derivative

\nabla_{t}

to the system 259 – 261 , and then employ the embeddings:

\begin{matrix} \nabla_{x} Δ : {\dot{B}}_{2, 10 n}^{p_{γ}, (2, \frac{n - 4}{2})} \cdot {\dot{B}}_{2, 10 n}^{q_{γ}, (2, \frac{n}{2} - γ)} & ↪ {\dot{B}}_{1, 10 n}^{q_{γ}, (2, \frac{n - 2}{2} - γ)}, \end{matrix}

(298)

\begin{matrix} \nabla_{x} Δ : {\dot{B}}_{2, 10 n}^{p_{γ}, (2, \frac{n - 2}{2})} \cdot {\dot{B}}_{2, 10 n}^{q_{γ}, (2, \frac{n - 2}{2} - γ)} & ↪ {\dot{B}}_{1, 10 n}^{q_{γ}, (2, \frac{n - 2}{2} - γ)} . \end{matrix}

(299)

\begin{matrix}  \end{matrix}

Notice that these estimates have the same small amount of room as 288 – 289 above when measuring the gap condition 47 for this set of exponents. Using 298 – 299 in conjunction with the already established estimates 252 – 253 and 254 – 255 , we may conclude 297 when the bootstrapping constant

ℰ

is sufficiently small.

We have now established the estimate 279 . Integrating this in time, and applying a Cauchy-Schwartz with respect to the time integration and using the condition

| t - s |^{\frac{1}{2}} ⩽ M

, we have the estimate: 277

(L . H . S .) ≲ \sum_{i = 0}^{k - 1} ∥ (M^{- 1} \nabla_{ξ})^{i} \nabla_{ξ} \tilde{ω A^{(M)}} ∥ L_{t}^{2} ({\dot{B}}_{2, 10 n}^{q_{γ}, (2, \frac{n}{2} - 2 γ)}) .

Therefore, to conclude the estimate 276 , we simply need to prove the bound:

\begin{matrix} \sum_{i = 0}^{k - 1} ∥ (M^{- 1} \nabla_{ξ})^{i} \nabla_{ξ} \tilde{ω A^{(M)}} ∥ L_{t}^{2} ({\dot{B}}_{2, 10 n}^{q_{γ}, (2, \frac{n}{2} - 2 γ)}) ≲ ℰ . \end{matrix}

(300)

At a heuristic level, this estimate is true because there is enough room in the norm

L_{t}^{2} ({\dot{B}}_{2, 10 n}^{q_{γ}, (2, \frac{n}{2} - 2 γ)})

verses the bootstrapping norm 126d to save precisely

\frac{1}{2} - 2 γ

derivatives. This, used in conjunction with the truncation condition coming form the operator

ω {\bar{Π}}^{(\frac{1}{2} - δ)}

, is enough to absorb the extra angular factor

θ^{- 1}

produced by the unsmoothed derivative

\nabla_{ξ}

. All throughout the calculation, the exponent

q_{γ}

is high enough that the intrinsic angular singularity contained in the potentials

{\tilde{ω A^{(M)}}}

can be recovered by an application of Bernstein's inequality to the endpoint Strichartz spatial exponent

L^{\frac{2 (n - 1)}{n - 3}}

. We spell out briefly the details of this procedure as follows:

Freezing now the frequency and the number of

(M^{- 1} \nabla_{ξ})

derivatives on the right hand side of 300 , and using the bootstrapping condition 126d , we see that it suffices to show the bound (note that 300 is already in square function form):

∥ (M^{- 1} \nabla_{ξ})^{i} \nabla_{ξ} P_{μ} (\tilde{ω A^{(M)}}) ∥ L_{t}^{2} ({\dot{B}}_{2, 10 n}^{q_{γ}, (2, \frac{n}{2} - 2 γ)}) ≲ ∥ P_{μ} (\underset{̲}{A} ∙ ≪ 1) ∥ L^{2} ({\dot{B}}_{2, 10 n}^{\frac{2 (n - 1)}{n - 3}, (2, \frac{n - 1}{2})}) .

In fact, after a further localization in the angle, we will show that:

∥ (M^{- 1} \nabla_{ξ})^{i} \nabla_{ξ} ω Π_{θ} P_{μ} (\tilde{ω A^{(M)}}) ∥ L_{t}^{2} ({\dot{B}}_{2, 10 n}^{q_{γ}, (2, \frac{n}{2} - 2 γ)}) ≲ θ^{γ} ∥ P_{μ} (\underset{̲}{A} ∙ ≪ 1) ∥ L^{2} ({\dot{B}}_{2, 10 n}^{\frac{2 (n - 1)}{n - 3}, (2, \frac{n - 1}{2})}) .

By an application of the Bernstein inequality, and recalling the definition 278 of the exponent

q_{γ}

, this last estimate is a consequence of being able to show that:

(301) ∥ ( M − 1 ∇ ξ ) i ∇ ξ ω Π θ P μ ( ω A ( M ) ~ ) ∥ L t 2 ( B ˙ 2 , 10 n 2 ( n − 1 ) n − 3 , ( 2 , n 2 − 2 γ ) ) ≲ θ − 1 ∥ P μ ( A ̲ ∙ ≪ 1 ) ∥ L 2 ( B ˙ 2 , 10 n 2 ( n − 1 ) n − 3 , ( 2 , n − 1 2 ) ) .

Using now the heuristic operator bound 195 in conjunction with the Coulomb savings 196 and the heuristic symbol type bounds 256 , we have the following heuristic identity which follows our strict convention 58 :

(M^{- 1} \nabla_{ξ})^{i} \nabla_{ξ} ω Π_{θ} P_{μ} (\tilde{ω A^{(M)}}) \approx θ^{- 2} P_{μ} (\underset{̲}{A} ∙ ≪ 1) .

Plugging this last bound into the left hand side of 301 , and rearranging the Besov weights, we have the estimate:

∥ (M^{- 1} \nabla_{ξ})^{i} \nabla_{ξ} ω Π_{θ} P_{μ} (\tilde{ω A^{(M)}}) ∥ L_{t}^{2} ({\dot{B}}_{2, 10 n}^{\frac{2 (n - 1)}{n - 3}, (2, \frac{n}{2} - 2 γ)}) ≲ (\frac{μ^{\frac{1}{2} - 2 γ}}{θ}) θ^{- 1} ∥ P_{μ} (\underset{̲}{A} ∙ ≪ 1) ∥ L^{2} ({\dot{B}}_{2, 10 n}^{\frac{2 (n - 1)}{n - 3}, (2, \frac{n - 1}{2})}) .

The truncation condition that

μ^{\frac{1}{2} - δ} ⩽ θ

(note that

μ ≲ 1

) now guarantees that we have the bound:

(\frac{μ^{\frac{1}{2} - 2 γ}}{θ}) ≲ 1,

when

γ

is sufficiently small compared to

δ

. This completes the proof of 301 , and hence 300 , which in turn finishes our proof of estimate 277 . □

Having now established the symbol bounds 232 – 233 separately for each of the two terms in the product 234 . By using the Leibniz rule for derivatives, these together imply the bounds 232 – 233 for the full product on the left hand side of 234 .

This completes our proof of Proposition 9.1 . □

We now proceed to prove the second main estimate 217 for remainder kernel in the splitting 215 . This involves a sum of kernels, each of which according to the identities 226 – 227 has at least one copy of the terms

ω h^{- 1} (x) ω h (y) - I

and

ω h (x) ω h^{- 1} (y) - I

. Therefore, without loss of generality, we may assume that we are trying to prove the estimate:

\begin{matrix} \int_{R_{x}^{n}} χ_{D_{σ}} (x) ∥ \int_{R_{ξ}^{n}} {e^{2 π i (x - y) \cdot ξ}}^{ω} G (x, y) χ (ξ) d ξ ∥ d x ≲ σ^{- γ} . \end{matrix}

(302)

where we have set:

^{ω} G (x, y) = {\tilde{ω g}}^{- 1} (x) (ω h^{- 1} (x) ω h (y) - I) \tilde{ω g} (y) [∙] ω g^{- 1} (y) ω g (x) .

We note here that the corresponding estimates for the other terms in

ℛ_{σ}^{T T^{*}}

are similar and are left to the reader.

To prove 302 , we use following angular cutoff functions to split:

^{ω} G = {χ_{| cos (θ_{ξ, x - y}) | ⩾ | x - y |^{- 1 + γ}}}^{ω} G + {χ_{| cos (θ_{ξ, x - y}) | < | x - y |^{- 1 + γ}}}^{ω} G .

Therefore, using the triangle and Minkowski inequalities, we see that it suffices to prove the pair of bounds:

\begin{matrix} \int_{R_{x}^{n}} χ_{D_{σ}} (x) ∥ \int_{R_{ξ}^{n}} e^{2 π i (x - y) \cdot ξ} {χ_{| cos (θ_{ξ, x - y}) | ⩾ | x - y |^{- 1 + γ}}}^{ω} G (x, y) χ (ξ) d ξ ∥ d x & ≲ σ^{- γ}, \end{matrix}

(303)

\begin{matrix} \int_{R_{x}^{n}} χ_{D_{σ}} (x) ∥ \int_{R_{ξ}^{n}} e^{2 π i (x - y) \cdot ξ} {χ_{| cos (θ_{ξ, x - y}) | < | x - y |^{- 1 + γ}}}^{ω} G (x, y) χ (ξ) d ξ ∥ d x & ≲ σ^{- γ} . \end{matrix}

(304)

\begin{matrix}  \end{matrix}

The proof of the first estimate, 303 , is a simple matter of integrating by parts as many times as necessary with respect to the weighted radial derivative

\frac{1}{2 π i | x - y | cos (θ_{ξ, x - y})} \partial_{| ξ |}

, taking account of the fact that

^{ω} G

is independent of the variable

| ξ |

. Assuming that

| x - y | \sim σ

is sufficiently large, we will eventually have that:

\begin{matrix} | (\frac{1}{2 π i | x - y | cos (θ_{ξ, x - y})} \partial_{| ξ |})^{k} χ (ξ) | ≲ σ^{- n - γ}, \end{matrix}

(305)

at which point we may stop the integration by parts and put absolute value signs around the remaining integral. The right hand side of 303 will then follow as a direct consequence of 305 and the simple bounds:

\begin{matrix} \int_{R_{x}^{n}} χ_{D_{σ}} (x) d x & ≲ σ^{n}, \end{matrix}

\begin{matrix} {sup}_{x, ω} ∥^{ω} G (x, y) ∥ & ≲ 1 . \end{matrix}

\begin{matrix}  \end{matrix}

To conclude the proof of estimate 302 , we need to show the second estimate 304 above. At this point, we have stripped things down to where oscillations under the integral sign are no longer of any use, so we simply strive to estimate the absolute value of the integrand. Here the smallness of the function

^{ω} G (x, y)

is essential. To make use of this, we rearrange the order in the absolute integral and use Hölders inequality to bound:

304 (306) ( L . H . S . ) ≲ ∫ S n − 1 sup x ∈ D σ ∥ ω G ( x , y ) ∥ d ω ⋅ sup ω ∫ R x n χ | cos ( θ ξ , x − y ) | < | x − y | − 1 + γ ( x ) χ D σ ( x ) d x .

To bound the second integral on the right hand side of the above product, we translate by the vector

y

and then apply a rotation to reduce the bound we wish to show to the following:

\begin{matrix} \int_{| x | \sim σ} χ_{| cos (θ_{(1, 0), x}) | < | x |^{- 1 + γ}} (x) d x ≲ σ^{n - 1 + γ} . \end{matrix}

(307)

The validity of 307 follows trivially from the fact that if we split

x = (x_{1}, x^{'})

, we have the bounds

| x_{1} | ≲ σ^{γ}

and

| x^{'} | ≲ σ

over the range of integration thanks to the angular cutoff and the identity:

cos (θ_{(1, 0), x}) = \frac{x_{1}}{| x |} .

Thus, keeping in mind the bound 307 , we see from estimate 306 that the proof of 304 follows from a Cauchy-Schwartz on the sphere

S^{n - 1}

and the following integrated bounds:

\begin{matrix} {(\int_{S^{n - 1}} {sup}_{x \in D_{σ}} ∥^{ω} G (x, y) ∥^{2} d ω)}^{\frac{1}{2}} ≲ σ^{1 - n - 2 γ} . \end{matrix}

(308)

Due to its use in the next section, we will in fact show the following more general set of estimates which includes 308 as a special case:

Proposition 9.4 (Estimates for integrated remainder group elements ). Let the group elements

ω h

be defined infinitesimally via the equations 222 – 223 and the Hodge system 224 , where the parameter

σ^{- 1 + γ}

is replaced by

M^{- 1}

. Then upon integration, one has the following bounds:

\begin{matrix} {(\int_{S^{n - 1}} {sup}_{| x - y | \sim N} ∥ ω h^{- 1} (t, x) ω h (s, y) - I ∥^{2} d ω)}^{\frac{1}{2}} & ≲ ℰ (1 + | t - s | + N) \cdot M^{- n - δ}, \end{matrix}

(309)

\begin{matrix} {(\int_{S^{n - 1}} {sup}_{| x - y | \sim N} ∥ ω h (t, x) ω h^{- 1} (s, y) - I ∥^{2} d ω)}^{\frac{1}{2}} & ≲ ℰ (1 + | t - s | + N) \cdot M^{- n - δ}, \end{matrix}

(310)

\begin{matrix}  \end{matrix}

where

ℰ

is the bootstrapping constant from line 126d . The above estimates are uniform in the value of

M

when it is sufficiently large.

Proof of the estimates 309 – 310 . As will become apparent to the reader, it suffices to show the first bound 309 , as the second follows from essentially identical reasoning.

Our first step here is to disentangle the products, and to work exclusively with either spatially separated or temporally separated products. This is accomplished via the following simple algebraic identity:

(311) ω h − 1 ( t , x ) ω h ( s , y ) − I = ω h − 1 ( t , x ) ω h ( s , x ) − I + ω h − 1 ( t , x ) ω h ( s , x ) ⋅ ( ω h − 1 ( s , x ) ω h ( s , y ) − I ) .

Working now, for the moment, with the second term in this last expression substituted into the estimate 309 we expand:

ω h^{- 1} (s, x) ω h (s, y) - I = \int_{x}^{y} ω h^{- 1} (s, x) \partial_{ℓ} (ω h (s, ℓ)) d ℓ .

Integrating this last expression in

L_{ω}^{2}

we are reduced to proving the following:

Lemma 9.5. Let the (spatial) connection

{\tilde{\tilde{ω \underset{̲}{C}}}}

be defined via the Hodge system:

\begin{matrix} (\tilde{\tilde{ω \underset{̲}{C}}})^{d f} & = d^{*} Δ^{- 1} ([\tilde{ω \underset{̲}{C}}, \tilde{\tilde{ω \underset{̲}{C}}}] + [\tilde{\tilde{ω \underset{̲}{C}}}, \tilde{ω \underset{̲}{C}}]), \end{matrix}

(312a)

\begin{matrix} (\tilde{\tilde{ω \underset{̲}{C}}})^{c f} & = \tilde{\tilde{ω {\underset{̲}{A}}^{(M)}}} - \nabla_{x} Δ^{- 1} ([\tilde{\tilde{ω {\underset{̲}{A}}^{(M)}}}, \tilde{ω \underset{̲}{C}}] + [\tilde{ω {\underset{̲}{A}}^{(M)}}, \tilde{\tilde{ω \underset{̲}{C}}}]), \end{matrix}

(312b)

\begin{matrix}  \end{matrix}

where we have set:

\begin{matrix} \tilde{\tilde{ω {\underset{̲}{A}}^{(M)}}} = - \nabla_{x} ω {\bar{Π}}_{∙ ⩽ M^{- 1}} ω {\bar{Π}}^{(\frac{1}{2} - δ)} ω L Δ_{ω^{⊥}}^{- 1} \underset{̲}{A} ∙ ≪ 1 (\partial_{ω}), \end{matrix}

(313)

and where the spatial connections

{\tilde{ω {\underset{̲}{A}}^{(M)}}}

and

{\tilde{ω \underset{̲}{C}}}

are defined on the lines 243 and 244 above. Then one has the following integrated estimate uniform in the parameter

M

\begin{matrix} {(\int_{S^{n - 1}} {sup}_{x} ∥ \tilde{\tilde{ω \underset{̲}{C}}} (x) ∥^{2} d ω)}^{\frac{1}{2}} ≲ ℰ \cdot M^{- n - δ} . \end{matrix}

(314)

Proof of the estimate 314 . Our strategy here is similar to the previous lemmas. We first prove things for the linear term in 312b , and then use the critical embeddings 254 and 252 to transfer things to the connection

{\tilde{\tilde{ω \underset{̲}{C}}}}

via the Hodge system 312 .

Our first step then is to show that:

\begin{matrix} {(\int_{S^{n - 1}} {sup}_{x} ∥ \tilde{\tilde{ω {\underset{̲}{A}}^{(M)}}} (x) ∥^{2} d ω)}^{\frac{1}{2}} ≲ ℰ \cdot M^{- n - δ} . \end{matrix}

(315)

In fact, we will show the following somewhat stronger estimate which will easily imply 315 , and which is more robust with respect to Hodge systems:

\begin{matrix} {(\int_{S^{n - 1}} ∥ \tilde{\tilde{ω {\underset{̲}{A}}^{(M)}}} ∥^{2} {\dot{B}}_{2, 10 n}^{\infty, (2, \frac{n}{2} - γ)} d ω)}^{\frac{1}{2}} ≲ ℰ \cdot M^{- n - δ} . \end{matrix}

(316)

This last estimate is a simple matter of using Bernstein's inequality and orthogonality which will net us the factor

M^{1 - n}

, followed by the condition that

μ^{\frac{1}{2} - δ} ≲ M^{- 1 - δ}

at each fixed frequency thanks to the

ω {\bar{Π}}_{∙ ⩽ M^{- 1}} ω {\bar{Π}}^{(\frac{1}{2} - δ)}

multiplier which nets us the remaining powers of

M^{- 1}

. The implementation is as follows: We first decompose things into the sum over all frequencies

μ ≲ 1

and angles

θ ≲ 1

\tilde{\tilde{ω {\underset{̲}{A}}^{(M)}}} = \sum_{θ, μ : μ ≲ 1} ω Π_{θ} P_{μ} \tilde{\tilde{ω {\underset{̲}{A}}^{(M)}}} .

Keeping in mind the spatial frequency truncation of

\tilde{\tilde{ω {\underset{̲}{A}}^{(M)}}}

, and by the square sum definition of the Besov norms, the triangle inequality, and dyadic summing, we see that it suffices to show the following fixed frequency estimate:

\begin{matrix} {(\int_{S^{n - 1}} {sup}_{x} ∥ ω Π_{θ} P_{μ} \tilde{\tilde{ω {\underset{̲}{A}}^{(M)}}} (x) ∥^{2} d ω)}^{\frac{1}{2}} ≲ μ^{γ} θ^{γ} \cdot ℰ \cdot M^{- n - δ} . \end{matrix}

(317)

For each fixed

ω

, we use Bernstein's inequality and the equivalence:

ω Π_{θ} P_{μ} \tilde{\tilde{ω {\underset{̲}{A}}^{(M)}}} \approx θ^{- 1} ω Π_{θ} ω {\bar{Π}}_{∙ ⩽ M^{- 1}} ω {\bar{Π}}^{(\frac{1}{2} - δ)} P_{μ} (\underset{̲}{A} ∙ ≪ 1),

to compute that:

\begin{matrix} {sup}_{x} ∥ ω Π_{θ} P_{μ} \tilde{\tilde{ω {\underset{̲}{A}}^{(M)}}} (x) ∥, \end{matrix}

\begin{matrix} ≲ & θ^{- 1} μ \cdot θ^{\frac{n - 1}{2}} \cdot ∥ ω Π_{θ} ω {\bar{Π}}_{∙ ⩽ M^{- 1}} ω {\bar{Π}}^{(\frac{1}{2} - δ)} P_{μ} (\underset{̲}{A} ∙ ≪ 1) ∥ {\dot{H}}_{x}^{\frac{n - 2}{2}}, \end{matrix}

\begin{matrix} ≲ & μ^{γ} θ^{γ} \cdot M^{- \frac{n + 1}{2} - δ} ∥ ω {\bar{Π}}_{∙ ⩽ M^{- 1}} P_{μ} (\underset{̲}{A} ∙ ≪ 1) ∥ {\dot{H}}_{x}^{\frac{n - 2}{2}} . \end{matrix}

\begin{matrix}  \end{matrix}

Notice that this last line follows from the truncation condition

μ^{1 - 2 δ} ≲ θ^{2}

as well as the small constant bounds

γ ≪ δ

. The proof of 317 is now a result of the following simple calculation involving Plancherel:

\begin{matrix} {(\int_{S^{n - 1}} ∥ ω {\bar{Π}}_{∙ ⩽ M^{- 1}} P_{μ} (\underset{̲}{A} ∙ ≪ 1) ∥^{2} {\dot{H}}_{x}^{\frac{n - 2}{2}} d ω)}^{\frac{1}{2}}, \end{matrix}

(318)

\begin{matrix} ≲ & {(\int_{R n_{ξ}} \int_{S^{n - 1}} ∥ (b_{∙ ⩽ M^{- 1}}^{ω} + b_{∙ ⩽ M^{- 1}}^{- ω}) | ξ |^{\frac{n - 2}{2}} p_{μ} \hat{\underset{̲}{A} ∙ ≪ 1} (ξ) ∥^{2} d ω d ξ)}^{\frac{1}{2}}, \end{matrix}

\begin{matrix} ≲ & M^{- \frac{n - 1}{2}} ∥ P_{μ} (\underset{̲}{A} ∙ ≪ 1) ∥ {\dot{H}}_{x}^{\frac{n - 2}{2}}, \end{matrix}

\begin{matrix} ≲ & ℰ \cdot M^{- \frac{n - 1}{2}} . \end{matrix}

\begin{matrix}  \end{matrix}

To finish the proof of 314 , we simply need to pass the estimate 316 onto the set of spatial potentials

{\tilde{\tilde{ω \underset{̲}{C}}}}

. To do this, we set up auxiliary spaces

L_{ω}^{\infty} ({\dot{B}}_{2, 10 n}^{p_{γ}, (2, \frac{n - 2}{2})})

and

L_{ω}^{2} ({\dot{B}}_{2, 10 n}^{\infty, (2, \frac{n}{2} - γ)})

. From the estimates 254 and 252 we immediately have that:

\begin{matrix} ∥ \tilde{ω {\underset{̲}{A}}^{(M)}} ∥ L_{ω}^{\infty} ({\dot{B}}_{2, 10 n}^{p_{γ}, (2, \frac{n - 2}{2})}) & ≲ ℰ, \end{matrix}

(319)

\begin{matrix} ∥ \tilde{ω \underset{̲}{C}} ∥ L_{ω}^{\infty} ({\dot{B}}_{2, 10 n}^{p_{γ}, (2, \frac{n - 2}{2})}) & ≲ ℰ, \end{matrix}

(320)

\begin{matrix}  \end{matrix}

where the index

p_{γ}

is the exponent from the line 193 above. The desired result now follows from the bilinear estimate:

\begin{matrix} \nabla_{x} Δ^{- 1} : L_{ω}^{2} ({\dot{B}}_{2, 10 n}^{\infty, (2, \frac{n}{2} - γ)}) \cdot L_{ω}^{\infty} ({\dot{B}}_{2, 10 n}^{p_{γ}, (2, \frac{n - 2}{2})}) ↪ L_{ω}^{2} ({\dot{B}}_{2, 10 n}^{\infty, (2, \frac{n}{2} - γ)}) . \end{matrix}

(321)

This is a simple consequence of the condition

p_{γ} < n

which allows us to fulfill the condition 47 of the general embedding 45 . The result follows from integrating this bound in

L_{ω}^{2}

. □

We now turn our attention to proving the bound 309 for the temporally separated product which is the first term on the right hand side of equation 311 above. Expand the integrand here an the derivative of another integral over time line, we have that:

ω h^{- 1} (t, x) ω h (s, x) - I = \int_{s}^{t} ω h^{- 1} (t, x) \partial_{t} (ω h (ℓ, x)) d ℓ .

After integrating in

L_{ω}^{2}

the right hand side of this last expression, we see that we are reduced to proving that:

Lemma 9.6. Let the quantity

{\tilde{\tilde{^{ω} C}}}_{0}

be defined implicitly via the elliptic equation:

(322) ω C ~ ~ 0 = ω A 0 ( M ) ~ ~ − ∇ t Δ − 1 ( [ ω A ̲ ( M ) ~ ~ , ω C ̲ ~ ] + [ ω A ̲ ( M ) ~ , ω C ̲ ~ ~ ] ) − d * Δ − 1 ( [ ω C ~ ~ 0 , ω C ̲ ~ ] + [ ω C ~ 0 , ω C ̲ ~ ~ ] ) ,

where we have set:

\begin{matrix} \tilde{\tilde{ω A_{0}^{(M)}}} = - \nabla_{t} ω {\bar{Π}}_{∙ ⩽ M^{- 1}} ω {\bar{Π}}^{(\frac{1}{2} - δ)} ω L Δ_{ω^{⊥}}^{- 1} \underset{̲}{A} ∙ ≪ 1 (\partial_{ω}), \end{matrix}

(323)

and where the connections

{\tilde{\tilde{ω {\underset{̲}{A}}^{(M)}}}}

and

{\tilde{\tilde{ω \underset{̲}{C}}}}

are as in Lemma 9.5 , and where the quantity

\tilde{ω A_{0}^{(M)}}

is defined on line 275 above. Then one has the following integrated estimate uniform in the parameter

M

\begin{matrix} {(\int_{S^{n - 1}} {sup}_{x} ∥ {\tilde{\tilde{^{ω} C}}}_{0} (x) ∥^{2} d ω)}^{\frac{1}{2}} ≲ ℰ \cdot M^{- n - δ} . \end{matrix}

(324)

Proof of the estimate 324 . As in the proof of the previous Lemma, our goal here is to first prove the

L_{ω}^{2} ({\dot{B}}_{2, 10 n}^{\infty, (n, \frac{n}{2} - γ)})

improvement of this claim for the terms on the right hand side of the equation 322 which do not involve the variable

{\tilde{\tilde{^{ω} C}}}_{0}

. The desired bound can then be achieved via iteration or bootstrapping using the bilinear estimate 321 and the estimate 320 to deal with the term involving

{\tilde{\tilde{^{ω} C}}}_{0}

on the right hand side of 322 . Therefore, we are trying to show the following three estimates:

\begin{matrix} {(\int_{S^{n - 1}} ∥ \tilde{\tilde{ω A_{0}^{(M)}}} ∥^{2} {\dot{B}}_{2, 10 n}^{\infty, (2, \frac{n}{2} - γ)} d ω)}^{\frac{1}{2}} & ≲ ℰ \cdot M^{- n - δ}, \end{matrix}

(325)

\begin{matrix} {(\int_{S^{n - 1}} ∥ d^{*} Δ^{- 1} [{\tilde{^{ω} C}}_{0}, \tilde{\tilde{ω \underset{̲}{C}}}] ∥^{2} {\dot{B}}_{2, 10 n}^{\infty, (2, \frac{n}{2} - γ)} d ω)}^{\frac{1}{2}} & ≲ ℰ \cdot M^{- n - δ}, \end{matrix}

(326)

\begin{matrix} {(\int_{S^{n - 1}} ∥ \frac{\nabla_{t}}{Δ} ([\tilde{\tilde{ω {\underset{̲}{A}}^{(M)}}}, \tilde{ω \underset{̲}{C}}] + [\tilde{ω {\underset{̲}{A}}^{(M)}}, \tilde{\tilde{ω \underset{̲}{C}}}]) ∥^{2} {\dot{B}}_{2, 10 n}^{\infty, (2, \frac{n}{2} - γ)} d ω)}^{\frac{1}{2}} & ≲ ℰ \cdot M^{- n - δ} . \end{matrix}

(327)

\begin{matrix}  \end{matrix}

The proof of the first estimate 325 is essentially identical to that of 316 above, once one takes into account the truncation condition 126c . The proof of the second estimate 326 follows from the

L_{ω}^{2} ({\dot{B}}_{2, 10 n}^{\infty, (2, \frac{n}{2} - γ)})

bound proved for the potentials

{\tilde{\tilde{ω \underset{̲}{C}}}}

proved in the previous lemma, the bilinear estimate 321 , and the following:

∥ \tilde{^{ω} C_{0}} ∥ L_{ω}^{\infty} ({\dot{B}}_{2, 10 n}^{p_{γ}, (2, \frac{n - 2}{2})}) ≲ ℰ,

which is a direct consequence of 291 above.

Therefore, it remains for us to prove the last bound 327 . Unfortunately, this does not follow directly from the procedure we have been using so far. The trouble is that the time derivative

\nabla_{t}

will in general not cancel with the Laplacean, and it is not possible to prove a bilinear estimate which is morally of the form

{\dot{B}}_{2}^{p_{γ}, (2, \frac{n - 4}{2})} \cdot {\dot{B}}_{2}^{\infty, (2, \frac{n}{2})} \subseteq Δ {\dot{B}}_{2}^{\infty, (2, \frac{n}{2})}

due to bad

H i g h \times H i g h

frequency interactions in dimension

n = 6

. The only way around this seems to be to do something which is quite a bit more involved. The way we will prove 327 is in a series of steps designed to reduce things to a term which, in some sense, represents the central difficulty.

This last term will be dealt with using a scale of non-isotropic spaces which are similar to the ones employed in the proof of Lemma 9.2 above. The argument we will present here is largely ad-hoc, and there are many variations. Furthermore, we will proceed by proving certain estimates which may be cut out at this stage of the overall paper but will turn out to be useful in the sequel.

The first step we make here is to recall that, although we have been suppressing it, there is additional polarity information in the definition of the connections

d + ω A

(see 197 ). This comes from the choice of null vector-field

ω L^{\pm}

. For convenience, we will use here an implicitly defined notation which we call

ω \underset{̲}{L}

, to denote the opposite vector-field for any given choice of polarization. That is, we always have the formula:

\begin{matrix} □ = ω \underset{̲}{L} ω L + Δ_{ω^{⊥}} . \end{matrix}

(328)

Now, for a given choice of polarization, we can always write

\pm \nabla_{t} = ω L^{\pm} - ω \cdot \nabla_{x}

Therefore, modulo proving estimates which are identical to those Lemma 9.5 above, and distributing the

ω \underset{̲}{L}

derivative, we that the proof of 327 can be reduced to the proof of the following three bilinear estimates:

\begin{matrix} ∥ Δ^{- 1} [ω \underset{̲}{L} \tilde{ω {\underset{̲}{A}}^{(M)}}, \tilde{\tilde{ω \underset{̲}{C}}}] ∥ L_{ω}^{2} ({\dot{B}}_{2, 10 n}^{\infty, (2, \frac{n}{2} - γ)}) & ≲ ℰ \cdot M^{- n - δ}, \end{matrix}

(329)

\begin{matrix} ∥ Δ^{- 1} [\tilde{\tilde{ω {\underset{̲}{A}}^{(M)}}}, ω \underset{̲}{L} \tilde{ω \underset{̲}{C}}] ∥ L_{ω}^{2} ({\dot{B}}_{2, 10 n}^{\infty, (2, \frac{n}{2} - γ)}) & ≲ ℰ \cdot M^{- n - δ}, \end{matrix}

(330)

\begin{matrix} ∥ Δ^{- 1} ([ω \underset{̲}{L} \tilde{\tilde{ω {\underset{̲}{A}}^{(M)}}}, \tilde{ω \underset{̲}{C}}] + [\tilde{ω {\underset{̲}{A}}^{(M)}}, ω \underset{̲}{L} \tilde{\tilde{ω \underset{̲}{C}}}]) ∥ L_{ω}^{2} ({\dot{B}}_{2, 10 n}^{\infty, (2, \frac{n}{2} - γ)}) & ≲ ℰ \cdot M^{- n - δ} . \end{matrix}

(331)

\begin{matrix}  \end{matrix}

Our first step is to prove the estimates 329 – 330 . To do this, we introduce the auxiliary index:

\begin{matrix} r_{γ} = \frac{2 n (n - 1)}{(n - 2) (n + 1) - 3 γ n} . \end{matrix}

(332)

We now show that one has the following improvements over the estimates 255 , 253 :

\begin{matrix} ∥ ω \underset{̲}{L} \tilde{ω {\underset{̲}{A}}^{(M)}} ∥ L_{ω}^{\infty} ({\dot{B}}_{2, 10 n}^{r_{γ}, (2, \frac{n - 4}{2})}) & ≲ ℰ, \end{matrix}

(333)

\begin{matrix} ∥ ω \underset{̲}{L} \tilde{ω \underset{̲}{C}} ∥ L_{ω}^{\infty} ({\dot{B}}_{2, 10 n}^{r_{γ}, (2, \frac{n - 4}{2})}) & ≲ ℰ . \end{matrix}

(334)

\begin{matrix}  \end{matrix}

With the help of 333 – 334 , the proof of the estimates 329 – 330 follows from the

L_{ω}^{2} ({\dot{B}}_{2, 10 n}^{\infty, (2, \frac{n}{2} - γ)})

bound shown in the previous Lemma, and the following bilinear embedding:

\begin{matrix} Δ^{- 1} : L_{ω}^{2} ({\dot{B}}_{2, 10 n}^{\infty, (2, \frac{n}{2} - γ)}) \cdot L_{ω}^{\infty} ({\dot{B}}_{2, 10 n}^{r_{γ}, (2, \frac{n - 4}{2})}) & ↪ L_{ω}^{2} ({\dot{B}}_{2, 10 n}^{\infty, (2, \frac{n}{2} - γ)}) . \end{matrix}

(335)

\begin{matrix}  \end{matrix}

Notice that the validity of this last estimate follows from the condition 47 , because for

0 < γ ≪ 1

we have the index bounds:

2 + γ < \frac{n}{r_{γ}},

which follows easily from the definitions of

r_{γ}

. To prove 333 – 334 we proceed by first showing 333 , and then use the Hodge system 243 to show that 333 334

\Rightarrow

The details follow.

We are now trying to show 333 . We use the identity 328 and the definition 244 and the structure equation 126e to compute that:

(336) ω L ̲ ω A ̲ ( M ) ~ = ∇ x ω Π ¯ M − 1 < ∙ ω Π ¯ ( 1 2 − δ ) A ̲ ∙ ≪ 1 ( ∂ ω ) − ∇ x ω Π ¯ M − 1 < ∙ ω Π ¯ ( 1 2 − δ ) Δ ω ⊥ − 1 P ~ ( [ B , H ] ) ( ∂ ω ) .

The estimate 333 for the first term on the right hand side of this last expression is a trivial consequence the

{\dot{H}}^{\frac{n - 4}{2}}

bound (at fixed time) for that term which is provided through the energy type norm contained in the bootstrapping assumption 126d , and the Besov nesting 42 . Therefore, we strive to bound the second term on the right hand side of 336 above. To do this, we first decompose things into a sum over all possible angles spread from the

ω

direction and write:

\nabla_{x} ω {\bar{Π}}_{M^{- 1} < ∙} ω {\bar{Π}}^{(\frac{1}{2} - δ)} Δ_{ω^{⊥}}^{- 1} \tilde{P} ([B, H]) (\partial_{ω}) = \sum_{θ} \nabla_{x} ω Π_{θ} ω {\bar{Π}}_{M^{- 1} < ∙} ω {\bar{Π}}^{(\frac{1}{2} - δ)} Δ_{ω^{⊥}}^{- 1} \tilde{P} ([B, H]) (\partial_{ω}) .

For each angularly localized piece in this last expression, we may make use of the Coulomb savings 196 to show the following heuristic multiplier bound (again making use of our convention explained below 58 above):

\nabla_{x} ω Π_{θ} ω {\bar{Π}}_{M^{- 1} < ∙} ω {\bar{Π}}^{(\frac{1}{2} - δ)} Δ_{ω^{⊥}}^{- 1} P_{μ} \tilde{P} ([B, H]) (\partial_{ω}) \approx (μ θ)^{- 1} ω Π_{θ} P_{μ} P_{∙ ≪ 1} ([B, H]) .

Therefore, dropping the small frequency multiplier, our goal is to show the following fixed angle estimate:

∥ ω Π_{θ} | D_{x} |^{- 1} ([B, H]) ∥ {\dot{B}}_{2}^{r_{γ}, (2, \frac{n - 4}{2})} ≲ θ^{1 + γ} \cdot ℰ .

Using Bernstein's inequality on each fixed dyadic block in the Besov nesting 42 , and making use of a small numerical calculation which we leave to the reader, one finds that this last estimate is a consequence of the following non-localized Besov space estimate:

∥ | D_{x} |^{- 1} ([B, H]) ∥ {\dot{B}}_{2}^{\frac{2 n}{n + 2 - γ}, (2, \frac{n - 4}{2})} ≲ ℰ .

This last bound is now a consequence of the bootstrapping structure estimate 126f and the following bilinear embedding:

| D_{x} |^{- 1} : {\dot{B}}_{2}^{2, (2, \frac{n - 2}{2})} \cdot {\dot{B}}_{2}^{2, (2, \frac{n - 4}{2})} ↪ {\dot{B}}_{2}^{\frac{2 n}{n + 2 - γ}, (2, \frac{n - 4}{2})} .

Notice that the reason we are forced to work with the relatively high space

L^{\frac{2 n}{n + 2 - γ}}

is because of

L o w \times H i g h

frequency interactions. This is why we are forced to work in the less aesthetic space

L^{r_{γ}}

above instead of

L^{2}

. This completes the proof of 333 .

Our next step is to establish the implication 333 334

\Rightarrow

. This follows immediately from differentiation of the Hodge system 243 with respect to the

ω \underset{̲}{L}

vector-field, and then using the following bilinear estimate to bootstrap:

\nabla_{x} Δ^{- 1} : {\dot{B}}_{2, 10 n}^{r_{γ}, (2, \frac{n - 4}{2})} \cdot {\dot{B}}_{2, 10 n}^{p_{γ}, (2, \frac{n - 2}{2})} ↪ {\dot{B}}_{2, 10 n}^{r_{γ}, (2, \frac{n - 4}{2})} .

We leave it to the reader to check that the various conditions of estimate 45 are satisfied in this case.

It remains for us to show the bound 331 . To make this a bit easier, we employ the skew symmetry of the Lie brackets in that expression to write it as:

Δ^{- 1} ([ω \underset{̲}{L} \tilde{\tilde{ω {\underset{̲}{A}}^{(M)}}}, \tilde{ω \underset{̲}{C}}] + [\tilde{ω {\underset{̲}{A}}^{(M)}}, ω \underset{̲}{L} \tilde{\tilde{ω \underset{̲}{C}}}]) = Δ^{- 1} ([ω \underset{̲}{L} \tilde{\tilde{ω {\underset{̲}{A}}^{(M)}}}, \tilde{ω \underset{̲}{C}} - \tilde{ω {\underset{̲}{A}}^{(M)}}] + [\tilde{ω {\underset{̲}{A}}^{(M)}}, ω \underset{̲}{L} (\tilde{\tilde{ω \underset{̲}{C}}} - \tilde{\tilde{ω {\underset{̲}{A}}^{(M)}}})]) .

From this we see that the proof of 331 will follow once we can establish the three separate estimates:

\begin{matrix} ∥ Δ^{- 1} ([ω \underset{̲}{L} \tilde{\tilde{ω {\underset{̲}{A}}^{(M)}}}, \tilde{ω \underset{̲}{C}} - \tilde{ω {\underset{̲}{A}}^{(M)}}]) ∥ L_{ω}^{2} ({\dot{B}}_{2, 10 n}^{\infty, (2, \frac{n}{2} - γ)}) ≲ ℰ \cdot M^{- n - δ}, \end{matrix}

(337)

\begin{matrix} ∥ Δ^{- 1} ([\tilde{ω {\underset{̲}{A}}^{(M)}}, ω \underset{̲}{L} \tilde{\tilde{ω \underset{̲}{C}}}]) ∥ L_{ω}^{2} ({\dot{B}}_{2, 10 n}^{\infty, (2, \frac{n}{2} - γ)}) ≲ ℰ \cdot M^{- n - δ}, \end{matrix}

(338)

\begin{matrix} ∥ Δ^{- 1} ([\tilde{ω {\underset{̲}{A}}^{(M)}}, ω \underset{̲}{L} \tilde{\tilde{ω {\underset{̲}{A}}^{(M)}}}]) ∥ L_{ω}^{2} ({\dot{B}}_{2, 10 n}^{\infty, (2, \frac{n}{2} - γ)}) ≲ ℰ \cdot M^{- n - δ} . \end{matrix}

(339)

\begin{matrix}  \end{matrix}

To prove the first estimate 337 above, we make use of the fact that

\tilde{ω \underset{̲}{C}} - \tilde{ω {\underset{̲}{A}}^{(M)}}

obeys a better bound than either term in that expression does individually:

\begin{matrix} ∥ \tilde{ω \underset{̲}{C}} - \tilde{ω {\underset{̲}{A}}^{(M)}} ∥ {\dot{B}}_{2, 10 n}^{s_{γ}, (2, \frac{n - 2}{2})} ≲ ℰ, \end{matrix}

(340)

where we have set the index

s_{γ}

to be:

s_{γ} = \frac{n p_{γ}}{n + p_{γ}} + γ .

The proof of 340 follows immediately from the entirely quadratic structure of the terms in the expression

\tilde{ω \underset{̲}{C}} - \tilde{ω {\underset{̲}{A}}^{(M)}}

, in conjunction with following bilinear estimate whose proof is a simple consequence of the definition of the

p_{γ}

indices, 193 , and the general embedding 45 :

\nabla_{x} Δ^{- 1} : {\dot{B}}_{2, 10 n}^{p_{γ}, (2, \frac{n - 2}{2})} \cdot {\dot{B}}_{2, 10 n}^{p_{γ}, (2, \frac{n - 2}{2})} ↪ {\dot{B}}_{2, 10 n}^{s_{γ}, (2, \frac{n - 2}{2})} .

Furthermore, by taking the

ω \underset{̲}{L}

derivative of the potentials in the estimate 317 , and making use of the truncation condition 126c , we easily have the following:

\begin{matrix} ∥ ω \underset{̲}{L} \tilde{\tilde{ω {\underset{̲}{A}}^{(M)}}} ∥ {\dot{B}}_{2, 10 n}^{\infty, (2, \frac{n - 2}{2} - γ)} ≲ ℰ \cdot M^{- n - δ} . \end{matrix}

(341)

The proof of 337 now follows from combining estimates 340 and 341 in to the following bilinear embedding whose validity follows easily from 45 and the condition

2 + γ < \frac{n}{s_{γ}}

(say for

n = 6

or higher):

\begin{matrix} Δ^{- 1} : L_{ω}^{2} ({\dot{B}}_{2, 10 n}^{\infty, (2, \frac{n - 2}{2} - γ)}) \cdot L_{ω}^{\infty} ({\dot{B}}_{2, 10 n}^{s_{γ}, (2, \frac{n - 2}{2})}) ↪ L_{ω}^{2} ({\dot{B}}_{2, 10 n}^{\infty, (2, \frac{n}{2} - γ)}) . \end{matrix}

(342)

We have now come to the point where the current techniques reach an impasse.

Notice that while the terms

ω \underset{̲}{L} \tilde{\tilde{ω \underset{̲}{C}}}

and

ω \underset{̲}{L} \tilde{\tilde{ω {\underset{̲}{A}}^{(M)}}}

do seem to have a better structure at first glance via the equation 336 , it is surprisingly difficult to pass this into integrated estimates of the form 317 . This is because while the linear term on the right hand side of 336 is quite nice, the only saving grace of the quadratic term in that expression is that it can go in lower spatial

L^{p}

space, which is not particularly useful when half of the needed savings in the estimate 317 comes form orthogonality (meaning that anything below

L^{2}

gets wasted). A way to get rid of this problem is to employ non-isotropic spaces. Specifically, we define the norm:

∥ A ∥^{ω} N_{1, 10 n}^{- \frac{1}{2} - γ, 2, \infty} = \sum_{μ} μ^{- \frac{1}{2} - γ} (1 + μ)^{10 n} ∥ P_{μ} (A) ∥ L_{ω^{| |}}^{2} (L_{ω^{⊥}}^{\infty}) .

Our goal is now to show the following estimate which represents a more manageable form of the differentiated version of 316 :

\begin{matrix} ∥ ω \underset{̲}{L} \tilde{\tilde{ω {\underset{̲}{A}}^{(M)}}} ∥ L_{ω}^{2} (^{ω} N_{1, 10 n}^{- \frac{1}{2} - γ, 2, \infty}) ≲ ℰ \cdot M^{- n - δ} . \end{matrix}

(343)

Having done this, our next goal will be to pass on estimates of this form on to the non-linear potential

\tilde{\tilde{ω \underset{̲}{C}}}

. For reasons which will become apparent in a moment, it is more convenient to state this estimate for the following sum of spaces:

\begin{matrix} ∥ ω \underset{̲}{L} \tilde{\tilde{ω \underset{̲}{C}}} ∥ L_{ω}^{2} (^{ω} N_{1, 10 n}^{- \frac{1}{2} - γ, 2, \infty}) + L_{ω}^{2} ({\dot{B}}_{2, 10 n}^{n, (2, \frac{n - 2}{2} - γ)}) ≲ ℰ \cdot M^{- n - δ} . \end{matrix}

(344)

Once this is accomplished, the proof of 338 – 338 will follow from the two bilinear estimates:

\begin{matrix} Δ^{- 1} : L_{ω}^{\infty} (P_{∙ ≪ 1} Δ_{ω^{⊥}}^{- \frac{1}{2}} {\dot{H}}^{\frac{n - 4}{2}}) \cdot L_{ω}^{2} (^{ω} N_{1, 10 n}^{- \frac{1}{2} - γ, 2, \infty}) & ↪ L_{ω}^{2} ({\dot{B}}_{2, 10 n}^{\infty, (2, \frac{n}{2} - γ)}), \end{matrix}

(345)

\begin{matrix} Δ^{- 1} : L_{ω}^{\infty} ({\dot{B}}_{2, 10 n}^{p_{γ}, (2, \frac{n - 2}{2})}) \cdot L_{ω}^{2} ({\dot{B}}_{2, 10 n}^{n, (2, \frac{n - 4}{2})}) & ↪ L_{ω}^{2} ({\dot{B}}_{2, 10 n}^{\infty, (2, \frac{n}{2} - γ)}) . \end{matrix}

(346)

\begin{matrix}  \end{matrix}

Here the space in the first term in the product on the left hand side of 345 above is given by the norm:

∥ A ∥ Δ_{ω^{⊥}}^{- \frac{1}{2}} {\dot{H}}^{\frac{n - 4}{2}} = ∥ Δ_{ω^{⊥}}^{\frac{1}{2}} A ∥ {\dot{H}}^{\frac{n - 4}{2}} .

That the set of potentials

{\tilde{ω {\underset{̲}{A}}^{(M)}}}

is in this space with norm

≲ ℰ

follows from the explicit formula 244 and the Coulomb gauge savings 196 . Having now outlined the general strategy, we move to the proofs of the individual estimates.

To prove 343 we use the spatial truncation condition 158 , the triangle inequality, and dyadic summing to reduce things to the following single frequency estimate:

\begin{matrix} {(\int_{S^{n - 1}} ∥ P_{μ} (ω \underset{̲}{L} \tilde{\tilde{ω {\underset{̲}{A}}^{(M)}}}) ∥^{2} L_{ω^{| |}}^{2} (L_{ω^{⊥}}^{\infty}) d ω)}^{\frac{1}{2}} ≲ μ^{\frac{1}{2} + 2 γ} ℰ \cdot M^{- n - δ} . \end{matrix}

(347)

Now freeze

ω

and run a Littlewood-Paley decomposition in the

R_{ω^{⊥}}^{n - 1}

frequency plane:

\begin{matrix} P_{μ} (ω \underset{̲}{L} \tilde{\tilde{ω {\underset{̲}{A}}^{(M)}}}) & = \sum_{λ : λ ≲ M^{- 1} μ} \nabla_{x} ω \underset{̲}{L} ω L Δ_{ω^{⊥}}^{- 1} ω {\bar{Π}}_{∙ ⩽ M^{- 1}} ω {\bar{Π}}^{(\frac{1}{2} - δ)} Q_{λ} P_{μ} (\underset{̲}{A} ∙ ≪ 1) (\partial_{ω}), \end{matrix}

\begin{matrix} \approx M^{- 1} \sum_{λ : λ ≲ M^{- 1} μ} μ^{3} λ^{- 2} ω {\bar{Π}}_{∙ ⩽ M^{- 1}} ω {\bar{Π}}^{(\frac{1}{2} - δ)} Q_{λ} P_{μ} (\underset{̲}{A} ∙ ≪ 1), \end{matrix}

(348)

\begin{matrix}  \end{matrix}

where the last line follows from the truncation condition 126c and our heuristic multiplier convention. Notice that the sum restriction in these formulas comes because of the presence of the cutoff

ω {\bar{Π}}_{∙ ⩽ M^{- 1}}

. The extra

M^{- 1}

factor comes from this same angular cutoff and the Coulomb gauge savings 196 . Working now with the right hand side of 348 , we use Bernstein's inequality and dyadic summing to compute that:

\begin{matrix} ∥ P_{μ} (ω \underset{̲}{L} \tilde{\tilde{ω {\underset{̲}{A}}^{(M)}}}) ∥ L_{ω^{| |}}^{2} (L_{ω^{⊥}}^{\infty}), \end{matrix}

\begin{matrix} ≲ & M^{- 1} \sum_{λ : λ ≲ M^{- 1} μ} μ^{3} λ^{- 2} ∥ Q_{λ} ω {\bar{Π}}_{∙ ⩽ M^{- 1}} ω {\bar{Π}}^{(\frac{1}{2} - δ)} P_{μ} (\underset{̲}{A} ∙ ≪ 1) ∥ L_{ω^{| |}}^{2} (L_{ω^{⊥}}^{\infty}), \end{matrix}

\begin{matrix} ≲ & M^{- \frac{n - 3}{2}} μ^{\frac{3}{2}} ∥ ω {\bar{Π}}_{∙ ⩽ M^{- 1}} ω {\bar{Π}}^{(\frac{1}{2} - δ)} P_{μ} (\underset{̲}{A} ∙ ≪ 1) ∥ {\dot{H}}_{x}^{\frac{n - 2}{2}}, \end{matrix}

\begin{matrix} ≲ & M^{- \frac{n + 1}{2} - δ} μ^{\frac{1}{2} + 2 γ} ∥ ω {\bar{Π}}_{∙ ⩽ M^{- 1}} ω {\bar{Π}}^{(\frac{1}{2} - δ)} \underset{̲}{A} ∙ ≪ 1 ∥ {\dot{H}}_{x}^{\frac{n - 2}{2}} . \end{matrix}

\begin{matrix}  \end{matrix}

Integrating now this last line in

L_{ω}^{2}

, and using the orthogonality computation which began on line 318 above we have achieved 347 as was to be shown.

Our goal is now to pass the estimate 343 on to potentials

{ω \underset{̲}{L} \tilde{\tilde{ω \underset{̲}{C}}}}

modulo terms which are in the more regular space

L_{ω}^{2} ({\dot{B}}_{2, 10 n}^{n, (2, \frac{n - 2}{2} - γ)})

. To do this, we differentiate the system 312 with respect to the vector-field

ω \underset{̲}{L}

, and write it heuristically as:

ω \underset{̲}{L} \tilde{\tilde{ω \underset{̲}{C}}} = ω \underset{̲}{L} \tilde{\tilde{ω {\underset{̲}{A}}^{(M)}}} + \nabla_{x} Δ^{- 1} ([ω \underset{̲}{L} \tilde{\tilde{ω {\underset{̲}{A}}^{(M)}}}, \tilde{ω \underset{̲}{C}}] + [\tilde{\tilde{ω {\underset{̲}{A}}^{(M)}}}, ω \underset{̲}{L} \tilde{ω \underset{̲}{C}}] + [ω \underset{̲}{L} \tilde{ω {\underset{̲}{A}}^{(M)}}, \tilde{\tilde{ω \underset{̲}{C}}}] + [\tilde{ω {\underset{̲}{A}}^{(M)}}, ω \underset{̲}{L} \tilde{\tilde{ω \underset{̲}{C}}}] + [ω \underset{̲}{L} \tilde{ω \underset{̲}{C}}, \tilde{\tilde{ω \underset{̲}{C}}}] + [\tilde{ω \underset{̲}{C}}, ω \underset{̲}{L} \tilde{\tilde{ω \underset{̲}{C}}}]) .

Therefore, the desired bound will follow from a bootstrapping argument the estimates 343 , 319 – 320 , and 333 – 334 with the help of the following three bilinear estimates:

\begin{matrix} \nabla_{x} Δ^{- 1} : L_{ω}^{2} ({\dot{B}}_{2, 10 n}^{\infty, (2, \frac{n}{2} - γ)}) \cdot L_{ω}^{\infty} ({\dot{B}}_{2, 10 n}^{r_{γ}, (2, \frac{n - 4}{2})}) & ↪ L_{ω}^{2} ({\dot{B}}_{2, 10 n}^{n, (2, \frac{n - 2}{2} - γ)}), \end{matrix}

(349)

\begin{matrix} \nabla_{x} Δ^{- 1} : L_{ω}^{2} ({\dot{B}}_{2, 10 n}^{n, (2, \frac{n - 2}{2} - γ)}) \cdot L_{ω}^{\infty} ({\dot{B}}_{2, 10 n}^{p_{γ}, (2, \frac{n - 2}{2})}) & ↪ L_{ω}^{2} ({\dot{B}}_{2, 10 n}^{n, (2, \frac{n - 2}{2} - γ)}), \end{matrix}

(350)

\begin{matrix} \nabla_{x} Δ^{- 1} : L_{ω}^{2} (^{ω} N_{1, 10 n}^{- \frac{1}{2} - γ, 2, \infty}) \cdot L_{ω}^{\infty} ({\dot{B}}_{2, 10 n}^{p_{γ}, (2, \frac{n - 2}{2})}) & ↪ L_{ω}^{2} (^{ω} N_{1, 10 n}^{- \frac{1}{2} - γ, 2, \infty}) . \end{matrix}

(351)

\begin{matrix}  \end{matrix}

The estimates 349 – 350 are again an integrated form of the general Besov embedding 45 , and we leave it to the reader to check that the indices

p_{γ}, r_{γ}

are in the right range to satisfy the conditions 47 – 50 . It remains for us to prove the inclusion 351 .

We do this now. Let

A

and

C

be two test matrices. By performing a trichotomy, we see that it suffices to prove the following three frequency localized summation estimates for fixed values of

ω

\begin{matrix} \sum_{λ, μ_{i} : μ_{1} ≪ μ_{2} λ \sim μ_{2}} λ^{- \frac{1}{2} - γ} (1 + λ)^{10 n} ∥ \nabla_{x} Δ^{- 1} P_{λ} (P_{μ_{1}} A \cdot P_{μ_{2}} C) ∥ L_{ω^{| |}}^{2} (L_{ω^{⊥}}^{\infty}) & ≲ \end{matrix}

(352)

\begin{matrix} ∥ A ∥^{ω} N_{1, 10 n}^{- \frac{1}{2} - γ, 2, \infty} \cdot ∥ C ∥ {\dot{B}}_{2, 10 n}^{p_{γ}, (2, \frac{n - 2}{2})}, \end{matrix}

(353)

\begin{matrix} \sum_{λ, μ_{i} : μ_{2} ≪ μ_{1} λ \sim μ_{1}} λ^{- \frac{1}{2} - γ} (1 + λ)^{10 n} ∥ \nabla_{x} Δ^{- 1} P_{λ} (P_{μ_{1}} A \cdot P_{μ_{2}} C) ∥ L_{ω^{| |}}^{2} (L_{ω^{⊥}}^{\infty}) & ≲ \end{matrix}

(354)

\begin{matrix} ∥ A ∥^{ω} N_{1, 10 n}^{- \frac{1}{2} - γ, 2, \infty} \cdot ∥ C ∥ {\dot{B}}_{2, 10 n}^{p_{γ}, (2, \frac{n - 2}{2})}, \end{matrix}

(355)

\begin{matrix} \sum_{λ, μ_{i} : μ_{1} \sim μ_{2} λ ≲ μ_{1}, μ_{2}} λ^{- \frac{1}{2} - γ} (1 + λ)^{10 n} ∥ \nabla_{x} Δ^{- 1} P_{λ} (P_{μ_{1}} A \cdot P_{μ_{2}} C) ∥ L_{ω^{| |}}^{2} (L_{ω^{⊥}}^{\infty}) & ≲ \end{matrix}

(356)

\begin{matrix} ∥ A ∥^{ω} N_{1, 10 n}^{- \frac{1}{2} - γ, 2, \infty} \cdot ∥ C ∥ {\dot{B}}_{2, 10 n}^{p_{γ}, (2, \frac{n - 2}{2})} . \end{matrix}

(357)

\begin{matrix}  \end{matrix}

The proof of 353 – 357 is essentially identical to the proof of the three estimates 265 – 269 we have shown earlier, although the proof of the last estimate 357 requires a slightly more delicate argument due to the presence of additional low frequency weights. We leave 353 – 355 to the reader. To show the last estimate above, we follow the proof of 269 which begins on line 271 , although we do so without throwing away the

P_{λ}

multiplier so soon. This leaves us with the fixed frequency estimate, which we expand out into all frequencies in the

ω^{| |}

variable, calling the appropriate multipliers

{\tilde{Q}}_{σ}

\begin{matrix} λ^{- \frac{1}{2} - γ} ∥ \nabla_{x} Δ^{- 1} P_{λ} (P_{μ_{1}} A \cdot P_{μ_{2}} C) ∥ L_{ω^{| |}}^{2} (L_{ω^{⊥}}^{\infty}), \end{matrix}

\begin{matrix} ≲ & λ^{\frac{n - 1}{p_{γ}} - \frac{3}{2} - γ} ∥ P_{λ} (P_{μ_{1}} A \cdot P_{μ_{2}} C) ∥ L_{ω^{| |}}^{2} (L_{ω^{⊥}}^{p_{γ}}), \end{matrix}

\begin{matrix} ≲ & λ^{\frac{n - 1}{p_{γ}} - \frac{3}{2} - γ} \sum_{σ ≲ λ} ∥ {\tilde{Q}}_{σ} (P_{μ_{1}} A \cdot P_{μ_{2}} C) ∥ L_{ω^{| |}}^{2} (L_{ω^{⊥}}^{p_{γ}}) . \end{matrix}

(358)

\begin{matrix}  \end{matrix}

Now we use the fact that the multiplier

{\tilde{Q}}_{σ}

only acts in the

ω^{| |}

variable. In that variable its action can be written in terms of a kernel

K^{{\tilde{Q}}_{σ}}

which has uniformly bounded

L_{ω^{| |}}^{1}

norm (in terms of the parameter

σ

) and has amplitude

\sim σ

Therefore, via Young's and then Hölder's inequality, and a little dyadic summing, this allows us to bound:

358

\begin{matrix} (L . H . S .) & ≲ λ^{\frac{n - 1}{p_{γ}} - \frac{3}{2} - γ} \sum_{σ ≲ λ} ∥ (| K^{{\tilde{Q}}_{σ}} | * ∥ (P_{μ_{1}} A \cdot P_{μ_{2}} C) ∥ L_{ω^{⊥}}^{p_{γ}}) ∥ L_{ω^{| |}}^{2}, \end{matrix}

\begin{matrix} ≲ λ^{\frac{n - 1}{p_{γ}} - \frac{3}{2} - γ} \sum_{σ ≲ λ} σ^{\frac{1}{p_{γ}}} ∥ P_{μ_{1}} A \cdot P_{μ_{2}} C ∥ L_{ω^{| |}}^{\frac{2 p_{γ}}{2 + p γ}} (L_{ω^{⊥}}^{p_{γ}}), \end{matrix}

\begin{matrix} ≲ λ^{\frac{n}{p_{γ}} - \frac{3}{2} - γ} ∥ P_{μ_{1}} A ∥ L_{ω^{| |}}^{2} (L_{ω^{⊥}}^{\infty}) \cdot ∥ P_{μ_{2}} C ∥ L^{p_{γ}}, \end{matrix}

\begin{matrix} ≲ {(\frac{λ}{μ_{1}})}^{\frac{n}{p_{γ}} - \frac{3}{2} - γ} μ_{1}^{- \frac{1}{2} - γ} ∥ P_{μ_{1}} A ∥ L_{ω^{| |}}^{2} (L_{ω^{⊥}}^{\infty}) \cdot μ_{2}^{\frac{n}{p_{γ}} - 1} ∥ P_{μ_{2}} C ∥ L^{p_{γ}} . \end{matrix}

\begin{matrix}  \end{matrix}

This last line and the condition

0 < \frac{n}{p_{γ}} - \frac{3}{2} - γ

allows us to safely make the sum on the left hand side of 357 and then proceed via Cauchy-Schwartz to arrive at the desired bound. This completes our proof of the bilinear estimate 351 .

The last thing we need to do here is to prove the two final estimates 345 and 346 .

The second of these is of course simply an integrated version of the general estimate 45 . Therefore we concentrate on proving the first. To do this, we proceed as we did in the proof of estimate 351 and run a trichotomy on a product of test matrices

A \cdot C

. This leaves us with establishing the three estimates (forgetting about the extra high frequency weights which are not central):

\begin{matrix} \sum_{λ, μ_{i} : μ_{1} ≪ μ_{2} λ \sim μ_{2}} λ^{- γ} ∥ Δ^{- 1} P_{λ} (P_{μ_{1}} A \cdot P_{μ_{2}} C) ∥ L^{\infty} & ≲ ∥ A ∥ Δ_{ω^{⊥}}^{- \frac{1}{2}} {\dot{H}}^{\frac{n - 4}{2}} \cdot ∥ C ∥^{ω} N_{1, 10 n}^{- \frac{1}{2} - γ, 2, \infty}, \end{matrix}

(359)

\begin{matrix} \sum_{λ, μ_{i} : μ_{2} ≪ μ_{1} λ \sim μ_{1}} λ^{- γ} ∥ Δ^{- 1} P_{λ} (P_{μ_{1}} A \cdot P_{μ_{2}} C) ∥ L^{\infty} & ≲ ∥ A ∥ Δ_{ω^{⊥}}^{- \frac{1}{2}} {\dot{H}}^{\frac{n - 4}{2}} \cdot ∥ C ∥^{ω} N_{1, 10 n}^{- \frac{1}{2} - γ, 2, \infty}, \end{matrix}

(360)

\begin{matrix} \sum_{λ, μ_{i} : μ_{1} \sim μ_{2} λ ≲ μ_{1}, μ_{2}} λ^{- γ} ∥ Δ^{- 1} P_{λ} (P_{μ_{1}} A \cdot P_{μ_{2}} C) ∥ L^{\infty} & ≲ ∥ A ∥ Δ_{ω^{⊥}}^{- \frac{1}{2}} {\dot{H}}^{\frac{n - 4}{2}} \cdot ∥ C ∥^{ω} N_{1, 10 n}^{- \frac{1}{2} - γ, 2, \infty} . \end{matrix}

(361)

\begin{matrix}  \end{matrix}

The proofs of the two

L o w \times H i g h

interaction estimates, 359 – 360 , are both similar and very simple. They follow from the pair of

L^{\infty}

estimates:

\begin{matrix} ∥ P_{μ_{1}} (A) ∥ L^{\infty} & ≲ μ_{1} ∥ P_{μ_{1}} (A) ∥ Δ_{ω^{⊥}}^{- \frac{1}{2}} {\dot{H}}^{\frac{n - 4}{2}}, \end{matrix}

(362)

\begin{matrix} ∥ P_{μ_{2}} (C) ∥ L^{\infty} & ≲ μ_{2}^{1 + γ} ∥ P_{μ_{2}} (C) ∥^{ω} N_{1, 10 n}^{- \frac{1}{2} - γ, 2, \infty} . \end{matrix}

(363)

\begin{matrix}  \end{matrix}

The proof of 362 follows easily from the kind of angular decomposition and Bernstein inequality tricks used to prove estimate 192 above. To prove the second estimate 363 , we let

{\tilde{Q}}_{σ}

again denote a family of frequency cutoffs in the

R_{ω^{| |}}

variable and we compute via Bernstein:

\begin{matrix} ∥ P_{μ_{2}} (C) ∥ L^{\infty} & ≲ \sum_{σ ≲ μ_{1}} ∥ {\tilde{Q}}_{σ} P_{μ_{2}} (C) ∥ L_{ω^{⊥}}^{\infty} (L_{ω^{| |}}^{\infty}), \end{matrix}

\begin{matrix} ≲ \sum_{σ ≲ μ_{1}} σ^{\frac{1}{2}} ∥ P_{μ_{2}} (C) ∥ L_{ω^{⊥}}^{\infty} (L_{ω^{| |}}^{2}), \end{matrix}

\begin{matrix} ≲ μ_{2}^{1 + γ} ∥ P_{μ_{2}} (C) ∥^{ω} N_{1, 10 n}^{- \frac{1}{2} - γ, 2, \infty} . \end{matrix}

\begin{matrix}  \end{matrix}

Using now 362 – 363 we have the pair of fixed frequency bounds:

\begin{matrix} λ^{- γ} ∥ Δ^{- 1} P_{λ} (P_{μ_{1}} A \cdot P_{μ_{2}} C) ∥ L^{\infty} \end{matrix}

\begin{matrix} ≲ & (\frac{μ_{1}}{μ_{2}}) ∥ P_{μ_{1}} (A) ∥ Δ_{ω^{⊥}}^{- \frac{1}{2}} {\dot{H}}^{\frac{n - 4}{2}} \cdot ∥ P_{μ_{2}} (C) ∥^{ω} N_{1, 10 n}^{- \frac{1}{2} - γ, 2, \infty} & μ_{1} & ≪ μ_{2}, \end{matrix}

\begin{matrix}  \end{matrix}

and:

\begin{matrix} λ^{- γ} ∥ Δ^{- 1} P_{λ} (P_{μ_{1}} A \cdot P_{μ_{2}} C) ∥ L^{\infty} \end{matrix}

\begin{matrix} ≲ & {(\frac{μ_{2}}{μ_{1}})}^{1 + γ} ∥ P_{μ_{1}} (A) ∥ Δ_{ω^{⊥}}^{- \frac{1}{2}} {\dot{H}}^{\frac{n - 4}{2}} \cdot ∥ P_{μ_{2}} (C) ∥^{ω} N_{1, 10 n}^{- \frac{1}{2} - γ, 2, \infty} & μ_{2} & ≪ μ_{1} . \end{matrix}

\begin{matrix}  \end{matrix}

These may easily be summed over the respective ranges on the left hand side of 359 – 360 to yield the desired bounds.

Our final task here is to establish the

H i g h \times H i g h

frequency interaction estimate 361 . This is where the non-isotropic spaces really shine. In what follows, we let

Q_{σ_{1}}

denote a frequency cutoff in the

R_{ω^{⊥}}^{n - 1}

frequency plane, and

{\tilde{Q}}_{σ_{2}}

a cutoff in the orthogonal direction. We compute that:

\begin{matrix} λ^{- γ} ∥ Δ^{- 1} P_{λ} (P_{μ_{1}} A \cdot P_{μ_{2}} C) ∥ L^{\infty}, \end{matrix}

\begin{matrix} ≲ & λ^{- 2 - γ} \sum_{σ_{i} : σ_{i} ≲ λ} ∥ {\tilde{Q}}_{σ_{2}} Q_{σ_{1}} (P_{μ_{1}} A \cdot P_{μ_{2}} C) ∥ L_{ω^{⊥}}^{\infty} (L_{ω^{| |}}^{\infty}), \end{matrix}

\begin{matrix} ≲ & λ^{- 1 - γ} \sum_{σ_{1} : σ_{1} ≲ λ} ∥ Q_{σ_{1}} (P_{μ_{1}} A \cdot P_{μ_{2}} C) ∥ L_{ω^{⊥}}^{\infty} (L_{ω^{| |}}^{1}), \end{matrix}

\begin{matrix} ≲ & λ^{- 1 - γ} \sum_{σ_{1} : σ_{1} ≲ λ} ∥ Q_{σ_{1}} (P_{μ_{1}} A \cdot P_{μ_{2}} C) ∥ L_{ω^{| |}}^{1} (L_{ω^{⊥}}^{\infty}), \end{matrix}

\begin{matrix} ≲ & λ^{- 1 - γ} \sum_{σ_{1} : σ_{1} ≲ λ} σ_{1}^{\frac{n - 3}{2}} ∥ Q_{σ_{1}} (P_{μ_{1}} A \cdot P_{μ_{2}} C) ∥ L_{ω^{| |}}^{1} (L_{ω^{⊥}}^{\frac{2 (n - 1)}{n - 3}}), \end{matrix}

\begin{matrix} ≲ & λ^{\frac{n - 5}{2} - γ} ∥ P_{μ_{1}} (A) ∥ L_{ω^{| |}}^{2} (L_{ω^{⊥}}^{\frac{2 (n - 1)}{n - 3}}) \cdot ∥ P_{μ_{2}} (C) ∥ L_{ω^{| |}}^{2} (L_{ω^{⊥}}^{\infty}), \end{matrix}

\begin{matrix} ≲ & {(\frac{λ}{μ_{1}})}^{\frac{n - 5}{2} - γ} ∥ Δ_{ω^{⊥}}^{\frac{1}{2}} P_{μ_{1}} (A) ∥ {\dot{H}}_{x}^{\frac{n - 4}{2}} \cdot ∥ P_{μ_{2}} (C) ∥^{ω} N_{1, 10 n}^{- \frac{1}{2} - γ, 2, \infty} . \end{matrix}

\begin{matrix}  \end{matrix}

Notice that the last line above follows from the

{\dot{H}}^{1}

Sobolev embedding in the

R_{ω^{⊥}}^{n - 1}

plane. This estimate can now be safely summed using the condition that

6 ⩽ n

to sum over the lower dyadics, and then using Cauchy-Schwartz to sum over the frequency localized pieces. This completes our proof of the bilinear estimate 351 , and hence our proof of the integrated bound 324 . □

Having now established the proof of both the integrated bounds 314 – 324 , we have proved the integrated group element bounds 309 – 310 . This ends our proof of Proposition 9.4 . □

9.1 Proof of the Accuracy estimate 178d

We will now give a short proof of the multiplier equivalence bound 178d . This will follow almost directly from the estimates we have already shown. We compute the kernel of the operator

Φ (0) ((2 π | ξ |)^{α} (Φ (0))^{*}) - (- Δ)^{\frac{α}{2}} P_{1}

to be (again suppressing

\pm

notations):

(364) K α ( x , y ) = ∫ R n e 2 π i ( x − y ) ⋅ ξ ω g − 1 ( x ) ω g ( y ) [ ∙ ] ω g − 1 ( y ) ω g ( x ) χ α ( ξ ) d ξ − ∫ R n e 2 π i ( x − y ) ⋅ ξ [ ∙ ] χ α ( ξ ) d ξ ,

where

χ^{α} (ξ) = (2 π | ξ |)^{α} χ_{(- \frac{1}{2}, 2)} (ξ)

. Notice that this cutoff function satisfies the general requirements of the generic bump function

χ

used throughout this section.

In particular, there exist constants

C_{k}

which depend only on

α

and the original

χ_{(- \frac{1}{2}, 2)}

such that:

\begin{matrix} \int_{R^{n}} | \nabla_{ξ}^{k} χ^{α} (ξ) | d ξ ⩽ C_{k} . \end{matrix}

(365)

We now decompose the kernel

K^{α} = \sum_{σ} K_{σ}^{α}

according to the dyadic physical space decomposition 213 . For each fixed value of the small constant

ℰ

on line 178d we write this sum in terms of two pieces, a “close” part and a “far” part:

\begin{matrix} K^{α} & = K_{∙ ⩽ ℰ^{- \frac{1}{2 (n + 1)}}}^{α} + K_{ℰ^{- \frac{1}{2 (n + 1)}} < ∙}^{α}, \end{matrix}

(366)

\begin{matrix} = \sum_{σ : σ ⩽ ℰ^{- \frac{1}{2 (n + 1)}}} K_{σ}^{α} + \sum_{σ : ℰ^{- \frac{1}{2 (n + 1)}} < σ} K_{σ}^{α} . \end{matrix}

\begin{matrix}  \end{matrix}

To estimate the near portion of things, we do a little algebraic manipulation and write the kernel as:

K_{∙ ⩽ ℰ^{- \frac{1}{2 (n + 1)}}}^{α} = χ_{D_{∙ ⩽ ℰ^{- \frac{1}{2 (n + 1)}}}} (\int_{R n} e^{2 π i (x - y) \cdot ξ} (ω g^{- 1} (x) ω g (y) - I) [∙] ω g^{- 1} (y) ω g (x) χ^{α} (ξ) d ξ + \int_{R n} e^{2 π i (x - y) \cdot ξ} [∙] (ω g^{- 1} (y) ω g (x) - I) χ^{α} (ξ) d ξ) .

By a direct application of the pair of integrated bounds 309 – 310 (with

M \sim 1

) this last expression gives us the absolute kernel bound:

| K_{∙ ⩽ ℰ^{- \frac{1}{2 (n + 1)}}}^{α} (x, y) | ≲ ℰ \cdot (1 + | x - y |) χ_{D_{∙ ⩽ ℰ^{- \frac{1}{2 (n + 1)}}}} (| x - y |) .

By integrating the right hand side of this last inequality we easily arrive at the pair of Schur-test bounds:

\begin{matrix} ∥ K_{∙ ⩽ ℰ^{- \frac{1}{2 (n + 1)}}}^{α} ∥ L_{y}^{\infty} (L_{x}^{1}), ∥ K_{∙ ⩽ ℰ^{- \frac{1}{2 (n + 1)}}}^{α} ∥ L_{x}^{\infty} (L_{y}^{1}) ≲ ℰ^{\frac{1}{2}} . \end{matrix}

(367)

To estimate the second kernel on the right hand side of 366 , we do things separately for each term in the sum 364 . For the second term, which does not contain the group elements, a simple application of the estimate 365 and integration by parts shows that one has the absolute bounds:

\begin{matrix} | χ_{D_{ℰ^{- \frac{1}{2 (n + 1)}} < ∙}} (| x - y |) \int_{R^{n}} e^{2 π i (x - y) \cdot ξ} [∙] χ^{α} (ξ) d ξ |, \end{matrix}

(368)

\begin{matrix} ≲ & χ_{D_{ℰ^{- \frac{1}{2 (n + 1)}} < ∙}} (| x - y |) \cdot (1 + | x - y |)^{- 2 (n + 1)}, \end{matrix}

\begin{matrix} ≲ & ℰ^{\frac{1}{2}} \cdot (1 + | x - y |)^{- (n + 1)} . \end{matrix}

\begin{matrix}  \end{matrix}

This easily yields Schur-test bounds of the form 367 . Therefore, it remains to prove these bounds for the first integral expression on the right hand side 364 after it has been cut off in the far region

ℰ^{- \frac{1}{2 (n + 1)}} < | x - y |

. This follows at once from writing this kernel as a sum over various dyadic regions, and using the symbol bounds 229 – 230 as well as the reduction to the integrated estimates 309 – 310 . The key thing to notice is that there are only two places where we do not pick up the factor of

ℰ

in the resulting estimates. The first is in the main integration by parts argument when the derivatives

\nabla_{ξ}^{k}

all fall on the cutoff function

χ^{α}

. In that case we can simply use the compactness of the group elements and proceed in a way that is analogous to the computation which started on line 368 above. The second place is where we estimate the integral 303 . In that case we can easily upgrade the bound 305 to have the factor

σ^{- 2 (n + 1)}

on the right hand side. We are then essentially in the same situation as was reached starting on line 368 above. This completes our proof of the general multiplier approximation estimate 178d .

10 The Dispersive Estimate

In this section, we complete our proof of the non-microlocalized version of the Strichartz estimates contained in 178a . Using the abstract machinery of [4] , these will follow once we can show that the parametrix 184 satisfies a dispersive estimate.

If at fixed time

t

we write that operator as:

T (t) (\hat{f}) = Φ (t) (\hat{f}),

where we have suppressed the

\pm

notation, then we seek to prove the bound (where

f

has nothing to do with the original

\hat{f}

, but just represents a function of the physical space variables):

\begin{matrix} ∥ T (t) T^{*} (s) f ∥ L_{x}^{\infty} ≲ (1 + | t - s |)^{- \frac{n - 1}{2}} ∥ f ∥ L_{x}^{1} . \end{matrix}

(369)

Now, a calculation similar that used to produce 211 shows that the kernel of the above operator can be computed to be:

(370) K T T * ( t , s ; x , y ) = ∫ R n e 2 π i ( ( t − s ) | ξ | + ( x − y ) ⋅ ξ ) ω g − 1 ( t , x ) ω g ( s , y ) [ ∙ ] ω g − 1 ( s , y ) ω g ( t , x ) χ ( ξ ) d ξ .

Therefore, as is usually the case, we see that it suffices to show the fixed time uniform bound:

\begin{matrix} ∥ K^{T T^{*}} (t, s; \cdot, \cdot) ∥ L_{x, y}^{\infty} ≲ (1 + | t - s |)^{- \frac{n - 1}{2}} . \end{matrix}

(371)

The proof of 371 turns out to be a straightforward consequence of the bounds established in the previous section. The strategy we follow here is almost identical.

We first decompose the

K^{T T *}

kernel into a sum of two pieces:

K_{σ}^{T T^{*}} = {\tilde{K}}^{T T^{*}} + ℛ^{T T^{*}},

for which we'll show the bound 371 individually. The

{\tilde{K}}^{T T^{*}}

kernel will be smooth enough that we can use a standard stationary phase computation on it.

The remainder kernel

ℛ^{T T^{*}}

will be small in absolute value without using any sophisticated integration by parts (although, as in the previous section, there will be some use for oscillations in this term also). As in the previous section, the definition of

{\tilde{K}}^{T T^{*}}

will depend on a physical space scale, in this case the value of

(1 + | t - s | + | x - y |)

. This will again be effected by the choice of an auxiliary gauge transformation

\tilde{ω g}

. This time we define

\tilde{ω g}

to be the transformation into the Coulomb gauge of the smoothed out potential:

\begin{matrix} \tilde{ω {\underset{̲}{A}}^{(M)}} = - ω {\bar{Π}}_{M^{- 1} < ∙} ω {\bar{Π}}^{(\frac{1}{2} - δ)} \nabla_{x} ω L Δ_{ω^{⊥}}^{- 1} \underset{̲}{A} ∙ ≪ 1 (\partial_{ω}), \end{matrix}

(372)

where we define the scale

M

to be such that:

M = (1 + | t - s | + | x - y |)^{\frac{1}{2}} .

As before, we use the splitting 226 – 227 to compute:

(373) K ~ T T * ( t , s ; x , y ) = ∫ R n e 2 π i ( ( t − s ) | ξ | + ( x − y ) ⋅ ξ ) ω g ~ − 1 ( t , x ) ω g ~ ( s , y ) [ ∙ ] ω g ~ − 1 ( s , y ) ω g ~ ( t , x ) χ ( ξ ) d ξ .

Our first step here is to notice that it suffices to show 371 for the kernel 373 under the condition that

| x - y | > \frac{1}{2} (1 + | t - s |)

, for if this were not the case then we could simply integrate by parts as many times as necessary with respect to the variable

λ = | ξ |

in the expression 373 and easily achieve 371 . Therefore, we will now show that:

\begin{matrix} ∥ {\tilde{K}}^{T T^{*}} (t, s; x, y) ∥ ≲ | x - y |^{- \frac{n - 1}{2}} . \end{matrix}

(374)

We now factor the phase in 373 as:

e^{2 π i ((t - s) | ξ | + (x - y) \cdot ξ)} = e^{2 π i (t - s) λ} e^{2 π i λ | x - y | cos (Θ_{x - y, ω})},

where we are using the frequency polar coordinates

ξ = λ ω

. Integrating first on the sphere

S^{n - 1}

, we see that to conclude 374 it is enough to show that:

(375) ∥ ∫ S n − 1 e 2 π i λ | x − y | cos ( Θ x − y , ω ) ω g ~ − 1 ( t , x ) ω g ~ ( s , y ) [ ∙ ] ω g ~ − 1 ( s , y ) ω g ~ ( t , x ) d ω ∥ ≲ | x − y | − n − 1 2 .

This last estimate will follow easily from the Morse lemma and the already established symbol bounds 232 – 233 . To implement this, we first cut off the above integral into small neighborhoods of stationary points of the phase and a remainder. We do this with the smooth partition of unity:

1 = χ_{| 1 - cos (Θ_{x - y, ω}) | < \frac{1}{8}} + χ_{| 1 + cos (Θ_{x - y, ω}) | < \frac{1}{8}} + \tilde{χ} .

The cutoff

\tilde{χ}

cuts off on the region where

cos (Θ_{x - y, ω})

is bounded away from

\pm 1

, and there we have the gradient estimate:

c < | \nabla_{ω} cos (Θ_{x - y, ω}) |,

for a sufficiently small constant

c

. Using this, and integrating by parts

n - 1

times while using the symbol bounds 232 – 233 , we easily have that:

∥ \int_{S^{n - 1}} e^{2 π i λ | x - y | cos (Θ_{x - y, ω})} {\tilde{ω g}}^{- 1} (t, x) \tilde{ω g} (s, y) [∙] {\tilde{ω g}}^{- 1} (s, y) \tilde{ω g} (t, x) \tilde{χ} (ω) d ω ∥ ≲ λ^{1 - n} \cdot | x - y |^{- \frac{n - 1}{2}} .

This proves 375 because we may assume that

\frac{1}{4} < λ

. Our goal is now to prove the localized estimate:

∥ \int_{S^{n - 1}} e^{2 π i λ | x - y | cos (Θ_{x - y, ω})} {\tilde{ω g}}^{- 1} (t, x) \tilde{ω g} (s, y) [∙] {\tilde{ω g}}^{- 1} (s, y) \tilde{ω g} (t, x) {\tilde{χ}}_{| 1 - cos (Θ_{x - y, ω}) | < \frac{1}{8}} (ω) d ω ∥ ≲ | x - y |^{- \frac{n - 1}{2}} .

It will become clear that the corresponding estimate for the region where

| 1 + cos (Θ_{x - y, ω}) | < \frac{1}{8}

follows from identical calculations.

Now, the angular function

cos (Θ_{x - y, ω})

has a single non-degenerate critical point in a neighborhood of the unit vector

(x - y) / | x - y |

with index

n - 1

. Therefore, by the Morse lemma there exists a diffeomorphism

θ = φ (ω)

in a neighborhood of this point such that:

1 - cos (Θ_{x - y, ω}) = θ_{1}^{2} + \dots + θ_{n - 1}^{2} .

By making this change of variables, we see that we are trying to prove that:

| | | \int_{R^{n - 1}} {e^{2 π i λ | x - y | | θ |^{2}}}^{φ^{- 1} (θ)} {\tilde{g}}^{- 1} (t, x)^{φ^{- 1} (θ)} \tilde{g} (s, y) {[∙]}^{φ^{- 1} (θ)} {\tilde{g}}^{- 1} (s, y)^{φ^{- 1} (θ)} \tilde{g} (t, x) χ (θ) J_{φ^{- 1}} (θ) d θ | | | ≲ | x - y |^{- \frac{n - 1}{2}} .

Here

J_{φ^{- 1}}

denotes the Jacobian matrix of

φ^{- 1}

, and

χ

is some smooth function which is supported where

| θ | ⩽ 1

. Making now the simple change of variables

\sqrt{λ | x - y |} θ = θ^{'}

, it suffices to be able to show that:

(376) | | | ∫ R n − 1 e 2 π i | θ ′ | 2 φ ~ ( θ ′ ) g ~ − 1 ( t , x ) φ ~ ( θ ′ ) g ~ ( s , y ) [ ∙ ] φ ~ ( θ ′ ) g ~ − 1 ( s , y ) φ ~ ( θ ′ ) g ~ ( t , x ) J ~ ( θ ′ ) d θ ′ | | | ≲ 1 .

Here

\tilde{J} (θ^{'})

denotes a smooth function with (large) compact support and uniform gradient bounds:

| \nabla_{θ^{'}}^{k} \tilde{J} | ≲ 1 .

Furthermore, the function

\tilde{φ} (θ^{'})

obeys the gradient bounds:

| \nabla_{θ^{'}}^{k} \tilde{φ} | ≲ | x - y |^{- \frac{k}{2}} .

Combining this last estimate with the symbol bounds 232 – 233 and the truncation condition

M = | x - y |^{\frac{1}{2}}

, we have the uniform gradient estimates:

\begin{matrix} ∥ \nabla_{θ^{'}}^{k} (^{\tilde{φ}} {\tilde{g}}^{- 1} (t, x)^{\tilde{φ}} \tilde{g} (s, y)) ∥ ≲ 1, \end{matrix}

\begin{matrix} ∥ \nabla_{θ^{'}}^{k} (^{\tilde{φ}} {\tilde{g}}^{- 1} (s, y)^{\tilde{φ}} \tilde{g} (t, x)) ∥ ≲ 1 . \end{matrix}

\begin{matrix}  \end{matrix}

Using these bounds, we can prove the bound 376 by treating the quantity on the left hand side as a Fresnel-type integral and performing

n

integrations by parts in the region where

1 < | θ^{'} |

To complete our proof of 371 we need to show that:

\begin{matrix} ∥ ℛ^{T T^{*}} (t, s; \cdot, \cdot) ∥ L_{x, y}^{\infty} ≲ (1 + | t - s |)^{- \frac{n - 1}{2}}, \end{matrix}

(377)

where

ℛ^{T T^{*}}

is the kernel which is defined by subtracting 373 from 370 . Using the splitting 226 – 227 we see that this has at least one factor involving the expressions

ω h^{- 1} (x) ω h (y) - I

ω h (x) ω h^{- 1} (y) - I

under the integral sign. There are several such combination, but we will choose to estimate only one such term and leave the others to reader as they can be treated analogously. Therefore, we may without loss of generality assume that we are trying to prove the bound:

(378) ∥ ∫ 0 ∞ ∫ S n − 1 e 2 π i λ ( ( t − s ) + ( x − y ) ⋅ ω ) ω G ( t , x ; s , y ) χ ( λ ) λ n − 1 d λ d ω ∥ ≲ ( 1 + | t − s | ) − n − 1 2 ,

where we have set:

^{ω} G (t, x; s, y) = {\tilde{ω g}}^{- 1} (t, x) (ω h^{- 1} (t, x) ω h (s, y) - I) \tilde{ω g} (s, y) [∙] ω g^{- 1} (s, y) ω g (t, x) .

As in the proof of 371 above for the smoothed out kernel

{\tilde{K}}^{T T^{*}}

, we may without loss of generality assume that we trying to prove 378 in the region where

| x - y | > \frac{1}{2} (1 + | t - s |)

because otherwise we may integrate as many times as necessary with respect to the radial frequency variable to pick up the desired decay.

To proceed further, we will first decompose the range of frequency integration into a small set and a remainder where we can again integrate by parts with respect to

λ

. This is accomplished by using the angular partition of unity:

1 = χ_{| \frac{t - s}{| x - y |} + cos (Θ_{x - y, ω}) | > | x - y |^{γ - 1}} + χ_{| \frac{t - s}{| x - y |} + cos (Θ_{x - y, ω}) | ⩽ | x - y |^{γ - 1}} .

To deal with the bound 378 for the first cutoff function above, we need to show that:

∥ {\int_{S^{n - 1}}}^{ω} G (t, x; s, y) d ω \cdot \int_{0}^{\infty} e^{2 π i λ ((t - s) + (x - y) \cdot ω)} χ_{| \frac{t - s}{| x - y |} + cos (Θ_{x - y, ω}) | > | x - y |^{γ - 1}} χ (λ) λ^{n - 1} d λ ∥ ≲ | x - y |^{- \frac{n - 1}{2}} .

This bound follows easily from radial integration by parts in the inner integral, followed by the simple compactness estimate:

\int_{S^{n - 1}} ∥^{ω} G (t, x; s, y) ∥ d ω ≲ 1,

which is of course uniform in the variables

(t, x; s, y)

To wrap things up here, we need to show the absolute estimate:

\int_{R n} ∥^{ω} G (t, x; s, y) ∥ χ_{| \frac{t - s}{| x - y |} + cos (Θ_{x - y, ω}) | ⩽ | x - y |^{γ - 1}} χ (ξ) d ξ ≲ | x - y |^{- \frac{n - 1}{2}} .

After a Cauchy-Schwartz, this will follow once we can establish that both:

\begin{matrix} {(\int_{R n} χ_{| \frac{t - s}{| x - y |} + cos (Θ_{x - y, ω}) | ⩽ | x - y |^{γ - 1}} χ (ξ) d ξ)}^{\frac{1}{2}} & ≲ | x - y |^{\frac{1}{2} (γ - 1)}, \end{matrix}

(379)

\begin{matrix} {(\int_{S^{n - 1}} ∥^{ω} G (t, x; s, y) ∥^{2} d ω)}^{\frac{1}{2}} & ≲ | x - y |^{- \frac{1}{2} (n - 2 + γ)} . \end{matrix}

(380)

\begin{matrix}  \end{matrix}

The first estimate, 379 follows from elementary bounds. Notice first that after a rotation, it suffices to assume that the vector

x - y

lies along the

(1, 0)

direction.

Then the cutoff function is supported in the region where:

\frac{ξ_{1}}{| ξ |} = - \frac{t - s}{| x - y |} + O (| x - y |^{γ - 1}),

which is a conical set about the

ξ_{1}

-axis of volume no greater than a constant times

| x - y |^{γ - 1}

in the region where

| ξ | ≲ 1

. The second estimate 380 above we have already shown. It is a special case of the bound 309 which was proved in the previous section. This completes our proof of 377 , and hence our demonstration of the dispersive estimate 371 .

11 The Decomposable Function Spaces: Proof of the Square-Sum and Differentiated Strichartz Estimates

We now introduce a piece of machinery which will be of central importance for the remainder of the paper. This is a suitable reinterpretation of the important “decomposable function” criterion from the work [8] . In our context, we set up the general situation as follows: Suppose we are given an

M (m \times m)

valued Fourier integral operator:

\begin{matrix} Φ (\hat{f}) (t, x) = \int_{R n} e^{2 π i ψ (t, x; ξ)} e^{2 π i x \cdot ξ} g_{1} (t, x; ξ) \hat{f} (ξ) g_{2} (t, x; ξ) d ξ, \end{matrix}

(381)

where the

g_{i}

are arbitrary matrix valued functions, such that this operator satisfies certain mixed Lebesgue space mapping properties (uniform in

y_{0}

\begin{matrix} ∥ Φ_{y_{0}} (\hat{f}) ∥ L^{q_{1}} (L^{r_{1}}) ≲ ∥ \hat{f} ∥ L^{2}, \end{matrix}

(382)

where

Φ_{y_{0}}

is the same operator as 381 but with phase

ψ

replaced by

ψ_{y_{0}} = ψ (t, x - y_{0}; ξ)

. Suppose now that we are given a matrix valued function

C (t, x; ω)

which only depends on the angular variable

ω = ξ / | ξ |

in frequency. We would like to prove estimates for the coupled operator (we only discuss left multiplication here, the case of right multiplication is analogous):

\begin{matrix} \tilde{Φ} (\hat{f}) (t, x) = \int_{R n} e^{2 π i ψ (t, x; ξ)} e^{2 π i x \cdot ξ} C (t, x; ω) \cdot g_{1} (t, x; ξ) \hat{f} (ξ) g_{2} (t, x; ξ) d ξ . \end{matrix}

(383)

These should be done in a way that the decay properties of the function

C (t, x; ω)

can be used to improve the range of the estimates 382 . A robust way for doing this has been worked out in the paper of Rodnianski–Tao [8] . The answer is to fix an angular scale, say

θ

, and then to form the norm (“classical” decomposable norm):

\begin{matrix} ∥ C ∥^{2} D_{θ}^{c l} (L_{t}^{q_{2}} (L_{x}^{r_{2}})) = \sum_{k = 0}^{10 n} θ^{- n + 1} \int_{S_{ω}^{n - 1}} ∥ (θ \nabla_{ξ})^{k} C ∥^{2} L_{t}^{q_{2}} (L_{x}^{r_{2}}) d ω . \end{matrix}

(384)

By decomposing the frequency variable in 383 into angular sectors of size

\sim θ

, a straightforward computation then shows that one has the estimate:

\begin{matrix} ∥ \tilde{Φ} (\hat{f}) ∥ L^{q} (L^{r}) ≲ ∥ C ∥ D_{θ}^{c l} (L_{t}^{q_{2}} (L_{x}^{r_{2}})) \cdot ∥ \hat{f} ∥ L^{2}, \end{matrix}

(385)

whenever estimate 382 holds with

\frac{1}{q} = \frac{1}{q_{1}} + \frac{1}{q_{2}}

and

\frac{1}{r} = \frac{1}{r_{1}} + \frac{1}{r_{2}}

There are two problems which occur when trying to apply 384 in the present context. The first is that this norm is for a single scale, which causes problems in products where many different scales interact with each other. The other problem, which is conceptually much more serious, is that the estimate 384 contains the highly singular factor of

θ^{- \frac{n - 1}{2}}

, which needs to be eliminated with a delicate orthogonality argument, the kind which is not preserved in this problem for a variety of reasons (non-linear Hodge systems, a covariant wave equation that does not commute with angular cutoffs, etc). However, with only a slight reworking the basic idea behind 384 can be shown to be surprisingly robust. First of all, for a fixed scale we replace 384 with a square function norm which has the same effect, and which will be very easy to verify in the present context. Since we will be using multiple scales in a moment, we introduce the solid angular cutoff functions

{\bar{b^{φ}}}_{θ} (ω)

(not to be confused with the hollow multipliers

b_{θ}^{ω} (ξ)

introduced in Section 4 ), such that:

\begin{matrix} {\bar{b^{φ}}}_{θ} (ω) \equiv 1, \end{matrix}

(386)

when

ω \in Γ_{φ}

, for the angular sector

Γ_{φ}

which we interpret as a cap in a finitely overlapping collection on the sphere

S_{ω}^{n - 1} = \cup_{φ} Γ_{φ}

. Here the scale is determined by the condition

| Γ_{φ} | \sim θ

. On this scale, we replace 384 with the norm:

\begin{matrix} ∥ C ∥ D_{θ} (L_{t}^{q_{2}} (L_{x}^{r_{2}})) = ∥ {(\sum_{k = 0}^{10 n} \sum_{φ} {sup}_{ω} ∥ {\bar{b^{φ}}}_{θ} (θ \nabla_{ξ})^{k} C ∥^{2} L_{x}^{r_{2}})}^{\frac{1}{2}} ∥_{L_{t}^{q_{2}}} . \end{matrix}

(387)

It is not difficult to see that by decomposing the integral on the right hand side of 384 into fine and course scales, and applying Hölder's on the fine (continuous) scales, that the Rodnianski-Tao norm 384 with the time integral on the outside is bounded by the norm 387 . Furthermore, it is easy to see from the proof given in [8] that having the time integral on the outside does not effect the bound 385 so long as the index

q_{1}

implicitly appearing in this bound is such that

2 ⩽ q_{1}

. This allows one to use Minkowski's inequality to pull the square sum on the parametrix through the time integral. For us this index condition will always hold because we are working with Strichartz type norms. We leave it to the reader to work out the details of these claims.

We now form an

ℓ^{1}

Banach space based on incorporating the norms 387 over all dyadic angular scales

θ ≲ 1

. The elements of this space we denote by

{C} = {C^{(θ)}}

, and we define its norm

ℓ^{1} (D_{θ})

norm as:

\begin{matrix} ∥ {C} ∥ ℓ^{1} (D_{θ} (L_{t}^{q_{2}} (L_{x}^{r_{2}}))) = \sum_{θ} ∥ C^{(θ)} ∥ D_{θ} (L_{t}^{q_{2}} (L_{x}^{r_{2}})) . \end{matrix}

(388)

There is also the forgetful map from the space

ℓ^{1} (D_{θ})

to functions which define as:

\begin{matrix} {C} ⇝ C = \sum_{θ} C^{(θ)}, \end{matrix}

(389)

and we will in practice abusively identify

{C}

with

C

via the map 389 . The main point is that given any function

C

, there may be a variety of ways which we embed

C

in the space

ℓ^{1} (D_{θ})

, and it is up to the structure of the application to decide how this should be done. Of course, given the square function norms 116 we are working with, our choice here is somewhat canonical.

Now, if we consider the

C

in 389 as embedded in the integral 383 , we easily have the estimate:

\begin{matrix} ∥ \tilde{Φ} (\hat{f}) ∥ L^{q} (L^{r}) ≲ ∥ {C} ∥ ℓ^{1} (D_{θ} (L_{t}^{q_{2}} (L_{x}^{r_{2}}))) \cdot ∥ \hat{f} ∥ L^{2} . \end{matrix}

(390)

We also form spatial Besov versions of the norm 390 , which we denote as

ℓ^{1} D_{θ} (L^{q} ({\dot{B}}_{2}^{r, (2, s)}))

This leads us to the basic notation of this section:

Definition 11.1. For a given matrix valued function, we say it is in the decomposable space

D (L^{q} ({\dot{B}}_{2}^{r, (2, s)}))

if the following norm is finite:

\begin{matrix} ∥ C ∥ D (L^{q} ({\dot{B}}_{2}^{r, (2, s)})) = {inf}_{C = \sum_{θ} C^{(θ)}} {\sum_{θ} ∥ C^{(θ)} ∥ D_{θ} (L^{q} ({\dot{B}}_{2}^{r, (2, s)}))} . \end{matrix}

(391)

We also define the low frequency analog of these spaces, which we denote by

D (L^{q} ({\dot{B}}_{2, 10 n}^{r, (2, s)}))

, similarly.

We remark here that it is easy to see that the norm 391 leads to a Banach space. This will be important in a moment. Also, it is easy to show that the various Besov-Lebesgue space inclusions 42 – 44 hold for these spaces if we define

D (L^{p})

analogously to 391 . This is a simple consequence of the fact that the Littlewood-Paley theory commutes with the derivatives

\nabla_{ξ}^{k}

. We now show that this space satisfies the expected range of bilinear Riesz operator estimates:

Lemma 11.2 (A decomposable Besov calculus). Let the indices

0 ⩽ σ

1 ⩽ q_{i}, r_{i} ⩽ \infty

, and

s_{i}

be given. Then one has the following family of bilinear estimates:

\begin{matrix} | D_{x} |^{- σ} : D (L_{t}^{q_{1}} ({\dot{B}}_{2}^{r_{1}, (2, s_{1})})) \cdot D (L_{t}^{q_{2}} ({\dot{B}}_{2}^{r_{2}, (2, s_{2})})) ↪ D (L_{t}^{q_{3}} ({\dot{B}}_{1}^{r_{3}, (2, s_{3})})), \end{matrix}

(392)

where the various indices satisfy the conditions:

\begin{matrix} s_{3} & = s_{1} + s_{2} + σ - \frac{n}{2}, \end{matrix}

(393)

\begin{matrix} σ + \frac{n}{2} - s_{3} & < n (\frac{1}{r_{1}} + \frac{1}{r_{2}}), \end{matrix}

(394)

\begin{matrix} s_{1} & < \frac{n}{2} + min {n (\frac{1}{r_{2}} - \frac{1}{r_{3}}), 0}, \end{matrix}

(395)

\begin{matrix} s_{2} & < \frac{n}{2} + min {(\frac{1}{r_{1}} - \frac{1}{r_{3}}), 0}, \end{matrix}

(396)

\begin{matrix} \frac{1}{q_{3}} & = \frac{1}{q_{1}} + \frac{1}{q_{2}}, \end{matrix}

(397)

\begin{matrix} \frac{1}{r_{3}} & ⩽ \frac{1}{r_{1}} + \frac{1}{r_{2}} . \end{matrix}

(398)

\begin{matrix}  \end{matrix}

Proof of estimate 392 . The proof of 392 is largely a triviality given that it is true for the norms

L_{t}^{q_{1}} ({\dot{B}}_{2}^{r_{1}, (2, s_{1})})

without the decomposable structure. First of all, notice that from the definition 11.1 it suffices to establish things with the norms

D (L_{t}^{q_{i}} ({\dot{B}}_{2}^{r_{i}, (2, s_{i})}))

replaced by their vector generalizations

ℓ^{1} D_{θ} (L_{t}^{q_{i}} ({\dot{B}}_{2}^{r_{i}, (2, s_{i})}))

This follows at once from working with two test matrices

A

and

C

and decomposing them into sums:

\begin{matrix} A & = \sum_{θ} A^{(θ)}, & C & = \sum_{θ} C^{(θ)}, \end{matrix}

\begin{matrix}  \end{matrix}

where the

{A}

and

{C}

collections have norms no greater than twice that of

A

and

C

respectively.

Suppose now that we are given two test elements

{A^{(θ)}}

and

{C^{(θ)}}

. The we write their product under the map 389 as:

\begin{matrix} A \cdot C & = \sum_{θ_{1}, θ_{2}} A^{(θ_{1})} \cdot C^{(θ_{2})}, \end{matrix}

(399)

\begin{matrix} = \sum_{θ} (\sum_{θ_{1} : θ < θ_{1}} A^{(θ_{1})} \cdot C^{(θ)} + \sum_{θ_{2} : θ ⩽ θ_{2}} A^{(θ)} \cdot C^{(θ_{2})}), \end{matrix}

\begin{matrix} = \sum_{θ} T_{1}^{(θ)} + T_{2}^{(θ)} . \end{matrix}

\begin{matrix}  \end{matrix}

Freezing the scale

θ

, we will prove the following two estimates:

\begin{matrix} ∥ | D_{x} |^{- σ} T_{1}^{(θ)} ∥ D_{θ} (L_{t}^{q_{3}} ({\dot{B}}_{1}^{r_{3}, (2, s_{3})})) ≲ ∥ {A} ∥ ℓ^{1} D_{θ} (L_{t}^{q_{1}} {\dot{B}}_{2}^{r_{1}, (2, s_{1})}) \cdot ∥ C^{(θ)} ∥ D_{θ} (L_{t}^{q_{2}} {\dot{B}}_{2}^{r_{2}, (2, s_{2})}), \end{matrix}

(400)

\begin{matrix} ∥ | D_{x} |^{- σ} T_{2}^{(θ)} ∥ D_{θ} (L_{t}^{q_{3}} ({\dot{B}}_{1}^{r_{3}, (2, s_{3})})) ≲ ∥ A^{(θ)} ∥ D_{θ} (L_{t}^{q_{1}} {\dot{B}}_{2}^{r_{1}, (2, s_{1})}) \cdot ∥ {C} ∥ ℓ^{1} D_{θ} (L_{t}^{q_{2}} {\dot{B}}_{2}^{r_{2}, (2, s_{2})}) . \end{matrix}

(401)

\begin{matrix}  \end{matrix}

We will only concentrate on 400 , as the second estimate above follows from virtually identical reasoning. Expanding out the sum in

T_{1}^{(θ)}

, it suffices to show:

(402) ∥ | D x | − σ ( A ( θ 1 ) ⋅ C ( θ ) ) ∥ D θ ( L t q 3 ( B ˙ 1 r 3 , ( 2 , s 3 ) ) ) ≲ ∥ A ( θ 1 ) ∥ D θ 1 ( L t q 1 B ˙ 2 r 1 , ( 2 , s 1 ) ) ⋅ ∥ C ( θ ) ∥ D θ ( L t q 2 B ˙ 2 r 2 , ( 2 , s 2 ) ) ,

where we have the condition

θ ⩽ θ_{1}

. We now compute the norm on the right hand side of this last equation. For the remainder of the proof we fix the time variable.

This can then be dealt with at the end by integrating in time and using Hölders inequality because all of the action in the norms 387 takes place under the time integral. To proceed, we first fix the angular sector

Γ_{φ}

and the number of

(θ \nabla_{ξ})

derivatives to compute that:

\begin{matrix} {sup}_{ω} ∥ {\bar{b^{φ}}}_{θ} (θ \nabla_{ξ})^{k} | D_{x} |^{- σ} (A^{(θ_{1})} \cdot C^{(θ)}) ∥ {\dot{B}}_{1}^{r_{3}, (2, s_{3})}, \end{matrix}

\begin{matrix} ≲ & \sum_{i = 0}^{k} \sum_{φ_{1} : Γ_{φ_{1}} \subseteq 10 Γ_{φ}} {sup}_{ω} ∥ | D_{x} |^{- σ} ({\bar{b^{φ_{1}}}}_{θ} (θ \nabla_{ξ})^{k - i} A^{(θ_{1})} \cdot {\bar{b^{φ}}}_{θ} (θ \nabla_{ξ})^{i} C^{(θ)}) ∥ {\dot{B}}_{2}^{r_{3}, (2, s_{3})}, \end{matrix}

\begin{matrix} ≲ & \sum_{i = 0}^{k} \sum_{φ_{1} : Γ_{φ_{1}} \subseteq 10 Γ_{φ}} {sup}_{ω} ∥ {\bar{b^{φ_{1}}}}_{θ} (θ \nabla_{ξ})^{k - i} A^{(θ_{1})} ∥ {\dot{B}}_{2}^{r_{1}, (2, s_{1})} \cdot {sup}_{ω} ∥ {\bar{b^{φ}}}_{θ} (θ \nabla_{ξ})^{i} C^{(θ)} ∥ {\dot{B}}_{2}^{r_{2}, (2, s_{2})} . \end{matrix}

\begin{matrix}  \end{matrix}

Square summing this last expression over angular sectors, and adding over all

0 ⩽ k ⩽ 10 n

we arrive at the estimate:

∥ | D_{x} |^{- σ} (A^{(θ_{1})} \cdot C^{(θ)}) ∥ D_{θ} ({\dot{B}}_{1}^{r_{3}, (2, s_{3})}) ≲ {sup}_{φ} \sum_{k = 0}^{10 n} {sup}_{ω} ∥ {\bar{b^{φ}}}_{θ} (θ \nabla_{ξ})^{k} A^{(θ_{1})} ∥ {\dot{B}}_{2}^{r_{1}, (2, s_{1})} \cdot ∥ C^{(θ)} ∥ D_{θ} ({\dot{B}}_{2}^{r_{2}, (2, s_{2})}) .

We can now conclude 402 on account of the condition

θ ⩽ θ_{1}

which implies the trivial bound:

\begin{matrix} {sup}_{φ} \sum_{k = 0}^{10 n} {sup}_{ω} ∥ {\bar{b^{φ}}}_{θ} (θ \nabla_{ξ})^{k} A^{(θ_{1})} ∥ {\dot{B}}_{2}^{r_{1}, (2, s_{1})}, \end{matrix}

\begin{matrix} ≲ & {sup}_{φ^{'}} \sum_{k = 0}^{10 n} {sup}_{ω} ∥ {\bar{b^{φ^{'}}}}_{θ_{1}} (θ_{1} \nabla_{ξ})^{k} A^{(θ_{1})} ∥ {\dot{B}}_{2}^{r_{1}, (2, s_{1})}, \end{matrix}

\begin{matrix} ≲ & ∥ A^{(θ_{1})} ∥ D_{θ_{1}} ({\dot{B}}_{2}^{r_{1}, (2, s_{1})}) . \end{matrix}

\begin{matrix}  \end{matrix}

This completes our proof of the estimate 392 . □

We now establish the link which relates the norms 391 to the

{\dot{X}}^{s}

norms we have proved for the parametrix

Φ

Lemma 11.3 (Core decomposable estimates for the potentials

{ω A^{\pm}}

and

{^{ω} C^{\pm}}

Let the sets of potentials

{ω A^{\pm}}

and

{^{ω} C^{\pm}}

be defined as on lines 197 , 200 , and 201 above. Then one has the following family of decomposable bounds:

\begin{matrix} ∥ ω A^{\pm} ∥ D (L_{t}^{\infty} ({\dot{B}}_{2, 10 n}^{p_{γ}, (2, \frac{n - 2}{2})})) & ≲ ℰ, & ∥ ω A^{\pm} ∥ D (L_{t}^{2} ({\dot{B}}_{2, 10 n}^{q_{γ}, (2, \frac{n - 1}{2})})) & ≲ ℰ, \end{matrix}

(403)

\begin{matrix} ∥ \nabla_{t} ω {\underset{̲}{A}}^{\pm} ∥ D (L_{t}^{\infty} ({\dot{B}}_{2, 10 n}^{p_{γ}, (2, \frac{n - 4}{2})})) & ≲ ℰ, & ∥ \nabla_{t} ω {\underset{̲}{A}}^{\pm} ∥ D (L_{t}^{2} ({\dot{B}}_{2, 10 n}^{q_{γ}, (2, \frac{n - 3}{2})})) & ≲ ℰ, \end{matrix}

(404)

\begin{matrix} ∥^{ω} C^{\pm} ∥ D (L_{t}^{\infty} ({\dot{B}}_{2, 10 n}^{p_{γ}, (2, \frac{n - 2}{2})})) & ≲ ℰ, & ∥^{ω} C^{\pm} ∥ D (L_{t}^{2} ({\dot{B}}_{2, 10 n}^{q_{γ}, (2, \frac{n - 1}{2})})) & ≲ ℰ, \end{matrix}

(405)

\begin{matrix} ∥ \nabla_{t} ω {\underset{̲}{C}}^{\pm} ∥ D (L_{t}^{\infty} ({\dot{B}}_{2, 10 n}^{p_{γ}, (2, \frac{n - 4}{2})})) & ≲ ℰ, & ∥ \nabla_{t} ω {\underset{̲}{C}}^{\pm} ∥ D (L_{t}^{2} ({\dot{B}}_{2, 10 n}^{q_{γ}, (2, \frac{n - 3}{2})})) & ≲ ℰ, \end{matrix}

(406)

\begin{matrix}  \end{matrix}

where

p_{γ}

and

q_{γ}

are the dimensional constants from lines 193 and 278 above. Furthermore, one has the following improved null-differentiated space-time bounds:

\begin{matrix} ∥ (ω L^{\mp} ω {\underset{̲}{A}}^{\pm}, \nabla_{t} Δ^{- \frac{1}{2}} ω L^{\mp} ω {\underset{̲}{A}}^{\pm}) ∥ D (L_{t}^{2} ({\dot{B}}_{2, 10 n}^{p_{γ}, (2, \frac{n - 3}{2})})) & ≲ ℰ, \end{matrix}

(407)

\begin{matrix} ∥ (ω L^{\mp} ω {\underset{̲}{C}}^{\pm}, \nabla_{t} Δ^{- \frac{1}{2}} ω L^{\mp} ω {\underset{̲}{C}}^{\pm}) ∥ D (L_{t}^{2} ({\dot{B}}_{2, 10 n}^{p_{γ}, (2, \frac{n - 3}{2})})) & ≲ ℰ . \end{matrix}

(408)

\begin{matrix}  \end{matrix}

In all of these estimates, the small constant

ℰ

is the same as on lines 126d and 126f above.

Proof of the estimates 403 – 408 . With the current setup, the proof of these bounds is very simple and repeats many of things we have already done in previous sections.

Starting with the estimates 403 – 404 , we see that using the truncation condition 126c it suffices to prove the first collection, as the time differentiated versions will follow from these with little fuss. We now follow essentially the same steps used to prove estimates 192 and 300 . The only difference here is that we incorporate the square function norms contained in the

{\dot{X}}^{\frac{n - 2}{2}}

spaces. In what follows, we will in fact only prove the space-time estimate which is the second bound on the right hand side of 403 above. The first bound on this line follows from similar reasoning and is left to the reader. The first step is to define the scale decomposition (we now ignore

\pm

notation for the remainder of the proof ):

ω A = \sum_{θ} ω A^{(θ)} = \sum_{θ} ω Π_{θ} ω A .

Our goal is now to prove the following fixed time bounds which can easily be summed over and then integrated to achieve the desired goal:

∥ ω Π_{θ} ω A (t) ∥ D_{θ} ({\dot{B}}_{2, 10 n}^{q_{γ}, (2, \frac{n - 1}{2})}) ≲ θ^{γ} ∥ \underset{̲}{A} ∙ ≪ 1 (t) ∥ S {\dot{B}}_{2}^{\frac{2 (n - 1)}{n - 3}, (2, \frac{n - 1}{2})} .

By using the square function structure contained in the definition of the various Besov and decomposable Besov norms and taking into account the low frequency truncation of the potentials

{ω A}

and

{\underset{̲}{A} ∙ ≪ 1}

, the proof of this last estimate reduces to the fixed frequency bound:

∥ ω Π_{θ} P_{μ} ω A (t) ∥ D_{θ} ({\dot{B}}_{2}^{q_{γ}, (2, \frac{n - 1}{2})}) ≲ θ^{γ} ∥ P_{μ} \underset{̲}{A} ∙ ≪ 1 (t) ∥ S {\dot{B}}_{2}^{\frac{2 (n - 1)}{n - 3}, (2, \frac{n - 1}{2})} .

Expanding now the decomposable norm on the left hand side of this last inequality, we see that the proof reduces to showing the square function bounds:

(409) ∑ k = 0 10 n ∑ φ sup ω ∥ b φ ¯ θ ( ω ) ( θ ∇ ξ ) k ω Π θ P μ ω A ( t ) ∥ 2 B ˙ 2 q γ , ( 2 , n − 1 2 ) ≲ θ 2 γ ∑ φ : ω 0 ∈ Γ φ ∥ ω 0 Π ~ θ P μ A ̲ ∙ ≪ 1 ( t ) ∥ 2 B ˙ 2 2 ( n − 1 ) n − 3 , ( 2 , n − 1 2 ) ,

where

{\tilde{ω Π}}_{θ}

is a fixed thickening of the multiplier

ω Π_{θ}

such that one has the general quasi-idempotence bound:

\begin{matrix} {sup}_{ω} ∥ {\bar{b^{φ}}}_{θ} (ω) {\tilde{\tilde{ω Π}}}_{θ} A ∥ L^{q} & ≲ ∥ {\tilde{^{ω_{0}} Π}}_{θ} A ∥ L^{q}, & ω_{0} \in Γ_{φ}, \end{matrix}

(410)

\begin{matrix}  \end{matrix}

where

{\tilde{\tilde{ω Π}}}_{θ}

is any multiplier with frequency support contained in the frequency support of

ω Π_{θ}

whose convolution kernel satisfies comparable

L^{1}

bounds. Here the statement that

ω_{0} \in Γ_{φ}

is taken to mean that

ω_{0}

is in the center of the cap

Γ_{φ}

, the very same notion we used in the definition of the square function norms 116 above.

Using now the general bound 410 as well as the heuristic multiplier identity:

(θ \nabla_{ξ})^{k} ω Π_{θ} P_{μ} ω {\bar{Π}}^{(\frac{1}{2} - δ)} \nabla_{t, x} ω L Δ_{ω^{⊥}}^{- 1} {\underset{̲}{A}}_{∙ ≪ 1} (\partial_{ω}) (t) \approx θ^{- 1} ω Π_{θ} P_{μ} \underset{̲}{A} ∙ ≪ 1 (t),

we have the bound: 409

(L . H . S) ≲ \sum_{φ : ω_{0} \in Γ_{φ}} θ^{- 2} ∥ {\tilde{^{ω_{0}} Π}}_{θ} P_{μ} \underset{̲}{A} ∙ ≪ 1 (t) ∥^{2} {\dot{B}}_{2}^{q_{γ}, (2, \frac{n - 1}{2})} .

The estimate 409 now follows from the Bernstein nested-Besov inclusion:

{\tilde{^{ω_{0}} Π}}_{θ} ({\dot{B}}_{2}^{\frac{2 (n - 1)}{n - 3}, (2, \frac{n - 1}{2})}) \subseteq θ^{1 + γ} {\tilde{^{ω_{0}} Π}}_{θ} ({\dot{B}}_{2}^{q_{γ}, (2, \frac{n - 1}{2})}) .

Our next goal is to pass the estimates 403 – 404 on to the non-linear set of potentials

(^{ω} C_{0}, {ω \underset{̲}{C}})

. Since it is a-priori not clear that these functions have finite

D (L^{q} ({\dot{B}}_{2}^{r, (2, s)}))

norms, we construct the bounds from scratch by running a contraction mapping argument in these spaces on the Picard iterates of the systems 200 and 201 . To guarantee convergence of the resulting sequences, we make use specific instances of the general embedding 392 . Our general strategy here is the following. We first establish the non-time differentiated estimates 405 for the spatial potentials

{ω \underset{̲}{C}}

. Then, assuming the non-time differentiated versions of the improved estimates 407 – 408 (whose proof relies only on the previously established bounds) we prove the time-differentiated estimates 406 . Having established these, we then prove the estimates 405 for the temporal potential

^{ω} C_{0}

. Our next order of business is to prove the non-time differentiated versions of the improved null-differentiated bounds 407 – 408 . Finally, armed with all of this, we show the version of the estimates 407 – 408 which contain the extra time derivatives. In what follows, we will only list out the various bilinear estimates which yield the desired bounds. Since these are almost identical to many of the estimates we have dealt with in the past sections, we leave the verification of the numerology to the reader.

To prove the non-time differentiated versions of 405 for the collection

{ω \underset{̲}{C}}

we use the pair of bounds:

\begin{matrix} \nabla_{x} Δ^{- 1} : D (L_{t}^{\infty} ({\dot{B}}_{2, 10 n}^{p_{γ}, (2, \frac{n - 2}{2})})) \cdot D (L_{t}^{\infty} ({\dot{B}}_{2, 10 n}^{p_{γ}, (2, \frac{n - 2}{2})})) & ↪ D (L_{t}^{\infty} ({\dot{B}}_{2, 10 n}^{p_{γ}, (2, \frac{n - 2}{2})})), \end{matrix}

(411)

\begin{matrix} \nabla_{x} Δ^{- 1} : D (L_{t}^{2} ({\dot{B}}_{2, 10 n}^{q_{γ}, (2, \frac{n - 1}{2})})) \cdot D (L_{t}^{\infty} ({\dot{B}}_{2, 10 n}^{p_{γ}, (2, \frac{n - 2}{2})})) & ↪ D (L_{t}^{2} ({\dot{B}}_{2, 10 n}^{q_{γ}, (2, \frac{n - 1}{2})})) . \end{matrix}

(412)

\begin{matrix}  \end{matrix}

To establish the first bound on line 406 we first differentiate the Hodge system 200 with respect to time and then apply the embedding:

\begin{matrix} \nabla_{x} Δ^{- 1} : D (L_{t}^{\infty} ({\dot{B}}_{2, 10 n}^{p_{γ}, (2, \frac{n - 4}{2})})) \cdot D (L_{t}^{\infty} ({\dot{B}}_{2, 10 n}^{p_{γ}, (2, \frac{n - 2}{2})})) ↪ D (L_{t}^{\infty} ({\dot{B}}_{2, 10 n}^{p_{γ}, (2, \frac{n - 4}{2})})) . \end{matrix}

(413)

To prove the time integrated bound which is the second on line 406 we decompose the vector-field

\nabla_{t}

into

\pm ω \underset{̲}{L} \mp ω \cdot \nabla_{x}

just as we did starting on line 328 above. Then, modulo estimates of the form 412 , and assuming that we have shown the non-time differentiated versions of 407 – 408 we may reduce things to the embedding:

\begin{matrix} \nabla_{x} Δ^{- 1} : D (L_{t}^{2} ({\dot{B}}_{2, 10 n}^{p_{γ}, (2, \frac{n - 3}{2})})) \cdot D (L_{t}^{\infty} ({\dot{B}}_{2, 10 n}^{p_{γ}, (2, \frac{n - 2}{2})})) ↪ D (L_{t}^{2} ({\dot{B}}_{2, 10 n}^{p_{γ}, (2, \frac{n - 3}{2})})) . \end{matrix}

(414)

Our next step is to prove the estimates 405 for the temporal potential

^{ω} C_{0}

. By an inspection of the elliptic equation 201 , we see that modulo embeddings of the form 411 – 412 and the bounds we have already shown, we only need to establish things for the term

\nabla_{t} Δ^{- 1} ([ω \underset{̲}{A}, ω \underset{̲}{C}])

. Again expanding the time derivative as

\pm ω \underset{̲}{L} \mp ω \cdot \nabla_{x}

and distributing the

ω \underset{̲}{L}

derivative via the Leibniz rule, we are reduced knowing the following (which just represent another form of the embeddings 413 – 414 ):

\begin{matrix} Δ^{- 1} : D (L_{t}^{\infty} ({\dot{B}}_{2, 10 n}^{p_{γ}, (2, \frac{n - 4}{2})})) \cdot D (L_{t}^{\infty} ({\dot{B}}_{2, 10 n}^{p_{γ}, (2, \frac{n - 2}{2})})) & ↪ D (L_{t}^{\infty} ({\dot{B}}_{2, 10 n}^{p_{γ}, (2, \frac{n - 2}{2})})) . \end{matrix}

(415)

\begin{matrix} Δ^{- 1} : D (L_{t}^{2} ({\dot{B}}_{2, 10 n}^{p_{γ}, (2, \frac{n - 3}{2})})) \cdot D (L_{t}^{\infty} ({\dot{B}}_{2, 10 n}^{p_{γ}, (2, \frac{n - 2}{2})})) & ↪ D (L_{t}^{2} ({\dot{B}}_{2, 10 n}^{p_{γ}, (2, \frac{n - 1}{2})})) . \end{matrix}

(416)

\begin{matrix}  \end{matrix}

Finally, we wish to show the improved bounds 407 – 408 . We work recursively here. First, we assume that the non-time differentiated versions of these estimates are valid. By the truncation condition 126c , we see that the proof of the estimate 407 with the extra operator

\partial_{t} Δ^{- \frac{1}{2}}

follows from the proof of this estimate without that operator. Thus, our aim is to establish the estimate 408 in the presence of the extra

\partial_{t} Δ^{- \frac{1}{2}}

derivatives. Applying this operator to the

ω \underset{̲}{L}

differentiated Hodge system 200 , we see that things can be handled with the help of the two bilinear inclusions:

\begin{matrix} \nabla_{x} Δ^{- \frac{3}{2}} : D (L_{t}^{2} ({\dot{B}}_{2, 10 n}^{p_{γ}, (2, \frac{n - 5}{2})})) \cdot D (L_{t}^{\infty} ({\dot{B}}_{2, 10 n}^{p_{γ}, (2, \frac{n - 2}{2})})) & ↪ D (L_{t}^{2} ({\dot{B}}_{2, 10 n}^{p_{γ}, (2, \frac{n - 3}{2})})) . \end{matrix}

(417)

\begin{matrix} \nabla_{x} Δ^{- \frac{3}{2}} : D (L_{t}^{2} ({\dot{B}}_{2, 10 n}^{p_{γ}, (2, \frac{n - 3}{2})})) \cdot D (L_{t}^{\infty} ({\dot{B}}_{2, 10 n}^{p_{γ}, (2, \frac{n - 4}{2})})) & ↪ D (L_{t}^{2} ({\dot{B}}_{2, 10 n}^{p_{γ}, (2, \frac{n - 3}{2})})) . \end{matrix}

(418)

\begin{matrix}  \end{matrix}

Notice that the numerology in these last two estimates is a bit tight in

H i g h \times H i g h

frequency regime. In particular, the condition 394 only has room of about

1 / 10

n = 6

dimensions. The next item on the stack for us is the estimates 407 – 408 without the extra derivative

\nabla_{t} Δ^{- 1}

. Assuming for the moment that this is true for 407 , we see that the proof of 408 in this case follows easily from

ω \underset{̲}{L}

differentiating the Hodge system 200 and applying a less singular version of the estimate 417 .

Therefore, we are now at the point where everything has been reduced to the proof of the first estimate 407 . To do this we apply the following instance of the identity 336 :

\begin{matrix} ω \underset{̲}{L} ω \underset{̲}{A} = \nabla_{x} ω {\bar{Π}}^{(\frac{1}{2} - δ)} \underset{̲}{A} ∙ ≪ 1 (\partial_{ω}) - \nabla_{x} ω {\bar{Π}}^{(\frac{1}{2} - δ)} Δ_{ω^{⊥}}^{- 1} \tilde{P} ([B, H]) (\partial_{ω}) . \end{matrix}

(419)

The estimate 407 for the first term on the right hand side of 419 is very simple and left to the reader. It follows from steps similar to the proof we gave above of the estimates 403 . Notice that there are no singular angular factors here so there is a lot of room in this estimate if one takes into account the extra Coulomb savings 196 .

We are now trying to prove 407 for the second term on the right hand side of 419 which we decompose into angular scales as:

\begin{matrix} \nabla_{x} ω {\bar{Π}}^{(\frac{1}{2} - δ)} Δ_{ω^{⊥}}^{- 1} \tilde{P} ([B, H]) (\partial_{ω}) = \sum_{θ} \nabla_{x} ω Π_{θ} ω {\bar{Π}}^{(\frac{1}{2} - δ)} Δ_{ω^{⊥}}^{- 1} \tilde{P} ([B, H]) (\partial_{ω}) . \end{matrix}

(420)

By the definition of the norm 391 and using some dyadic summing, we see that it suffices to bound and then sum over the fixed time-fixed frequency expressions:

(421) ∥ ∇ x ω Π θ ω Π ¯ ( 1 2 − δ ) Δ ω ⊥ − 1 P ~ ( [ B , H ] ) ( ∂ ω ) ( t ) ∥ D ( B ˙ 2 , 10 n p γ , ( 2 , n − 3 2 ) ) , = ( ∑ k = 0 10 n ∑ φ sup ω ∥ b φ ¯ θ ( θ ∇ ξ ) k ∇ x ω Π θ ω Π ¯ ( 1 2 − δ ) Δ ω ⊥ − 1 P ~ ( [ B , H ] ) ( ∂ ω ) ( t ) ∥ 2 B ˙ 2 , 10 n p γ , ( 2 , n − 3 2 ) ) 1 2 .

For each fixed value of

θ

, and for each fixed spatial frequency

μ

we have the following heuristic multiplier bound one the Coulomb savings are taken into account:

(θ \nabla_{ξ})^{k} \nabla_{x} ω Π_{θ} ω {\bar{Π}}^{(\frac{1}{2} - δ)} Δ_{ω^{⊥}}^{- 1} P_{μ} \tilde{P} ([B, H]) (\partial_{ω}) (t) \approx (μ θ)^{- 1} ω Π_{θ} P_{μ} P_{∙ ≪ 1} ([B, H]) (t) .

Taking this into account, and using the same multiplier reductions used to prove 409 above, we have the inequality:

421

\begin{matrix} (L . H . S .) & ≲ θ^{- 1} {(\sum_{φ : ω_{0} \in Γ_{φ}} ∥ {\tilde{^{ω_{0}} Π}}_{θ} P_{∙ ≪ 1} ([B, H]) (t) ∥^{2} {\dot{B}}_{2, 10 n}^{p_{γ}, (2, \frac{n - 5}{2})})}^{\frac{1}{2}}, \end{matrix}

\begin{matrix} ≲ θ^{γ} {(\sum_{φ : ω_{0} \in Γ_{φ}} ∥ {\tilde{^{ω_{0}} Π}}_{θ} ([B, H]) (t) ∥^{2} {\dot{B}}_{2}^{2, (2, \frac{n - 5}{2})})}^{\frac{1}{2}}, \end{matrix}

\begin{matrix} ≲ θ^{γ} ∥ ([B, H]) (t) ∥ {\dot{B}}_{2}^{2, (2, \frac{n - 5}{2})} . \end{matrix}

(422)

\begin{matrix}  \end{matrix}

This last set of inequalities results from the localized Besov inclusion:

{\tilde{^{ω_{0}} Π}}_{θ} ({\dot{B}}_{2}^{2, (2, \frac{n - 5}{2})}) \subseteq θ^{1 + γ} {\tilde{^{ω_{0}} Π}}_{θ} ({\dot{B}}_{2}^{p_{γ}, (2, \frac{n - 5}{2})}),

an orthogonality argument. Integrating the bound 422

L^{2}

in time, and using some dyadic summing, we see that our proof of 407 is reduced to showing the following:

∥ [B, H] ∥ L_{t}^{2} ({\dot{H}}^{\frac{n - 5}{2}}) ≲ ℰ .

Keeping in mind the bootstrapping estimates 126f , we see that this last line is simply a more singular version of the embedding 136 shown above. In the

L o w \times H i g h

case the proof follows from 137 . In the

H i g h \times L o w

case there is even more room and one can again use something similar to 137 . In the

H i g h \times H i g h

case we use the embedding:

P_{λ} (L^{2} ({\dot{B}}^{\frac{2 (n - 1)}{n - 3}, (2, \frac{n - 1}{2})})) \cdot P_{λ} (L^{\infty} ({\dot{H}}^{\frac{n - 4}{2}})) ↪ {(\frac{μ}{λ})}^{σ} P_{μ} (L^{2} ({\dot{H}}^{\frac{n - 5}{2}})),

where

σ = n (\frac{n - 2}{n - 1}) - \frac{5}{2}

. This last bound follows from the general frequency localized embedding 52 . Note that in dimensions

6 ⩽ n

we have the necessary condition

0 < σ

. This completes our proof of the estimate 407 and therefore our demonstration of Lemma 11.3 . □

11.1 Proof of the Square Sum Strichartz Estimates

We now come to what is perhaps the linchpin of our argument. These are the square sum structure estimates contained in 178a . With the current machinery in hand, these will be quite easy to establish. At the heart of things is whether the angular multipliers

ω Π_{θ}

“commute with the dynamics” of the covariant wave operator

□_{\underset{̲}{A} ∙ ≪ 1}

. At a quick first glance using Duhamel's principle, this seems to be connected with whether one can control the commutator

[ω Π_{θ}, □_{\underset{̲}{A} ∙ ≪ 1}]

. Unfortunately, it is not too difficult to see that one runs into serious difficulties as soon as

θ ≪ 1

This is not the end of the story however, because it turns out that modulo a very nice error term, one can control the commutator with the “integrated” form the equations

[ω Π_{θ}, Φ]

. This shows one of the deep advantages to working with the parametrix as opposed to dealing directly with the equations themselves¹⁰ . We proceed as follows.

Our first step is to fix a scale

θ

and run a cap decomposition

S^{n - 1} = \cup_{φ} Γ_{φ}

. The next thing we do is to decompose the parametrix

Φ (\hat{f})

into a sum of three pieces:

\begin{matrix} Φ (\hat{f}) & = \int_{R^{n}} e^{2 π i λ ω u} ω g_{∙ ≪ θ}^{- 1} \hat{f} (λ ω) ω g_{∙ ≪ θ} χ_{(\frac{1}{2}, 2)} (λ) λ^{n - 1} d λ d ω \end{matrix}

\begin{matrix} + \int_{R^{n}} e^{2 π i λ ω u} ω g_{∙ ≪ θ}^{- 1} \hat{f} (λ ω) ω g_{θ ≲ ∙} χ_{(\frac{1}{2}, 2)} (λ) λ^{n - 1} d λ d ω \end{matrix}

\begin{matrix} + \int_{R^{n}} e^{2 π i λ ω u} ω g_{θ ≲ ∙}^{- 1} \hat{f} (λ ω) ω g χ_{(\frac{1}{2}, 2)} (λ) λ^{n - 1} d λ d ω, \end{matrix}

\begin{matrix} = I_{1} + I_{2} + I_{3} . \end{matrix}

\begin{matrix}  \end{matrix}

Here:

ω g = ω g_{∙ ≪ θ} + ω g_{θ ≲ ∙} = P_{∙ ≪ θ} (ω g) + P_{θ ≲ ∙} (ω g),

is a low-high frequency decomposition of the group element

ω g

. We define the decomposition for

ω g^{- 1}

similarly. Our goal is now to prove the following three estimates:

\begin{matrix} \sum_{φ : ω_{0} \in Γ_{φ}} ∥^{ω_{0}} Π_{θ} P_{1} (I_{1}) ∥^{2} L^{2} (L^{\frac{2 (n - 1)}{n - 3}}) & ≲ ∥ \hat{f} ∥^{2} L^{2}, \end{matrix}

(423)

\begin{matrix} \sum_{φ : ω_{0} \in Γ_{φ}} ∥^{ω_{0}} Π_{θ} P_{1} (I_{2}) ∥^{2} L^{2} (L^{\frac{2 (n - 1)}{n - 3}}) & ≲ ∥ \hat{f} ∥^{2} L^{2}, \end{matrix}

(424)

\begin{matrix} \sum_{φ : ω_{0} \in Γ_{φ}} ∥^{ω_{0}} Π_{θ} P_{1} (I_{3}) ∥^{2} L^{2} (L^{\frac{2 (n - 1)}{n - 3}}) & ≲ ∥ \hat{f} ∥^{2} L^{2} . \end{matrix}

(425)

\begin{matrix}  \end{matrix}

The proof of the first bound 423 follows easily from the plain endpoint Strichartz estimate we have already established. To see this, first notice that for a fixed angle one has the identity:

^{ω_{0}} Π_{θ} P_{1} \int_{R^{n}} e^{2 π i λ ω u} ω g_{∙ ≪ θ}^{- 1} \hat{f} (λ ω) ω g_{∙ ≪ θ} χ_{(\frac{1}{2}, 2)} (λ) λ^{n - 1} d λ d ω =^{ω_{0}} Π_{θ} P_{1} \int_{R^{n}} e^{2 π i λ ω u} ω g_{∙ ≪ θ}^{- 1} {\bar{b^{φ^{'}}}}_{θ} (ω) \hat{f} (λ ω) ω g_{∙ ≪ θ} χ_{(\frac{1}{2}, 2)} (λ) λ^{n - 1} d λ d ω,

where

Γ_{φ} \subset \frac{1}{2} Γ_{φ^{'}}

for some fixed thickening

Γ_{φ^{'}}

of the spherical cap that

ω_{0} \in Γ_{φ}

That this is the case follows easily from the fact that the Fourier transform of the function:

e^{2 π i x \cdot ξ} ω g_{∙ ≪ θ}^{- 1} (x) \hat{f} (λ ω) ω g_{∙ ≪ θ} (x),

is a tempered distribution with support contained in an

O (c θ)

neighborhood of the point

ξ

for some small constant

c

, uniform in the value of

θ

. Using now the boundedness of the multiplier

^{ω_{0}} Π_{θ} P_{1}

, we only need to establish that the truncated parametrix

I_{1}

obeys the endpoint Strichartz estimate. We reduce this claim further by writing the integral in the form:

I_{1} = \int_{R n} \int_{R n} K^{P_{∙ ≪ θ}} (w) K^{P_{∙ ≪ θ}} (y) \int_{R^{n}} e^{2 π i λ ω u} ω g_{w}^{- 1} \hat{f} (λ ω) ω g_{y} χ_{(\frac{1}{2}, 2)} (λ) λ^{n - 1} d λ d ω d w d y,

where

ω g_{w}^{- 1} (x) = ω g^{- 1} (x - w)

and

ω g_{y} (x) = ω g (x - y)

denote the translated group elements. Using the fact that the convolution kernel

K^{P_{∙ ≪ θ}}

has

O (1)

L^{1}

norm uniform in the value of

θ

, we are left with establishing the

L^{2}

and dispersive estimates of the previous sections for more general kernels of the form:

\begin{matrix} Φ_{g_{1}, g_{2}} (\hat{f}) = \int_{R^{n}} e^{2 π i λ ω u} ω g_{1}^{- 1} \hat{f} (λ ω) ω g_{2} χ_{(\frac{1}{2}, 2)} (λ) λ^{n - 1} d λ d ω, \end{matrix}

(426)

where

ω g_{1}

and

ω g_{2}

are unrelated group elements which are generated from Hodge systems and connections of the form 197 – 201 , which satisfy the general requirements 126 and 158 for

λ = 1

. This indeed turns out to be the case, and the key observation is that by using the identity 14 , all of the

T T^{*}

arguments go through just as they did in previous sections.

It remains for us to prove the bounds 424 – 425 . These are essentially identical to each other so we concentrate on the proof of the first of these, leaving the other one to the reader. By an application of Bernstein's inequality and orthogonality, we see that it suffices for us to show the estimate:

\begin{matrix} ∥ θ I_{2} ∥ L^{2} (L^{2}) ≲ ∥ \hat{f} ∥ L^{2} . \end{matrix}

(427)

At a heuristic level, this estimate is true because one has the identity

θ ω g_{θ ≲ ∙} \approx \nabla_{x} ω g = g ω \underset{̲}{C}

. And we see that in this case things would follow easily from the

D (L^{2} (L^{\infty}))

contained in the estimates 405 . To implement this in a rigorous way, we derive the following elliptic equation for

ω g_{θ ≲ ∙}

based on the formulas 199 :

\begin{matrix} ω g_{θ ≲ ∙} & = \nabla^{i} Δ^{- 1} P_{θ ≲ ∙} (ω g ω {\underset{̲}{C}}_{i}), \end{matrix}

\begin{matrix} = \sum_{λ : θ ≲ λ} \nabla^{i} Δ^{- 1} P_{λ} (ω g ω {\underset{̲}{C}}_{i}) . \end{matrix}

\begin{matrix}  \end{matrix}

If we denote the (vector) kernel of operator

\nabla_{x} Δ^{- 1} P_{λ}

K_{λ}^{\nabla Δ^{- 1}}

, then we have the uniform

L^{1}

bounds:

∥ K_{λ}^{\nabla Δ^{- 1}} ∥ L^{1} ≲ λ^{- 1} .

Using this and taking into account the previous reductions used in the proof of estimate 423 above we easily arrive at the bound:

\begin{matrix} ∥ θ I_{2} ∥ L^{2} (L^{2}) & ≲ \sum_{λ : θ ≲ λ} (\frac{θ}{λ}) {sup}_{w, y} ∥ {\tilde{I}}_{w, y} ∥ L^{2} (L^{2}), \end{matrix}

\begin{matrix} ≲ {sup}_{w, y} ∥ {\tilde{I}}_{w, y} ∥ L^{2} (L^{2}), \end{matrix}

\begin{matrix}  \end{matrix}

where

{\tilde{I}}_{w, y}

is the family of translated kernels:

\begin{matrix} {\tilde{I}}_{w, y} = \int_{R^{n}} e^{2 π i λ ω u} ω g_{w}^{- 1} \hat{f} (λ ω) ω g_{y} ω {\underset{̲}{C}}_{y} χ_{(\frac{1}{2}, 2)} (λ) λ^{n - 1} d λ d ω d w d y, \end{matrix}

(428)

where we have also now set

ω {\underset{̲}{C}}_{y} (x) = ω \underset{̲}{C} (x - y)

. Using the decomposable estimate 390 , we now have that:

∥ {\tilde{I}}_{w, y} ∥ L^{2} (L^{2}) ≲ ∥ ω {\underset{̲}{C}}_{y} ∥ D (L^{2} (L^{\infty})) \cdot ∥ I_{w, y} ∥ L^{\infty} (L^{2}),

where the integral

I_{w, y}

is the same as

{\tilde{I}}_{w, y}

but with the matrix

ω {\underset{̲}{C}}_{y}

removed.

Using now the nesting:

\begin{matrix} D (L^{2} ({\dot{B}}_{2, 10 n}^{q_{γ}, (2, \frac{n - 1}{2})})) \subseteq D (L^{2} (L^{\infty})), \end{matrix}

(429)

the estimate 405 , and the remarks made above about general kernels of the form 426 , we have the pair of estimates:

\begin{matrix} ∥ ω {\underset{̲}{C}}_{y} ∥ D (L^{2} (L^{\infty})) & ≲ ℰ, & ∥ I_{w, y} ∥ L^{\infty} (L^{2}) & ≲ ∥ \hat{f} ∥ L^{2}, \end{matrix}

\begin{matrix}  \end{matrix}

uniform in the values of

w, y

. This is enough to prove the estimate 424 . This completes our proof of the square sum Strichartz estimates contained in 178a .

¹⁰ This also seems to have far reaching philosophical consequences for how one should proceed in lower dimensions. Specifically, it seems to suggest that the correct “covariant” $X^{s, θ}$ space should be defined in terms of the parametrix and not in terms of the symbol of the covariant equation.

11.2 Proof of the Differentiated Strichartz Estimates 178b – 178c

To wrap things up for this overall section, we prove the estimates 178b – 178c .

This will follows easily from the general list of decomposable estimates contained in Lemma 11.3 . In what follows, we will only bother to prove the time differentiated estimate 178c . The proof of the gradient estimate 178b follows from identical reasoning and is left to the reader (in fact, one only need apply the plain Strichartz estimates shown in previous sections followed by a

D (L^{\infty} (L^{\infty}))

estimate for the spatial potentials

{ω \underset{̲}{C}}

). Time differentiating the parametrix

Φ^{\pm} (\hat{f})

we see that:

\begin{matrix} \nabla_{t} Φ^{\pm} (\hat{f}) & = \int_{R^{n}} (\pm 2 π i | ξ |) e^{2 π i λ ω u^{\pm}} ω g_{\pm}^{- 1} \hat{f} (λ ω) ω g_{\pm} χ_{(\frac{1}{2}, 2)} (λ) λ^{n - 1} d λ d ω \end{matrix}

\begin{matrix} + \int_{R^{n}} e^{2 π i λ ω u^{\pm}} [ω g_{\pm}^{- 1} \hat{f} (λ ω) ω g_{\pm},^{ω} C_{0}^{\pm}] χ_{(\frac{1}{2}, 2)} (λ) λ^{n - 1} d λ d ω, \end{matrix}

\begin{matrix} = Φ^{\pm} (\pm 2 π i | ξ | \hat{f}) + \tilde{I} . \end{matrix}

\begin{matrix}  \end{matrix}

Therefore, our task is to show the pair of estimates:

\begin{matrix} ∥ P_{1} {\tilde{I}}_{1} ∥ L^{2} (S L^{\frac{2 (n - 1)}{n - 3}}) & ≲ ℰ \cdot ∥ \hat{f} ∥ L^{2}, \end{matrix}

(430)

\begin{matrix} ∥ P_{1} {\tilde{I}}_{1} ∥ L^{\infty} (L^{2}) & ≲ ℰ \cdot ∥ \hat{f} ∥ L^{2} . \end{matrix}

(431)

\begin{matrix}  \end{matrix}

The estimate 430 follows from essentially identical reasoning to that employed in the proof of estimates 424 – 425 above. The main point is to drop to

L^{2} (L^{2})

via Bernstein, and then use the

D (L^{2} (L^{\infty}))

estimate for the potential

^{ω} C_{0}

contained on line 405 above. The proof of the second estimate 431 above follows easily from the

D (L^{\infty} (L^{\infty}))

estimate for

^{ω} C_{0}

contained on line 405 above. Specifically, one has the nesting:

D (L^{\infty} ({\dot{B}}_{2, 10 n}^{p_{γ}, (2, \frac{n - 2}{2})})) \subseteq D (L^{\infty} (L^{\infty})) .

This completes our demonstration of 178b – 178c and ends this section.

12 Completion of the proof: Controlling the $L^{1} (L^{2})$ Norm of the Differentiated Parametrix

Our final task here is to prove the estimate 178e which guarantees that our parametrix is a good approximation the covariant wave equation

□_{\underset{̲}{A} ∙ ≪ 1}

. This essentially boils down to applying the estimates 403 – 408 to the various error terms listed on the right hand side of equation 203 above. We will prove the desired estimates for each of these terms separately.

$∙$ Decomposing the term $\underset{̲}{A} ∙ ≪ 1 (ω L^{\mp}) -^{ω} C^{\pm} (ω L^{\mp})$

This represents the worst error term which comes out of our approximation, as well as the main “renormalization” which the parametrix creates. In what follows we will eliminate the

\pm

notation on favor of the

ω \underset{̲}{L}

notation introduced on line 328 above. Using this convention, a short computation involving the formulas 200 – 201 and the structure equation 126e yields the identity:

\begin{matrix} \underset{̲}{A} ∙ ≪ 1 (ω \underset{̲}{L}) -^{ω} C (ω \underset{̲}{L}) \end{matrix}

\begin{matrix} = & (I - ω {\bar{Π}}^{(\frac{1}{2} - δ)}) \underset{̲}{A} ∙ ≪ 1 (\partial_{ω}) + ω {\bar{Π}}^{(\frac{1}{2} - δ)} Δ_{ω^{⊥}}^{- 1} \tilde{P} ([B, H]) (\partial_{ω}) \end{matrix}

\begin{matrix} + ω \underset{̲}{L} Δ^{- 1} ([ω \underset{̲}{A}, ω \underset{̲}{C}]) - d^{*} Δ^{- 1} [ω \underset{̲}{C},^{ω} C (ω \underset{̲}{L})], \end{matrix}

\begin{matrix} = & T_{1} + T_{2} + T_{3} + T_{4} . \end{matrix}

(432)

\begin{matrix}  \end{matrix}

Our goal is prove the following four estimates:

\begin{matrix} ∥ T_{1} ∥ D (L^{2} (L^{n - 1})) & ≲ ℰ, & ∥ T_{2} ∥ D (L^{1} (L^{\infty})) & ≲ ℰ, \end{matrix}

(433)

\begin{matrix} ∥ T_{3} ∥ D (L^{1} (L^{\infty})) & ≲ ℰ, & ∥ T_{4} ∥ D (L^{1} (L^{\infty})) & ≲ ℰ . \end{matrix}

(434)

\begin{matrix}  \end{matrix}

To prove the first estimate on line 433 , we see from the decomposable version of the Besov nesting 44 that is suffices to prove the following:

∥ (I - ω {\bar{Π}}^{(\frac{1}{2} - δ)}) \underset{̲}{A} ∙ ≪ 1 (\partial_{ω}) ∥ D (L^{2} ({\dot{B}}_{2}^{n - 1, (2, \frac{n (n - 3)}{2 (n - 1)})})) ≲ ℰ .

By the square sum nature of the Besov and decomposable norms, and keeping in mind the Besov version of the endpoint Strichartz estimate contained in the bootstrapping estimate 126d , we see that it suffices to prove this estimate at fixed frequency. Thus, we are trying to prove that:

\begin{matrix} ∥ (I - ω {\bar{Π}}^{(\frac{1}{2} - δ)}) P_{μ} \underset{̲}{A} ∙ ≪ 1 (\partial_{ω}) ∥ D (L^{2} (L^{n - 1})) ≲ ∥ P_{μ} \underset{̲}{A} ∙ ≪ 1 ∥ L^{2} (S {\dot{B}}_{2}^{\frac{2 (n - 1)}{n - 3}, (2, \frac{n - 1}{2})}) . \end{matrix}

(435)

Decomposing the term on the left hand side of this expression into all dyadic angular regions spread from the direction

ω

this is further reduced to showing that:

∥ ω Π_{θ} (I - ω {\bar{Π}}^{(\frac{1}{2} - δ)}) P_{μ} \underset{̲}{A} ∙ ≪ 1 (\partial_{ω}) ∥ D_{θ} (L^{2} (L^{n - 1})) ≲ θ^{γ} ∥ P_{μ} \underset{̲}{A} ∙ ≪ 1 ∥ L^{2} (S {\dot{B}}_{2}^{\frac{2 (n - 1)}{n - 3}, (2, \frac{n - 1}{2})}) .

Notice that we are only trying to show this for values

θ ≲ μ^{\frac{1}{2} - δ}

. Further computing the term on the left hand side of this last expression, and applying the heuristic multiplier bound (also using the Coulomb savings 196 ):

(θ \nabla_{ξ}^{k}) ω Π_{θ} (I - ω {\bar{Π}}^{(\frac{1}{2} - δ)}) P_{μ} \underset{̲}{A} ∙ ≪ 1 (\partial_{ω}) \approx θ ω Π_{θ} P_{μ} \underset{̲}{A} ∙ ≪ 1 .

Plugging this into the definition of the norm

D_{θ} (L^{2} (L^{n - 1}))

, using the multiplier-sum reductions employed in the proof of the inequality 409 , and reverting back to Besov notation we have the inequality sequence involving Bernstein's inequality 56 and a simple index manipulation:

435

\begin{matrix} (L . H . S .) & ≲ θ {(\sum_{φ : ω_{0} \in Γ_{φ}} ∥ {\tilde{^{ω_{0}} Π}}_{θ} P_{μ} \underset{̲}{A} ∙ ≪ 1 ∥^{2} L^{2} ({\dot{B}}_{2}^{n - 1, (2, \frac{n (n - 3)}{2 (n - 1)})}))}^{\frac{1}{2}}, \end{matrix}

\begin{matrix} ≲ θ^{\frac{n - 3}{2}} {(\sum_{φ : ω_{0} \in Γ_{φ}} ∥ {\tilde{^{ω_{0}} Π}}_{θ} P_{μ} \underset{̲}{A} ∙ ≪ 1 ∥^{2} L^{2} ({\dot{B}}_{2}^{\frac{2 (n - 1)}{n - 3}, (2, \frac{n (n - 3)}{2 (n - 1)})}))}^{\frac{1}{2}}, \end{matrix}

\begin{matrix} ≲ θ^{\frac{n - 3}{2}} μ^{\frac{- n - 1}{2 (n - 1)}} {(\sum_{φ : ω_{0} \in Γ_{φ}} ∥ {\tilde{^{ω_{0}} Π}}_{θ} P_{μ} \underset{̲}{A} ∙ ≪ 1 ∥^{2} L^{2} ({\dot{B}}_{2}^{\frac{2 (n - 1)}{n - 3}, (2, \frac{n - 1}{2})}))}^{\frac{1}{2}} . \end{matrix}

\begin{matrix}  \end{matrix}

Estimate 435 now follows from the fact that:

θ^{\frac{n - 3}{2}} μ^{\frac{- n - 1}{2 (n - 1)}} ≲ θ^{γ},

which is a consequence of the truncation condition

θ ≲ μ^{\frac{1}{2} - δ}

and the fact that

6 ⩽ n

, and the fact that we have chosen

δ, γ

according to 190 . This ends our proof of the first estimate on line 433 .

Our next step is to prove the second estimate on line 433 above. We will show the somewhat more regular estimate:

\begin{matrix} ∥ ω {\bar{Π}}^{(\frac{1}{2} - δ)} Δ_{ω^{⊥}}^{- 1} \tilde{P} ([B, H]) (\partial_{ω}) ∥ D (L^{1} ({\dot{B}}_{1}^{\infty, (n, \frac{n}{2})})) ≲ ℰ . \end{matrix}

(436)

Decomposing the term inside the norm on the left hand side of this last inequality into dyadic angular scales, applying the definition of the fixed scale decomposable norms

D_{θ} (L^{1} ({\dot{B}}_{1}^{\infty, (n, \frac{n}{2})}))

, using the (fixed time) fixed frequency heuristic multiplier bound (which again takes into account the savings 196 ):

(θ \nabla_{ξ})^{k} ω Π_{θ} ω {\bar{Π}}^{(\frac{1}{2} - δ)} Δ_{ω^{⊥}}^{- 1} P_{λ} \tilde{P} ([B, H]) (\partial_{ω}) \approx θ^{- 1} λ^{- 2} ω Π_{θ} P_{λ} ([B, H]),

expanding the resulting expression into a trichotomy, applying the multiplier square sum reduction used previously in the proof of estimate 409 above, and keeping in mind the bootstrapping structure estimates 126f , we see that the estimate 436 reduces to the demonstration of the following three fixed time bounds:

\begin{matrix} \sum_{λ, μ_{i} : μ_{1} ≪ μ_{2} \sim λ} λ^{- 2} {(\sum_{φ : ω_{0} \in Γ_{φ}} ∥ {\tilde{^{ω_{0}} Π}}_{θ} P_{λ} ([P_{μ_{1}} (B) (t), P_{μ_{2}} (H) (t)]) ∥^{2} L^{\infty})}^{\frac{1}{2}} \end{matrix}

(437)

\begin{matrix} ≲ θ^{1 + γ} ∥ B (t) ∥ {\dot{B}}_{2}^{\frac{2 (n - 1)}{n - 3}, (2, \frac{n - 1}{2})} \cdot ∥ H (t) ∥ {\dot{B}}_{2}^{\frac{2 (n - 1)}{n - 3}, (2, \frac{n - 3}{2})}, \end{matrix}

(438)

\begin{matrix} \sum_{λ, μ_{i} : μ_{2} ≪ μ_{1} \sim λ} λ^{- 2} {(\sum_{φ : ω_{0} \in Γ_{φ}} ∥ {\tilde{^{ω_{0}} Π}}_{θ} P_{λ} ([P_{μ_{1}} (B) (t), P_{μ_{2}} (H) (t)]) ∥^{2} L^{\infty})}^{\frac{1}{2}} \end{matrix}

(439)

\begin{matrix} ≲ θ^{1 + γ} ∥ B (t) ∥ {\dot{B}}_{2}^{\frac{2 (n - 1)}{n - 3}, (2, \frac{n - 1}{2})} \cdot ∥ H (t) ∥ {\dot{B}}_{2}^{\frac{2 (n - 1)}{n - 3}, (2, \frac{n - 3}{2})}, \end{matrix}

(440)

\begin{matrix} \sum_{λ, μ_{i} : λ ≲ μ_{1} \sim μ_{2}} λ^{- 2} {(\sum_{φ : ω_{0} \in Γ_{φ}} ∥ {\tilde{^{ω_{0}} Π}}_{θ} P_{λ} ([P_{μ_{1}} (B) (t), P_{μ_{2}} (H) (t)]) ∥^{2} L^{\infty})}^{\frac{1}{2}} \end{matrix}

(441)

\begin{matrix} ≲ θ^{1 + γ} ∥ B (t) ∥ {\dot{B}}_{2}^{\frac{2 (n - 1)}{n - 3}, (2, \frac{n - 1}{2})} \cdot ∥ H (t) ∥ {\dot{B}}_{2}^{\frac{2 (n - 1)}{n - 3}, (2, \frac{n - 3}{2})} . \end{matrix}

(442)

\begin{matrix}  \end{matrix}

We begin with the proof of the first estimate 438 . This is the most singular of the three. Fixing all of the spatial frequencies on the left hand side of this bound, we see that by an application of Young's inequality, it suffices to prove the following refinement:

(443) λ − 2 ( ∑ φ : ω 0 ∈ Γ φ ∥ ω 0 Π ~ θ P λ ( [ P μ 1 ( B ) ( t ) , P μ 2 ( H ) ( t ) ] ) ∥ 2 L ∞ ) 1 2 ≲ ( μ 1 μ 2 ) γ θ 1 + γ ∥ P μ 1 ( B ) ( t ) ∥ B ˙ 2 2 ( n − 1 ) n − 3 , ( 2 , n − 1 2 ) ⋅ ∥ P μ 2 ( H ) ( t ) ∥ B ˙ 2 2 ( n − 1 ) n − 3 , ( 2 , n − 3 2 ) .

This bound is scale invariant, so we may assume that

1 = λ \sim μ_{2}

. To aid in the demonstration, we introduce the auxiliary index:

{\tilde{r}}_{γ} = \frac{2 n (n - 1)}{n^{2} - 2 n - 1 - 2 (n - 1) γ} .

Notice that this has been chosen precisely so that one has the identity:

γ = \frac{1}{2} + n (\frac{n - 3}{2 (n - 1)} - \frac{1}{{\tilde{r}}_{γ}}),

so that ultimately we can make a reference to the fixed frequency bound 53 . The problem here is that we have

2 < {\tilde{r}}_{γ}

(in any dimension), so we are going to run into orthogonality issues in the square-sum on the left hand side of 443 . This will end up costing some extra powers of

θ^{- 1}

, but luckily the Bernstein inequality will more than make up for this. Applying Bernstein to each term in the sum on the left hand side of 443 we arrive at the bound: 443

\begin{matrix} (L . H . S .) ≲ θ^{{\frac{n - 1}{\tilde{r}}}_{γ}} {(\sum_{φ : ω_{0} \in Γ_{φ}} ∥ {\tilde{^{ω_{0}} Π}}_{θ} P_{1} ([P_{μ_{1}} (B) (t), P_{μ_{2}} (H) (t)]) ∥^{2} L^{{\tilde{r}}_{γ}})}^{\frac{1}{2}} . \end{matrix}

(444)

To get rid of the square-sum on the right hand side of this last expression, we introduce the following map from

L^{p} (R n)

ℓ^{2} (L^{p} (R n))

T^{θ} (A) = ({\tilde{^{ω_{1}} Π}}_{θ} P_{1} (A), \dots, {\tilde{^{ω_{N}} Π}}_{θ} P_{1} (A)),

where

(ω_{1}, \dots, ω_{N})

is some ordering of the

Γ_{φ}

spherical cap “base-points”. Notice that there are

N \sim θ^{1 - n}

of these. By orthogonality, and using the uniform boundedness of the multipliers

ω Π_{θ} P_{1}

L^{\infty}

we have the pair of estimates:

\begin{matrix} ∥ T^{θ} (A) ∥ ℓ^{2} (L^{2}) & ≲ ∥ P_{1} (A) ∥ L^{2}, \end{matrix}

\begin{matrix} ∥ T^{θ} (A) ∥ ℓ^{2} (L^{\infty}) & ≲ θ^{\frac{1 - n}{2}} ∥ P_{1} (A) ∥ L^{\infty} . \end{matrix}

\begin{matrix}  \end{matrix}

By interpolating these to bounds in the pair of spaces

(ℓ^{2} (L^{2}), ℓ^{2} (L^{\infty}))

and

(L^{2}, L^{\infty})

(see [1] ), we have the bound:

∥ T^{θ} (A) ∥ ℓ^{2} (L^{{\tilde{r}}_{γ}}) ≲ θ^{(1 - n) (\frac{1}{2} - \frac{1}{{\tilde{r}}_{γ}})} ∥ P_{1} (A) ∥ L^{{\tilde{r}}_{γ}} .

Plugging this last estimate into the right hand side of 444 above, and finally applying generic fixed frequency estimate 53 we have that:

443

\begin{matrix} (L . H . S .), \end{matrix}

\begin{matrix} ≲ & θ^{(n - 1) (\frac{2}{r_{γ}} - \frac{1}{2})} ∥ P_{1} ([P_{μ_{1}} (B) (t), P_{μ_{2}} (H) (t)]) ∥ L^{{\tilde{r}}_{γ}}, \end{matrix}

\begin{matrix} ≲ & {(\frac{μ_{1}}{μ_{2}})}^{γ} θ^{(n - 1) (\frac{2}{r_{γ}} - \frac{1}{2})} ∥ P_{μ_{1}} (B) (t) ∥ {\dot{B}}_{2}^{\frac{2 (n - 1)}{n - 3}, (2, \frac{n - 1}{2})} \cdot ∥ P_{μ_{2}} (H) (t) ∥ {\dot{B}}_{2}^{\frac{2 (n - 1)}{n - 3}, (2, \frac{n - 3}{2})} . \end{matrix}

\begin{matrix}  \end{matrix}

The estimate 443 now follows from the bound:

θ^{(n - 1) (\frac{2}{r_{γ}} - \frac{1}{2})} ≲ θ^{1 + γ},

which holds in dimensions

6 ⩽ n

. We leave the verification of this to the reader.

This ends our demonstration of the

L o w \times H i g h

frequency estimate 438 . Notice that the second estimate 440 is simply a less singular version of this. In fact, repeating the above procedure, we see that in that case there is an extra factor of

(\frac{μ_{2}}{μ_{1}})

in the analog of the fixed frequency bound 443 .

We have now reduced the proof of the second estimate on line 433 to the

H i g h \times H i g h

interaction bound 442 . By applying the

L^{\infty} \to L^{2}

version of Bernstein, using orthogonality, and then applying the general bound 52 , we have the fixed frequency estimate:

\begin{matrix} λ^{- 2} {(\sum_{φ : ω_{0} \in Γ_{φ}} ∥ {\tilde{^{ω_{0}} Π}}_{θ} P_{λ} ([P_{μ_{1}} (B) (t), P_{μ_{2}} (H) (t)]) ∥^{2} L^{\infty})}^{\frac{1}{2}}, \end{matrix}

\begin{matrix} ≲ & θ^{\frac{n - 1}{2}} λ^{\frac{n - 4}{2}} ∥ P_{λ} ([P_{μ_{1}} (B) (t), P_{μ_{2}} (H) (t)]) ∥ L^{2}, \end{matrix}

\begin{matrix} ≲ & {(\frac{λ}{μ_{1}})}^{σ} θ^{\frac{n - 1}{2}} ∥ P_{μ_{1}} (B) (t) ∥ {\dot{B}}_{2}^{\frac{2 (n - 1)}{n - 3}, (2, \frac{n - 1}{2})} \cdot ∥ P_{μ_{2}} (H) (t) ∥ {\dot{B}}_{2}^{\frac{2 (n - 1)}{n - 3}, (2, \frac{n - 3}{2})}, \end{matrix}

\begin{matrix}  \end{matrix}

where

0 < σ = n (\frac{n - 3}{n - 1}) - 2

. By summing this last line and then applying Cauchy-Schwartz, we easily arrive at the bound 442 .

To finish this subsection, we only need to prove the two estimates on line 434 above. To show the first estimate involving the

T_{3}

term, we simply expand the

ω \underset{̲}{L}

derivative into the product via the Leibniz rule, and then use the decomposable bounds 403 and 405 and 407 – 408 in conjunction with the following instance of the bilinear decomposable estimate 392 :

Δ^{- 1} : D (L_{t}^{2} ({\dot{B}}_{2, 10 n}^{p_{γ}, (2, \frac{n - 3}{2})})) \cdot D (L_{t}^{2} ({\dot{B}}_{2, 10 n}^{q_{γ}, (2, \frac{n - 1}{2})})) ↪ D (L_{t}^{1} ({\dot{B}}_{1}^{\infty, (2, \frac{n}{2})})) .

To show the second bound on line 434 , we again use the estimates 403 and 405 , this time in conjunction with:

\nabla_{x} Δ^{- 1} : D (L_{t}^{2} ({\dot{B}}_{2, 10 n}^{q_{γ}, (2, \frac{n - 1}{2})})) \cdot D (L_{t}^{2} ({\dot{B}}_{2, 10 n}^{q_{γ}, (2, \frac{n - 1}{2})})) ↪ D (L_{t}^{1} ({\dot{B}}_{1}^{\infty, (2, \frac{n}{2})})) .

This completes our decomposable estimates for the error term

\underset{̲}{A} ∙ ≪ 1 (ω L^{\mp}) -^{ω} C^{\pm} (ω L^{\mp})

$∙$ Decomposing the term $D_{α}^{\underset{̲}{A} ∙ ≪ 1} {(^{ω} C^{\pm})}^{α}$

Again dropping the

\pm

notation and using the equations 200 – 201 and the identity 328 as well as the structure equation 126e , we can write this as:

\begin{matrix} D_{α}^{\underset{̲}{A} ∙ ≪ 1} (^{ω} C)^{α} & = - ω {\bar{Π}}^{(\frac{1}{2} - δ)} ω L Δ_{ω^{⊥}}^{- 1} \tilde{P} ([B, H]) (\partial_{ω}) \end{matrix}

\begin{matrix} + (\pm ω \underset{̲}{L} \mp ω \cdot \nabla_{x}) \nabla_{t} Δ^{- 1} [ω \underset{̲}{A}, ω \underset{̲}{C}] + \nabla_{t} d^{*} Δ^{- 1} [^{ω} C_{0}, ω \underset{̲}{C}] \end{matrix}

\begin{matrix} - [ω \underset{̲}{A}, ω \underset{̲}{C}] + [\underset{̲}{A} ∙ ≪ 1, ω \underset{̲}{C}], \end{matrix}

\begin{matrix} = {\tilde{T}}_{2} + {\tilde{T}}_{3} + {\tilde{T}}_{4} + {\tilde{T}}_{5} + {\tilde{T}}_{6} . \end{matrix}

\begin{matrix}  \end{matrix}

We will show that all of these terms obey the estimate:

\begin{matrix} ∥ {\tilde{T}}_{k} ∥ D (L^{1} (L^{\infty})) & ≲ ℰ, & 2 ⩽ k ⩽ 6 . \end{matrix}

(445)

\begin{matrix}  \end{matrix}

Notice that, for the most part, the terms

{\tilde{T}}_{k}

represent less singular versions of the

T_{k}

on line 432 above. In fact, they can all be treated using similar embeddings by simply wasting one derivative. Specifically, the estimate 445 for the first term

{\tilde{T}}_{2}

follows directly from 436 above once one takes into account the presence of the truncation 126c inherent in the projection

\tilde{P}

. To prove the estimate 445 for the portion of term

{\tilde{T}}_{3}

containing the

ω \underset{̲}{L}

derivative, we use the same embedding employed in the proof of the estimate for

T_{3}

on line 434 above. This follows because one can distribute the time derivative and simply waste smoothness in the estimates 404 , 406 , and 407 – 408 . Specifically, by taking advantage of the low frequency behavior of these estimates, we have the bounds:

\begin{matrix} ∥ \nabla_{t} ω \underset{̲}{A} ∥ D (L_{t}^{2} ({\dot{B}}_{2, 9 n}^{q_{γ}, (2, \frac{n - 1}{2})})) & ≲ ℰ, & ∥ \nabla_{t} ω \underset{̲}{C} ∥ D (L_{t}^{2} ({\dot{B}}_{2, 9 n}^{q_{γ}, (2, \frac{n - 1}{2})})) & ≲ ℰ, \end{matrix}

(446)

\begin{matrix} ∥ \nabla_{t} ω \underset{̲}{L} ω \underset{̲}{A} ∥ D (L_{t}^{2} ({\dot{B}}_{2, 9 n}^{p_{γ}, (2, \frac{n - 3}{2})})) & ≲ ℰ, & ∥ \nabla_{t} ω \underset{̲}{L} ω \underset{̲}{C} ∥ D (L_{t}^{2} ({\dot{B}}_{2, 9 n}^{p_{γ}, (2, \frac{n - 3}{2})})) & ≲ ℰ . \end{matrix}

(447)

\begin{matrix}  \end{matrix}

Using a similar strategy, we can prove the estimate 445 for the portion of

{\tilde{T}}_{3}

containing the

ω \cdot \nabla_{x}

derivative (notice that the functions

ω_{i}

are trivially decomposable) as well as the term

{\tilde{T}}_{4}

in the same way as we showed 434 for the term

T_{4}

above. All we need to do is to show the estimate:

∥ {\nabla_{t}}^{ω} C_{0} ∥ D (L_{t}^{2} ({\dot{B}}_{2, 9 n}^{q_{γ}, (2, \frac{n - 1}{2})})) ≲ ℰ .

This follows in the same way we proved the undifferentiated estimate 405 for

^{ω} C_{0}

above, but instead of using the undifferentiated versions of 403 , 405 , and 407 – 408 , we simply use 446 – 447 . Finally, notice that the proof of the estimate 445 for the terms

{\tilde{T}}_{5}

and

{\tilde{T}}_{6}

above follows by simply multiplying (decompose twice!) the

D (L^{2} (L^{\infty}))

estimate which is implied by the bounds 403 and 405 above. This completes our decomposition of the second error term on the right hand side of 203 above.

$∙$ Decomposing the term $[{\underset{̲}{A}}^{α} ∙ ≪ 1 - (^{ω} C^{\pm})^{α}, [(\underset{̲}{A} ∙ ≪ 1)_{α} -^{ω} C_{α}^{\pm}, ∙]]$

Here we again use the norm

D (L^{1} (L^{\infty}))

, which we can achieve as a product of

D (L^{2} (L^{\infty}))

estimates, again making an appeal to 403 and 405 above.

This completes our proof of the approximation estimate 178e and thus, at last, the proof of Proposition 7.2 which allows us to close the bootstrapping begun in Proposition 6.1 . FP. References

Jöran Bergh, Jörgen Löfström Interpolation spaces. An introduction. Grundlehren der Mathematischen Wissenschaften, No. 223. Springer-Verlag, Berlin-New York, 1976.
P. Bizoń, Z. Tabor, On blowup of Yang-Mills fields. Phys. Rev. D (3) 64 (2001), no. 12, 121701, 4 pp.
Thierry Cazenave, Jalal Shatah, Shadi A. Tahvildar-Zadeh Harmonic maps of the hyperbolic space and development of singularities in wave maps and Yang-Mills fields. Ann. Inst. H. Poincar Phys. Thor. 68 (1998), no. 3, 315–349.
Markus Keel, Terence Tao Endpoint Strichartz estimates. Amer. J. Math. 120 (1998), no. 5, 955–980.
Sergiu Klainerman, Igor Rodnianski Improved local well-posedness for quasilinear wave equations in dimension three. Duke Math. J. 117 (2003), no. 1, 1–124.
Sergiu Klainerman, Igor Rodnianski On the global regularity of wave maps in the critical Sobolev norm. Internat. Math. Res. Notices 2001, no. 13, 655–677.
Andrea Nahmod, Atanas Stefanov, Karen Uhlenbeck, On the well-posedness of the wave map problem in high dimensions. Comm. Anal. Geom. 11 (2003), no. 1, 49–83.
Igor Rodnianski, Terence Tao Global regularity for the Maxwell-Klein-Gordon equation with small critical Sobolev norm in high dimensions. Comm. Math. Phys. 251 (2004), no. 2, 377–426.
Jalal Shatah, Michael Struwe The Cauchy problem for wave maps. Int. Math. Res. Not. 2002, no. 11, 555–571.
Hart F. Smith, Daniel Tataru Sharp local well-posedness results for the nonlinear wave equation. to appear in Annals of Mathematics.
Terence Tao Global regularity of wave maps. I. Small critical Sobolev norm in high dimension. Internat. Math. Res. Notices 2001, no. 6, 299–328.
Michael E. Taylor Tools for PDE. Pseudodifferential operators, paradifferential operators, and layer potentials. Mathematical Surveys and Monographs, 81. American Mathematical Society, Providence, RI, 2000.
Karen K. Uhlenbeck Connections with $L^{p}$ bounds on curvature. Comm. Math. Phys. 83 (1982), no. 1, 31–42.

Joachim Krieger

Jacob Sterbenz