## Continuity of maps solutions of optimal transportation problems

### November 27, 2006

Abstract
In this paper we investigate the continuity of maps solutions of optimal transportation problems. These maps are expressed through the gradient of a potential for which we establish ${C}^{1}$  and ${C}^{1,\alpha }$  regularity. Our results hold assuming a condition on the cost function (condition A3 below), that was the one used for ${C}^{2}$  a priori estimates in [?. The optimal potential will solve a Monge-Ampere equation of the form $det\left(M\left(x,\nabla \phi \right)+{D}^{2}\phi \right)=f$  where $M$  depends on the cost function. One of the interesting outcome is that under the condition A3, the regularity obtained is better than the one obtained in the case of the ’usual’ Monge-Ampere equation $det{D}^{2}\phi =f$  , in particular we will obtain here ${C}^{1,\alpha }$  regularity for $\phi$  under the condition $f\in {L}^{p},p>n$  .

1 Introduction

In this work we present some results about regularity of optimal maps arising in problems of optimal transportation. Given a cost function $\left(x,y\right)\to c\left(x,y\right)$  going from ${\mathbb{R}}^{n}×{\mathbb{R}}^{n}$  to $\mathbb{R}$  , (or from $\Omega ×{\Omega }^{\prime }$  to $\mathbb{R}$  for $\Omega ,{\Omega }^{\prime }$  some domains of ${\mathbb{R}}^{n}$  ) and ${\rho }_{0},{\rho }_{1}$  two positive measures of equal mass, the problem of optimal transportation consists in finding a map $T$  that pushes forward ${\rho }_{0}$  onto ${\rho }_{1}$  (hereafter ${T}_{#}{\rho }_{0}={\rho }_{1}$  ) in the sense that
 $\begin{array}{ccc}\forall f\in {C}_{b}^{0}\left({\mathbb{R}}^{n}\right),\int f\circ Td{\rho }_{0}=\int fd{\rho }_{1},& & \end{array}$
minimizing the transportation cost
 $\begin{array}{ccc}C=\int c\left(x,T\left(x\right)\right)d{\rho }_{0}\left(x\right).& & \end{array}$
The reader can refer to [? for references about optimal transportation, and conditions needed for the solvability of the problem. The problem we are concerned here is the regularity of optimal maps.
We recall briefly how optimal maps are expressed through a c-convex potential: given a lower semi-continuous function $\phi :\Omega \subset {\mathbb{R}}^{n}\to \mathbb{R}$  , we define its c-transform by
 $\begin{array}{ccc}{\phi }^{c}\left(y\right)={sup}_{x\in \Omega }\left\{-c\left(x,y\right)-\phi \left(x\right)\right\}.& & \end{array}$ (1)
A function is said c-convex if it is the c-transform of another function, moreover in this case ${\phi }^{cc}=\phi$  . Assuming for simplicity that $\Omega$  is bounded, note that for a ${C}^{2}$  cost $c$  , ${\phi }^{c}$  will be locally semi-convex and Lipschitz. Through Monge-Kantorovitch duality, the problem of optimal transportation becomes a problem of linear programming. For instance, the Monge-Kantorovitch problem associated to the optimal transport of ${\rho }_{0}$  onto ${\rho }_{1}$  under the cost $c$  is to find an optimal pair of potentials $\left(\phi ,\psi \right)$  that realizes
 $\begin{array}{ccc}{inf}_{\phi \left(x\right)+\psi \left(y\right)\ge -c\left(x,y\right)}\int \phi \left(x\right)d{\rho }_{0}\left(x\right)+\int \psi \left(y\right)d{\rho }_{1}\left(y\right).& & \end{array}$
We can assume that for the optimal pair we have $\psi ={\phi }^{c}$  . For $d{\rho }_{0}$  almost every $x$  , there will be a unique ${y}_{x}$  such that $\phi \left(x\right)+{\phi }^{c}\left({y}_{x}\right)=-c\left(x,{y}_{x}\right)$  .
The map $x\to {y}_{x}$  will then yield the optimal map for the transportation problem. Such map will be denoted ${G}_{\phi }$  . Formally, for a given $x$  at which $\phi$  is differentiable, ${G}_{\phi }\left(x\right)$  is defined as the unique $y$  such that $-{\nabla }_{x}c\left(x,y\right)=\nabla \phi \left(x\right)$  (this makes sense under condition A1 below). In the case of the optimal transportation on a Riemannian manifold with distance $d\left(\cdot ,\cdot \right)$  and with $c={d}^{2}/2$  , the map ${G}_{\phi }$  will be $x\to {exp}_{x}\left(\nabla \phi \right)$  (see [?).
In this setting, for a ${C}^{2}$  smooth c-convex potential $\phi$  such that
 $\begin{array}{ccc}{G}_{\phi #}{\rho }_{0}={\rho }_{1},& & \end{array}$
the corresponding Monge-Ampere equation will be
 $\begin{array}{ccc}det\left({D}_{xx}^{2}c\left(x,{G}_{\phi }\left(x\right)\right)+{D}^{2}\phi \right)=|det{D}_{xy}^{2}c|\frac{{\rho }_{0}\left(x\right)}{{\rho }_{1}\left({G}_{\phi }\left(x\right)\right)}.& & \end{array}$ (2)
(See [? for a derivation of this equation, or [?, [?.)

1.1 Conditions on the cost function

In this paper we will make the following assumptions on the cost function, that are the same as in [? (the assumption A3 is equivalent, although in a different form).
• A1 For all $\left(x,p\right)\in {\mathbb{R}}^{n}×{\mathbb{R}}^{n}$  , there exists a unique $y\in {\mathbb{R}}^{n}$  such that $-{\nabla }_{x}c\left(x,y\right)=p$  . Such $y$  will then be denoted ${T}_{x}\left(p\right)$  .
• A2 The cost function $c$  satisfies $det{D}_{xy}^{2}c\ne 0$  for all $x,y\in {\mathbb{R}}^{n}×{\mathbb{R}}^{n}$  .
• A3 We have $c\in {C}^{4}\left({\mathbb{R}}^{n}×{\mathbb{R}}^{n}\right)$  and there exists ${C}_{0}>0$  such that for all unit vectors $\nu ,\xi$  with $\xi \perp \nu$  ,  $\begin{array}{ccc}{D}_{{p}_{\nu }{p}_{\nu }}^{2}\left[p\to {D}_{\xi \xi }^{2}c\left(x,{T}_{x}\left(p\right)\right)\right]\le -{C}_{0},& & \end{array}$
where ${D}_{{p}_{\nu }{p}_{\nu }}^{2}$  denotes the pure second derivative with respect to $p$  in the direction $\nu$  , and ${D}_{\xi \xi }^{2}$  the pure second derivative with respect to $x$  in the direction $\xi$  .
Remark. The map $\left(x,p\right)\to {T}_{x}\left(p\right)$  is the ’c-analog’ of the exponential map on a Riemannian manifold, in the case where $c={d}^{2}/2$  , it coincides.
Remark. The condition A3 will be analyzed in greater detail at the beginning of the proof of our main result, and a geometric interpretation of this condition will be given.

1.2 Some definitions

Definition 1.1 For $\phi$  a semi-convex function, the sub-differential of $\phi$  at $x$  , that we denote $\partial \phi \left(x\right)$  , is the set
 $\begin{array}{ccc}\partial \phi \left(x\right)=\left\{p\in {\mathbb{R}}^{n},\phi \left(y\right)\ge \phi \left(x\right)+p\cdot \left(y-x\right)+o\left(|x-y|\right)\right\}.& & \end{array}$
If $\phi$  is c-convex, the c-sub-differential of $\phi$  at $x$  , that we denote ${\partial }^{c}\phi$  , is the set
 $\begin{array}{ccc}{\partial }^{c}\phi \left(x\right)=\left\{p\in {\mathbb{R}}^{n},{\phi }^{c}\left({T}_{p}\left(x\right)\right)+\phi \left(x\right)=-c\left(x,{T}_{p}\left(x\right)\right)\right\}.& & \end{array}$
From ( 1 ), we see immediately that ${\partial }^{c}\phi \subset \partial \phi$  . The concern is that this inclusion might be strict, except in the case where $\phi$  is differentiable (and hence ${C}^{1}$  by semi-convexity) at $x$  , see [? for a discussion about this subject.
Definition 1.2 Let $p\to {T}_{x}\left(p\right)$  be the mapping defined by assumption A1. The point $x$  being fixed, a c-segment with respect to $x$  is the image by ${T}_{x}$  of a segment of ${\mathbb{R}}^{n}$  .
If for ${v}_{0},{v}_{1}\in {\mathbb{R}}^{n}$  we have ${T}_{x}\left({v}_{i}\right)={y}_{i},i=0,1$  , the c-segment centered at $x$  joining ${y}_{0}$  to ${y}_{1}$  will denoted $\left\{{y}_{\theta },\theta \in \left[0,1\right]\right\}$  where ${y}_{\theta }={T}_{x}\left(\theta {v}_{1}+\left(1-\theta \right){y}_{0}\right)$  .
Definition 1.3 Let $A,B\subset {\mathbb{R}}^{n}$  . We say that $B$  is c-convex with respect to $A$  if for all ${y}_{0},{y}_{1}\in B,x\in A$  , the c-segment with respect to $x$  going from ${y}_{0}$  to ${y}_{1}$  is contained in $B$  .

1.3 Statement of the results

We are now ready to state our main result; hereafter $dVol$  denotes the Lebesgue measure of ${\mathbb{R}}^{n}$  , and ${B}_{r}$  (resp. ${B}_{r}\left(x\right)$  ) denotes a ball of radius $r$  (resp. centered at $x$  ).
Theorem 1.4 Let $c$  be a cost function that satisfies assumptions A1, A2, A3. Let ${\rho }_{0},{\rho }_{1}$  be two non-negative functions. Let $\phi$  be a ${C}^{1}$  c-convex potential, that solves
 $\begin{array}{ccc}det\left({D}_{xx}^{2}c\left(x,{G}_{\phi }\left(x\right)\right)+{D}^{2}\phi \right)=|det{D}_{xy}^{2}c|\frac{{\rho }_{0}\left(x\right)}{{\rho }_{1}\left({G}_{\phi }\left(x\right)\right)}& & \end{array}$ (3)
in the sense that ${G}_{\phi #}{\rho }_{0}dVol={\rho }_{1}dVol$  . Assume that ${\rho }_{1}\ge \lambda >0$  and that ${\rho }_{0}$  satisfies
 $\begin{array}{ccc}{\int }_{{B}_{\epsilon }}{\rho }_{0}\le C{\epsilon }^{n\left(1-\frac{1}{p}\right)}& & \end{array}$ (4)
for some $p\in \right]n,+\infty \right]$  and for all $\epsilon >0$  . Then for $\beta =\frac{\alpha }{4n-2+\alpha }$  , $\alpha =1-\frac{n}{p}$  , we have
 $\begin{array}{ccc}\parallel \phi {\parallel }_{{C}^{1,\beta }}\le {C}^{\prime },& & \end{array}$
and ${C}^{\prime }$  depends only on $C$  in ( 4 ) and on the constant ${C}_{0}>0$  in condition A3. If we only know that
 $\begin{array}{ccc}{\rho }_{0}\left({B}_{\epsilon }\right)=f\left(\epsilon \right){\epsilon }^{n\left(1-\frac{1}{n}\right)}& & \end{array}$ (5)
with ${lim}_{\epsilon \to 0}f=0$  then the modulus of continuity of $\nabla \phi$  is controlled by the modulus of continuity of $f$  at 0.
Remark 1. Equation ( 3 ) with ${\rho }_{1}=dVol$  can also be formulated as
 $\begin{array}{ccc}Jac\left({G}_{\phi }\right)={\rho }_{0},& & \end{array}$
with $Jac\left(\cdot \right)$  the Jacobian determinant, and therefore is the analog of the Monge-Ampere equation $det{D}^{2}\phi ={\rho }_{0}$  .
Remark 2. Conditions ( 4 ) (resp. ( 5 )) are implied by ${\rho }_{0}\in {L}^{p}$  (resp.
${\rho }_{0}\in {L}^{n}$  ). Note also that we obtain a continuity result even in the critical case ${\rho }_{0}\in {L}^{n}$  which is somehow surprising.
Remark 3. Here condition A3 is not invariant under affine transformations that preserve the volume. This non-invariance might explain why we obtain better regularity in this case. Indeed, to obtain ${C}^{1,\alpha }$  regularity for the Monge-Ampere equation $det{D}^{2}u=f$  , the condition on $f$  is $f$  bounded away from 0 and $\infty$  (note that affine transformations do not preserve ${L}^{p}$  norms for $p<\infty$  ). Moreover, some pathological situations known as Pogorelov’s counterexamples can happen where, with a ${C}^{\infty }$  right hand side, the solution is not strictly convex, and hence not ${C}^{2}$  (see [?, [?).
This Theorem is established assuming ${C}^{1}$  regularity for the following reason: we obtain the result by estimating the size of ${\partial }^{c}\phi \left({B}_{\epsilon }\right)$  for some balls ${B}_{\epsilon }$  . If we don’t have the ${C}^{1}$  regularity assumption, the sub-differential of $\phi$  can contain more than one point, but we don’t know if for all $p\in \partial \phi \left(x\right)$  , $p\in {\partial }^{c}\phi \left(x\right)$  (see [? for a discussion on the subject). We then use Theorem  1.4 as an a-priori estimate that leads to the following corollary:
Corollary 1.5 Let $\Omega$  and ${\Omega }^{\prime }$  be bounded open subsets of ${\mathbb{R}}^{n}$  with ${\Omega }^{\prime }$  c-convex with respect to $\Omega$  . Let $\phi :\Omega \to {\mathbb{R}}^{n}$  be a c-convex potential such that ${G}_{\phi #}{\rho }_{0}dVol={\rho }_{1}dVol$  for ${\rho }_{0},{\rho }_{1}$  two non-negative functions on $\Omega$  and ${\Omega }^{\prime }$  . Assume that ${\rho }_{1}$  is bounded away from $0$  . Then under the assumption ( 5 ), $\phi$  is ${C}^{1}$  , and hence the Theorem  1.4 applies.
Proof. In [?, it has been proved that under assumptions A1-3, if the densities ${\rho }_{0},{\rho }_{1}$  are ${C}^{2}$  smooth, bounded away from 0, and if ${\Omega }^{\prime }$  is c-convex with respect to $\Omega$  , the potential $\phi$  is ${C}^{3}$  smooth in $\Omega$  . Hence by a standard regularization procedure, we can find smooth positive sequences ${\rho }_{0}^{\epsilon },{\rho }_{1}^{\epsilon }$  that converge to ${\rho }_{0},{\rho }_{1}$  in ${L}^{1}$  . The a priori bound of Theorem  1.4 apply to the sequence ${\phi }^{\epsilon }$  , and ${\phi }^{\epsilon }$  converges uniformly to $\phi$  .
$\square$

2 Proof of the results

We begin by giving some heuristic arguments that explain how we obtain our results, and in particular why the results obtained here are better than those for the ’usual’ Monge-Ampere equation $det{D}^{2}\phi =\frac{{\rho }_{0}}{{\rho }_{1}\left(\nabla \phi \right)}$  (see [? for examples of non-smooth solutions to this equation).

Geometric interpretation of condition A3 and sketch of the proof.

We explain the geometrical meaning of condition A3, and how it will imply ${C}^{1}$  regularity for $\phi$  : assume that for a c-convex function $\phi$  the functions $-\phi \left(\cdot \right)-c\left(\cdot ,{y}_{0}\right)$  and $-\phi \left(\cdot \right)-c\left(\cdot ,{y}_{1}\right)$  both reach a local maximum at $x=0$  (in other words, assume that $\phi$  is not ${C}^{1}$  at 0). Consider ${y}_{\theta }$  the c-segment centered at $x=0$  joining ${y}_{0}$  to ${y}_{1}$  , i.e. ${y}_{\theta }={T}_{x=0}\left(\theta {v}_{1}+\left(1-\theta \right){v}_{0}\right)$  with ${v}_{0},{v}_{1}\in {\mathbb{R}}^{n}$  such that ${T}_{x=0}{v}_{i}={y}_{i},i=1,2$  . Then, as we will see in Lemma  2.1 , condition A3 implies that $-\phi -c\left(x,{y}_{\theta }\right)$  will also have a local maximum at 0, and moreover that ${D}^{2}\phi \left(0\right)+{D}_{xx}^{2}c\left(0,{y}_{\theta }\right)$  (in some generalized sense, since $\phi$  is not ${C}^{2}$  at 0) will be bounded by below by $\alpha I$  , with $\alpha >0$  for $\theta$  away from $0,1$  . Then, by estimating all supporting functions to $\phi$  on a small ball centered at 0, we will find that ${G}_{\phi }\left({B}_{\epsilon }\left(0\right)\right)$  contains ${B}_{C\epsilon }\left({y}_{\theta }\right)$  for some $C>0$  , and for all ${y}_{\theta }$  , where $\theta$  is in $\left[1/4,3/4\right]$  . This will contradict the bound on Jacobian determinant of ${G}_{\phi }$  .
We now enter into the rigorous proof of Theorem  1.4 .

2.1 Geometric interpretation of condition A3

The core of the proof is the following lemma, which a somehow geometrical translation of assumption A3. Actually, I believe that this lemma is indeed equivalent to assumption A3 for a smooth cost function.
Lemma 2.1 For ${y}_{0},{y}_{1}\in {\mathbb{R}}^{n}$  , let $\left({y}_{\theta }{\right)}_{\theta \in \left[0,1\right]}$  be the c-segment with respect to $x=0$  joining ${y}_{0}$  to ${y}_{1}$  , in the sense that if ${T}_{x}\left({z}_{0}\right)={y}_{0},{T}_{x}\left({z}_{1}\right)={y}_{1}$  , we have ${y}_{\theta }={T}_{x}\left(\theta {z}_{1}+\left(1-\theta \right){z}_{0}\right)$  .
Let $\overline{\phi }=max\left\{-c\left(x,{y}_{0}\right)+c\left(0,{y}_{0}\right),-c\left(x,{y}_{1}\right)+c\left(0,{y}_{1}\right)\right\}$  . Then for all $\epsilon >0$  , for all $\theta \in \left[\epsilon ,1-\epsilon \right]$  , for all $|x|\le C\epsilon$  , we have
 $\begin{array}{ccc}\overline{\phi }\left(x\right)\ge -c\left(x,{y}_{\theta }\right)+c\left(0,{y}_{\theta }\right)+{\delta }_{0}|{y}_{1}-{y}_{0}{|}^{2}\theta \left(1-\theta \right)|x{|}^{2}-\gamma |x{|}^{3},& & \end{array}$
where ${\delta }_{0}>0$  depends on ${C}_{0}>0$  in assumption A3, $\gamma$  depends on $\parallel c\left(\cdot ,\cdot \right){\parallel }_{{C}^{3}}$  , and $C$  is bounded away from 0 for $|{y}_{0}|,|{y}_{1}|$  bounded.
Proof of Lemma  2.1 . Rotating the coordinate, and subtracting an affine function, we can assume the following:
 $\begin{array}{ccc}& & {f}_{0}\left(x\right)=-c\left(x,{y}_{0}\right)+c\left(0,{y}_{0}\right)=a{x}_{1}-{D}_{xx}^{2}c\left(0,{y}_{0}\right).x.x/2+o\left({x}^{2}\right),\end{array}$
 $\begin{array}{ccc}& & {f}_{1}\left(x\right)=-c\left(x,{y}_{1}\right)+c\left(0,{y}_{1}\right)=b{x}_{1}-{D}_{xx}^{2}c\left(0,{y}_{1}\right).x.x/2+o\left({x}^{2}\right),\end{array}$
 $\begin{array}{ccc}& & -c\left(x,{y}_{\theta }\right)+c\left(0,{y}_{\theta }\right)=\left[\theta b+\left(1-\theta \right)a\right]{x}_{1}-{D}_{xx}^{2}c\left(0,{y}_{\theta }\right).x.x/2+o\left({x}^{2}\right),\end{array}$
with $a  and where ${x}_{i}$  is the coordinate of $x$  in the direction ${e}_{i}$  . Using the general fact that $max\left\{{f}_{0},{f}_{1}\right\}\ge \theta {f}_{1}+\left(1-\theta \right){f}_{0}$  for $0\le \theta \le 1$  , we have
 $\begin{array}{ccc}\overline{\phi }\left(x\right)\ge \left(\theta b+\left(1-\theta \right)a\right){x}_{1}-\left(\theta {D}_{xx}^{2}c\left(0,{y}_{1}\right)+\left(1-\theta \right){D}_{xx}^{2}c\left(0,{y}_{0}\right)\right).x.x/2+o\left({x}^{2}\right).& & \end{array}$
Then we use the assumption A3:
Lemma 2.2 Under assumption A3,
 $\begin{array}{ccc}-{D}_{xx}^{2}c\left(0,{y}_{\theta }\right).x.x\le -\left(\left(1-\theta \right){D}_{xx}^{2}c\left(0,{y}_{0}\right)+\theta {D}_{xx}^{2}c\left(0,{y}_{1}\right)\right).x.x-\delta |x{|}^{2}+\Delta |{x}_{1}||x|,& & \end{array}$
$\delta ={\delta }_{0}|{y}_{1}-{y}_{0}{|}^{2}\theta \left(1-\theta \right)$  , ${\delta }_{0}$  depends on ${C}_{0}$  in assumption A3, and $\Delta ={\Delta }_{0}|{y}_{1}-{y}_{0}{|}^{2}\theta \left(1-\theta \right)$  , ${\Delta }_{0}$  depends on $c\left(\cdot ,\cdot {\right)}_{{C}^{4}}$  .
Proof. Let $f:\mathbb{R}\to \mathbb{R}$  be convex, with ${f}^{\prime \prime }\ge \alpha >0$  . Then we have
 $\begin{array}{ccc}\theta f\left({y}_{0}\right)+\left(1-\theta \right)f\left({y}_{1}\right)\ge f\left(\theta {y}_{0}+\left(1-\theta \right){y}_{1}\right)+C\theta \left(1-\theta \right)|{y}_{1}-{y}_{0}{|}^{2},& & \end{array}$
where $C>0$  depends on $\alpha >0$  . Then note that the convexity assumption A3 concerns only $-{D}_{\xi \xi }c$  for $\xi \perp {e}_{1}$  . For the ${e}_{1}$  direction, we note that for a ${C}^{2}$  function $f$  , we have
 $\begin{array}{ccc}|\theta f\left({y}_{0}\right)+\left(1-\theta \right)f\left({y}_{1}\right)-f\left(\theta {y}_{0}+\left(1-\theta \right){y}_{1}\right)|\le C\theta \left(1-\theta \right)|{y}_{1}-{y}_{0}{|}^{2},& & \end{array}$
where $C$  depends on $\parallel f{\parallel }_{{C}^{2}}$  .
$\square$  Using the previous lemma, we now have
 $\begin{array}{ccc}\overline{\phi }& \ge & \left(\theta b+\left(1-\theta \right)a\right){x}_{1}-{D}_{xx}^{2}c\left(0,{y}_{\theta }\right).x.x/2\end{array}$ (6)
 $\begin{array}{ccc}& +& \delta |x{|}^{2}-\Delta |{x}_{1}||x|+o\left(|x{|}^{2}\right).\end{array}$
We need to eliminate the term $-\Delta |{x}_{1}||x|$  . In order to do so, notice that
 $\begin{array}{ccc}|{D}_{xx}^{2}c\left(0,{y}_{\theta }\right).x.x/2-{D}_{xx}^{2}c\left(0,{y}_{{\theta }^{\prime }}\right).x.x/2|\le C|\theta -{\theta }^{\prime }||x{|}^{2}.& & \end{array}$
Then in ( 6 ), we can write
 $\begin{array}{ccc}\delta |x{|}^{2}-\Delta |x||{x}_{1}|& =& |{y}_{1}-{y}_{0}{|}^{2}\theta \left(1-\theta \right)\left({\delta }_{0}|x{|}^{2}-{\Delta }_{0}|x||{x}_{1}|\right)\end{array}$
 $\begin{array}{ccc}& \ge & |{y}_{1}-{y}_{0}{|}^{2}\theta \left(1-\theta \right)\left({\delta }_{0}|x{|}^{2}/2-{\Delta }_{0}^{2}/\left(2{\delta }_{0}\right)|{x}_{1}{|}^{2}\right).\end{array}$
Hence we have, setting $\delta :=\delta /2$  ,
 $\begin{array}{ccc}\overline{\phi }\ge & & \left({\theta }^{\prime }b+\left(1-{\theta }^{\prime }\right)a\right){x}_{1}-{D}_{xx}^{2}c\left(0,{y}_{\theta }^{\prime }\right).x.x/2\end{array}$
 $\begin{array}{ccc}& & +\delta |x{|}^{2}+\left(\left(b-a\right)\left(\theta -{\theta }^{\prime }\right)-{C}_{0}{x}_{1}\right){x}_{1}-C|\theta -{\theta }^{\prime }||x{|}^{2}.\end{array}$
where ${C}_{0}=\theta \left(1-\theta \right)|{y}_{1}-{y}_{0}{|}^{2}{\Delta }_{0}^{2}/\left(2{\delta }_{0}\right)$  . Taking ${\theta }^{\prime }\in \left[\epsilon ,1-\epsilon \right]$  , $\theta ={\theta }^{\prime }+{x}_{1}{C}_{0}/\left(b-a\right)$  , and restricting to $|{x}_{1}|\le \left(b-a\right){C}_{0}^{-1}\epsilon$  , we can find
 $\begin{array}{ccc}\forall \theta \in \left[\epsilon ,1-\epsilon \right],\overline{\phi }\ge \left(\theta b+\left(1-\theta \right)a\right){x}_{1}-{D}_{xx}^{2}c\left(0,{y}_{\theta }\right).x.x/2+\delta |x{|}^{2}+o\left(|x{|}^{2}\right).& & \end{array}$
Noticing that all the terms $o\left(|x{|}^{2}\right)$  are in fact bounded by $C|{D}^{3}c\left(\cdot ,\cdot \right)||x{|}^{3}$  , and that $\theta b+\left(1-\theta \right)a=-{\nabla }_{x}c\left(0,{y}_{\theta }\right)$  , we conclude the lemma.
$\square$

2.2 Bounds on the modulus of continuity of $\nabla \phi$

Here we suppose that there exists ${x}_{0}$  and ${x}_{1}$  close such that $\nabla \phi \left({x}_{1}\right)-\nabla \phi \left({x}_{0}\right)$  is large compared to ${x}_{1}-{x}_{0}$  . If this does not happen, then $\phi$  is ${C}^{1,1}$  .
We can assume that $\phi \left({x}_{0}\right)=\phi \left({x}_{1}\right)$  . The supporting functions $-c\left(x,{y}_{0}\right)+c\left({x}_{0},{y}_{0}\right)+\phi \left({x}_{0}\right)$  and $-c\left(x,{y}_{1}\right)+c\left({x}_{1},{y}_{1}\right)+\phi \left({x}_{1}\right)$  will cross somewhere (say at ${x}_{m}$  ) on the segment $\left[{x}_{0},{x}_{1}\right]$  . We might suppose that at this point they are equal to 0.
Lemma 2.3 Under the assumptions made above, we have $\phi \le {C}_{1}|{x}_{1}-{x}_{0}||{y}_{1}-{y}_{0}|$  in the segment $\left[{x}_{0},{x}_{1}\right]$  .
Proof. We have
 $\begin{array}{ccc}H& =& \phi \left({x}_{0}\right)=-{\nabla }_{x}c\left({x}_{m},{y}_{0}\right)\cdot \left({x}_{0}-{x}_{m}\right)+O\left(|{x}_{0}-{x}_{m}{|}^{2}\right)\end{array}$
 $\begin{array}{ccc}& =& \phi \left({x}_{1}\right)=-{\nabla }_{x}c\left({x}_{m},{y}_{1}\right)\cdot \left({x}_{1}-{x}_{m}\right)+O\left(|{x}_{1}-{x}_{m}{|}^{2}\right).\end{array}$
By semi-convexity, on $\left[{x}_{0},{x}_{1}\right]$  we have $\phi \le H+C|{x}_{1}-{x}_{0}{|}^{2}$  . Then we assume that $-{\nabla }_{x}c\left({x}_{m},{y}_{0}\right)\cdot \left({x}_{0}-{x}_{m}\right)$  and $-{\nabla }_{x}c\left({x}_{m},{y}_{1}\right)\cdot \left({x}_{1}-{x}_{m}\right)$  are both positive, otherwise we are done. Then we have
 $\begin{array}{ccc}2H& \le & -{\nabla }_{x}c\left({x}_{m},{y}_{0}\right)\cdot \left({x}_{0}-{x}_{1}\right)-{\nabla }_{x}c\left({x}_{m},{y}_{1}\right)\cdot \left({x}_{1}-{x}_{0}\right)+C|x-{x}_{1}{|}^{2}\end{array}$
 $\begin{array}{ccc}& \le & C|{x}_{1}-{x}_{0}||{y}_{1}-{y}_{0}|+C|x-{x}_{1}{|}^{2},\end{array}$
where $C$  depends on $|{D}_{xy}c|,|{D}_{xx}c|$  . Recall that we assume that $|{x}_{1}-{x}_{0}|$  is small compared to $|{y}_{1}-{y}_{0}|$  , otherwise there is nothing to prove; this means that $|{x}_{1}-{x}_{0}{|}^{2}$  is small compared to $|{x}_{1}-{x}_{0}||{y}_{1}-{y}_{0}|$  , and we conclude.
$\square$  We use again Lemma  2.1 (centered at ${x}_{m}$  ) that will yield
 $\begin{array}{ccc}\phi \left(x\right)& \ge & max\left\{-c\left(x,{y}_{0}\right)+c\left({x}_{m},{y}_{0}\right),-c\left(x,{y}_{1}\right)+c\left({x}_{m},{y}_{1}\right)\right\}\end{array}$
 $\begin{array}{ccc}& \ge & -c\left(x,{y}_{\theta }\right)+c\left({x}_{m},{y}_{\theta }\right)+{\delta }_{0}\theta \left(1-\theta \right)|{y}_{0}-{y}_{1}{|}^{2}|x-{x}_{m}{|}^{2}-\gamma |x-{x}_{m}{|}^{3}.\end{array}$
for all $\theta \in \left[\epsilon ,1-\epsilon \right]$  , $|x-{x}_{m}|\le C\epsilon$  , and with ${y}_{\theta }$  the c-segment from ${x}_{m}$  joining ${y}_{0}$  to ${y}_{1}$  . Note that $\epsilon$  is small but fixed once for all.
We want to find supporting functions to $\phi$  on a ball of suitable radius.
For that we consider a function of the form
 $\begin{array}{ccc}-c\left(x,y\right)+c\left({x}_{m},y\right)+\phi \left({x}_{m}\right).& & \end{array}$
Of course, this function coincides with $\phi$  at ${x}_{m}$  . We then have
 $\begin{array}{ccc}& & -c\left(x,y\right)+c\left({x}_{m},y\right)+\phi \left({x}_{m}\right)\end{array}$
 $\begin{array}{ccc}& \le & -c\left(x,{y}_{\theta }\right)+c\left({x}_{m},{y}_{\theta }\right)+{C}_{2}|y-{y}_{\theta }||x-{x}_{m}|+{C}_{1}|{x}_{1}-{x}_{0}||{y}_{1}-{y}_{0}|,\end{array}$
where ${C}_{2}$  depends on $|{D}_{xy}^{2}c|$  , and we have used Lemma  2.3 to estimate $\phi \left({x}_{m}\right)$  . We want this to be bounded by $-c\left(x,{y}_{\theta }\right)+c\left({x}_{m},{y}_{\theta }\right)+{\delta }_{0}\theta \left(1-\theta \right)|{y}_{0}-{y}_{1}{|}^{2}|x-{x}_{m}{|}^{2}-\gamma |x-{x}_{m}{|}^{3}.$  First we restrict $\theta$  to $\left[1/4,3/4\right]$  , then we want
 $\begin{array}{ccc}{\delta }_{0}|{y}_{0}-{y}_{1}{|}^{2}|x-{x}_{m}{|}^{2}-\gamma |x-{x}_{m}{|}^{3}\ge {C}_{2}|y-{y}_{\theta }||x-{x}_{m}|+{C}_{1}|{x}_{1}-{x}_{0}||{y}_{1}-{y}_{0}|.& & \end{array}$
We choose $|y-{y}_{\theta }|\le {C}_{3}|x-{x}_{m}||{y}_{1}-{y}_{0}{|}^{2}$  for ${C}_{3}$  small enough (for example ${C}_{3}={\delta }_{0}/4$  ) , and the above inequality will be satisfied for
 $\begin{array}{ccc}|x-{x}_{m}{|}^{2}={C}_{4}\frac{|{x}_{1}-{x}_{0}|}{|{y}_{1}-{y}_{0}|},& & \end{array}$
if for this value of $|x-{x}_{m}|$  , we have indeed $|x-{x}_{m}|\le {\delta }_{0}/\left(2\gamma \right)|{y}_{0}-{y}_{1}{|}^{2}$  . If not then it means that $|{y}_{1}-{y}_{0}{|}^{5}\le C|{x}_{1}-{x}_{0}|$  and we are done.
Now we assume that this is not the case, and therefore the ratio $\frac{|{x}_{1}-{x}_{0}|}{|{y}_{1}-{y}_{0}|}$  is small. Hence we consider a ball of radius $\nu =C{\left(\frac{|{x}_{1}-{x}_{0}|}{|{y}_{1}-{y}_{0}|}\right)}^{1/2}$  centered at ${x}_{m}$  .
We denote $\mu ={C}_{3}\nu |{y}_{1}-{y}_{0}{|}^{2}$  . We denote ${N}_{\mu }\left(S\right)$  the $\mu$  neighborhood of a set $S$  . The functions $-c\left(x,y\right)+c\left({x}_{m},y\right)+\phi \left({x}_{m}\right)$  , for $y\in {N}_{\mu }\left\{{y}_{\theta },\theta \in \left[1/4,3/4\right]\right\}$  will be equal to $\phi$  at ${x}_{m}$  , and will be below $\phi$  on the the boundary of the ball ${B}_{\nu }\left({x}_{m}\right)$  . Hence they are supporting functions to $\phi$  at some point in ${B}_{\nu }\left({x}_{m}\right)$  .
The volume of all such $y$  is comparable to $\left[{y}_{1}-{y}_{0}\right]{\mu }^{n-1}\sim \left[{y}_{1}-{y}_{0}\right]{\nu }^{n-1}|{y}_{1}-{y}_{0}{|}^{2\left(n-1\right)},$  while the ball around ${x}_{m}$  has a volume comparable to ${\nu }^{n}$  .
If the Jacobian determinant of the mapping ${G}_{\phi }$  is bounded, we get that $|{y}_{1}-{y}_{0}{|}^{2n-1}\le C\nu$  . This implies $|{y}_{1}-{y}_{0}{|}^{2n-1}\le C{\left(\frac{|{x}_{1}-{x}_{0}|}{|{y}_{1}-{y}_{0}|}\right)}^{1/2},$  thus we conclude, using $-{\nabla }_{x}c\left(x,{y}_{i}\right)=\nabla \phi \left({x}_{i}\right),i=0,1$  , that
 $\begin{array}{ccc}|\nabla \phi \left({x}_{1}\right)-\nabla \phi \left({x}_{0}\right)|\le C|{x}_{1}-{x}_{0}{|}^{\frac{1}{4n-1}}.& & \end{array}$
We can refine the argument: Let $F$  be defined by
 $\begin{array}{ccc}F\left(V\right)=sup\left\{dVol\left({G}_{\phi }\left(B\right)\right),B\text{a ball of volume}V\right\}.& & \end{array}$
We have
 $\begin{array}{ccc}F\left(V\right)\le \left(inf{\rho }_{1}{\right)}^{-1}{sup}_{|B|=V}{\int }_{B}{\rho }_{0}.& & \end{array}$
Then we have $F\left(|{B}_{\nu }|\right)\ge dVol\left({N}_{\mu }\left\{{y}_{\theta },\theta \in \left[1/4,3/4\right]\right\}\right)$  , which gives
 $\begin{array}{ccc}F\left(\frac{|{x}_{1}-{x}_{0}{|}^{n/2}}{|{y}_{1}-{y}_{0}{|}^{n/2}}\right)\ge C|{x}_{1}-{x}_{0}{|}^{\left(n-1\right)/2}|{y}_{1}-{y}_{0}{|}^{\left(3n-1\right)/2}.& & \end{array}$ (7)
Assume that $F\left(V\right)\le C{V}^{\kappa }$  for some $\kappa \in \mathbb{R}$  . Note that ${\rho }_{0}\in {L}^{p}$  implies that $F\left(V\right)=O\left({V}^{1-1/p}\right)$  , hence we may write $\kappa =1-1/p$  for some $p\in \right]1,+\infty \right]$  .
Then we find
 $\begin{array}{ccc}|{y}_{1}-{y}_{0}{|}^{2n-1+\frac{1}{2}\left(1-\frac{n}{p}\right)}\le C|{x}_{1}-{x}_{0}{|}^{\frac{1}{2}\left(1-\frac{n}{p}\right)}.& & \end{array}$
We see first that we need $p>n$  , then we get, setting $\alpha =1-n/p$  ,
 $\begin{array}{ccc}|{y}_{1}-{y}_{0}|\le |{x}_{1}-{x}_{0}{|}^{\frac{\alpha }{4n-2+\alpha }}.& & \end{array}$
If we only know that $F\left(V\right)=o\left({V}^{1-1/n}\right)$  , (which is true if ${\rho }_{0}\in {L}^{n}$  ), we write $F\left(V\right)={\left[f\left({V}^{2/n}\right)\right]}^{2n-1}{V}^{1-1/n}$  , for some non-decreasing $f$  , with ${lim}_{V\to 0}f=0$  . We then have, as $|{x}_{1}-{x}_{0}|$  goes to 0, $\frac{|{x}_{1}-{x}_{0}|}{|{y}_{1}-{y}_{0}|}$  that goes also to 0 (otherwise there is nothing to prove). Using the special form of $F$  in ( 7 ), we get
 $\begin{array}{ccc}{f}^{2n-1}\left(\frac{|{x}_{1}-{x}_{0}|}{|{y}_{1}-{y}_{0}|}\right)\ge |{y}_{1}-{y}_{0}{|}^{2n-1},& & \end{array}$
hence we get that $|{y}_{1}-{y}_{0}|$  goes to 0 when $|{x}_{1}-{x}_{0}|$  goes to 0. Let $g$  be the function such that $|{y}_{1}-{y}_{0}|\le g\left(|{x}_{1}-{x}_{0}|\right)$  ( $g$  is the modulus of continuity of ${G}_{\phi }$  ), then $g$  satisfies
 $\begin{array}{ccc}f\left(\frac{u}{g\left(u\right)}\right)\ge g\left(u\right).& & \end{array}$
This yields a uniform control on the modulus of continuity of ${G}_{\phi }$  : indeed, for $v>0$  , if $g>v$  , then $f\left(u/v\right)\ge v$  , and $u\ge v{f}^{-1}\left(v\right)$  . The function $v\to v{f}^{-1}\left(v\right)$  is non-decreasing and goes to $0$  when $v$  goes to 0. Considering $\omega$  its inverse, $\omega$  is the modulus of continuity of ${G}_{\phi }$  . Finally we have $\nabla \phi \left(x\right)=-{\nabla }_{x}c\left(x,{G}_{\phi }\left(x\right)\right)$  , and the continuity of $\nabla \phi$  is asserted.
$\square$  Remark. The power $\beta =\frac{\alpha }{4n-2+\alpha }$  is not optimal for example if $n=1,p=+\infty$  , for which the ${C}^{1,1}$  regularity is trivial, but note that in order to obtain this bound, we had to assume that $|{y}_{1}-{y}_{0}|\ge |{x}_{1}-{x}_{0}{|}^{1/5}$  , and, before, that $|{x}_{1}-{x}_{0}|=o\left(|{y}_{1}-{y}_{0}|\right)$  . Hence the conclusion should be: either $\phi$  is ${C}^{1,1}$  , or $\phi$  is ${C}^{1,1/5}$  or $\phi$  is ${C}^{1,\beta }$  . Note that $\beta \le 1/7$  for $n\ge 2$  .
Gregoire Loeper EPFL, SB-IMA 1015 Lausanne, Switzerland gregoire.loeper@epfl.ch

$\text{1}$  Laboratoire J.A.Dieudonné, Université de Nice-Sophia-Antipolis, Parc Valrose, 06108 NICE Cedex 2.