1. Introduction
We consider anisotropic mean curvature flow, a geometric evolution equation used to model microstructure in complex materials. The prototypical application is in multiphase grain growth for polycrystals. As noted in [Reference Garcke, Nestler and Stoth28], isotropic models, such as mean curvature flow, fail to capture phenomenological features such as the dendritic growth of phases (see also [Reference Chen15]). In chemical kinetics, phase separation experiences anisotropy due to the underlying lattice orientation of the solid host material [Reference Bazant5]. Similarly, many materials even display anisotropic surface tensions which are not smooth with respect to the interface orientation (see also [Reference Angenent and Gurtin3, Reference Taylor53, Reference Taylor, Cahn and Handwerker54]): here, one can even consider the household setting of salt (NaCl) and air. At the same time, the study of interface evolutions poses serious numerical and mathematical challenges and a large amount of insight has been gained by modeling such systems in terms of phase-field models, where one replaces interfaces by continuous order parameters (see, e.g., [Reference Chen15]).
In this paper, we prove convergence of solutions of the anisotropic scalar Allen–Cahn equation, a phase-field model, to anisotropic mean curvature flow using variational methods. This may be considered as a first step to proving convergence in the physically relevant vectorial setting. Our approach generally sheds light onto anisotropic mean curvature flow and further enables us to prove a weak–strong uniqueness result for the interface evolution.
Anisotropic mean curvature flow prescribes the evolution of an oriented hypersurface with the surface velocity determined by a weighted mean curvature. Fixing a surface tension $\sigma \;:\; {\mathbb{R}}^{d}\to{\mathbb{R}}_{\geq 0}$ and a mobility $\mu \;:\;{\mathbb{R}}^{d}\to{\mathbb{R}}_{\geq 0}$ (where one can think of extending from the sphere by one-homogeneity), we say that a time-parametrized collection of sets $\{A(t)\}_{t\in [0,T]}$ evolves by anisotropic mean curvature flow if the boundary $\Gamma (t)\;:\!=\; \partial A(t)$ is smooth and satisfies
where $\nu$ is the outer unit normal of $A$ , $V$ is the surface normal velocity, and $H_{\sigma }\;:\!=\; \mathrm{div}_{\Gamma (t)}(D\sigma (\nu ))$ is the anisotropic mean curvature with respect to $\sigma$ . We note that $D\sigma (\nu )$ is typically referred to as the Cahn–Hoffman vector field, a generalization of the surface normal. In the case that $\sigma = \mu = |\cdot |$ are given by the Euclidean norm, one recovers the usual (isotropic) mean curvature flow $V=-H$ .
Following the approach of Luckhaus and Sturzenhecker [Reference Luckhaus and Sturzenhecker47], one way to encode the motion (1.1) is through the characteristic functions $\chi (t) \;:\!=\; \chi _{A(t)}$ . Here $A(t)$ are naturally given by sets of finite perimeter instead of smooth open sets, allowing for distributional (or $BV$ ) weak solutions to (1.1). In the isotropic setting, solutions were derived via a minimizing movements scheme (see also [Reference Almgren, Taylor and Wang1]) for the perimeter functional. This was a natural approach as mean curvature flow can formally be viewed as the gradient flow of the perimeter functional with an appropriate metric [Reference Hensel and Laux36, Reference Serfaty51].
To carry this analogy to our setting, the curvature flow (1.1) seeks to minimize an anisotropic surface energy $E\;:\; L_{\mathrm{loc}}^{2}(\Omega )\to [0,\infty ]$ defined by
Here, $\nu =-\frac{\nabla \chi }{|\nabla \chi |}$ is the measure-theoretic outer unit normal, and $c_{0}$ is a positive constant quantifying surface energy. Formally speaking (see Subsection 2.2), anisotropic mean curvature flow is a gradient flow of the anisotropic perimeter $E$ with respect to the weighted $L^2$ -surface metric
To construct solutions of interface evolutions, such as (1.1), one can approximate via diffuse-interface models. The idea is that the sharp interface $\Gamma (t)$ , which captures a jump discontinuity of $u =\chi$ , is replaced by a diffuse interface $u_{\varepsilon }$ forming a continuous transition between values close to $1$ and values close to $0$ . Diffuse-interface or phase-field models are often used in practice and especially for numerics, where tracking of the interface is reduced to a reaction-diffusion equation. Herein, we consider a phase-field approximation of the curvature flow (1.1) given by the anisotropic Allen–Cahn equation
where $\Omega$ is the $d$ -dimensional torus, the function $W\;:\;{\mathbb{R}} \to{\mathbb{R}}_{\geq 0}$ is a double-well potential with its wells at $0$ and $1$ , and the anisotropic surface tension and mobility are encoded by the functions
and
For the isotropic case $\sigma = \mu = |\cdot |$ , we simply obtain $f=|\cdot |^{2}$ and $g \equiv 1$ .
In contrast to the approach taken in [Reference Elliott and Schätzle21], we have introduced a regularization (1.6) of the mobility $\mu$ allowing us to take advantage of the gradient flow structure for the anisotropic Allen–Cahn equation. We define the anisotropic Cahn–Hilliard energy $E_{\varepsilon }\;:\; L^{2}(\Omega )\to [0,\infty ]$ by
and introduce a weighted $L^{2}$ -metric given by
for a given point $u\in \mathrm{dom}(E_{\varepsilon })$ . Note that we omit the dependence on $\varepsilon$ in the notation $(\cdot,\cdot )_{u}$ for convenience. A formal calculation shows that the Allen–Cahn equation (1.4) is equivalent to
where $\nabla _{u_{\varepsilon }(t)}$ is the gradient on $\left (L^{2}(\Omega ), (\cdot,\cdot )_{u_{\varepsilon }(t)}\right )$ . The above equation encapsulates (1.4) as a gradient flow, and this structure will be exploited for the construction of solutions to (1.4) in Section 3 as well as for the sharp-interface limit in Section 4.
A first indication that (1.4) approximates (1.1) is the $\Gamma$ -convergence of the associated energies (see [Reference Braides10, Reference Maso19]). With
it was shown by Bouchitté [Reference Bouchitté9] that $E_{\varepsilon }\overset{\Gamma }{\longrightarrow }E$ as $\varepsilon \searrow 0$ with respect to the strong $L^{1}$ -topology on the underlying space. Likewise in the spirit of Luckhaus and Modica [Reference Luckhaus and Modica46], Cicalese et al. [Reference Cicalese, Nagase and Pisante18] verified an anisotropic Gibbs–Thomson relation for the energies connecting the first variation of (1.7) to the limiting minimal surface’s anisotropic curvature.
Early results on anisotropic mean curvature flow in the special case $\mu =\sigma$ are due to Chen, Giga, and Goto [Reference Chen, Giga and Goto17], who proved the existence and uniqueness of viscosity solutions (up to fattening) for smooth surface tensions $\sigma$ . Almgren, Taylor, and Wang [Reference Almgren, Taylor and Wang1] introduced a time discretization in the form of a minimizing movements scheme including crystalline surface tensions, yielding the so-called flat-flow solutions for anisotropic mean curvature flow. They also proved a short-time existence result for strong solutions if the surface tension $\sigma$ is smooth. Bellettini and Paolini [Reference Bellettini and Paolini7] provide a thorough introduction of the anisotropic mean curvature flow equation for a Finsler metric $\sigma$ , i.e., with the surface tension possibly depending on the position in space, and argue formally that the time discretization, the level-set equation proposed in [Reference Chen, Giga and Goto17], and the anisotropic Allen–Cahn equation lead to solutions to anisotropic mean curvature flow.
Allowing for sufficiently regular convex surface tensions $\sigma$ and arbitrary mobilities $\mu$ , Elliott and Schätzle [Reference Elliott and Schätzle21] proved that, in the sharp-interface limit, solutions to the anisotropic Allen–Cahn equation converge to anisotropic mean curvature flow in the sense of the viscosity formulation. Unlike in the present work, Elliott and Schätzle used a discontinuous, nonregularized version of $g$ and, therefore, resorted to viscosity solutions of the phase-field equation. In [Reference Chambolle and Novaga14], Chambolle and Novaga prove consistency of the minimizing movements scheme [Reference Almgren, Taylor and Wang1] and the MBO thresholding scheme for the energy (1.2) with anisotropic mean curvature flow using viscosity solutions. Similarly, using viscosity solutions and distributional solutions with an energy convergence hypothesis, Chambolle et al. [Reference Chambolle, De Gennaro and Morini11] prove convergence of the minimizing movements scheme [Reference Almgren, Taylor and Wang1] for a translationally dependent energy to inhomogeneous anisotropic mean curvature flow.
In the case of non-smooth (or crystalline) surface tensions, the Cahn–Hoffman vector field is effectively described in terms of a differential inclusion $\nu _{\sigma } \in \partial \sigma (\nu )$ and selection of the appropriate curvature can make the problem nonlocal. Recently, a variety of work has been invested in understanding crystalline curvature flow. Giga and Giga were among the first to develop a robust solution concept in the planar setting [Reference Giga and Giga29]. For crystalline surface tensions, existence and uniqueness was proven in dimension $d=3$ by Giga and Požár in [Reference Giga and Požár32], and the result was ultimately extended to arbitrary dimension in [Reference Giga and Požár33] by the same authors. Chambolle, Morini, and Ponsiglione [Reference Chambolle, Morini and Ponsiglione13] introduced a novel definition of supersolutions, subsolutions, and weak solutions to anisotropic mean curvature flow that is also based on level set techniques and is particularly useful for crystalline surface tensions. They presented an existence and uniqueness result up to fattening and a comparison principle. The same authors together with Novaga [Reference Chambolle, Morini, Novaga and Ponsiglione12] extended this result from the special case $\mu =\sigma$ to arbitrary mobilities $\mu$ .
Many of the above results are qualitative, but in the spirit of Chen [Reference Chen16], given the power of viscosity methods, quantitative rates of convergence have been derived and we refer the interested reader to [Reference Bellettini and Novaga6, Reference Giga, Ohtsuka and Schätzle31] and references therein.
In contrast to the above approaches, we will apply relative entropy methods to identify the limit of (1.4). A related approach regarding a localized energy excess was introduced by the first author with Otto in [Reference Laux and Otto41] where they proved convergence of the MBO thresholding scheme to multi-phase mean curvature flow. A nontrivial modification of this idea – based on controlling the tilt excess of the sharp or diffuse interface with regard to a smooth approximation – has been used to prove convergence of the vectorial Allen–Cahn equation to multiphase mean curvature flow [Reference Laux and Simon42] and derive an associated rate of convergence [Reference Fischer and Marveggio25] along with optimal quantitative convergence rates for the Allen–Cahn equation to mean curvature flow in the two-phase setting [Reference Fischer, Laux and Simon24]. A key feature of viscosity type solutions is the associated comparison principle, which automatically provides uniqueness of solutions up to the issue of fattening. However, in the multiphase case, fundamentally different tools are needed to address uniqueness: The relative entropy method [Reference Fischer, Hensel, Laux and Simon23] has been used to prove weak–strong uniqueness results for a variety of geometric evolution equations including planar multiphase flows (see, e.g., [Reference Fischer and Hensel22, Reference Hensel and Laux35, Reference Hensel and Laux36, Reference Laux40]).
In this paper, we prove convergence of the anisotropic Allen–Cahn equation (1.4) to anisotropic mean curvature flow (1.1) for arbitrary Lipschitz mobilities $\mu$ and $C^2$ surface tensions $\sigma$ under an energy convergence hypothesis, as is often used in application [Reference Kim, Mellet and Wu38, Reference Laux and Otto41, Reference Laux and Stinson43, Reference Luckhaus and Sturzenhecker47]. This result provides a complete proof for the result first announced in [Reference Laux39]. Further, we prove weak–strong uniqueness of the associated distributional solution concept for anisotropic mean curvature flow: If a smooth solution (which we will endow with the structure of a calibrated evolution) and a $BV$ solution share the same initial data, it follows that both solutions coincide for all times in their common interval of existence. We summarize these results here and refer to Theorems 4.1 and 5.2 for precise details.
Theorem 1.1. Let $\mu$ be a Lipschitz mobility, $\sigma$ a $C^{2}$ uniformly convex surface tension, and $\Omega$ the $d$ -dimensional torus. Then the following holds:
-
• Any sequence of weak solutions $u_\varepsilon$ of the anisotropic Allen–Cahn equation ( 1.4 ) with well-prepared initial conditions has a subsequence converging to some limit $u=\chi$ with $\chi \;:\;\Omega \times (0,T)\to \{0,1\}$ as $\varepsilon \to 0.$ Under an energy convergence hypothesis, $u$ is a weak solution of anisotropic mean curvature flow.
-
• Let $\sigma$ and $\mu$ be smooth. If ${\mathscr{A}(t)}_{t \in [0,T]}$ is a strong solution of anisotropic mean curvature flow ( 1.1 ) and $ \chi \;:\;\Omega \times (0,T)\to \{0,1\}$ is a distributional solution of anisotropic mean curvature flow with the same initial condition, i.e., $\chi (\cdot, 0) = \chi _{\mathscr{A}(0)},$ then $\chi \equiv \chi _{\mathscr{A}}$ for all $t \in [0,T].$
We remark that convergence of the anisotropic Allen–Cahn equation is well-studied, but as far as the authors are aware, this has exclusively been done from the perspective of viscosity solutions. Our result considers this from the distributional setting and may ultimately be amenable to tackling the multiphase setting that is most relevant to physical applications. Further, our uniqueness result for distributional solutions shows that it may be possible to obtain quantitative convergence rates in the spirit of Fischer et al. [Reference Fischer, Laux and Simon24] for the anisotropic Allen–Cahn equation.
The structure of the paper is as follows: In Section 2, we introduce the admissible class of anisotropies and, based on the closely connected notions of anisotropic surface energy and anisotropic mean curvature, derive the anisotropic mean curvature flow equation as a formal gradient flow. Here, we further introduce our notion of a distributional solution to anisotropic mean curvature flow. As a preparation for the convergence result, Section 3 discusses the anisotropic Allen–Cahn equation (1.4) and establishes the existence – via a time discretization – and regularity of weak solutions. Section 4 is devoted to the proof of the conditional convergence theorem. Finally, Section 5 covers the weak–strong uniqueness theorem.
Notation
Throughout the paper, let $d\geq 2$ be the ambient dimension. We consider the equations with periodic boundary conditions, i.e., for the domain we will always choose the flat torus $\Omega \;:\!=\;{\mathbb{R}}^{d}/\mathbb{Z}^{d}$ .
The $i$ -th unit vector will be denoted by $e_{i}$ , and the identity matrix in dimension $d$ will be written as $I_{d}\in{\mathbb{R}}^{d\times d}$ . For the scalar products of vectors $a, b \in{\mathbb{R}}^{d}$ and of matrices $A, B \in{\mathbb{R}}^{d \times d}$ , we write $a \cdot b \;:\!=\; \sum _{i=1}^{d}a_{i}b_{i}$ and $A\;:\;B\;:\!=\; \sum _{i,j=1}^{d}(A)_{ij}(B)_{ij}$ , respectively. The tensor product of $a, b \in{\mathbb{R}}^{d}$ is a matrix $a \otimes b \in{\mathbb{R}}^{d \times d}$ , which is given by $(a\otimes b)_{ij}=a_{i}b_{j}$ .
The symbol $\nabla$ is reserved for derivatives with respect to the space variable $x \in \Omega$ . For a vector field $X\;:\; \Omega \to{\mathbb{R}}^{d}$ , the derivative $\nabla X$ is a $(d\times d)$ -matrix defined by $(\nabla X)_{ij}=\partial _{j}X_{i}$ . Derivatives with respect to variables $p \in{\mathbb{R}}^{d}$ will be denoted by $D = D_{p}$ . In contrast to $\nabla$ , $D\sigma (\nu )$ will denote the column vector.
For a set $A \subset \mathbb{R}^d$ , $\chi _A$ is the characteristic function taking the value $1$ on $A$ and $0$ in the complement.
By a standard abuse of notation, we refer to the distributional derivative in space of a function $\chi \in BV(\Omega ;\;\{0,1\})$ by $\nabla \chi$ . In the case that $\chi \in L^1((0,T);\;BV(\Omega ))$ , we write $\nabla \chi \;:\!=\; \nabla \chi (\cdot,t )\otimes \mathcal{L}^1_{(0,T)}$ in the notation of Young measures.
The divergence of a vector field $X\;:\; \Omega \to{\mathbb{R}}^{d}$ is $\mathrm{div} X = \sum _{i=1}^{d} \partial _{i}X_{i}$ . The divergence of a matrix field $A\;:\; \Omega \to{\mathbb{R}}^{d \times d}$ is vector-valued, $(\mathrm{div} A)_{i}=\sum _{j=1}^{d}\partial _{j}(A)_{ij}$ .
For a vector $a \in{\mathbb{R}}^{d}$ we define the differential operator $a \cdot \nabla \;:\!=\;\sum _{i=1}^{d}a_{i}\partial _{i}$ . In particular, for a vector field $X$ , we have $(a\cdot \nabla )X=(\nabla X)a$ .
2. Anisotropies and anisotropic mean curvature
In this section, we develop the necessary mathematical preliminaries for the rest of the paper. We introduce the notion of admissible anisotropies in Subsection 2.1 and provide a calculation clarifying the formal view of anisotropic mean curvature flow as gradient flow in Subsection 2.2. Finally, we show in the smooth setting how the anisotropic curvature can be reinterpreted via integration by parts, allowing us to introduce a distributional solution for anisotropic mean curvature flow in Subsection 2.3.
2.1 Admissible anisotropies
In this subsection, we introduce a suitable class of anisotropic surface tensions and mobilities and discuss some elementary facts about admissible anisotropic surface tensions.
Definition 2.1. An admissible pair of anisotropies $(\sigma, \mu )$ is comprised of an admissible surface tension $\sigma \;:\; {\mathbb{R}}^{d} \to{\mathbb{R}}_{\geq 0}$ that satisfies
-
(S1) positive $1$ -homogeneity, i.e., $\sigma (\lambda p)=\lambda \sigma (p)$ for all $p\in{\mathbb{R}}^{d}$ , $\lambda \gt 0$ ,
-
(S2) positive definiteness, i.e., $\sigma (p)=0$ if and only if $p=0$ ,
-
(S3) smoothness, i.e., $\sigma \in C^{2}({\mathbb{R}}^{d}\setminus \{0\})$ ,
-
(S4) uniform convexity, i.e., $\sigma ^{2}\;:\;{\mathbb{R}}^{d}\to{\mathbb{R}}_{\geq 0}$ is strongly convex in the sense that there exists a constant $c\gt 0$ satisfying
\begin{equation*} \sigma ^{2}((1-t)p+tq)\leq (1-t)\sigma ^{2}(p)+t\sigma ^{2}(q)-c\frac {t(1-t)}{2}|p-q|^{2} \end{equation*}for all $p, q \in{\mathbb{R}}^{d}$ , $t \in (0,1)$ ,
and an admissible mobility $\mu \;:\;{\mathbb{R}}^{d} \to{\mathbb{R}}_{\geq 0}$ that satisfies
-
(M1) positive $1$ -homogeneity, i.e., $\mu (\lambda p)=\lambda \mu (p)$ for all $p\in{\mathbb{R}}^{d}$ , $\lambda \gt 0$ ,
-
(M2) positive definiteness, i.e., $\mu (p)=0$ if and only if $p=0$ ,
-
(M3) regularity, i.e., $\mu \in C^{0,1}({\mathbb{R}}^{d})$ .
The polar norm of $\sigma$ is
Example 2.2. A useful class of admissible surface tensions, which is suitable to approximate arbitrary norms, is introduced by Barrett, Garcke, and N FC;rnberg in [Reference Barrett, Garcke and Nürnberg4, (1.12)] as follows: Let $q \in [1, \infty )$ , $L \in{\mathbb{N}}$ , and let $G_{1}, G_{2}, \ldots, G_{L} \in{\mathbb{R}}^{d \times d}$ be symmetric and positive definite. We define
One can show that $\sigma$ is an admissible surface tension in the above sense (see also [Reference Giga30, Remark 1.7.5] and [Reference Bellettini and Paolini7, Example 4.6]).
For convenience, let us collect several useful properties of surface tensions.
Lemma 2.3. Let $\sigma$ be an admissible surface tension as in Definition 2.1 . Then
-
(i) $\sigma{^{\circ }}$ is an admissible surface tension and $\sigma ^{\circ \circ }=\sigma$ ,
-
(ii) $p\cdot q \leq \sigma (p)\sigma{^{\circ }}(q)$ for all $p, q \in{\mathbb{R}}^{d}$ ,
-
(iii) we have $0 \lt \min _{S^{d-1}}\sigma \leq \max _{S^{d-1}}\sigma \lt \infty$ , and
\begin{equation*} (\!\min _{S^{d-1}}\sigma )|p| \leq \sigma (p)\leq (\!\max _{S^{d-1}}\sigma )|p| \qquad \text {for all } p \in {\mathbb {R}}^{d}, \end{equation*} -
(iv) $D\sigma$ is positively $0$ -homogeneous and $D^{2}\sigma$ is positively $(\!-\!1)$ -homogeneous, i.e.,
\begin{equation*} D\sigma (\lambda p) = D\sigma (p) \qquad \text {and} \qquad D^{2}\sigma (\lambda p)=\frac {1}{\lambda }D^{2}\sigma (p) \qquad \text {for all }\lambda \gt 0,\, p \in {\mathbb {R}}^{d}\setminus \{0\}, \end{equation*} -
(v) $p\cdot D\sigma (p)=\sigma (p)$ for all $p \in{\mathbb{R}}^{d}\setminus \{0\}$ ,
-
(vi) $\sigma{^{\circ }}(D\sigma (p))=1$ for all $p \in{\mathbb{R}}^{d}\setminus \{0\}$ ,
-
(vii) $\sigma{^{\circ }}(q)D\sigma\!\left (D\sigma{^{\circ }}(q)\right )=q$ for all $q \in{\mathbb{R}}^{d}\setminus \{0\}$ .
Proof. (ii) can be shown directly from the definition of $\sigma{^{\circ }}$ and the positive $1$ -homogeneity of $\sigma$ .
(iii) and (iv) follow from the positive $1$ -homogeneity and continuity resp. smoothness of $\sigma$ .
(v) is a result of (iv) and the fundamental theorem of calculus, cf. [Reference Giga30, Section 1.7.2]:
Giga [Reference Giga30, Section 1.7.2] also provides the proofs for (vi), (vii), and (i) except the uniform convexity of $\sigma{^{\circ }}$ and the duality statement $\sigma ^{\circ \circ }=\sigma$ . A proof for $\sigma ^{\circ \circ }=\sigma$ can be found in [Reference Rockafellar50, Theorem 15.1 and Corollary 15.1.1]. The uniform convexity of $\sigma ^{\circ }$ can be proved in a similar way to [Reference Lindenstrauss and Tzafriri45, Proposition 1.e.2].
A useful trick with regard to the Euclidean metric $|\cdot |$ is to control quadratic errors for unit vectors via the inequality
with equality if and only if $|p^{\prime }|=1$ . In order to introduce a tilt excess functional for anisotropic mean curvature flow, we are interested in an anisotropic counterpart to the above inequality.
The suitable anisotropic inequality bounds squared distances $|p-p^{\prime }|^{2}$ by a term of the form $\sigma (p)-|p^{\prime }|D\sigma (p^{\prime })\cdot p$ . However, since the mapping $p^{\prime } \mapsto |p^{\prime }|D\sigma (p^{\prime })$ is, in general, not continuously differentiable at $p^{\prime }=0$ , we will use a truncated version instead. To this end, we fix a cutoff function $\psi \in C^{\infty }([0,\infty ))$ satisfying
The following lemma contains two estimates featuring the truncated version.
Lemma 2.4. Let $\sigma$ be an admissible surface tension. There exist constants $c_{\sigma }, C_{\sigma }\gt 0$ depending only on $\sigma$ such that
-
(i) we have
(2.3) \begin{equation} \sigma (p)-|p^{\prime }|\psi (|p^{\prime }|)D\sigma (p^{\prime })\cdot p \geq c_{\sigma }\left |p-p^{\prime }\right |^{2} \end{equation}for all $p, p^{\prime } \in{\mathbb{R}}^{d}$ such that $|p|=1$ and $\left |p^{\prime }\right |\leq 1$ , and -
(ii) we have
(2.4) \begin{equation} \sigma (p)-|p^{\prime }|\psi (|p^{\prime }|)D\sigma (p^{\prime })\cdot p\leq C_{\sigma }\left (\left |p-p^{\prime }\right |^{2}+\left (1-\left |p^{\prime }\right |\right )\right ) \end{equation}for all $p, p^{\prime } \in{\mathbb{R}}^{d}$ such that $|p|=1$ and $\left |p^{\prime }\right |\leq 1$ .
Proof. Variants of (i) and (ii) were provided by Dziuk [Reference Dziuk20, Proposition 2.2] and Laux [Reference Laux39, Lemma 3.3 and Section 4.2], respectively. The proof is included here for the reader’s convenience.
For inequality (i), we consider two cases with respect to $p^{\prime }$ : First, if $D\sigma (p^{\prime })\cdot p\lt \frac{\sigma (p)}{2}$ or $p^{\prime }=0$ , we use the estimate $|p-p^{\prime }|^{2}\leq 4$ to obtain
Second, if $p^{\prime }\neq 0$ and $D\sigma (p^{\prime })\cdot p\geq \frac{\sigma (p)}{2}$ , we have $\left |p+\frac{p^{\prime }}{|p^{\prime }|}\right |\geq \frac{\min _{S^{d-1}}\sigma }{\max _{S^{d-1}}|D\sigma |}\gt 0$ since
The uniform convexity assumption (S4) can equivalently be stated as follows (see [Reference Giga30, Remark 1.7.5]): There exists a constant $\underline{\sigma }\gt 0$ such that
for all $p^{*}\neq 0$ . We use a second-order Taylor expansion around $\frac{p^{\prime }}{|p^{\prime }|}$ and write the remainder in terms of an intermediate point $p^{*}=(1-t)p+t\frac{p^{\prime }}{|p^{\prime }|}$ with $t \in [0,1]$ . Together with the convexity property (2.6), it follows that
Furthermore, the assumption $D\sigma (p^{\prime })\cdot p \geq \frac{\sigma (p)}{2}$ allows us to compute
Finally, a combination of (2.7) and (2.8) together with an application of Young’s inequality yields
which completes the proof of (i) in the second case.
For inequality (ii), we distinguish two cases again. First, in the easier case $\left |p-p^{\prime }\right |\geq 1$ , we can estimate
Second, if $|p-p^{\prime }|\lt 1$ , then
and, therefore,
for all $t \in [0,1]$ . As in the proof of inequality (i) above, we introduce a second-order Taylor expansion around $\frac{p^{\prime }}{|p^{\prime }|}$ with an intermediate point $p^{*}=(1-t)p+t\frac{p^{\prime }}{|p^{\prime }|}$ , where $t \in [0,1]$ . Using this expansion, Lemma 2.3(ii), (vi), and Young’s inequality, we obtain
from which (ii) follows in the case $|p-p^{\prime }|\lt 1$ .
The following fact on the duality of $\sigma$ and $\sigma{^{\circ }}$ will be used at a later point.
Lemma 2.5. Let $B \in L^{1}(\Omega )^d$ , and let $\sigma$ be an admissible surface tension. Then
where the supremum is taken over all $\eta \in C^{1}(\Omega )^{d}$ such that $\sigma{^{\circ }}(\eta )\leq 1$ .
Proof. For $\eta \in C^{1}(\Omega )^{d}$ such that $\sigma{^{\circ }}(\eta )\leq 1$ , we have that
by Lemma 2.3. To see the other bound, whenever $B \neq 0$ note that $\sigma (B) = B\cdot D\sigma (B)$ and that $\sigma{^{\circ }}(D\sigma (B)) = 1$ by Lemma 2.3. As $\{\nu \;:\; \sigma ^{\circ }(\nu )\leq 1\}$ is a convex set, if $\eta$ is given by a smooth mollification of $D\sigma (B)$ , the inequality $\sigma ^{\circ }(\eta )\leq 1$ still holds. Approximating $D\sigma (B)$ by such $\eta$ shows that $\int _{\Omega }\sigma (B)dx\leq \sup _{\eta }\int _{\Omega }B\cdot \eta \, dx$ , concluding the proof.
2.2 Anisotropic mean curvature and surface energy
In the remainder of this section, we assume that $(\sigma,\mu )$ is an admissible pair of anisotropies according to Definition 2.1.
Similarly to [Reference Bellettini and Paolini7], we introduce the $\sigma$ -mean curvature of a hypersurface $\Gamma$ as the surface divergence of the Cahn–Hoffman vector of the outer unit normal $\nu$ , i.e., $H_{\sigma }=\mathrm{div}_{\Gamma }(D\sigma (\nu ))$ . For the notion of the Cahn–Hoffman vector, see [Reference Giga30, Section 1.3]. The surface divergence is given by $\mathrm{div}_{\Gamma }X \;:\!=\; \mathrm{div} X - \nu \cdot \nabla X\, \nu$ . Observe that, for every $C^{1}$ -extension of the normal $\nu$ to the whole space and every $x \in \Gamma$ , we have
where the last step uses the fact that $D^{2}\sigma (p)p=\frac{d}{ds}\Big |_{s=0}D\sigma (\mathrm{e}^{s}p)=\frac{d}{ds}\Big |_{s=0}D\sigma (p)=0$ for all $p \in{\mathbb{R}}^{d}\setminus \{0\}$ by the positive $0$ -homogeneity of $D\sigma$ , Lemma 2.3(iv).
Furthermore, we define the anisotropic surface energy $E$ as in (1.2). The relation between the anisotropic mean curvature and anisotropic surface energy becomes clear from the following theorem, which deals with the first variation and direction of steepest descent of the functional $E$ :
Theorem 2.6. Let $A \subset{\mathbb{R}}^{d}$ be a bounded open set with $C^{2}$ -boundary, and let $\nu$ denote the outer unit normal on $\partial A$ . Given a compactly supported vector field $B \in C_{c}^{1}({\mathbb{R}}^{d})^{d}$ , we define a one-parameter family of diffeomorphisms
For $|t| \lt \|\nabla B\|_{L^{\infty }}^{-1}$ , let $A(t)\;:\!=\; \Psi _{t}(A)$ and $u(x,t)\;:\!=\;\chi _{\Omega (t)}(x)$ . Then
Moreover, we define $\|B\|_{*}\;:\!=\;\left ({c_{0}}\int _{\partial A}\sigma{^{\circ }}(\!-\!{\mathrm{sgn}}(H_{\sigma })B)^{2}\frac{\sigma (\nu )^{2}}{\mu (\nu )}d{\mathcal{H}^{d-1}} \right )^{\frac{1}{2}}$ , so that $\|\cdot \|_{*}$ is bounded from below and above by the $L^{2}$ -norm for vector-valued functions $\partial A \to{\mathbb{R}}^{d}$ . If we denote the right-hand side of ( 2.12 ) by the dual pairing $(dE, B)$ , then a solution of the minimization problem
is given by a positive multiple of $-H_{\sigma }\frac{\mu (\nu )}{\sigma (\nu )}D\sigma (\nu )$ .
Proof. The identity for the first variation (2.12) was stated by Almgren, Taylor, and Wang [Reference Almgren, Taylor and Wang1, Section 2.2] and proved by Bellettini and Paolini in the more general context of Finsler geometry [Reference Bellettini and Paolini7, Theorem 5.1]. The proof of the steepest descent statement follows the idea of [Reference Bellettini and Paolini7, Proposition 5.1] but incorporates the (arbitrary) mobility $\mu$ :
Let $B\in L^{2}(\partial A)^{d}$ , then by Lemma 2.3(ii) and the Cauchy–Schwarz inequality it follows that
and Lemma 2.3(v), (vi) yield that the first inequality is an equality if $B=-\lambda{\mathrm{sgn}}(H_{\sigma })D\sigma (\nu )$ for some nonnegative function $\lambda \;:\; \partial A \to{\mathbb{R}}_{\geq 0}$ . Therefore, if $B$ is a constant positive multiple of $-H_{\sigma }\frac{\mu (\nu )}{\sigma (\nu )}D\sigma (\nu )$ , then both inequalities are equalities. The statement follows by the positive $1$ -homogeneity of $\|\cdot \|_{*}$ .
This theorem helps us to justify the gradient flow structure for anisotropic mean curvature flow as introduced earlier via (1.2) and (1.3): Considering that tangential components of the velocity $B$ do not contribute to the variation of $A$ , let us restrict ourselves to velocities of the form $B =-\lambda{\mathrm{sgn}}(H_{\sigma })D\sigma (\nu )$ , where $\lambda \in C(\partial A;\;{\mathbb{R}}_{\geq 0})$ . The normal component of such a vector field $B$ is $V=B\cdot \nu = -\lambda{\mathrm{sgn}}(H_{\sigma })\sigma (\nu )$ , and the metric term becomes
which is in accordance with (1.3). The velocity of steepest descent which was given in Theorem 2.6 satisfies
by Lemma 2.3(v), which (formally) verifies the gradient flow structure of (1.1).
2.3 Distributional solutions to anisotropic mean curvature flow
We introduce a distributional formulation for (1.1) that was proposed in [Reference Laux39]. The idea behind this formulation is to encode the $\sigma$ -mean curvature by a $(d\times d)$ -matrix via an integration by parts as follows:
Remark 2.7. Let $A \subset{\mathbb{R}}^{d}$ be a bounded open set with $C^{2}$ -boundary. For every vector field $B\in C^{1}({\mathbb{R}}^{d})^{d}$ , we have
To see this, we extend the normal $\nu$ to a vector field $\nu \in C^{1}({\mathbb{R}}^{d})^{d}$ , e.g., as a truncation of the normalized gradient $\frac{\nabla \mathrm{sdist}}{|\nabla \mathrm{sdist}|}$ of the signed distance function. There exists an open neighborhood $U$ of $\partial A$ where $\nu \neq 0$ , and in this neighborhood we can decompose the vector field $B$ as
The restriction of $B_{1}$ to $\partial A$ is tangential since, by Lemma 2.3(v), we have $B_{1}\cdot \nu = 0$ . By the divergence theorem on the closed surface $\partial A$ , we obtain
In this computation, the third equality follows by adding zero and using that $\nu \cdot \nabla \nu \,B_{1} =B_{1}\cdot \nabla \left (\frac{1}{2}|\nu |^{2}\right )=0$ . For the fourth equality observe that $(\nabla B_{1}){^{T}} \nu + B_{1}){^{T}} \nu + (\nabla \nu ){^{T}} B_{1}=\nabla (B_{1}\cdot \nu )=0$ since $B_{1}$ is tangential. holds true because the second fundamental form
is symmetric for all $x \in \partial A$ (see [Reference Giga30, Section 1.3]).
On the other hand, we have
which yields
From (2.15) and (2.16) we obtain (2.14).
Using the integration by parts (2.14), we arrive at a distributional formulation for anisotropic mean curvature which resembles the isotropic version due to Luckhaus and Sturzenhecker [Reference Luckhaus and Sturzenhecker47]. Following [Reference Laux39], our definition for $BV$ solutions to anisotropic mean curvature flow also includes an optimal energy dissipation inequality, which alludes to the gradient flow structure of the problem.
Definition 2.8 (Distributional solutions to anisotropic mean curvature flow). Let $\chi _{0} \in BV(\Omega ;\;\{0,1\})$ . A distributional (or $BV$ ) solution to $V=-\mu (\nu ) H_{\sigma }$ with initial condition $\chi _{0}$ is a function $\chi \in BV\left (\Omega \times (0,T);\left \{0,1\right \}\right )\cap C^{0,\frac{1}{2}}\left ([0,T];\;\;L^{1}(\Omega )\right )\cap L^{\infty }\left (0,T;\;BV(\Omega )\right )$ such that
-
(i) there exists a $\big |\nabla \chi \big |$ -measurable function $V\;:\; \Omega \times (0,T) \to{\mathbb{R}}$ such that
(2.17) \begin{equation} {\int _{0}^{T}\int _{\Omega }} V^{2}{\big |\nabla \chi \big |} \lt \infty, \end{equation}which is the normal velocity in the sense that for all $\zeta \in C^{1}(\Omega \times [0,T])$ and ${T^{\prime }}{T^{\prime }} \in (0,T]$ :(2.18) \begin{align} \int _{\Omega } \zeta (x,{T^{\prime }})\chi (x,{T^{\prime }})dx&-\int _{\Omega } \zeta (x,0)\chi _{0}(x)dx \nonumber \\[5pt] &=\int _{0}^{{T^{\prime }}}\int _{\Omega } \partial _{t}\zeta (x,t)\chi (x,t)dx dt + \int _{0}^{{T^{\prime }}}\int _{\Omega } \zeta (x,t)V(x,t){\big |\nabla \chi \big |}, \end{align} -
(ii) the relation $V=-\mu (\nu ) H_{\sigma }$ is satisfied in a distributional sense, i.e., for all $B \in C^{1}(\Omega \times [0,T])^{d}$ , we have
(2.19) \begin{equation} -{\int _{0}^{T}\int _{\Omega }} \frac{1}{\mu (\nu )}VB\cdot \nu{\big |\nabla \chi \big |} ={\int _{0}^{T}\int _{\Omega }} \nabla B \;:\; \left (\sigma (\nu )I_{d}-\nu \otimes D\sigma (\nu ) \right ){\big |\nabla \chi \big |}, \end{equation}and -
(iii) the function $\chi$ satisfies the optimal energy dissipation inequality
(2.20) \begin{equation} \int _{\Omega }\sigma (\nu )|\nabla \chi ({T^{\prime }})|+\int _{0}^{{T^{\prime }}}\int _{\Omega }\frac{1}{\mu (\nu )}V^{2}{\big |\nabla \chi \big |} \leq \int _{\Omega }\sigma (\nu )|\nabla \chi _0| \end{equation}for every ${T^{\prime }} \in [0,T]$ .
A prime example of anisotropic mean curvature flow is the following self-similar solution, which generalizes the evolution of shrinking spheres by (isotropic) mean curvature flow. This example of motion by anisotropic mean curvature flow is derived in [Reference Giga30, Section 1.7.2].
Example 2.9 (The Wulff shape). Let $\sigma$ be an admissible surface tension, and suppose that $\mu =\sigma$ . The Wulff shape associated with $\sigma$ is the set
Then
-
• the $\sigma$ -mean curvature of $\partial \mathscr{W}_{\sigma }$ is $H_{\sigma }\equiv d-1$ , and
-
• the family $\{\Gamma (t) \}_{t\in [0,\frac{1}{2(d-1)})}$ defined by $\Gamma (t)=\sqrt{1-2(d-1)t}\ \partial \mathscr{W}_{\sigma }$ evolves by anisotropic mean curvature flow $V=-\sigma (\nu )H_{\sigma }$ .
In Theorem 4.1, we will prove convergence of the anisotropic Allen–Cahn equation (1.4) to a distributional solution of anisotropic mean curvature flow in the sense of Definition 2.8, but first we must introduce an appropriate notion of solution for (1.4).
3. The anisotropic Allen–Cahn equation
The goal of this chapter is to construct solutions to the anisotropic Allen–Cahn equation (1.4) and to establish spatial $H^{2}$ -regularity for these solutions.
From now on, our assumptions on the double-well potential $W$ are
-
(W1) $W(0)=W(1)=0$ and $W(s)\gt 0$ for all $s \in{\mathbb{R}}\setminus \{0, 1\}$ ,
-
(W2) $W \in C^{1}({\mathbb{R}})$ ,
-
(W3) $W$ is twice differentiable at $0$ and $1$ , with $W^{\prime \prime }(0),\, W^{\prime \prime }(1)\gt 0$ ,
-
(W4) there exists $\lambda \gt 0$ such that $W$ is $\lambda$ -convex (see Remark 3.1), and
-
(W5) $W$ is nonincreasing on $(\!-\!\infty, 0]$ and nondecreasing on $[1,\infty )$ .
While (W1) and (W2) are common assumptions on double-well potentials, (W3) is motivated by the desirable property of exponential convergence of solutions to (1.4) to the wells far away from the diffuse interface and is not necessary for the main results of existence and sharp-interface limit. This assumption was used in a similar way by Sternberg [Reference Sternberg52]. A reference which makes use of the exponential convergence is [Reference Fischer, Laux and Simon24], where Fischer, Simon, and the first author derive a convergence rate for the sharp-interface limit in the isotropic case.
Assumption (W4) is needed in the proof of existence of solutions to guarantee the convergence of the approximate energies. Assumption (W5) is dispensable if one only considers solutions $u_{\varepsilon }$ to (1.4) that satisfy $0\leq u_{\varepsilon }\leq 1$ almost everywhere.
Up to a linear scaling, a possible choice for $W$ is the standard double-well potential $W(s)=\frac{9}{16}\left (1-s^{2}\right )^{2}$ . Here, the prefactor $\frac{9}{16}$ is chosen such that ${c_{0}}\;:\!=\;\int _{-1}^{1}\sqrt{W(s)}ds=1$ .
Remark 3.1. Let $\mathcal{X}$ be a Hilbert space and $\lambda \gt 0$ . A function $F\;:\;\mathcal{X}\to [0,\infty ]$ is called $\lambda$ -convex if the following equivalent conditions are satisfied:
-
(i) For all $x_{0}\in \mathcal{X}$ , the mapping $x \mapsto F(x)+\frac{\lambda }{2}\left \|x-x_{0}\right \|_{\mathcal{X}}^{2}$ is convex,
-
(ii) for all $x, y \in \mathcal{X}$ and $\mu \in (0,1)$ , we have
\begin{equation*} F\left ((1-\mu )x+\mu y\right )\leq (1-\mu )F(x)+\mu F(y) + \frac {\lambda }{2}\mu (1-\mu )\left \|y-x\right \|_{\mathcal {X}}^{2}. \end{equation*}
Furthermore, we will always assume that $(\sigma, \mu )$ is an admissible pair of anisotropies in accordance with Definition 2.1. The information on the surface tension $\sigma$ and mobility $\mu$ is contained in the functions $f$ , $g$ as defined in (1.5) and (1.6), respectively. We remark that there is some freedom with regard to the choice of $g$ : Since $\frac{\sigma (p)}{\mu (p)}$ is not defined for $p=0$ , adding $+1$ in the numerator as well as the denominator (cf. (1.6)) is a means of avoiding a singularity at $0$ . Our existence and conditional convergence statements rely merely on the following properties of $g$ :
Lemma 3.2. Letting $g$ be as in ( 1.6 ), the following holds:
-
(i) The function $g\;:\;{\mathbb{R}}^{d}\to{\mathbb{R}}_{\gt 0}$ is continuous,
-
(ii) we have $c_{g}\;:\!=\;\inf _{{\mathbb{R}}^{d}}g \gt 0$ , and
-
(iii)
(3.1) \begin{equation} \lim _{|p|\to \infty }\left |g(p)- \frac{\sigma (p)}{\mu (p)} \right |=0. \end{equation}
The following weak solution concept for the anisotropic Allen–Cahn equation combines an integration by parts in space (ii) with an optimal energy dissipation identity (iii), which hints at the gradient flow structure of (1.4). Prescribing the initial data as in (i) is possible because $u_{\varepsilon }(0)$ is well-defined via the embedding $H^{1}(\Omega \times (0,T))\subset C^{0,\frac{1}{2}}\left ([0,T];\;L^{2}(\Omega )\right )$ .
Definition 3.3 (Solutions to the anisotropic Allen–Cahn equation). Let $\varepsilon \gt 0$ and $u_{\varepsilon,0} \in \mathrm{dom}(E_{\varepsilon })$ . A function
is a solution to the anisotropic Allen–Cahn equation with initial data $u_{\varepsilon,0}$ if
-
(i)
(3.2) \begin{equation} u_{\varepsilon }(0) = u_{\varepsilon,0}, \end{equation} -
(ii)
(3.3) \begin{equation} {\int _{0}^{T}\int _{\Omega }} g(\!-\!\nabla u_{\varepsilon })\partial _{t}u_{\varepsilon }\varphi \, dx dt ={\int _{0}^{T}\int _{\Omega }} \frac{1}{2}\left (D f(\!-\!\nabla u_{\varepsilon }))\cdot \nabla \varphi -\frac{1}{\varepsilon ^{2}}W^{\prime }(u_{\varepsilon }) \varphi \right ) dx dt \end{equation}for all $\varphi \in C^{1}(\Omega \times [0,T])$ , and
-
(iii)
(3.4) \begin{equation} E_{\varepsilon }[u_{\varepsilon }({T^{\prime }})] + \int _{0}^{T^{\prime }}\int _{\Omega }\varepsilon g(\!-\!\nabla u_{\varepsilon })(\partial _{t}u_{\varepsilon })^{2}dx dt = E_{\varepsilon }[u_{\varepsilon,0}] \end{equation}for all ${T^{\prime }} \in (0,T]$ .
We comment on the typical behavior of the anisotropic Allen–Cahn equation (1.4).
Remark 3.4. The reaction term $-\frac{1}{\varepsilon ^{2}}W^{\prime }(u_{\varepsilon })$ forces the solution $u_{\varepsilon }$ towards the wells of $W$ , i.e., towards the set $\{0, 1\}$ . The interplay of the reaction term with the anisotropic diffusion term $-\mathrm{div}\left (Df(\!-\!\nabla u_{\varepsilon })\right )$ results in a transition layer of width $\mathcal{O}(\varepsilon )$ whose shape also depends on the direction of the approximate outer normal $-\frac{\nabla u_{\varepsilon }}{|\nabla u_{\varepsilon }|}$ . More precisely, the typical width of the transition layer is $\varepsilon \sigma\!\left (-\frac{\nabla u_{\varepsilon }}{|\nabla u_{\varepsilon }|}\right )$ .
To see this, one can choose the $1$ -dimensional stationary ansatz
with $\nu \in S^{d-1}$ being a fixed unit-length vector. Together with a monotonicity assumption on the transition, this admits the solution
where $\Theta$ is the unique solution to
With this in mind, we note that the weighted $L^{2}$ -metric (1.8) depends on $\sigma$ and $\mu$ , whereas anisotropic mean curvature is a gradient flow with respect to the metric (1.3), which depends only on $\mu$ . This can be explained by the idea that, in the anisotropic Allen–Cahn equation, the metric (1.8) has to compensate for the typical width of the transition layer varying with the orientation of the normal.
3.1 Existence of weak solutions
Theorem 3.5 (Existence). Let $\varepsilon \gt 0$ and $u_{0} \in \mathrm{dom}(E_{\varepsilon })\cap L^{\infty }(\Omega )$ . For any finite time horizon $T \in (0,\infty )$ , there exists a solution $u$ to the anisotropic Allen–Cahn equation in the sense of Definition 3.3 with initial data $u_{0}$ . Furthermore, $u$ satisfies
To prove this existence theorem, we exploit the gradient-flow structure of the equation and construct solutions via a minimizing movements scheme. To this end, we consider an approximation of the PDE (1.4) that replaces the time derivative by difference quotients: Given a time-step size $h\gt 0$ , let us look for functions $\{u_{h}^{n}\}_{n\in{\mathbb{N}}_{0}}$ that solve
In fact, equation (3.9) is the strong Euler–Lagrange equation of the variational problem
where we recall the energy (1.7) and the inner-product (1.8) for which we use the notation
in which the dependence on $\varepsilon$ is suppressed again.
Equation (3.9) does not precisely resemble an implicit Euler scheme as it features the explicit term $g(\!-\!\nabla{u_{h}^{n-1}})$ rather than the implicit term $g(\!-\!\nabla{u_{h}^{n}})$ . Instead (3.9) can be viewed an implicit-explicit splitting discretization scheme of (1.4). This is mirrored in the minimization problem (3.11) by the occurrence of the metric $(\cdot,\cdot )_{{u_{h}^{n-1}}}$ , which is taken with respect to the constant point $u_{h}^{n-1}$ and therefore does not depend on $u$ . Thanks to this choice, one can immediately prove that (3.11) admits a unique solution as soon as $h$ is small enough, using the direct method in the calculus of variations:
Lemma 3.6. Let $E_\varepsilon$ be defined as in ( 1.7 ) and assumptions (W1) – (W5) hold. It follows that:
-
(i) The energy $E_{\varepsilon }$ is $\frac{\lambda }{2\varepsilon }$ -convex on $L^{2}(\Omega )$ ,
-
(ii) if $h \lt \frac{2\varepsilon ^{2}c_{g}}{\lambda }$ , then $u \mapsto E_{\varepsilon }[u]+\frac{1}{2h}\left \|u-{u_{h}^{n-1}}\right \|_{{u_{h}^{n-1}}}^{2}$ is strongly convex,
-
(iii) for all $h \gt 0$ , the mapping $u \mapsto E_{\varepsilon }[u]+\frac{1}{2h}\left \|u-{u_{h}^{n-1}}\right \|_{{u_{h}^{n-1}}}^{2}$ is lower semicontinuous on $L^{2}(\Omega )$ ,
-
(iii) for $h \lt \frac{2\varepsilon ^{2}c_{g}}{\lambda }$ and ${u_{h}^{n-1}} \in \mathrm{dom}(E_{\varepsilon })$ , there exists a unique minimizer ${u_{h}^{n}} \in \mathrm{dom}(E_{\varepsilon })$ for ( 3.11 ), and $\|{u_{h}^{n}}\|_{L^{\infty }}\leq \max \left \{\|{u_{h}^{n-1}}\|_{L^{\infty }},1 \right \}$ .
Proof.
-
(i) Let $u, v \in L^{2}(\Omega )$ and $\mu \in [0,1]$ . We will use the equivalent formulation (ii) in Remark 3.1, so that we have to show that
\begin{equation*} E_{\varepsilon }[(1-\mu )u+\mu v] \leq (1-\mu ) E_{\varepsilon }[u]+ \mu E_{\varepsilon }[v]+\frac {\lambda }{4\varepsilon }\mu (1-\mu )\|v-u\|_{L^{2}}^{2} \end{equation*}for all $u,v \in L^{2}(\Omega )$ and $\mu \in (0,1)$ . We can assume without loss of generality that $u, v \in \mathrm{dom}(E_{\varepsilon })$ . Exploiting the convexity of $f$ as well as the $\lambda$ -convexity of $W$ (W4), we obtain(3.12) \begin{align} E_{\varepsilon }[(1-\mu )u+\mu v] &= \frac{1}{2}\int _{\Omega }\left (\varepsilon f(\!-(1-\mu )\nabla u - \mu \nabla v)+\frac{1}{\varepsilon }W((1-\mu )u+\mu v)\right )dx \nonumber \\[5pt] &\leq \frac{1}{2}\int _{\Omega }\left (\varepsilon (1-\mu ) f(\!-\!\nabla u) +\varepsilon \mu f(\!-\!\nabla v) \right )dx \nonumber \\[5pt] &\quad + \frac{1}{2}\int _{\Omega }\left (\frac{1-\mu }{\varepsilon }W(u)+\frac{\mu }{\varepsilon }W(v)+\frac{\lambda }{2\varepsilon }\mu (1-\mu )|v-u|^{2}\right )dx \nonumber \\[5pt] &\leq (1-\mu )E_{\varepsilon }[u]+\mu E_{\varepsilon }[v] + \frac{\lambda }{4\varepsilon }\mu (1-\mu )\left \|v-u\right \|_{L^{2}}^{2}. \end{align} -
(ii) This follows from (i) and quantifying the strong convexity of the metric term: Given $u, v \in L^{2}(\Omega )$ and $\mu \in (0,1)$ , we compute
(3.13) \begin{align} \frac{1}{2h}\left \|((1-\mu )u\vphantom{+\mu v)-{u_{h}^{n-1}}}\right .& \left .\vphantom{((1-\mu )u}\hspace{-3pt}{+} \mu v)-{u_{h}^{n-1}}\right \|_{{u_{h}^{n-1}}}^{2}\nonumber \\[5pt] &= \frac{1-\mu }{2h}\|u-{u_{h}^{n-1}}\|_{{u_{h}^{n-1}}}^{2}+\frac{\mu }{2h}\|v-{u_{h}^{n-1}}\|_{{u_{h}^{n-1}}}^{2}\nonumber \\[5pt] &\quad - \frac{\mu (1-\mu )}{2h}\int _{\Omega }\varepsilon g(\!-\!\nabla{u_{h}^{n-1}})|v-u|^{2}dx \\ &\leq \frac{1-\mu }{2h}\|u-{u_{h}^{n-1}}\|_{{u_{h}^{n-1}}}^{2}+\frac{\mu }{2h}\|v-{u_{h}^{n-1}}\|_{{u_{h}^{n-1}}}^{2}\nonumber \\[5pt] &\quad - \frac{\varepsilon c_{g}}{2h}\mu (1-\mu )\left \|v-u\right \|_{L^{2}}^{2}. \end{align}Combining (3.12) with (3.13) yields strong convexity for $u \mapsto E_{\varepsilon }[u]+\frac{1}{2h}\left \|u-{u_{h}^{n-1}}\right \|_{{u_{h}^{n-1}}}^{2}$ if $\frac{\lambda }{4\varepsilon }\lt \frac{\varepsilon c_{g}}{2h},$ or, equivalently, if $h\lt \frac{2\varepsilon ^{2}c_{g}}{\lambda }$ . -
(iii) This will follow from the lower semi-continuity of the individual terms. To see that the anisotropic Dirichlet energy is lower semi-continuous, we can assume without restriction that $v_{k} \in H^{1}(\Omega )$ for all $k \in{\mathbb{N}}$ , so that it remains to prove that
\begin{equation*} \frac {\varepsilon }{2}\int _{\Omega }f(\!-\!\nabla v)dx \leq \liminf _{k \to \infty }\frac {\varepsilon }{2}\int _{\Omega }f(\!-\!\nabla v_{k})dx. \end{equation*}Without loss of generality, $v_k$ converge to $v$ weakly in $H^1(\Omega ),$ and lower semi-continuous follows from convexity of $f$ . The lower semicontinuity of the potential term $\int _{\Omega } W(u) \, dx$ and the metric term $u\mapsto \frac{1}{2h}\left \|u-{u_{h}^{n-1}} \right \|_{{u_{h}^{n-1}}}^{2}$ follow from Fatou’s lemma. -
(iv) The functional to be minimized is proper. Let $\left \{v_{k}\right \}_{k \in{\mathbb{N}}}$ be a minimizing sequence, i.e.,
\begin{equation*} \lim _{k \to \infty } \left (E_{\varepsilon }[v_{k}]+\frac {1}{2h}\left \|v_{k}-{u_{h}^{n-1}}\right \|_{{u_{h}^{n-1}}}^{2}\right ) =\inf _{u \in L^{2}(\Omega )}\left \{E_{\varepsilon }[u]+\frac {1}{2h}\left \|u-{u_{h}^{n-1}}\right \|_{{u_{h}^{n-1}}}^{2}\right \}. \end{equation*}Every minimizing sequence is bounded in $H^{1}(\Omega )$ since(3.14) \begin{align} \left \|v_{k}\right \|_{H^{1}}^{2}&\leq 2\left \|v_{k}-{u_{h}^{n-1}}\right \|_{L^{2}}^{2} + 2 \left \|{u_{h}^{n-1}}\right \|_{L^{2}}^{2}+ \left \|\nabla v_{k}\right \|_{L^{2}}^{2}\nonumber \\[5pt] &\leq \frac{2}{\varepsilon c_{g}}\left \|v_{k}-{u_{h}^{n-1}}\right \|_{{u_{h}^{n-1}}}^{2} + 2 \left \|{u_{h}^{n-1}}\right \|_{L^{2}}^{2}+ \frac{1}{(\!\min _{S^{d-1}}\sigma )^{2}}\int _{\Omega }f(\!-\!\nabla v_{k})dx\nonumber \\[5pt] &\leq 2 \left \|{u_{h}^{n-1}}\right \|_{L^{2}}^{2}+\left (\frac{4h}{\varepsilon c_{g}}+\frac{2}{\varepsilon (\!\min _{S^{d-1}}\sigma )^{2}} \right )\left (E_{\varepsilon }[v_{k}]+\frac{1}{2h}\left \|v_{k}-{u_{h}^{n-1}}\right \|_{{u_{h}^{n-1}}}^{2}\right ). \end{align}By Rellich’s theorem, there exists a subsequence converging in $L^{2}(\Omega )$ to a limit function $v \in L^{2}(\Omega )$ . The lower semicontinuity (iii) then shows that\begin{equation*} v \in \underset {u \in L^{2}(\Omega )}{\arg \min }\left \{E_{\varepsilon }[u]+ \frac {1}{2h}\left \|u-{u_{h}^{n-1}}\right \|_{{u_{h}^{n-1}}}^{2} \right \}. \end{equation*}The minimizer $v$ is unique due to the strong convexity (ii). For the $L^{\infty }$ -bound, we define a truncated version of $u_{h}^{n}$ by\begin{equation*} v(x)\;:\!=\;\begin {cases} -\alpha ^{n-1}+\tfrac 12 &\text {if } {u_{h}^{n}}(x)\lt -\alpha ^{n-1}+\tfrac 12,\\[5pt] {u_{h}^{n}}(x)&\text {if } |{u_{h}^{n}}(x)-\tfrac 12|\leq \alpha ^{n-1},\\[5pt] \alpha ^{n-1}+\tfrac 12 &\text {if } {u_{h}^{n}}(x)\gt \alpha ^{n-1}+\tfrac 12, \end {cases} \end{equation*}where $\alpha ^{n-1} \;:\!=\; \max \{\|{u_{h}^{n-1}}-\tfrac 12\|_{L^{\infty }},\tfrac 12\}.$ Letting $A\;:\!=\; \{x\in \Omega \,\big |\, v(x)\neq{u_{h}^{n}}(x) \}$ , it follows from ${u_{h}^{n}} \in \mathrm{dom}(E_{\varepsilon })\subseteq H^{1}(\Omega )$ and a general fact about Sobolev spaces that $v \in H^{1}(\Omega )$ and(3.15) \begin{equation} \nabla v(x) =\begin{cases} \nabla{u_{h}^{n}}(x) &\text{for almost every } x \in \Omega \setminus A,\\[5pt] 0&\text{for almost every } x\in A.\end{cases} \end{equation}Then, using (3.15), the positive definiteness of $\sigma$ , the monotonicity assumption (W5), and the pointwise inequality $|v-{u_{h}^{n-1}}|\leq |{u_{h}^{n}}-{u_{h}^{n-1}}|$ , one obtains\begin{align*} E_{\varepsilon }[v]+\frac{1}{2h}\left \|v-{u_{h}^{n-1}}\right \|_{{u_{h}^{n-1}}}^{2}&\leq \frac{\varepsilon }{2}\int _{\Omega }f(\!-\!\nabla{u_{h}^{n}})dx+\frac{1}{2\varepsilon }\int _{\Omega }W({u_{h}^{n}})dx\\[5pt] &\quad +\frac{\varepsilon }{2h}\int _{\Omega }g(\!-\!\nabla{u_{h}^{n-1}})|{u_{h}^{n}}-{u_{h}^{n-1}}|^{2}dx\\[5pt] &=E_{\varepsilon }[{u_{h}^{n}}]+\frac{1}{2h}\left \|{u_{h}^{n}}-{u_{h}^{n-1}}\right \|_{{u_{h}^{n-1}}}^{2}. \end{align*}By the uniqueness of minimizers, this implies that $v={u_{h}^{n}}$ and, therefore,\begin{equation*} \|{u_{h}^{n}}-\tfrac 12\|_{L^{\infty }}=\|v-\tfrac 12\|_{L^{\infty }}\leq \alpha ^{n-1} = \max \left \{\|{u_{h}^{n-1}}-\tfrac 12\|_{L^{\infty }},\tfrac 12 \right \}. \end{equation*}
In order to pass to the limit $h \searrow 0$ , we define one affine and two piecewise constant interpolations: For $n \in{\mathbb{N}}$ such that $t \in [(n-1)h, nh)$ let
where ${u_{h}^{0}} \in \mathrm{dom}(E)$ and $u_{h}^{n}$ are defined inductively via (3.11) for $n \in{\mathbb{N}}$ .
Before we turn to the limiting function $u$ , let us prove the following inequality, which is a consequence of the $\frac{\lambda }{2\varepsilon }$ -convexity of $E_{\varepsilon }$ and the minimizing property of the time steps $u_{h}^{n}$ $u_{h}^{n}$ (3.11):
Lemma 3.7. Let $h\lt \frac{2\varepsilon ^{2}c_{g}}{\lambda }$ and $w \in L^{2}(\Omega \times (0,T))$ . Then
for almost every $t \in (0,T)$ .
Proof. It suffices to prove the corresponding inequality involving the time steps $u_{h}^{n}$ , $n \in{\mathbb{N}}$ : Let $h\lt \frac{\varepsilon ^{2}c_{g}}{2\lambda }$ and $n \in{\mathbb{N}}$ . Then for all $v \in L^{2}(\Omega )$ , we have
Clearly, one can write the interpolations $u_{h}$ , $\overline{u}_{h}$ , and $\underline{u}_{h}$ on the left-hand side of (3.17) in terms of the time steps, choosing $n=n(t) \in{\mathbb{N}}$ such that $t \in [(n(t)-1)h, n(t)h)$ . We also observe that the piecewise affine interpolation $u_{h} \in C\left ([0,T];\;L^{2}(\Omega )\right )$ has a weak time derivative $\partial _{t}u_{h}$ given by
for almost every $t \in (0,T)$ . Therefore, (3.17) follows from (3.18) by reformulating the interpolation in terms of the time steps and using $v=w(t)$ for fixed $t \in (0,T)$ .
To prove (3.18), let $v \in L^{2}(\Omega )$ and $\delta \in (0,1)$ . Again, we use the equivalent characterization of $\frac{\lambda }{2\varepsilon }$ -convexity in Remark 3.1(ii). For computational ease, first note, that
Subtracting $(1-\delta )|u^n-u^{n-1}|^2$ from both sides of the above equality, we may directly compute
where the third step is an application of (3.11), with $\delta v + (1-\delta ){u_{h}^{n}}$ acting as a contender for the minimization problem. Taking the limit $\delta \searrow 0$ finally yields (3.18).
An application of the Arzelà–Ascoli theorem, as performed in the lemma below, shows that a subsequence of $\{u_{h} \}_{h}$ converges strongly, and the derivatives in time and space will turn out to converge weakly in $L^{2}$ . However, due to the nonlinear term $g(\!-\!\nabla u)$ in the anisotropic Allen–Cahn equation (1.4), weak convergence of the gradients is not sufficient to show that the limit function solves the equation. We will therefore use a convexity argument to also prove strong convergence of the gradients.
Lemma 3.8. Let $\{{u_{h}^{0}}\}_{h \in (0, \frac{2\varepsilon ^{2}c_{g}}{\lambda })}\subset \mathrm{dom}(E)$ such that $\limsup _{h\searrow 0}E_{\varepsilon }[{u_{h}^{0}}] \lt \infty$ and $\limsup _{h\searrow 0}\|{u_{h}^{0}}\|_{L^{\infty }(\Omega )}\lt \infty$ . Then there exist a sequence $h\searrow 0$ , denoted without relabeling, and a limit function $u \in H^{1}(\Omega \times (0,T))$ such that
-
(i)
\begin{equation*} u_{h}(t), \:\overline {u}_{h}(t), \:\underline {u}_{h}(t)\longrightarrow u(t) \qquad \text {in } L^{2}(\Omega )\text { uniformly in } t\in [0,T], \end{equation*} -
(ii)
\begin{equation*}\partial _{t}u_{h}{\rightharpoonup } \partial _{t}u \qquad \text {in } L^{2}(\Omega \times (0,T)), \end{equation*} -
(iii)
\begin{equation*} \nabla u_{h},\: \nabla \overline {u}_{h}, \:\nabla \underline {u}_{h}\longrightarrow \nabla u \qquad \text {in } L^{2}(\Omega \times (0,T))^{d} \end{equation*}
as $h\searrow 0$ , where $u_{h}$ , $\overline{u}_{h}$ , and $\underline{u}_{h}$ are constructed as above. Furthermore, the mapping $u\;:\; [0,T]\to L^{2}(\Omega )$ is $\frac{1}{2}$ -Hölder continuous and satisfies the $L^{\infty }$ -bound $\|u-\tfrac 12\|_{L^{\infty }(\Omega \times (0,T))}\leq \max \left \{\limsup _{h\searrow 0}\|{u_{h}^{0}}-\tfrac 12\|_{L^{\infty }(\Omega )},\tfrac 12 \right \}$ .
Proof.
-
(i) The first step is to prove a $H^{1}$ -bound for the minimizers $u_{h}^{n}$ , $n \in{\mathbb{N}}_{0}$ :
(3.20) \begin{align} \left \|{u_{h}^{n}}\right \|_{H^{1}(\Omega )}^{2}&=\|{u_{h}^{n}}\|_{L^{2}(\Omega )}^{2}+\left \|\nabla{u_{h}^{n}}\right \|_{L^{2}(\Omega )}^{2}\nonumber \\[5pt] &\leq \left \|{u_{h}^{n}}\right \|_{L^{\infty }(\Omega )}^{2}+ \frac{1}{\min _{S^{d-1}}\sigma ^{2}}\int _{\Omega }f(\!-\!\nabla{u_{h}^{n}})dx\nonumber \\[5pt] &\leq \left \|{u_{h}^{n}}\right \|_{L^{\infty }(\Omega )}^{2}+ \frac{2}{\varepsilon \min _{S^{d-1}}\sigma ^{2}}E_{\varepsilon }[{u_{h}^{n}}]\nonumber \\[5pt] &\leq \max \left \{\left \|{u_{h}^{0}}\right \|_{L^{\infty }(\Omega )}^{2},1\right \}+ \frac{2}{\varepsilon \min _{S^{d-1}}\sigma ^{2}}E_{\varepsilon }[{u_{h}^{0}}], \end{align}where the last inequality uses Lemma 3.6(iv) and (3.11) repeatedly.For any $t \in [0,T]$ , the function $u_{h}(t)$ is a convex combination of two minimizers $u_{h}^{n-1}$ , $u_{h}^{n-1}$ , $u_{h}^{n}$ of (3.11). Applying the triangle inequality yields
(3.21) \begin{align} \sup _{t\in [0,T]}\limsup _{h \searrow 0}\left \|u_{h}(t)\right \|_{H^{1}(\Omega )}&\leq \left (\!\max \left \{\limsup _{h\searrow 0}\left \|{u_{h}^{n}}\right \|_{L^{\infty }(\Omega )}^{2},1\right \}\vphantom{+\frac{2}{\varepsilon \min _{S^{d-1}}\sigma ^{2}}\limsup _{h\searrow 0}E_{\varepsilon }[{u_{h}^{0}}]}\right .\nonumber \\[5pt] &\hspace{8.5pt}{\left . \vphantom{\max \left \{\limsup _{h\searrow 0}\left \|{u_{h}^{n}}\right \|_{L^{\infty }(\Omega )}^{2},1\right \}} +\frac{2}{\varepsilon \min _{S^{d-1}}\sigma ^{2}}\limsup _{h\searrow 0}E_{\varepsilon }[{u_{h}^{0}}]\right )}^{\frac{1}{2}}\nonumber \\[5pt] &\lt \infty . \end{align}It follows from Rellich’s compact embedding theorem that any sequence $\{u_{h_{k}}(t)\}_{k \in{\mathbb{N}}}{\mathbb{N}}$ with $t \in [0,T]$ and $\lim _{k\to \infty }h_{k}=0$ has a subsequence converging in $L^{2}(\Omega )$ . To prove equicontinuity in time, which is the second requirement for the Arzelà–Ascoli theorem, we will – as is standard with minimizing movement approximations – show the stronger statement that $u_{h} \in C\left ([0,T];\;L^{2}(\Omega )\right )$ are $\frac{1}{2}$ -Hölder continuous and the Hölder constants are uniformly bounded as $h \searrow 0$ . Indeed, for $n \in{\mathbb{N}}$ , we use the definition of the metric $(\cdot,\cdot )_{{u_{h}^{n-1}}}$ and the minimization property (3.11) to show that(3.22) \begin{align} \left \|{u_{h}^{n}}-{u_{h}^{n-1}} \right \|_{L^{2}(\Omega )}^{2} &\leq \frac{1}{\varepsilon c_{g}}\left \|{u_{h}^{n}}-{u_{h}^{n-1}}\right \|_{{u_{h}^{n-1}}}^{2} \nonumber \\[5pt] &= \frac{2h}{\varepsilon c_{g}}\left (E_{\varepsilon }[{u_{h}^{n}}]+\frac{1}{2h}\left \|{u_{h}^{n}}-{u_{h}^{n-1}}\right \|_{{u_{h}^{n-1}}}^{2} - E_{\varepsilon }[{u_{h}^{n}}] \right ) \nonumber \\[5pt] &\leq \frac{2h}{\varepsilon c_{g}}\left (E_{\varepsilon }[{u_{h}^{n-1}}]-E_{\varepsilon }[{u_{h}^{n}}] \right ). \end{align}Inequality (3.22) allows us to bound the $L^{2}$ -norms of the weak time derivates $\partial _{t}u_{h}$ as(3.23) \begin{align} \|\partial _{t}u_{h}\|_{L^{2}\left (0,T;\;L^{2}(\Omega )\right )}^{2}&\leq \sum _{n=1}^{\left \lceil \frac{T}{h} \right \rceil } \int _{(n-1)h}^{nh}\|\partial _{t}u_{h}(t) \|_{L^{2}(\Omega )}^{2}dt \nonumber \\[5pt] &= \sum _{n=1}^{\left \lceil \frac{T}{h} \right \rceil } \int _{(n-1)h}^{nh} \frac{\|{u_{h}^{n}}-{u_{h}^{n-1}}\|_{L^{2}(\Omega )}^{2}}{h^{2}}dt \\ &\leq \sum _{n=1}^{\left \lceil \frac{T}{h} \right \rceil } \frac{2}{\varepsilon c_{g}}\left (E_{\varepsilon }[{u_{h}^{n-1}}]-E_{\varepsilon }[{u_{h}^{n}}] \right ) \nonumber \\[5pt] &\leq \frac{2}{\varepsilon c_{g}}E_{\varepsilon }[{u_{h}^{0}}]. \end{align}The fundamental theorem of calculus for vector-valued functions together with the Cauchy–Schwarz inequality now yields(3.24) \begin{align} \|u_{h}(t)-u_{h}(s)\|_{L^{2}(\Omega )}&\leq \int _{(s,t)}\|\partial _{t}u_{h}(r)\|_{L^{2}(\Omega )}dr \nonumber \\[5pt] &\leq \|\partial _{t}u_{h}\|_{L^{2}\left (s,t;L^{2}(\Omega )\right )}\sqrt{|t-s|}\nonumber \\[5pt] &\leq \sqrt{\frac{2E_{\varepsilon }[{u_{h}^{0}}]}{\varepsilon c_{g}}}\sqrt{|t-s|} \end{align}for all $s,t \in [0,T]$ . This is the one-dimensional case of Morrey’s inequality.By (3.24) and the assumption on the initial data, the Hölder constants are bounded as $h \searrow 0$ . Families of functions with bounded Hölder constants are equicontinuous. These equicontinuity and precompactness statements allow us to apply the Arzelà–Ascoli theorem, which states that there exists a subsequence $h \searrow 0$ and a function $u \in C\left ([0,T];\;L^{2}(\Omega )\right )$ such that $u_{h} \to u$ in $C\left ([0,T];\;L^{2}(\Omega )\right )$ . Furthermore, the Hölder continuity carries over to the limit function, i.e.,
(3.25) \begin{equation} \|u(t)-u(s)\|_{L^{2}(\Omega )}\leq \sqrt{\frac{2\limsup _{h\searrow 0}E_{\varepsilon }[{u_{h}^{0}}]}{\varepsilon 0}E_{\varepsilon }[{u_{h}^{0}}]}{\varepsilon c_{g}}\sqrt{|t-s|} \qquad \text{for all }s,t \in [0,T]. \end{equation}One can extract a further subsequence $\{u_{h_{k}} \}_{k\in{\mathbb{N}}}$ such that $h_{k}\searrow 0$ and $u_{h_{k}}(x,t)\to u(x,t)$ almost everywhere in $\Omega \times (0,T)$ . This pointwise convergence together with the triangle inequality yields\begin{align*} \|u-\tfrac 12\|_{L^{\infty }(\Omega \times (0,T))}&\leq \limsup _{k\to \infty }\|u_{h_{k}}-\tfrac 12\|_{L^{\infty }(\Omega \times (0,T))}\\[5pt] &\leq \limsup _{h\searrow 0}\|u_{h}-\tfrac 12\|_{L^{\infty }(\Omega \times (0,T))}\\[5pt] &\leq \limsup _{h\searrow 0}\sup _{n\in{\mathbb{N}}}\|{u_{h}^{n}}-\tfrac 12\|_{L^{\infty }(\Omega )}\\[5pt] &\leq \max \left \{\limsup _{h\searrow 0}\|{u_{h}^{0}}-\tfrac 12\|_{L^{\infty }(\Omega )},\tfrac 12\right \}. \end{align*}In a last step, we argue briefly that the piecewise constant interpolations $\overline{u}_{h}$ and $\underline{u}_{h}$ converge uniformly.Given $t \in [0,T]$ , we find $n \in{\mathbb{N}}$ such that $t \in [(n-1)h,nh)$ . Then
\begin{equation*} \|\overline {u}_{h}(t)-u_{h}(t)\|_{L^{2}(\Omega )}=\|u_{h}(nh)-u_{h}(t)\|_{L^{2}(\Omega )}\leq \sqrt {\frac {2E_{\varepsilon }[{u_{h}^{0}}]}{\varepsilon c_{g}}}\sqrt {nh-t}\leq \sqrt {\frac {2E_{\varepsilon }[{u_{h}^{0}}]}{\varepsilon c_{g}}}\sqrt {h}, \end{equation*}and a similar argument shows that\begin{equation*} \|\underline {u}_{h}(t)-u_{h}(t)\|_{L^{2}(\Omega )}\leq \sqrt {\frac {2E_{\varepsilon }[{u_{h}^{0}}]}{\varepsilon c_{g}}}\sqrt {h}. \end{equation*}Combining this bound with the uniform convergence $u_{h}\to u$ in $C\left ([0,T];\;L^{2}(\Omega )\right )$ , we obtain\begin{align*} \limsup _{h\searrow 0}\sup _{t\in [0,T]}\max &\left \{\|\overline{u}_{h}(t)-u(t)\|_{L^{2}(\Omega )},\|\underline{u}_{h}(t)-u(t)\|_{L^{2}(\Omega )} \right \}\\[5pt] &\leq \limsup _{h\searrow 0}\sup _{t \in [0,T]}\|u_{h}(t)-u(t)\|_{L^{2}(\Omega )}+ \lim _{h\searrow 0}\sqrt{\frac{2E_{\varepsilon }[{u_{h}^{0}}]}{\varepsilon c_{g}}}\sqrt{h} \\[5pt] &=0. \end{align*} -
(ii) It follows from (i) that $u_{h}\to u$ as $h \searrow 0$ in $L^{2}\left (0,T;\;L^{2}(\Omega )\right )\cong L^{2}(\Omega \times (0,T))$ . Furthermore, by (3.23), the time derivatives $\{\partial _{t}u_{h} \}$ are bounded in $L^{2}(\Omega \times (0,T))$ as $h \searrow 0$ . By, e.g., [Reference Ambrosio, Fusco and Pallara2, Prop. 2.5(b)], one concludes that $u$ has a weak time derivative $\partial _{t}u \in L^{2}(\Omega \times (0,T))$ , and that $\partial _{t}u_{h}\rightharpoonup \partial _{t}u$ in $L^{2}(\Omega \times (0,T))$ as $h \searrow 0$ .
-
(iii) We will prove the desired convergence result for the piecewise constant interpolation $\overline{u}_{h}$ first. Integrating (3.20) from $0$ to $T$ shows that the gradients $\{\nabla \overline{u}_{h} \}$ are bounded in $L^{2}(\Omega \times (0,T))^{d}$ as $h \searrow 0$ . Just like in (ii), it then follows from the strong convergence $\overline{u}_{h}\to u$ in $L^{2}(\Omega \times (0,T))$ that the limit function $u$ has a weak gradient $\nabla u \in L^{2}(\Omega \times (0,T))^{d}$ , and that $\nabla \overline{u}_{h}{\rightharpoonup } \nabla u$ as $h \searrow 0$ in $L^{2}(\Omega \times (0,T))^{d}$ .
The next step is to upgrade this weak convergence statement for the gradients to strong convergence in $L^{2}$ . The key ingredient for the argument will be the energy convergence
(3.26) \begin{equation} E_{\varepsilon }[\overline{u}_{h}(\!\cdot\!)]{\rightharpoonup } E_{\varepsilon }[u(\!\cdot\!)] \qquad \text{in } L^{1}(0,T). \end{equation}Let us first argue how (3.26) implies strong convergence of the gradients. Since the Cahn–Hilliard energy $E_{\varepsilon }$ is made up of a Dirichlet energy and a nonconvex term involving $W$ , we will use the convergence result (3.26) for the Cahn–Hilliard energies and a liminf inequality for the nonconvex part to derive a limsup inequality for the Dirichlet energy, which will allow us to prove the strong convergence statement.Similarly to the proof of Lemma 3.6(iii), an application of Fatou’s lemma yields
(3.27) \begin{equation} \liminf _{h\searrow 0}{\int _{0}^{T}\int _{\Omega }} W(\overline{u}_{h})dxdt \geq{\int _{0}^{T}\int _{\Omega }} W(u)dxdt. \end{equation}One can now test (3.26) with a constant test function and combine this convergence statement with (3.27), which yields(3.28) \begin{align} \limsup _{h\searrow 0}{\int _{0}^{T}\int _{\Omega }} f(\!-\!\nabla \overline{u}_{h})dx dt&=\frac{2}{\varepsilon }\lim _{h\searrow 0}\int _{0}^{T}E_{\varepsilon }[\overline{u}_{h}(t)]dt-\frac{1}{\varepsilon ^{2}}\liminf _{h\searrow 0}{\int _{0}^{T}\int _{\Omega }} W(\overline{u}_{h})dxdt \nonumber \\[5pt] &\leq \frac{2}{\varepsilon }\int _{0}^{T}E_{\varepsilon }[u(t)]dt-\frac{1}{\varepsilon ^{2}}{\int _{0}^{T}\int _{\Omega }} W(u)dxdt \nonumber \\[5pt] &={\int _{0}^{T}\int _{\Omega }} f(\!-\!\nabla u)dx dt. \end{align}The function $f$ is strongly convex, i.e., there exists a constant $c\gt 0$ such that $f(p^{\prime })\geq f(p)+Df(p)\cdot (p^{\prime }-p)+\frac{c}{2}|p^{\prime }-p|^{2}$ for all $p, p^{\prime } \in{\mathbb{R}}^{d}$ . Thus, if (3.28) holds true, then\begin{align*} \limsup _{h\searrow 0}\frac{c}{2}{\int _{0}^{T}\int _{\Omega }}& \left |\nabla \overline{u}_{h} - \nabla u\right |^{2}dx dt \\[5pt] &\leq \limsup _{h \searrow 0}\left ({\int _{0}^{T}\int _{\Omega }} f(\!-\!\nabla \overline{u}_{h})dx dt-{\int _{0}^{T}\int _{\Omega }} f(\!-\!\nabla u)dxdt \vphantom{-{\int _{0}^{T}\int _{\Omega }} Df(\!-\!\nabla u)\cdot (\nabla u - \nabla u_{h}) dxdt}\right .\\[5pt] &\hspace{8.5pt}\left . \vphantom{{\int _{0}^{T}\int _{\Omega }} f(\!-\!\nabla u_{h})dx dt- f(\!-\!\nabla u)dxdt} -{\int _{0}^{T}\int _{\Omega }} Df(\!-\!\nabla u)\cdot (\nabla u - \nabla \overline{u}_{h})dx dt\right ) \\ &\leq \limsup _{h \searrow 0}{\int _{0}^{T}\int _{\Omega }} f(\!-\!\nabla \overline{u}_{h})dx dt -{\int _{0}^{T}\int _{\Omega }} f(\!-\!\nabla u)dxdt\\[5pt] &\quad -\liminf _{h \searrow 0}{\int _{0}^{T}\int _{\Omega }} Df(\!-\!\nabla u)\cdot (\nabla u - \nabla \overline{u}_{h}) dxdt\\[5pt] &\leq 0, \end{align*}where the last step also uses the fact that $\nabla u_{h}{\rightharpoonup } \nabla u$ in $L^{2}(\Omega \times (0,T))^{d}$ . This computation implies that $\nabla \overline{u}_{h}\to \nabla u$ strongly in $L^{2}(\Omega \times (0,T))$ .In order to prove (3.26), it suffices to consider nonnegative test functions $\zeta \in L^{\infty }\left (0,T;\;{\mathbb{R}}_{\geq 0}\right )$ . We will prove a liminf inequality and a limsup inequality separately.
On the one hand, by Fatou’s lemma, the lower semicontinuity of $E_{\varepsilon }$ , and (i), we obtain
(3.29) \begin{equation} \liminf _{h\searrow 0}\int _{0}^{T}\zeta (t)E_{\varepsilon }[\overline{u}_{h}(t)]dt\geq \int _{0}^{T}\zeta (t)\liminf _{h\searrow 0}E_{\varepsilon }[\overline{u}_{h}(t)]dt\geq \int _{0}^{T}\zeta (t)E_{\varepsilon }[u(t)]dt. \end{equation}On the other hand, choosing $w(x,t)\;:\!=\;u(x,t)$ in (3.17) and integrating against $\zeta$ yields\begin{align*} \int _{0}^{T}\zeta (t)E_{\varepsilon }[u(t)]dt& \geq \int _{0}^{T}\zeta (t)E_{\varepsilon }[\overline{u}_{h}(t)]dt\\[5pt] & \quad -\int _{0}^{T}\zeta (t)\int _{\Omega }\left (\varepsilon g(\!-\!\nabla \underline{u}_{h})\partial _{t}u_{h}(u-\overline{u}_{h}) +\frac{\lambda }{4\varepsilon }|u-\overline{u}_{h}|^{2}\right )dx dt. \end{align*}In the limit $h\searrow 0$ , the second integral on the right-hand side vanishes since $\overline{u}_{h}\to u$ in $L^{2}(\Omega \times (0,T))$ and $g$ is bounded in $L^\infty$ by Lemma 3.2. Thus,(3.30) \begin{equation} \int _{0}^{T}\zeta (t)E_{\varepsilon }[u(t)]dt\geq \limsup _{h\searrow 0}\int _{0}^{T}\zeta (t)E_{\varepsilon }[\overline{u}_{h}(t)]dt. \end{equation}Combining (3.29) and (3.30) yields (3.26).We have shown that $\nabla \overline{u}_{h}\to \nabla u$ in $L^{2}(\Omega \times (0,T))$ as $h \searrow 0$ . As for $\nabla \underline{u}_{h}$ , one computes
\begin{align*} \|\nabla \underline{u}_{h}-&\nabla u\|_{L^{2}(\Omega \times (0,T))}^{2}\\[5pt] &=\int _{0}^{h}\int _{\Omega }|\nabla \underline{u}_{h}-\nabla u|^{2}dx dt + \int _{h}^{T}\int _{\Omega }|\nabla \underline{u}_{h}-\nabla u|^{2}dx dt \\[5pt] &=\int _{0}^{h}\int _{\Omega }|\nabla{u_{h}^{0}}-\nabla u|^{2}dx dt + \int _{h}^{T}\int _{\Omega }|\nabla \overline{u}_{h}(x,t-h)-\nabla u(x,t)|^{2}dx dt\\[5pt] &\leq 2h\int _{\Omega }|\nabla{u_{h}^{0}}|^{2}dx + 2 \int _{0}^{h}\int _{\Omega }|\nabla u|^{2}dxdt\\[5pt] &\quad +2\int _{0}^{T-h}\int _{\Omega }|\nabla \overline{u}_{h}(x,t)-\nabla u(x,t)|^{2}dx dt\\[5pt] &\quad + 2\int _{0}^{T-h}\int _{\Omega }|\nabla u(x,t)-\nabla u(x, t+h)|^{2}dx dt\\[5pt] &\longrightarrow 0 \qquad \text{as } h \searrow 0, \end{align*}where the last integral vanishes as $h \searrow 0$ due to the continuity of translation.Lastly, $\nabla u_{h}(x,t)$ is a convex combination of $\nabla \overline{u}_{h}(x,t)$ and $\nabla \underline{u}_{h}(x,t)$ for almost all $(x,t)\in \Omega \times (0,T)$ . Hence, it follows from $\nabla \overline{u}_{h}\to \nabla u$ and $\nabla \underline{u}_{h}\to \nabla u$ that, as $h\searrow 0$ , we have $\nabla u_{h}\to \nabla u$ in $L^{2}(\Omega \times (0,T))$ as well.
We are now ready to pass to the limit $h\searrow 0$ in (3.17), which will eventually allow us to prove Theorem 3.5. This is the content of the following lemma:
Lemma 3.9. Let $w \in L^{2}(\Omega \times (0,T))$ and $\zeta \in L^{\infty }(0,T)$ such that $\zeta \geq 0$ almost everywhere. Then
In other words, for every $w \in L^{2}(\Omega \times (0,T))$ , the inequality $E_{\varepsilon }[w(t)]\geq E_{\varepsilon }[u(t)]-\int _{\Omega }\varepsilon g(\!-\!\nabla u)\partial _{t}u(w-u)dx - \frac{\lambda }{4\varepsilon }\int _{\Omega }|w-u|^{2}dx$ holds true for almost every $t \in (0,T)$ .
Proof. First, by the weak convergence (3.26), it follows that
for any function $\zeta \in L^{\infty }(0,T)$ such that $\zeta \geq 0$ .
Second, for the term involving the time derivative, we know from Lemma 3.8(ii) that $\partial _{t}u_{h}{\rightharpoonup } \partial _{t}u$ in $L^{2}(\Omega \times (0,T))$ . Thus, to derive the convergence of the integrals, it suffices to show that $g(\!-\!\nabla \underline{u}_{h})(w-\overline{u}_{h}) \to g(\!-\!\nabla u)(w-u)$ (strongly) in $L^{2}(\Omega \times (0,T))$ .
By Lemma 3.8(iii), we know that $\nabla \overline{u}_{h},\nabla \underline{u}_{h}\to \nabla u$ in $L^{2}(\Omega \times (0,T))$ . Since $g$ is a continuous function, every subsequence of $g(\!-\!\nabla \underline{u}_{h})$ as $h\searrow 0$ has a further subsequence that converges pointwise almost everywhere to $g(\!-\!\nabla u)$ . By the uniform boundedness of $g$ and the dominated convergence theorem, it follows that
It remains to show the $L^{2}$ -convergence on the product space $\Omega \times (0,T)$ . Using the generalized dominated convergence theorem with
it follows that
i.e., by Fubini’s theorem, $g(\!-\!\nabla \underline{u}_{h})(w-\overline{u}_{h}) \to g(\!-\!\nabla u)(w-u)$ in $L^{2}(\Omega \times (0,T))$ . In total, we obtain
Third, it follows from Lemma 3.8(i) that
Integrating (3.17) against $\zeta$ , taking the limit $h \searrow 0$ , and plugging in (3.32)–(3.34) proves the desired inequality.
With the help of Lemma 3.9, it can be seen that $-\partial _{t}u(t)$ lies in the $u(t)$ -subdifferential $\partial _{u(t)} E_{\varepsilon }[u(t)]$ for almost every $t \in (0,T)$ . Thus, it seems plausible that the limiting trajectory $u(t)$ is a gradient flow for $E_{\varepsilon }$ .
Proof of Theorem 3.5. Choosing ${u_{h}^{0}}=u_{0}$ , one can construct the functions $u_{h}$ , $\overline{u}_{h}$ , and $\underline{u}_{h}$ by Lemma 3.6 and (3.16). Invoking Lemma 3.8 yields a subsequence $h \searrow 0$ as well as a function $u \in H^{1}(\Omega \times (0,T))\cap L^{\infty }(\Omega \times (0,T))$ such that $\|u-\tfrac 12\|_{L^{\infty }(\Omega \times (0,T))}\leq \max \left \{\|u_{0}-\tfrac 12\|_{L^{\infty }(\Omega )},\tfrac 12 \right \}$ , and such that the three convergence results in Lemma 3.8(i)–(iii) hold true. It remains to be shown that $u$ is a weak solution to the anisotropic Allen–Cahn equation with initial data $u_{0}$ in the sense of Definition 3.3.
As for the initial condition, it follows from the uniform convergence in Lemma 3.8(i) and the choice ${u_{h}^{0}}=u_{0}$ that
where all limits are in $L^{2}(\Omega )$ .
For the optimal energy dissipation relation (3.4) let ${T^{\prime }} \in (0,T]$ be a fixed time horizon. We define an extension of $u\;:\;\Omega \times [0,{T^{\prime }}]\to{\mathbb{R}}$ to $\Omega \times{\mathbb{R}}$ by
Then $\tilde{u} \in H_{\mathrm{loc}}^{1}(\Omega \times{\mathbb{R}})$ , with the weak time derivative being given by $\partial _{t}\tilde{u}(x,t)=\chi _{\Omega \times (0,{T^{\prime }})}\partial _{t}u(x,t)$ . Taking $a\gt 0$ , $\zeta \;:\!=\;\frac{1}{a}\chi _{(0,{T^{\prime }})}$ , and $w(x,t)\;:\!=\;\tilde{u}(x, t+a)$ in (3.31) yields
where we have replaced $u$ by $\tilde{u}$ on $\Omega \times (0,{T^{\prime }})$ several times and used the definition of $\tilde{u}$ in the last line.
Let us now take $a \searrow 0$ in the inequality above. As a consequence of the lower semicontinuity of $E_{\varepsilon }$ and the $L^{2}$ -continuity of the map $t\mapsto u(t)$ , we have
Furthermore, it follows from a general fact about Sobolev functions that
(see [Reference Gilbarg and Trudinger34, Lemma 7.23 and proof of Lemma 7.24]). In particular, the integral $\int _{0}^{{T^{\prime }}}\int _{\Omega }\frac{\lambda }{2\varepsilon }\left (\frac{\tilde{u}(t+a)-\tilde{u}(t)}{a}\right )^{2}dxdt$ is bounded as $a \searrow 0$ .
Therefore, the limiting inequality of (3.35) as $a \searrow 0$ reads
which is one inequality in the optimal energy dissipation relation (3.4).
For the converse inequality, we proceed analogously by taking $w(x,t)\;:\!=\;\tilde{u}(x,t-a)$ and $a \searrow 0$ . Using first the fact that $\frac{\tilde{u}(\cdot -a)-\tilde{u}(\!\cdot\!)}{a}{\rightharpoonup } -\partial _{t}u$ in $L^{2}(\Omega \times (0,{T^{\prime }}))$ , and second the inequality $\liminf _{a \searrow 0}\frac{1}{a}\int _{{T^{\prime }} - a}^{{T^{\prime }}}E_{\varepsilon }[u(t)]dt\geq E_{\varepsilon }[u({T^{\prime }})]$ , we obtain
The combination of (3.36) and (3.37) is precisely the optimal energy dissipation identity (3.4).
For the distributional formulation of the PDE (3.3) let $\varphi \in C^{1}(\Omega \times [0,T])$ and $s \gt 0$ . By plugging in $\zeta \equiv 1$ and $w=u+s\varphi$ into (3.31), we obtain
Dividing by $s$ and taking $s \searrow 0$ leads to
where the limit and the integrals can be interchanged in the last step because $u$ is essentially bounded.
We divide by $\varepsilon$ to obtain one inequality in (3.3). The converse inequality follows by taking $s \lt 0$ , $s \nearrow 0$ .
3.2 Regularity of weak solutions
The remainder of this section is devoted to proving a regularity result which states that solutions to the anisotropic Allen–Cahn equation have weak second derivatives in space. We apply a difference quotient method for elliptic regularity. The idea for this proof is taken from [Reference Fonseca and Leoni26]. The fact that $f$ is, in general, not twice continuously differentiable on ${\mathbb{R}}^{d}$ due to a singularity at $0$ will not pose a problem when proving the existence of second derivatives, but it prevents us from deriving an additional PDE for the second derivatives.
Theorem 3.10 (Spatial regularity of distributional solutions). Let $\varepsilon \gt 0$ . If $u$ is a solution to ( 1.4 ) in the sense of Definition 3.3 , then $u \in L^{2}\left (0,T;\;H^{2}(\Omega )\right )$ .
It suffices to prove the following
Claim 1. Let $u \in H^{1}(\Omega )$ , and suppose that $u$ is a weak solution to $-\mathrm{div}(Df(\!-\!\nabla u))=h$ for some function $h \in L^{2}(\Omega )$ . More precisely, we assume that
for all $w \in H^{1}(\Omega )$ . Then $u$ has a weak second derivative in space $\nabla ^{2}u \in L^{2}(\Omega )^{d\times d}$ and $\|\nabla ^2 u\|_{L^2(\Omega )} \leq C \|h\|_{L^2(\Omega )}$ .
Theorem 3.10 follows from this claim by choosing $h=2g(\!-\!\nabla u)\partial _{t}u+\frac{1}{\varepsilon ^{2}}W^{\prime }(u)$ and slicing in time. Then we have $h(\cdot,t) \in L^{2}(\Omega )$ for almost every $t \in (0,T)$ since $\partial _{t}u \in L^{2}(\Omega \times (0,T))$ , $u\in L^{\infty }(\Omega \times (0,T))$ , and $W \in C^{1}({\mathbb{R}})$ . The fact that (3.3) holds true for test functions $w \in L^{2}\left (0,T;\;H^{1}(\Omega )\right )$ instead of just $C^{1}(\Omega \times [0,T])$ can be shown with a density argument.
Proof of the claim. Given a function $w$ as in the claim, we define the difference quotient
for $s\neq 0$ and $l=1,\ldots, d$ . Then $D_{l}^{s}w \in L^{2}\left (0,T;\;H^{1}(\Omega )\right )$ . The weak formulation of the PDE and the substitution $y=x+se_{l}$ yield
In order to deal with the right-hand side, we will use the following identity for $i=1,2, \ldots, d$ and for almost every $x\in \Omega$ :
where the shorthand notation in the last line is to be read as
If the line segment between $\nabla u(x-se_{l})$ and $\nabla u(x)$ does not contain the origin, then $f$ is twice continuously differentiable in a neighborhood of the line segment, so that (3.39) follows immediately from the fundamental theorem of calculus. On the other hand, if $0 \in{\mathrm{conv}}\left (\left \{\nabla u(x-se_{l}),\nabla u(x,t)\right \}\right )$ , then (3.39) can be deduced by decomposing the line segment into two parts and applying the fundamental theorem of calculus on each part.
By the strong convexity of $f$ , there exists a constant $c\gt 0$ such that
for all $\xi \in{\mathbb{R}}^{d}$ , $p \in{\mathbb{R}}^{d}\setminus \{0\}$ . Using the definition of $A_{ij}^{s}$ , we obtain
for almost every $x\in \Omega$ .
One can now choose $w = D_{l}^{-s}u$ in (3.38) and use (3.39) with (3.40) as is standard (see, e.g., [Reference Gilbarg and Trudinger34, Lemma 7.23]) to conclude
As $\nabla D_{l}^{-s}u \rightharpoonup \partial _{l}\nabla u$ as $s\to 0$ , this concludes the claim.
4. Convergence of the anisotropic Allen–Cahn equation to anisotropic mean curvature flow
The goal of this section is to prove a conditional convergence result for the sharp-interface limit of the anisotropic Allen–Cahn equation. In order to conclude that the limit is a $BV$ solution to (1.1) in the sense of Definition 2.8, a critical assumption is the convergence of the time-integrated energies. A statement of the result and a sketch of the proof are included in the notes [Reference Laux39]. Here, we complete the proof and correct a mistake with respect to the phase-field equation (1.4): In [Reference Laux39], the function $g$ was defined as $g(p)=\frac{|p|}{\mu (p)}$ , thereby missing the effect of the anisotropic surface tension $\sigma$ on $g$ .
Theorem 4.1. Let $(\sigma, \mu )$ be an admissible pair of anisotropies, and let $f$ , $g$ be given by equations ( 1.5 ), ( 1.6 ), respectively. Suppose that $W$ is a double-well potential satisfying (W1) – (W5) . Let $\{u_{\varepsilon }\}_{\varepsilon \gt 0}$ be solutions to the anisotropic Allen–Cahn equation in the sense of Definition 3.3 with initial data $u_{\varepsilon,0} \in \mathrm{dom}(E_{\varepsilon })$ and such that $\|u_{\varepsilon }-\tfrac 12\|_{L^{\infty }(\Omega \times (0,T))}\leq \max \{\|u_{\varepsilon,0}-\tfrac 12\|_{L^{\infty }(\Omega )},\tfrac 12 \}$ . Assume that
-
• the initial conditions are well-prepared,
(4.1) \begin{equation} u_{\varepsilon,0}\longrightarrow u_{0} \qquad \text{in } L^{2}(\Omega ) \qquad \text{and} \qquad E_{\varepsilon }[u_{\varepsilon,0}]\longrightarrow E[u_{0}]\;=\!:\;E_{0}\lt \infty \end{equation}as $\varepsilon \searrow 0$ , and that -
• there is a uniform $L^{\infty }$ -bound on the initial conditions,
(4.2) \begin{equation} \limsup _{\varepsilon \searrow 0}\left \|u_{\varepsilon,0}\right \|_{L^{\infty }(\Omega )}\lt \infty . \end{equation}
Then there exists a subsequence $\varepsilon \searrow 0$ as well as a function
such that $u_{\varepsilon } \to u$ in $L^{2}(\Omega \times (0,T))$ as $\varepsilon \searrow 0$ .
If, furthermore,
then $u$ is a distributional solution to anisotropic mean curvature flow with initial condition $u(0)=u_{0}$ in the sense of Definition 2.8 .
Similarly to [Reference Luckhaus and Sturzenhecker47], where a minimizing movements construction for isotropic mean curvature is performed, the intuitive meaning of the assumption of energy convergence (4.3) is that no surface area is lost in the limit.
We will first argue by compactness that a limiting function $u$ exists. Assumption (4.3) then allows us to prove an equipartition of energy between the anisotropic Dirichlet energy and the nonconvex potential energy as $\varepsilon \searrow 0$ . In addition, this assumption guarantees the existence of a normal velocity $V$ in the sense of (2.18). We introduce a relative entropy functional which serves as a tilt excess. Using this tilt excess to bound the occurring error terms, we will derive the weak formulation (2.19) from the distributional formulation (3.3) and the optimal energy dissipation inequality (2.20) from (3.4).
4.1 Compactness
To prove the compactness statement for the solutions $\{u_{\varepsilon } \}_{\varepsilon \gt 0}$ to the anisotropic Allen–Cahn equation from Theorem 3.5, we will first show a $W^{1,1}$ -bound for the compositions of a suitable continuous function with $u_{\varepsilon }$ . This argument was performed, for example, by Fonseca and Tartar [Reference Fonseca and Tartar27] in the isotropic case (see also [Reference Leoni44]). However, some simplifications are possible since we assume an $L^{\infty }$ -bound on the functions $\{u_{\varepsilon } \}$ instead of prescribing growth conditions on the double-well potential $W$ .
Lemma 4.2. Let $u_\varepsilon$ be solutions of the anisotropic Allen–Cahn equation as in Theorem 3.5 . Under the assumptions of Theorem 4.1 except ( 4.3 ), there exist a subsequence $\varepsilon \searrow 0$ and a function $u=\chi \in BV\left (\Omega \times (0,T);\left \{0,1\right \}\right )\cap C^{0,\frac{1}{2}}\left ([0,T];\;L^{1}(\Omega )\right )\cap L^{\infty }\left (0,T;\;BV(\Omega )\right )$ such that
-
• $u_{\varepsilon }\to u$ in $L^{2}(\Omega \times (0,T))$ ,
-
• $\phi \circ u_{\varepsilon }{\overset{{*}}{\rightharpoonup }} \phi \circ u ={c_{0}} \chi$ in $BV(\Omega \times (0,T))$ , and
-
• $\phi \circ u_{\varepsilon }(t){\overset{{*}}{\rightharpoonup }}{c_{0}} \chi (t)$ in $BV(\Omega )$ for all $t\in [0,T]$
as $\varepsilon \searrow 0$ , where the function $\phi \;:\; {\mathbb{R}}\to{\mathbb{R}}$ is defined via
The first step of the proof is to show that
For this step, we observe that, by assumption (4.2), there exists a finite constant $R \gt 0$ such that $u_{\varepsilon }(x,t)\in [\!-\!R,R]$ for almost every $(x,t)\in \Omega \times (0,T)$ and for all sufficiently small $\varepsilon \gt 0$ . Using the continuity of $\phi$ , we first estimate
for $\varepsilon \gt 0$ small enough.
Furthermore, the chain rule for Sobolev functions is applicable to the compositions $\phi \circ u_{\varepsilon }$ even though $\phi$ need not be globally Lipschitz continuous: By virtue of its definition as a primitive function, $\phi$ is continuously differentiable. In particular, $\phi ^{\prime }\big |_{[-R,R]}$ is bounded, and we can assume without restriction that $\phi ^{\prime }$ is bounded. Thus, one can estimate
for $\varepsilon \gt 0$ sufficiently small.
Lastly, a similar argument for small $\varepsilon$ yields
where the fourth inequality uses the optimal energy dissipation identity (3.4) in the first and in the second factor.
It is known from assumption (4.1) that
where the second estimate makes use of the fact that $t\mapsto E_{\varepsilon }[u_{\varepsilon }(t)]$ is nonincreasing by (3.4). Thus, the estimates (4.6)–(4.8) suffice to prove (4.5).
In addition to (4.5), we want to derive a uniform $W^{1,1}$ -estimate for fixed times $t\in (0,T)$ , namely
Indeed, it follows by Fubini’s theorem that $\| u_{\varepsilon }(t)\|_{L^{\infty }}\leq R$ for almost every $t \in (0,T)$ if $\varepsilon \gt 0$ is small enough, and we can argue similarly as in (4.6) and (4.7) to show that
and
for a.e. $t\in (0,T)$ and all $\varepsilon \gt 0$ sufficiently small. These two observations prove (4.9).
Bounded sequences in $BV(\Omega \times (0,T))$ admit a weakly-* convergent subsequence by a version of [Reference Ambrosio, Fusco and Pallara2, Theorem 3.23]. Thus, by (4.5), we can find a subsequence $\varepsilon \searrow 0$ such that $\phi \circ u_{\varepsilon }$ converge weakly-* in $BV(\Omega \times (0,T))$ as $\varepsilon \searrow 0$ . Let us denote the limiting function by $v \in BV(\Omega \times (0,T))$ . Then, in particular, $\phi \circ u_{\varepsilon } \to v$ in $L^{1}(\Omega \times (0,T))$ , and we can assume without restriction that
and
which can be accomplished by taking a further subsequence. It follows from (4.9), (4.11), and the characterization [Reference Ambrosio, Fusco and Pallara2, Proposition 3.13] of weak-* convergence in $BV$ that $\phi \circ u_{\varepsilon }(t){\overset{{*}}{\rightharpoonup }} v(t)$ in $BV(\Omega )$ for a.e. $t\in (0,T)$ .
From now on, the limit $\varepsilon \searrow 0$ is to be understood as the limit along a subsequence $\{\varepsilon _{j}\}_{j \in{\mathbb{N}}}$ such that $\varepsilon _{j} \searrow 0$ and $\phi \circ u_{\varepsilon _{j}}{\overset{{*}}{\rightharpoonup }} v$ in $BV(\Omega \times (0,T))$ as $j\to \infty$ , and such that the pointwise convergence properties as in (4.10) and (4.11) hold true.
The following argument for the Hölder continuity $v \in C^{0,\frac{1}{2}}\left ([0,T];\;L^{1}(\Omega )\right )$ is adapted from [Reference Hensel and Laux35, Lemma 2], where Hensel and the first author deal with the Hölder continuity in the isotropic case: Let $0 \leq s \leq t \leq T$ . If we proceed as in (4.8), but only integrate from $s$ to $t$ , we find
There exists a null set $N\subset (0,T)$ such that, for all $s,t \in (0,T)\setminus N$ , we can pass to the limit $\varepsilon \searrow 0$ to obtain
This allows us to redefine $v$ on the null set $\Omega \times N$ so that $v\in C^{0,\frac{1}{2}}\left ([0,T];\;L^{1}(\Omega )\right )$ .
We identify $\phi \circ u_{\varepsilon }$ with their Hölder continuous representatives due to (4.12). If a sequence of uniformly Hölder continuous functions converges pointwise almost everywhere, it follows that the sequence converges pointwise. In particular, we can upgrade the $L^{1}(\Omega )$ -convergence for a.e. $t\in (0,T)$ in (4.11) to $L^{1}(\Omega )$ -convergence for all $t\in [0,T]$ .
Using the continuity $\phi \circ u_{\varepsilon }\in C^{0,\frac{1}{2}}\left ([0,T];\;L^{1}(\Omega )\right )$ and the lower semicontinuity of the variation [Reference Ambrosio, Fusco and Pallara2, Remark 3.5], we conclude from (4.9) that
i.e., we can also upgrade the essential boundedness of the $BV$ -norm to uniform boundedness in time. From this, one finds that $\phi \circ u_{\varepsilon }(t){\overset{{*}}{\rightharpoonup }} v(t)$ in $BV(\Omega )$ for all $t \in [0,T]$ and that $v \in L^{\infty }\left (0,T;\;BV(\Omega )\right )$ . (This statement includes the measurability of the measure-valued map $t \mapsto \nabla v(t)$ in the sense of [Reference Ambrosio, Fusco and Pallara2, Definition 2.25], which follows from the fact that we have $\sup _{t \in [0,T]}\int _{\Omega }|\nabla v(t)|\lt \infty$ and $v(t)\to v(t_{0})$ in $L^{1}(\Omega )$ as $t \to t_{0}$ , and therefore $\nabla v(t){\overset{{*}}{\rightharpoonup }} \nabla v(t_{0})$ in ${\mathcal{M}}(\Omega )$ as $t \to t_{0}$ .)
Let us turn to the convergence result for the solutions $u_{\varepsilon }$ . As the function $\sqrt{W}$ is strictly positive except at two isolated points, the primitive function $\phi$ is strictly increasing. Therefore, there exists a continuous inverse $\phi ^{-1}$ , and we obtain from (4.10) that
By (4.2) and the dominated convergence theorem, we may conclude $u_{\varepsilon }\to u$ in $L^{2}(\Omega \times (0,T))$ as $\varepsilon \searrow 0$ .
To see that the limiting function $u$ takes values in $\{0,1\}$ it suffices to use Fatou’s lemma:
Consequently, we have $W(u)=0$ , i.e., $u \in \{0,1\}$ almost everywhere in $\Omega \times (0,T)$ .
The representation $v = \phi \circ u ={c_{0}} \chi$ now follows from $\phi (0)=0$ and $\phi (1)={c_{0}}$ . Since $v \in BV\left (\Omega \times (0,T);\;\mathbb{R}\right )\cap C^{0,\frac{1}{2}}\left ([0,T];\;L^{1}(\Omega )\right )\cap L^{\infty }\left (0,T;\;BV(\Omega )\right )$ , it follows that the same holds true for $u=\frac{1}{{c_{0}}}v$ .
4.2 Equipartition of energy
The following theorem states that, asymptotically as $\varepsilon \searrow 0$ , the anisotropic Dirichlet energy $\frac{1}{2}{\int _{0}^{T}\int _{\Omega }}\varepsilon f(\!-\!\nabla u_{\varepsilon })dx dt$ and the nonconvex term $\frac{1}{2}{\int _{0}^{T}\int _{\Omega }} \frac{1}{\varepsilon }W(u_{\varepsilon })dx dt$ contribute equally to the Cahn-Hilliard energy. The equipartition of energy also holds true in a localized form. The argument relies crucially on the energy convergence assumption (4.3) and uses a trick introduced by Modica–Mortola for Gamma-convergence of the energies [Reference Modica and Mortola49] (see also [Reference Bogomol’nyi8]). Equipartition results of this kind have been used to prove conditional convergence in the static or dynamic case since [Reference Luckhaus and Modica46, Reference Modica48].
Statements (i) and (ii) can be viewed primarily as preparatory results for the equipartition statements (iii)–(v).
Theorem 4.3. Under the assumptions of Theorem 4.1 , including the energy convergence ( 4.3 ), the following convergence statements hold true in the limit $\varepsilon \searrow 0$ :
-
(i)
(4.13) \begin{equation} \frac{1}{2}\left (\varepsilon f(\!-\!\nabla u_{\varepsilon })+\frac{1}{\varepsilon }W(u_{\varepsilon }) \right ){\overset{{*}}{\rightharpoonup }}{c_{0}} \sigma (\nu ){\big |\nabla \chi \big |} \qquad \text{in }{\mathcal{M}}(\Omega \times [0,T]), \end{equation} -
(ii)
(4.14) \begin{equation} \sigma (\!-\!\nabla (\phi \circ u_{\varepsilon })){\overset{{*}}{\rightharpoonup }}{c_{0}} \sigma (\nu ){\big |\nabla \chi \big |} \qquad \text{in }{\mathcal{M}}(\Omega \times [0,T]), \end{equation} -
(iii)
(4.15) \begin{equation} \sqrt{\varepsilon f(\!-\!\nabla u_{\varepsilon })}-\sqrt{\frac{1}{\varepsilon }W(u_{\varepsilon })} \longrightarrow 0 \qquad \text{in } L^{2}(\Omega \times (0,T)), \end{equation} -
(iv)
(4.16) \begin{equation} \frac{1}{2}\left (\varepsilon f(\!-\!\nabla u_{\varepsilon })-\frac{1}{\varepsilon }W(u_{\varepsilon }) \right )\longrightarrow 0 \qquad \text{in } L^{1}(\Omega \times (0,T)), \end{equation} -
(v)
(4.17) \begin{equation} \varepsilon f(\!-\!\nabla u_{\varepsilon }){\overset{{*}}{\rightharpoonup }}{c_{0}} \chi \big | \quad \text{and} \quad \frac{1}{\varepsilon }W(u_{\varepsilon }){\overset{{*}}{\rightharpoonup }}{c_{0}} \sigma (\nu ){\big |\nabla \chi \big |} \qquad \text{in }{\mathcal{M}}(\Omega \times [0,T]). \end{equation}
Proof.
-
(i) It suffices to show that
\begin{equation*}\frac {1}{2}{\int _{0}^{T}\int _{\Omega }} \left (\varepsilon f(\!-\!\nabla u_{\varepsilon })+\frac {1}{\varepsilon }W(u_{\varepsilon }) \right ) \zeta dx dt \to {c_{0}} {\int _{0}^{T}\int _{\Omega }} \sigma (\nu )\zeta {\big |\nabla \chi \big |} \end{equation*}for all $\zeta \in C^{1}(\Omega \times [0,T])$ with $0\leq \zeta \leq 1$ . By a density and linearity argument, the same convergence then holds true for all $\zeta \in C(\Omega \times [0,T])$ .If $\zeta \in C^{1}(\Omega \times [0,T])$ and $0\leq \zeta \leq 1$ , we can use Young’s inequality and the chain rule for Sobolev functions to estimate
\begin{align*} \frac{1}{2}{\int _{0}^{T}\int _{\Omega }} \left (\varepsilon f(\!-\!\nabla u_{\varepsilon })+\frac{1}{\varepsilon }W(u_{\varepsilon }) \right ) \zeta dx dt &\geq{\int _{0}^{T}\int _{\Omega }} \sqrt{f(\!-\!\nabla u_{\varepsilon })W(u_{\varepsilon }) } \,\zeta dx dt\\[5pt] &={\int _{0}^{T}\int _{\Omega }} \sigma (-\phi ^{\prime }(u_{\varepsilon })\nabla u_{\varepsilon }) \zeta dx dt\\[5pt] &={\int _{0}^{T}\int _{\Omega }} \sigma (\!-\!\nabla (\phi \circ u_{\varepsilon }))\zeta dx dt. \end{align*}By Lemma 2.5, we have that(4.18) \begin{equation} {\int _{0}^{T}\int _{\Omega }} \sigma (B)\,dxdt=\sup _{\eta \in C^{1}(\Omega \times [0,T])\atop \sigma{^{\circ }}(\eta )\leq 1}{\int _{0}^{T}\int _{\Omega }} B\cdot \eta \,dx dt \end{equation}for all $B\in L^{1}(\Omega \times (0,T))^{d}$ .We apply (4.18) and the $L^{1}$ -convergence $\phi \circ u_{\varepsilon }\to{c_{0}} \chi$ as $\varepsilon \searrow 0$ to the above estimate, which, recalling Lemma 2.3, yields
(4.19) \begin{align} \liminf _{\varepsilon \searrow 0} \frac{1}{2}{\int _{0}^{T}\int _{\Omega }} &\left (\varepsilon f(\!-\!\nabla u_{\varepsilon })+\frac{1}{\varepsilon }W(u_{\varepsilon }) \right ) \zeta dx dt\nonumber \\[5pt] &\geq \liminf _{\varepsilon \searrow 0}{\int _{0}^{T}\int _{\Omega }} \sigma (\!-\!\nabla (\phi \circ u_{\varepsilon }))\zeta dx dt \nonumber \\[5pt] &= \liminf _{\varepsilon \searrow 0} \sup _{\eta }{\int _{0}^{T}\int _{\Omega }} \eta \cdot (\!-\!\nabla (\phi \circ u_{\varepsilon }))\zeta dx dt \nonumber \\[5pt] &= \liminf _{\varepsilon \searrow 0} \sup _{\eta }{\int _{0}^{T}\int _{\Omega }}(\phi \circ u_{\varepsilon })\mathrm{div}(\zeta \eta ) dx dt \nonumber \\[5pt] &\geq{c_{0}} \sup _{\eta }{\int _{0}^{T}\int _{\Omega }}\chi \mathrm{div}(\zeta \eta ) dx dt \nonumber \\[5pt] &= -{c_{0}} \sup _{\eta }{\int _{0}^{T}\int _{\Omega }} \zeta \eta \cdot \frac{\nabla \chi }{{\big |\nabla \chi \big |}}{\big |\nabla \chi \big |} \nonumber \\[5pt] &={c_{0}} \sup _{\eta }{\int _{0}^{T}\int _{\Omega }} \zeta \eta \cdot \nu{\big |\nabla \chi \big |}\nonumber \\[5pt] &={c_{0}}{\int _{0}^{T}\int _{\Omega }} \sigma (\nu )\zeta{\big |\nabla \chi \big |}, \end{align}where the supremum is taken over all $\eta \in C^{1}(\Omega \times [0,T])^{d}$ such that $\sigma{^{\circ }}(\eta )\leq 1$ . To prove the estimate from above, we observe that (4.19) also applies to the function $1-\zeta$ instead of $\zeta$ , and use the energy convergence assumption (4.3). Indeed,(4.20) \begin{align} \limsup _{\varepsilon \searrow 0} \frac{1}{2}{\int _{0}^{T}\int _{\Omega }} &\left (\varepsilon f(\!-\!\nabla u_{\varepsilon })+\frac{1}{\varepsilon }W(u_{\varepsilon }) \right ) \zeta dx dt\nonumber \\[5pt] &=\limsup _{\varepsilon \searrow 0} \left (\frac{1}{2}{\int _{0}^{T}\int _{\Omega }} \left (\varepsilon f(\!-\!\nabla u_{\varepsilon })+\frac{1}{\varepsilon }W(u_{\varepsilon }) \right ) dx dt \right .\nonumber \\[5pt] &\hspace{8.5pt}\left . - \frac{1}{2}{\int _{0}^{T}\int _{\Omega }} \left ({\varepsilon } f(\!-\!\nabla u_{\varepsilon })+\frac{1}{\varepsilon }W(u_{\varepsilon }) \right ) (1-\zeta ) dx dt \right )\nonumber \\[5pt] &=\lim _{\varepsilon \searrow 0} \int _{0}^{T}E_{\varepsilon }[u_{\varepsilon }(t)]dt\nonumber \\[5pt] &\quad{- \liminf _{\varepsilon \searrow 0}\frac{1}{2}}{\int _{0}^{T}\int _{\Omega }} \left (\varepsilon f(\!-\!\nabla u_{\varepsilon })+\frac{1}{\varepsilon }W(u_{\varepsilon }) \right ) (1-\zeta ) dx dt\nonumber \\[5pt] &\leq \int _{0}^{T}E[u(t)]dt -{c_{0}}{\int _{0}^{T}\int _{\Omega }} \sigma (\nu ) (1-\zeta ){\big |\nabla \chi \big |} \nonumber \\[5pt] &={c_{0}}{\int _{0}^{T}\int _{\Omega }} \sigma (\nu ) \zeta{\big |\nabla \chi \big |} . \end{align}The combination of the two inequalities (4.19) and (4.20) yields the first claim. -
(ii) In the same manner as in (i), it suffices to show that
\begin{equation*}{\int _{0}^{T}\int _{\Omega }} \sigma (\!-\!\nabla (\phi \circ u_{\varepsilon })) \zeta dx dt \to {c_{0}} {\int _{0}^{T}\int _{\Omega }} \sigma (\nu )\zeta {\big |\nabla \chi \big |} \end{equation*}whenever $\zeta \in C^{1}(\Omega \times [0,T])$ and $0 \leq \zeta \leq 1$ .The $\liminf$ inequality follows from the chain of inequalities in (4.19).
For the $\limsup$ inequality, we can use Young’s inequality and (i) to compute
\begin{align*} \limsup _{\varepsilon \searrow 0}{\int _{0}^{T}\int _{\Omega }} \sigma (\!-\!\nabla (\phi \circ u_{\varepsilon })) \zeta dx dt &\leq \lim _{\varepsilon \searrow 0}\frac{1}{2}{\int _{0}^{T}\int _{\Omega }} \left (\varepsilon f(\!-\!\nabla u_{\varepsilon })+\frac{1}{\varepsilon }W(u_{\varepsilon }) \right ) \zeta dx dt\\[5pt] &={c_{0}}{\int _{0}^{T}\int _{\Omega }} \sigma (\nu ) \zeta{\big |\nabla \chi \big |}. \end{align*} -
(iii) By taking $\zeta \equiv 1$ as test functions for the weak-* limits in (i) and (ii), we see that
(4.21) \begin{align} \lim _{\varepsilon \searrow 0}\frac{1}{2}{\int _{0}^{T}\int _{\Omega }} \left (\varepsilon f(\!-\!\nabla u_{\varepsilon })+\frac{1}{\varepsilon }W(u_{\varepsilon }) \right ) &dx dt = \lim _{\varepsilon \searrow 0}{\int _{0}^{T}\int _{\Omega }} \sigma (\!-\!\nabla (\phi \circ u_{\varepsilon })) dx dt \nonumber \\[5pt] & ={c_{0}}{\int _{0}^{T}\int _{\Omega }} \sigma (\nu ){\big |\nabla \chi \big |}. \end{align}This observation allows us to compute\begin{align*}{\int _{0}^{T}\int _{\Omega }}& \left (\sqrt{\varepsilon f(\!-\!\nabla u_{\varepsilon })}-\sqrt{\frac{1}{\varepsilon }W(u_{\varepsilon })} \right )^{2} dx dt\\[5pt] &={\int _{0}^{T}\int _{\Omega }}\left (\varepsilon f(\!-\!\nabla u_{\varepsilon })+\frac{1}{\varepsilon }W(u_{\varepsilon }) \right ) dx dt - 2{\int _{0}^{T}\int _{\Omega }} \sqrt{f(\!-\!\nabla u_{\varepsilon })W(u_{\varepsilon })}dx dt\\[5pt] &={\int _{0}^{T}\int _{\Omega }}\left (\varepsilon f(\!-\!\nabla u_{\varepsilon })+\frac{1}{\varepsilon }W(u_{\varepsilon }) \right ) dx dt - 2{\int _{0}^{T}\int _{\Omega }} \sigma (\!-\!\nabla (\phi \circ u_{\varepsilon }))dx dt\\[5pt] &\longrightarrow 0 \qquad \text{as } \varepsilon \searrow 0. \end{align*} -
(iv) Young’s inequality yields
\begin{align*} \left |\varepsilon f(\!-\!\nabla u_{\varepsilon })-\frac{1}{\varepsilon }W(u_{\varepsilon })\right |&\leq \delta \left (\sqrt{\varepsilon f(\!-\!\nabla u_{\varepsilon })}+\sqrt{\frac{1}{\varepsilon }W(u_{\varepsilon })} \right )^{2} \\[5pt] &\quad + \frac{1}{4\delta }\left (\sqrt{\varepsilon f(\!-\!\nabla u_{\varepsilon })}-\sqrt{\frac{1}{\varepsilon }W(u_{\varepsilon })} \right )^{2} \end{align*}for all $\delta \gt 0$ . Taking the limit $\varepsilon \searrow 0$ and using (iii) as well as (4.21), we obtain\begin{align*} \limsup _{\varepsilon \searrow 0}{\int _{0}^{T}\int _{\Omega }}&\left |\varepsilon f(\!-\!\nabla u_{\varepsilon })-\frac{1}{\varepsilon }W(u_{\varepsilon })\right | dx dt \\[5pt] &\leq \delta \limsup _{\varepsilon \searrow 0}{\int _{0}^{T}\int _{\Omega }} \left (\sqrt{\varepsilon f(\!-\!\nabla u_{\varepsilon })}+\sqrt{\frac{1}{\varepsilon }W(u_{\varepsilon })} \right )^{2} dx dt\\[5pt] &= \delta \limsup _{\varepsilon \searrow 0}{\int _{0}^{T}\int _{\Omega }} \left (\varepsilon f(\!-\!\nabla u_{\varepsilon })+\frac{1}{\varepsilon }W(u_{\varepsilon }) + 2 \sigma (\!-\!\nabla (\phi \circ u_{\varepsilon })) \right ) dx dt\\[5pt] &= 4 \delta{c_{0}}{\int _{0}^{T}\int _{\Omega }} \sigma (\nu ){\big |\nabla \chi \big |}. \end{align*}Since $\delta \gt 0$ is arbitrary, it follows that\begin{equation*} \lim _{\varepsilon \searrow 0} {\int _{0}^{T}\int _{\Omega }}\left |\varepsilon f(\!-\!\nabla u_{\varepsilon })-\frac {1}{\varepsilon }W(u_{\varepsilon })\right | dx dt=0. \end{equation*} -
(v) The two convergence results follow from adding and subtracting (i) and (iv), respectively.
4.3 Construction of the normal velocity
This construction follows the argument in the first step of [Reference Laux and Simon42, Proposition 2.10], where it was carried out by Simon and the first author for the (multiphase) isotropic case. The idea is to introduce the velocity as a Radon–Nikodým density $V\;:\!=\;\frac{\partial _{t}\chi }{{\big |\nabla \chi \big |}}$ .
For simplicity of notation, we denote the energy density of the anisotropic Cahn–Hilliard energy by
Lemma 4.4. Under the assumptions of Theorem 4.1 , including the energy convergence ( 4.3 ), there exists a $|\nabla \chi |$ -measurable normal velocity $V\;:\;\Omega \times (0,T)\to{\mathbb{R}}$ satisfying ( 2.17 ) and ( 2.18 ).
The first step of the proof is to show that the Radon–Nikodým theorem is applicable: Given a smooth test function $\zeta \in{C_{c}^{\infty }}(\Omega \times (0,T))$ , the definition of distributional derivatives and an application of the Cauchy–Schwarz inequality yield
It follows from (4.13) that
Thus, by taking the limit inferior on both sides of (4.22) and recalling that $\phi \circ u_{\varepsilon } \to{c_{0}} \chi$ in $L^{1}(\Omega \times (0,T))$ as $\varepsilon \searrow 0$ , we find
where the second inequality follows from the optimal energy dissipation inequality (3.4).
It is desirable to reformulate (4.24) in terms of open sets instead of test functions: Given an open set $A\subseteq \Omega \times (0,T)$ , let us maximize the left-hand side of (4.24) over all $\zeta \in{C_{c}^{\infty }}(A)$ with $\big |\zeta \big |\leq 1$ . This provides us with the inequality
Making use of the outer regularity of Radon measures, we see that
for all Borel sets $E \subseteq \Omega \times (0,T)$ such that ${\big |\nabla \chi \big |}(E)=0$ , i.e., $\partial _{t}\chi \ll{\big |\nabla \chi \big |}$ . By the Radon–Nikodým theorem, there exists a $\big |\nabla \chi \big |$ -measurable function $V$ such that $\partial _{t}\chi = V{\big |\nabla \chi \big |}$ on the open set $\Omega \times (0,T)$ .
In order to prove the square integrability (2.17) of $V$ , we go back to (4.24). Let us first fix a finite number $M\gt 0$ and find a sequence $\{\zeta _{k}\}_{k \in{\mathbb{N}}}$ of smooth test functions such that $\zeta _{k} \to V\chi _{\{|V|\leq M\}}$ in $L^{2}({\big |\nabla \chi \big |})$ and $|\zeta _{k}|\leq M$ for all $k \in{\mathbb{N}}$ . Then it follows by dominated convergence that $V\zeta _{k} \to V^{2}\chi _{\{|V|\leq M\}}$ in $L^{1}({\big |\nabla \chi \big |})$ and, therefore,
whereas the $L^{2}$ -convergence gives
Plugging in $\zeta _{k}$ and taking the limit $k \to \infty$ in (4.24), we now obtain
By rearranging this inequality and taking $M \to \infty$ with the help of the monotone convergence theorem, we find the desired integrability statement, namely
The final step is to prove the identity (2.18) (see also [Reference Hensel and Laux35, Lemma 7]). Given a test function $\zeta \in C^{1}(\Omega \times [0,T])$ and an intermediate time ${T^{\prime }} \in (0,T]$ , we introduce a cutoff $\eta _{\alpha }$ in time such that
and set $\zeta _{\alpha }(x,t)\;:\!=\;\zeta (x,t)\eta _{\alpha }(t)$ . Then, in particular, we have $\zeta _{\alpha }\in C_{c}^{1}(\Omega \times (0,T))$ for sufficiently small $\alpha \gt 0$ . The definition of $V$ as a Radon–Nikodým density yields
In the limit $\alpha \searrow 0$ , the left-hand side integral and the first right-hand side integral converge due to the dominated convergence theorem. For the last integral, we observe that $\eta _{\alpha }^{\prime }{\overset{{*}}{\rightharpoonup }} \delta _{0}-\delta _{{T^{\prime }}}$ in ${\mathcal{M}}({\mathbb{R}})$ as $\alpha \searrow 0$ . Furthermore, we have $\chi =u\in C^{0,\frac{1}{2}}\left ([0,T];\;L^{1}(\Omega )\right )$ by Lemma 4.2, and $\zeta$ is uniformly continuous, so that the map $t \mapsto \int _{\Omega }\chi (x,t)\zeta (x,t)dx$ is continuous on $[0,T]$ . We can therefore compute the limits of all three integrals and obtain
which proves (2.18).
4.4 Relative entropies
A key ingredient to derive the sharp interface limit is the following notions of relative entropies.
Definition 4.5.
-
(i) Let $u=\chi \in BV(\Omega )$ take values in $\{0,1\}$ almost everywhere. As in the definition of the anisotropic surface energy, let $\nu \;:\!=\;-\frac{\nabla \chi }{|\nabla \chi |}$ be the measure-theoretic outer unit normal. The relative entropy of $u$ with respect to a vector field $\xi \in C(\Omega )^{d}$ is
(4.28) \begin{equation} {\mathscr{E}\left [u\big |\xi \right ]}\;:\!=\;{c_{0}}\int _{\Omega }\left (\sigma (\nu )-|\xi |\psi (|\xi |)D\sigma (\xi )\cdot \nu \right ){\big |\nabla \chi \big |}, \end{equation}where $\psi \in C^{\infty }([0,\infty )$ is a cutoff function such that the three properties in (2.2) hold true. -
(ii) Let $\varepsilon \gt 0$ and $u_{\varepsilon } \in H^{1}(\Omega )\cap L^{\infty }(\Omega )$ . The $\varepsilon$ -relative entropy of $u_{\varepsilon }$ with respect to a vector field $\xi \in C(\Omega )^{d}$ is
(4.29) \begin{equation} {\mathscr{E}_{\varepsilon }\left [u_{\varepsilon }\big |\xi \right ]}\;:\!=\;\int _{\Omega }\left (\sigma (\!-\!\nabla u_{\varepsilon })+|\xi |\psi (|\xi |)D\sigma (\xi )\cdot \nabla u_{\varepsilon } \right )\sqrt{W(u_{\varepsilon })} dx. \end{equation}
Remark 4.6.
-
(i) For $u = \chi$ as in Definition 4.5(i), we can define the set of finite perimeter $A\;:\!=\;\left \{x \in \Omega \, \big |\,u(x)=1 \right \}$ (up to a null set). By De Giorgi’s structure theorem (see [Reference Ambrosio, Fusco and Pallara2, Theorem 3.59]), one can replace the total variation measure $|\nabla \chi |$ with the $(d-1)$ -dimensional Hausdorff measure and write
$@@$with $\partial ^{*}$ denoting the reduced boundary of a set of finite perimeter. -
(ii) One can also show that $|\nabla \chi (t)|\otimes{\mathcal{L}^{1}} (0,T) = |\nabla \chi |$ as Radon measures on $\Omega \times (0,T)$ . Thus, integrating the relative entropy over time yields
\begin{align*} \int _{0}^{T}{\mathscr{E}\left [u(t)\big |\xi (t)\right ]}dt & ={c_{0}}{\int _{0}^{T}\int _{\Omega }} \left (\sigma (\nu )-|\xi |\psi (|\xi |)D\sigma (\xi )\cdot \nu \right )|\nabla \chi (t)|dt\\[5pt] &={c_{0}}{\int _{0}^{T}\int _{\Omega }} \left (\sigma (\nu )-|\xi |\psi (|\xi |)D\sigma (\xi )\cdot \nu \right ){\big |\nabla \chi \big |}. \end{align*}
If $\xi$ is chosen to be a smooth approximation of the normal, then these relative entropies serve as tilt excesses and can be used to control quadratic error terms: Let $\xi \in C\left (\Omega \times [0,T]\right )^{d}$ such that $|\xi |\leq 1$ on $\Omega \times [0,T]$ . By Lemma 2.4(i), we have
The link between the relative entropy $\mathscr{E}$ and the phase-field version $\mathscr{E}_{\varepsilon }$ is the following convergence statement:
Lemma 4.7. Under the assumptions of Theorem 4.1 , including the energy convergence ( 4.3 ), we have
for every vector field $\xi \in C\left (\Omega \times [0,T]\right )^{d}$ such that $|\xi |\leq 1$ on $\Omega \times [0,T]$ , where $u=\chi$ is the limiting function constructed in Lemma 4.2 .
Proof. A direct computation yields
where the first term converges due to Theorem 4.3(ii). The convergence of the second term follows from the observation that $\nabla (\phi \circ u_{\varepsilon }){\overset{{*}}{\rightharpoonup }}{c_{0}} \nabla \chi$ in ${\mathcal{M}}(\Omega \times [0,T])$ , which is a consequence of the bound in (4.7) and the $L^{1}$ -convergence $\phi \circ u_{\varepsilon }\to{c_{0}} \chi$ .
While we have only used the first inequality in Lemma 2.4 so far, the second inequality helps us to show that the tilt excess can be made arbitrarily small by approximating the normal $\nu$ with suitable vector fields $\xi$ .
Lemma 4.8. For every $\delta \gt 0$ , there exists a smooth vector field $\xi \in C^{1}(\Omega \times [0,T])^{d}$ such that $|\xi |\leq 1$ in $\Omega \times [0,T]$ and
Proof. The outer unit normal $\nu$ is $\big |\nabla \chi \big |$ -measurable, and the estimate
shows that $\nu \in L^{2}({\big |\nabla \chi \big |})$ . Since $\big |\nabla \chi \big |$ is a Radon measure on $\Omega \times [0,T]$ , there exists an approximating sequence $\{\xi _{n}\}_{n\in \mathbb{N}} \subset C^{\infty }(\Omega \times [0,T])$ such that $\xi _{n}\to \nu$ in $L^{2}({\big |\nabla \chi \big |})$ as $n \to \infty$ .
Clearly, $\nu$ takes values in the closed convex set $\overline{B}_{1}=\left \{p \in{\mathbb{R}}^{d}\,\Big |\,|p|\leq 1 \right \}$ almost everywhere with respect to $\big |\nabla \chi \big |$ , so we can choose $\xi _{n}$ in a way that ensures that $|\xi _{n}|\leq 1$ on $\Omega \times [0,T]$ for all $n \in{\mathbb{N}}$ .
By Lemma 2.4(ii) and the Cauchy–Schwarz inequality, we obtain
as $n \to \infty$ .
4.5 Convergence of the curvature term
The following two subsections are dedicated to proving the distributional law of anisotropic mean curvature flow (2.19) for the limit function $u$ . The strategy is to use $B\cdot \varepsilon \nabla u_{\varepsilon }$ as a test function in the distributional formulation of the anisotropic Allen–Cahn equation (3.3), where $B \in C^{1}(\Omega \times [0,T])^{d}$ , and then pass to the limit as $\varepsilon \searrow 0$ on both sides separately.
Here and in the following subsections, $C$ denotes a generic positive constant that may depend on the pair of anisotropies $(\sigma,\mu )$ , the double-well potential $W$ , the time horizon $T$ , and the cutoff function $\psi$ . The constant $C$ is not necessarily the same on every occurrence.
Theorem 4.9. Under the assumptions of Theorem 4.1 , including the energy convergence ( 4.3 ), we have
for all $B\in C^{1}(\Omega \times [0,T])^{d}$ .
This theorem was first proved by Cicalese, Nagase, and Pisante in [Reference Cicalese, Nagase and Pisante18, Theorem 3.3]. However, we proceed with the alternative strategy of proof proposed in [Reference Laux39, Proposition 4.5].
We define the energy-stress tensor $T_{\varepsilon }$ by
With this definition, the statement of Theorem 4.9 can be rewritten as a weak-* convergence claim for the energy-stress tensor: Indeed, an integration by parts on the flat torus $\Omega$ yields
We observe that the right-hand side is exactly the term appearing in (4.32). Thus, in order to prove Theorem 4.9, it suffices to show that $T_{\varepsilon }{\overset{{*}}{\rightharpoonup }}{c_{0}} \left (\sigma (\nu )I_{d}-\nu \otimes D\sigma (\nu )\right ){\big |\nabla \chi \big |}$ as ${\mathbb{R}}^{d\times d}$ -valued Radon measures on $\Omega \times [0,T]$ .
As for the first summand of the energy-stress tensor $T_{\varepsilon }$ , it follows immediately from the equipartition of energy statement in Theorem 4.3(i) that
The following lemma covers the convergence of the second summand of $T_{\varepsilon }$ , thereby completing the proof of Theorem 4.9.
Lemma 4.10. Under the assumptions of Theorem 4.1 , including the energy convergence ( 4.3 ), we have
as $\varepsilon \searrow 0$ .
To prove Lemma 4.10, let us first fix a vector field $\xi \in C(\Omega \times [0,T])^{d}$ such that $|\xi |\leq 1$ in $\Omega \times [0,T]$ . This vector field will serve as an approximation for the measure-theoretic normal $\nu$ . Furthermore let $A \in C(\Omega \times [0,T])^{d \times d}$ be a test function.
For shorter notation, we introduce the approximate outer unit normal $\nu _{\varepsilon }$ via
Applying the product rule to $f=\sigma ^{2}$ and exploiting the positive $0$ -homogeneity of $D\sigma$ , one can rewrite the left-hand side of (4.35) as
where one can now conveniently insert the vector field $\xi$ at the cost of two errors controlled by the tilt excess $\mathscr{E}$ : Let us add zero twice and compute
As a consequence of the smoothness and homogeneity of $\sigma$ and since $\psi \equiv 0$ in a neighborhood of zero, the map $p \mapsto \psi (|p|)D\sigma (p)$ is Lipschitz continuous, i.e.,
The first right-hand side integral of (4.38) can now be estimated as follows:
and with the help of the energy convergence assumption (4.3) and Lemma 4.11 below, we obtain
Similarly recalling (4.30), the last integral on the right-hand side of (4.38) can be controlled by the tilt excess via
As for the remaining terms in (4.38), we have to show that
as $\varepsilon \searrow 0$ . Since $\psi (|\xi |) A\,D\sigma (\xi )$ is a continuous vector field on $\Omega \times [0,T]$ , this problem reduces to proving that $\varepsilon \sigma (\!-\!\nabla u_{\varepsilon })\nabla u_{\varepsilon }{\overset{{*}}{\rightharpoonup }} -{c_{0}} \nu{\big |\nabla \chi \big |}$ in ${\mathcal{M}}(\Omega \times [0,T])^{d}$ .
To prove this weak-* convergence statement, we rewrite
The first summand converges strongly to $0$ in $L^{1}(\Omega \times (0,T))$ : This follows by the Cauchy–Schwarz inequality if we recall that $\sqrt{\varepsilon }\nabla u_{\varepsilon }$ is bounded in $L^{2}(\Omega \times (0,T))$ as $\varepsilon \searrow 0$ and $\sqrt{\varepsilon f(\!-\!\nabla u_{\varepsilon })}-\sqrt{\frac{1}{\varepsilon }W(u_{\varepsilon })} \to 0$ in $L^{2}(\Omega \times (0,T))$ by Theorem 4.3(iii).
It has been shown in (4.6)–(4.8) that $\phi \circ u_{\varepsilon }$ is bounded in $W^{1,1}$ as $\varepsilon \searrow 0$ . In particular, the total variation $\int \int |\nabla u_{\varepsilon }|$ is uniformly bounded as $\varepsilon \searrow 0$ . Since $\phi \circ u_{\varepsilon }\to{c_{0}} \chi$ in $L^{1}(\Omega \times (0,T))$ , it follows that $\nabla (\phi \circ u_{\varepsilon }){\overset{{*}}{\rightharpoonup }}{c_{0}} \nabla \chi = -{c_{0}} \nu{\big |\nabla \chi \big |}$ in ${\mathcal{M}}(\Omega \times [0,T])^{d}$ .
In total, we have
By plugging in (4.40), (4.41), and (4.42) into (4.38), one arrives at
This concludes the proof of Lemma 4.10 since, appealing to Lemma 4.8, the time-integrated relative entropy can be made arbitrarily small.
Lemma 4.11. Under the assumptions of Theorem 4.1 , including the energy convergence ( 4.3 ), and with $\nu _\varepsilon$ defined in ( 4.36 ), we have
Proof. We apply Lemma 2.4(i), the estimate $|\nu _{\varepsilon }-\xi |\leq 2$ , and the Cauchy–Schwarz inequality in order to control the occurring integrals by the anisotropic Cahn–Hilliard energy and the $\varepsilon$ -relative entropy. Precisely,
One can now take the limit superior on both sides and note that, by assumption, we have $\limsup _{\varepsilon \searrow 0}\int _{0}^{T}E_{\varepsilon }[u_{\varepsilon }(t)]dt\lt \infty$ . Thus, applying Lemma 4.7 in the first term and Theorem 4.3(iii) in the second term yields
4.6 Convergence of the velocity term
The analogue of Theorem 4.9 for the left-hand side terms is
Theorem 4.12. Under the assumptions of Theorem 4.1 , including the energy convergence ( 4.3 ), we have
for all $B\in C^{1}(\Omega \times [0,T])^{d}$ .
Similarly to the argument for the curvature term, we replace the approximate outer normal $\nu _{\varepsilon }$ by a $\nu _{\varepsilon }$ by a vector field $\xi \in C(\Omega \times [0,T])^{d}$ such that $|\xi |\leq 1$ on terms can be controlled by the relative entropy. An additional error term arises as we replace the term $g(\!-\!\nabla u_{\varepsilon })$ by its ‘asymptotic’ version $\frac{\sigma ({\nu _{\varepsilon }})}{\mu ({\nu _{\varepsilon }})}$ : Expanding both sides of (4.44), we obtain
where we have used the identity $-\sigma ({\nu _{\varepsilon }})\nabla u_{\varepsilon }=\sigma ({\nu _{\varepsilon }})|\nabla u_{\varepsilon }|{\nu _{\varepsilon }} = \sigma (\!-\!\nabla u_{\varepsilon }){\nu _{\varepsilon }}$ .
To see that the third line on the right-hand side converges to zero as $\varepsilon \searrow 0$ , it suffices to show that
as $\varepsilon \searrow 0$ for all $\zeta \in C^{1}(\Omega \times [0,T])$ . A density argument then yields the same statement for all $\zeta \in C(\Omega \times [0,T])$ , so that one can choose $\zeta = B\cdot \frac{\psi (|\xi |)}{\mu (\xi )}\xi$ .
In order to prove (4.46), we integrate by parts and write
The first integral converges to zero as $\varepsilon \searrow 0$ due to the equipartiton of energy, Theorem 4.3(iii), and the optimal dissipation. For the other three integrals, we apply the convergence statements for the subsequence $\varepsilon \searrow 0$ given in Lemma 4.2, which leads to
The distributional criterion (2.18) for the normal velocity yields (4.46), and so it follows that the third line on the right-hand side of (4.45) converges to zero.
It remains to estimate the “error” terms in (4.45). For the first integral on the right-hand side, one uses the asymptotic description of $g$ provided by Lemma 3.2(iii): For every $\delta \gt 0$ , there exists a positive constant $R$ such that $\left |g(p)-\frac{\sigma (p)}{\mu (p)}\right |\lt \delta$ whenever $|p|\geq R$ . Therefore, we obtain
from which it follows that
Since $\delta \gt 0$ is arbitrary, we have shown that the first integral on the right-hand side of (4.45) converges to zero as $\varepsilon \searrow 0$ .
As for the second integral on the right-hand side, we use that the map $p \mapsto \frac{\psi (|p|)}{\mu (p)}p$ is Lipschitz continuous on ${\mathbb{R}}^{d}$ , so that
It follows with the help of Lemma 4.11 that
Again making use of the Lipschitz continuity and Lemma 2.4(i), the last integral on the right-hand side of (4.45) can be estimated as
Plugging in the results for each right-hand side integral into (4.45) leads to
The proof of Theorem 4.12 is complete once we take into account that, by Lemma 4.8, the time-integrated relative entropy can be made arbitrarily small for suitable choices of vector fields $\xi$ .
4.7 Optimal energy dissipation inequality
Lemma 4.13. Under the assumptions of Theorem 4.1 , including the energy convergence ( 4.3 ), the inequality
holds true for all ${T^{\prime }} \in [0,T]$ .
The strategy to prove this lemma is to take $\varepsilon \searrow 0$ in the optimal energy dissipation identity (3.4) for the anisotropic Allen–Cahn equation. By the well-preparedness of the initial data (4.1), we have $\lim _{\varepsilon \searrow 0}E_{\varepsilon }[u_{\varepsilon,0}]=E_{0}$ . Furthermore, the $\Gamma$ -convergence $E_{\varepsilon }\overset{\Gamma }{\longrightarrow }E$ on $L^{1}(\Omega )$ (see [Reference Bouchitté9, Theorem 3.5]) yields $\liminf _{\varepsilon \searrow 0}E_{\varepsilon }[u_{\varepsilon }({T^{\prime }})]\geq E[u({T^{\prime }})]$ since it follows from the conditions in Lemma 4.2 that $u_{\varepsilon }(t)\to u(t)$ in all $t \in [0,T]$ .
We remark that Bouchitté [Reference Bouchitté9] defines $\mathrm{dom}(E_{\varepsilon })=\mathrm{Lip}(\Omega )$ , so that the liminf inequality for our definition in (1.7) does not immediately follow. However, the liminf inequality is the easier part of the $\Gamma$ -convergence statement and can be shown by combining the Modica–Mortola argument in Theorem 4.3(i) above with the truncation of $W$ that can be found in [Reference Leoni44, Proof of Theorem 1.6].
Therefore, the proof of the lemma reduces to proving the following inequality:
Claim 2. For every ${T^{\prime }} \in [0,T]$ , we have
Let $\zeta \in C(\Omega \times [0,T])$ and $\xi \in C(\Omega \times [0,T])^{d}$ such that $|\xi |\leq 1$ in $\Omega \times [0,T]$ . We apply Young’s inequality in the form $a^{2}\geq 2ab-b^{2}$ and add zero multiple times in order to replace $g$ with its asymptotic version and $\nu _{\varepsilon }$ with $\xi$ , similarly to the approach in (4.45). In this case, we obtain
To show that the first integral on the right-hand side converges to zero, let $\delta \gt 0$ . By Lemma 3.2(iii), there exists some $R\gt 0$ such that $\left |g(p)-\frac{\sigma (p)}{\mu (p)}\right |\lt \delta$ for all $p \in{\mathbb{R}}^{d}$ with $|p|\geq R$ . Arguing as in (4.47), we find
Taking first $\varepsilon \searrow 0$ , then $\delta \searrow 0$ leads to
For the second line on the right-hand side of (4.52), we argue analogously and obtain
Due to the Lipschitz continuity of the map $p \mapsto \psi (|p|)\frac{\sigma (p)}{\mu (p)}$ , the third integral can be estimated as
so that, by appealing to Lemma 4.11, we obtain
For the fourth line of the right-hand side of (4.52), a very similar computation yields
Finally, it follows from (4.46) and the equipartition of energy, Theorem 4.3(v), that
It is now desirable to let $\xi \to \nu$ in $L^{2}({\big |\nabla \chi \big |})$ so that $\int _{0}^{T}{\mathscr{E}\left [u(t)\big |\xi (t)\right ]}dt \to 0$ . This is possible by Lemma 4.8 and Lemma 2.4(i). By the dominated convergence theorem, we also have $\psi (|\xi |)\frac{\sigma (\xi )}{\mu (\xi )}\to \frac{\sigma (\nu )}{\mu (\nu )}$ in $L^{2}({\big |\nabla \chi \big |})$ . Under this convergence, it follows from (4.52) and the computations (4.53)–(4.57) that
In a last step we let $\zeta \to \frac{V}{\sigma (\nu )}$ in $L^{2}({\big |\nabla \chi \big |})$ , which yields the lower bound as stated in the claim.
4.8 Proof of Theorem 4.1
We simply wrap up the proof of the theorem.
Proof. The compactness follows from Lemma 4.2. Under the energy convergence assumption, the distributional formulation for the time derivative (2.18) follows from Lemma 4.4. The optimal energy dissipation relation (2.20) follows from Lemma 4.13. To obtain the distributional formulation for the curvature in (2.19), we use $B\cdot \varepsilon \nabla u_{\varepsilon }$ as a test function in the distributional formulation of the anisotropic Allen–Cahn equation (3.3), where $B \in C^{1}(\Omega \times [0,T])^{d}$ , where we recall that by Theorem 3.10, we have $B\cdot \varepsilon \nabla u_{\varepsilon } \in L^{2}\left (0,T;\;H^{1}(\Omega )\right )$ , and that these functions are admissible test functions in (3.3). To pass to the limit as $\varepsilon \searrow 0$ in the left-hand side, one applies Theorem 4.12, and likewise for the right-hand side, apply Theorem 4.9.
5. Weak–strong uniqueness for anisotropic mean curvature flow
The goal of this section is to prove that, as long as a strong solution to anisotropic mean curvature flow (1.1) exists, any $BV$ solution with the same initial data coincides with the strong solution. One needs to require additional regularity for $(\sigma,\mu )$ to make sure that strong solutions will be sufficiently smooth, and we will also rely on the higher regularity of $\sigma$ in the proof of the weak–strong uniqueness statement. Thus, we assume that $\sigma,\mu \in C^{\infty }\big ({\mathbb{R}}^{d}\setminus \{0\}\big )$ . Further, without loss of generality, we let $c_0 = 1.$ Following Hensel and Moser [Reference Hensel and Moser37, Definition 10], we define strong solutions to (1.1) as follows:
Definition 5.1. Let $T\gt 0$ be a finite time horizon. A family $\{{\mathscr{A}}(t)\}_{t\in [0,T]}$ of open subsets of $\Omega$ is a strong solution to anisotropic mean curvature flow if
-
• $\partial{\mathscr{A}}(0)$ is an embedded $C^{\infty }$ -submanifold,
-
• there exists a $C^{\infty }$ -map $\Phi \;:\;\Omega \times [0,T]\to \Omega$ such that $\Phi (\cdot,t)$ is a diffeomorphism for all $t\in [0,T]$ , furthermore $\Phi (\cdot,0)={\mathrm{id}}_{\Omega }$ and
\begin{equation*} \Phi ({\mathscr {A}}(0),t)={\mathscr {A}}(t), \qquad \Phi (\partial {\mathscr {A}}(0),t)=\partial {\mathscr {A}}(t) \end{equation*}for all $t \in [0,T]$ , and -
• $\{\partial{\mathscr{A}}(t) \}_{t\in [0,T]}$ evolves by (1.1) in the classical sense.
The setup in [Reference Hensel and Moser37, Definition 10 and Remark 15] also suggests that the $C^{\infty }$ -regularity for $(\sigma,\mu )$ , $\Phi$ , and $\partial{\mathscr{A}}(0)$ can be relaxed.
We are now ready to formulate the central theorem of this section:
Theorem 5.2. Let ${\mathscr{A}(t)}_{t \in [0,T]}$ be a solution of anisotropic mean curvature flow ( 1.1 ), and further, let $\{A(t)\}_{t\in [0,T]}$ be time parametrized collection of sets with $\chi \;:\!=\;\chi _{A}$ a distributional solution of anisotropic mean curvature flow as in Definition 2.8 . If
then
for all $t \in [0,T]$ .
The proof for this theorem is modeled after [Reference Hensel and Laux35, Sections 2.2, 4], where Hensel and the first author derive an analogous result for multiphase isotropic mean curvature flow. The key step in this argument is to find a gradient flow calibration (see below) for the smooth evolution $\left \{\mathscr{A}(t)\right \}_{t \in [0,T]}$ . While the existence of a gradient flow calibration is nontrivial in the multiphase case (see [Reference Hensel and Moser37, Theorem 4] for a gradient flow calibration for multiphase mean curvature flow in $d=2$ with constant contact angle), such a calibration can always be constructed explicitly for a smooth two-phase evolution.
5.1 Gradient flow calibrations
In the remainder of this chapter, $C \lt \infty$ and $c\gt 0$ denote positive constants (‘large’ and ‘small’, respectively) that may depend on the pair of anisotropies $(\sigma,\mu )$ , on the time horizon $T$ , and on the smooth evolution $\left \{\mathscr{A}(t)\right \}_{t \in [0,T]}$ . These constants need not be the same on every occurrence.
Definition 5.3. Let $\left \{{\mathscr{A}}(t) \right \}_{t\in [0,T]}$ be a strong solution to anisotropic mean curvature flow consisting of nonempty open proper subsets of $\Omega$ . A gradient flow calibration for $\left \{{\mathscr{A}}(t) \right \}_{t\in [0,T]}$ is a triple
satisfying
-
• the approximate evolution equations
(5.1) \begin{align} \left | \partial _{t}\xi + (B\cdot \nabla )\xi + (\nabla B){^{T}}\xi \right |(x, t) & \leq C\; \mathrm{dist}(x, \partial{\mathscr{A}}(t))&\text{in } \Omega \times [0,T], \end{align}(5.2) \begin{align} \left | \xi \cdot \left (\partial _{t}\xi + (B\cdot \nabla )\xi \right )\right |(x, t) & \leq C\; \mathrm{dist}^{2}(x, \partial{\mathscr{A}}(t))&\text{in } \Omega \times [0,T], \end{align}(5.3) \begin{align} \left | \partial _{t}\vartheta + (B\cdot \nabla )\vartheta \right |(x, t) & \leq C\; \mathrm{dist}(x, \partial{\mathscr{A}}(t))&\text{in } \Omega \times [0,T], \end{align} -
• the compatibility condition
(5.4) \begin{equation} \left | B \cdot \xi + \mu (\xi )\mathrm{div}(|\xi |\psi (|\xi |)D \sigma (\xi ))\right |(x, t)\leq C\; \mathrm{dist}(x, \partial{\mathscr{A}}(t))\quad \text{in } \Omega \times [0,T], \end{equation} -
• and with $\nu _{\partial{\mathscr{A}}(t)}$ denoting the outer unit normal of the set ${\mathscr{A}}(t)$ , we have the coercivity conditions
(5.5) \begin{align} \xi (x,t)&=\nu _{\partial{\mathscr{A}}(t)}(x)&&\text{on } \bigcup _{t\in [0,T]}\left (\partial{\mathscr{A}}(t)\times \{t\}\right ), \end{align}(5.6) \begin{align} |\xi (x, t)|&\leq 1-c\;\mathrm{dist}^{2}(x, \partial{\mathscr{A}}(t)) &&\text{in } \Omega \times [0,T], \end{align}(5.7) \begin{align} \vartheta (x,t)&\gt c\;\mathrm{dist}(x, \partial{\mathscr{A}}(t))&&\text{in } \bigcup _{t\in [0,T]}\left (\overline{{\mathscr{A}}(t)}^{c}\times \{t\}\right ), \end{align}(5.8) \begin{align} \vartheta (x,t)&\lt -c\;\mathrm{dist}(x, \partial{\mathscr{A}}(t)) &&\text{in } \bigcup _{t\in [0,T]}\left ({\mathscr{A}}(t)\times \{t\}\right ). \end{align}
Intuitively, $\xi$ is an extension of the outer unit normal $\nu _{\partial{\mathscr{A}}(t)}$ (with an additional coercivity property), whereas $B$ extends the normal velocity vector and $\vartheta$ is comparable to a signed distance function. The compatibility condition (5.4) encodes the motion by anisotropic mean curvature. The space $C_{1}^{2}(\Omega \times [0,T])$ is defined as
Let us collect the key inequalities for gradient flow calibrations that will be used in the proof of Theorem 5.2.
Lemma 5.4. Let $(\xi, B, \vartheta )$ be a gradient flow calibration for $\left \{{\mathscr{A}}(t) \right \}_{t\in [0,T]}$ . Then the following estimates hold true for all $(x,t)\in \Omega \times [0,T]$ :
-
(i)
\begin{equation*} c\;\mathrm {dist}^{2}(x, \partial {\mathscr {A}}(t)) \leq 1-\left |\xi \right |(x,t)\leq C\; \mathrm {dist}^{2}(x, \partial {\mathscr {A}}(t)), \end{equation*} -
(ii)
\begin{equation*} c \;\mathrm {dist}^{2}(x, \partial {\mathscr {A}}(t)) \leq 1-\left |\xi \right |^{2}(x,t)\leq C \;\mathrm {dist}^{2}(x, \partial {\mathscr {A}}(t)), \end{equation*} -
(iii)
\begin{equation*} \left |\xi \cdot (\xi \cdot \nabla )B\right |(x,t)\leq C \;\mathrm {dist}(x, \partial {\mathscr {A}}(t)). \end{equation*}
If $\nu \;:\; \Omega \times [0,T]\to{\mathbb{R}}^{d}$ such that $|\nu |\equiv 1$ , then
-
(iv)
\begin{equation*} |\nu -\xi |^{2}\leq C\left (\sigma (\nu )-|\xi |\psi (|\xi |)D\sigma (\xi )\cdot \nu \right ), \end{equation*} -
(v)
\begin{equation*} 1-|\xi |\leq C\left (\sigma (\nu )-|\xi |\psi (|\xi |)D\sigma (\xi )\cdot \nu \right ), \text { and} \end{equation*} -
(vi)
\begin{equation*} \left |\xi \cdot (\nu -\xi )\right |\leq C\left (\sigma (\nu ) - |\xi |\psi (|\xi |)D\sigma (\xi )\cdot \nu \right ). \end{equation*}
Proof.
-
(i) The lower bound is precisely the coercivity condition (5.6).
For the upper bound, we observe that, due to (5.5) and (5.6), $|\xi |$ attains its maximum on $\partial{\mathscr{A}}(t)$ for every $t \in [0,T]$ . In particular, we have $\nabla |\xi |=0$ on $\partial{\mathscr{A}}(t)$ . A second-order Taylor expansion yields $1-|\xi |(x,t)\leq C\;\mathrm{dist}^{2}(x,\partial{\mathscr{A}}(t))$ . For this argument, we use the smoothness of $\xi$ (i.e., the continuity of $\nabla \xi$ and $\nabla ^2\xi$ on $\Omega \times [0,T]$ ) and the compactness of the domain $\Omega \times [0,T]$ .
-
(ii) This follows immediately from (i) since the condition $|\xi |\leq 1$ allows us to compute
\begin{equation*} 1-|\xi |\leq 1-|\xi |^{2}=(1-|\xi |)(1+|\xi |)\leq 2(1-|\xi |), \end{equation*}i.e., $1-|\xi |^{2}$ is bounded from below and above by a rescaled $1-|\xi |$ . -
(iii) This inequality is a consequence of (5.1) and (5.2): We have
\begin{align*} \left |\xi \cdot (\xi \cdot \nabla )B \right |(x,t) &= \left | \xi \cdot \left (\partial _{t}\xi + (B\cdot \nabla )\xi + (\nabla B){^{T}}\xi \right ) - \xi \cdot \left (\partial _{t}\xi + (B\cdot \nabla )\xi \right )\right |(x,t) \\[5pt] &\leq C\;\mathrm{dist}(x, \partial{\mathscr{A}}(t))+C\;\mathrm{dist}^{2}(x, \partial{\mathscr{A}}(t)) \\[5pt] &\leq C\;\mathrm{dist}(x, \partial{\mathscr{A}}(t)). \end{align*} -
(iv) This is the statement of Lemma 2.4(i).
-
(v) This is another elementary estimate: With the help of Lemma 2.3(ii), (vi), one computes
\begin{align*} \sigma (\nu )-|\xi |\psi (|\xi |)D\sigma (\xi )\cdot \nu &\geq \sigma (\nu )-|\xi |\psi (|\xi |)\sigma ( \nu )\\[5pt] &\geq (\!\min _{S^{d-1}}\sigma )\,(1-|\xi |\psi (|\xi |))\\[5pt] &\geq (\!\min _{S^{d-1}}\sigma )\,(1-|\xi |). \end{align*} -
(vi) follows from the previous two estimates since
\begin{equation*} \xi \cdot (\nu -\xi )=-\frac {1}{2}\left (1-2\,\nu \cdot \xi +|\xi |^{2}\right ) + \frac {1}{2}\left (1-|\xi |^{2}\right )=-\frac {1}{2}|\nu - \xi |^{2} + \frac {1}{2}\left (1-|\xi |^{2}\right ). \end{equation*}
Lemma 5.5. Every strong solution $\{{\mathscr{A}}(t)\}_{t\in [0,T]}$ to anisotropic mean curvature flow ( 1.1 ) that consists of nonempty open proper subsets of $\Omega$ admits a gradient flow calibration $(\xi, B, \vartheta )$ .
Proof. There exists a positive $\delta \gt 0$ such that in the neighborhood
the signed distance function $\mathrm{sdist}\;:\; U \to (-\delta, \delta )$ and the orthogonal projection
with respect to $\partial{\mathscr{A}}(t)$ are well-defined and regular (see [Reference Gilbarg and Trudinger34, Lemmas 14.16, 14.17]). We use the sign convention that $\mathrm{sdist}(x,t)\lt 0$ for $x \in{\mathscr{A}}(t)\cap B_{\delta }(\partial{\mathscr{A}}(t))$ .
Let $\zeta \in C_{c}^{\infty }((-\delta,\delta ))$ be a cutoff function such that $\zeta (s)=1-s^{2}$ for $|s|\leq \frac{\delta }{2}$ and $s\zeta ^{\prime }(s)\leq 0$ on $(-\delta,\delta )$ . Furthermore, we define another truncation $f \in C^{\infty }({\mathbb{R}})$ such that $f(s)=s$ for $|s|\leq \frac{\delta }{2}$ , $f(s)={\mathrm{sgn}}(s)\delta$ for $|s|\geq \frac{3}{4}\delta$ , and $f^{\prime }\geq 0$ . Let now
Second, we define $\vartheta \;:\!=\; f\circ\!\mathrm{sdist}$ on $U$ , which can be extended locally constantly to $\Omega \times [0,T]$ . Third, let
which can be extended to $\Omega \times [0,T]$ by zero using a cutoff function.
To show that $(\xi, B, \vartheta )$ is a gradient flow calibration for $\{{\mathscr{A}}(t)\}_{t\in [0,T]}$ as in Definition 5.3, we rely on a generic equation for the signed distance of a smoothly evolving surface with normal velocity vector $B$ , valid in the neighborhood $U$ . Precisely, for a smoothly evolving surface, we have
Supposing (5.10), we may differentiate this equation in space and replace $\nabla \mathrm{sdist}$ by $\xi$ when it makes an explicit appearance but also in $B$ (recalling (5.9)) to recover (5.1); note the error $O(\mathrm{dist}(x,\partial{\mathscr{A}}(t)))$ comes from the replacement of $\nabla ^2 \mathrm{sdist}$ by $\nabla \xi$ and $\partial _t (\nabla \, \mathrm{sdist})$ . Unsurprisingly, given the definition of $\vartheta$ above and (5.10), we immediately obtain (5.3). To recover (5.2), we note that $|\xi | \equiv 1$ on $\cup _{t\in [0,T]}\left (\partial{\mathscr{A}}(t)\times \{t\}\right )$ so that
One can then explicitly verify that
which allows one to recover the quadratic error in (5.2). Finally, the remaining properties of the calibration are easily verified by the choice of cutoff functions.
We briefly outline how to prove (5.10) at a space-time point $(x,t)\in U$ . Suppose first that $x\not \in{\mathscr{A}}(t)$ and that surface velocity $B(x,t) \;:\!=\; B(p(x,t),t)$ (a hopefully forgivable abuse of notation) points from the interface in the direction of $x$ . Straightforward geometric reasoning on the line segment $x + \mathbb{R} \nabla \mathrm{sdist}(x,t)$ then shows that
Supposing by contradiction that the previous inequality is strict, we have
for $\varepsilon \gt 0$ sufficiently small, where $\eta \gt 0$ is a small fixed constant. Letting $y_\varepsilon \in \partial{\mathscr{A}}(t+\varepsilon )$ be such that $\mathrm{sdist} (x,t+\varepsilon ) = |x - y_\varepsilon |$ , we let $\bar y_\varepsilon \;:\!=\; p(y_\varepsilon,t)$ . One may argue by smoothness of the interface evolution and the projection operator that for small enough $\varepsilon \gt 0$ (depending on $\eta$ ), we have
where on the right-hand side we used that $B(x,t)\cdot \nabla \mathrm{sdist}(x,t) = |B(x,t)|$ and $C\gt 0$ is a constant depending only on the smooth interface. Synthesizing (5.11) and (5.12) we have
a contradiction to the definition of $\mathrm{sdist}(x,t).$ This shows (5.10) at $(x,t)$ under the contingent hypotheses; the other cases are verified similarly (reversing the flow in time when necessary).
5.2 A Grönwall-type stability estimate
We wish to prove Theorem 5.2 by deriving a Grönwall-type estimate for a suitable quantity. A straightforward way to measure the difference between the calibrated evolution ${\mathscr{A}}(t)$ and the weak solution $A(t)$ ( $\chi _{A(t)}$ satisfying Definition 2.8) at a given time $t \in [0,T]$ is the bulk error
Since $|\vartheta (\cdot, t)|$ is strictly positive outside $\partial{\mathscr{A}}(t)$ , and hence almost everywhere in $\mathbb{R}^d$ this bulk error vanishes if and only if $|{\mathscr{A}}(t)\triangle A(t)|=0$ . However, the available stability estimate for the bulk error is not strong enough to apply Grönwall’s lemma immediately:
Lemma 5.6. Let ${\mathscr{A}(t)}_{t \in [0,T]}$ be a solution of anisotropic mean curvature flow ( 1.1 ) that is calibrated in the sense of Definition 5.3 . Further, let $\{A(t)\}_{t\in [0,T]}$ be time parametrized collection of sets with $\chi \;:\!=\;\chi _{A}$ a distributional solution of anisotropic mean curvature flow as in Definition 2.8 . Recalling the definition of $\mathscr{E}\left [\cdot \big |\cdot \right ]$ in ( 4.28 ) (with $c_0 = 1$ ), the following holds:
-
(i) The bulk error $\mathscr{B}\left [\chi ({T^{\prime }})\big |\vartheta ({T^{\prime }})\right ]$ at an arbitrary time ${T^{\prime }} \in [0,T]$ is given by
\begin{align*}{\mathscr{B}\left [\chi ({T^{\prime }})\big |\vartheta ({T^{\prime }})\right ]}&={\mathscr{B}\left [\chi (0)\big |\vartheta (0)\right ]}+\int _{0}^{{T^{\prime }}}\int _{\Omega }\left (\chi _{A(t)}-\chi _{\mathscr{A}(t)}\right )\left (\partial _{t}\vartheta +(B\cdot \nabla )\vartheta \right )dxdt\\[5pt] &\quad + \int _{0}^{{T^{\prime }}}\int _{\Omega }\left (\chi _{A(t)}-\chi _{\mathscr{A}(t)}\right )\vartheta \mathrm{div} B\, dx dt \\[5pt] &\quad + \int _{0}^{{T^{\prime }}}\int _{{\partial ^{*}} A(t)}\vartheta B\cdot (\xi -\nu )\,d{\mathcal{H}^{d-1}} dt \\[5pt] &\quad + \int _{0}^{{T^{\prime }}}\int _{{\partial ^{*}} A(t)}\vartheta (V-B\cdot \xi )\,d{\mathcal{H}^{d-1}} dt. \end{align*} -
(ii) For every $\delta \gt 0$ , there exists a constant $C(\delta )\gt 0$ , which also depends on the calibrated evolution, such that
(5.13) \begin{align} {\mathscr{B}\left [\chi ({T^{\prime }})\big |\vartheta ({T^{\prime }})\right ]}&\leq{\mathscr{B}\left [\chi (0)\big |\vartheta (0)\right ]} + C(\delta ) \int _{0}^{{T^{\prime }}}\left ({\mathscr{B}\left [\chi (t)\big |\vartheta (t)\right ]}+{\mathscr{E}\left [\chi (t)\big |\xi (t)\right ]}\right )dt\nonumber \\[5pt] &+ \delta \int _{0}^{{T^{\prime }}}\int _{{\partial ^{*}} A(t)}|V-B\cdot \xi |^{2}d{\mathcal{H}^{d-1}} dt \end{align}for all ${T^{\prime }} \in [0,T]$ .
Proof. See [Reference Hensel and Laux35, Sections 4.2, 4.3], where the argument is carried out for isotropic mean curvature flow. The same argument applies in our anisotropic setting since the definition of the bulk error functional has remained unchanged.
To compensate for the additional error term on the right-hand side of (5.13), we make use of another stability estimate for the relative entropy:
Lemma 5.7. Let the hypotheses of Lemma 5.6 hold. For all ${T^{\prime }} \in [0,T]$ , we have
A combination of the two stability estimates yields
In particular, it follows by Grönwall’s lemma that ${\mathscr{B}\left [\chi ({T^{\prime }})\big |\vartheta ({T^{\prime }})\right ]}=0$ for all ${T^{\prime }} \in [0,T]$ if ${\mathscr{B}\left [\chi ({T^{\prime }})\big |\vartheta (0)\right ]}+{\mathscr{E}\left [\chi ({T^{\prime }})\big |\xi (0)\right ]}=0$ , which completes the proof of Theorem 5.2.
5.3 Stability of the relative entropy
The goal of this subsection is to prove Lemma 5.7.
With the help of the divergence theorem and the surface energy (1.2) with $c_0 = 1$ , one can rewrite the relative entropy functional as
which using the distributional formulation of the time derivative from (2.19) leads to
Note, in the last step, we used the optimal energy dissipation inequality (2.20) for the weak solution $\chi$ . We introduce the notation $F(\xi )=|\xi |\psi (|\xi |)D\sigma (\xi )$ and $M(\xi )\;:\!=\;D_{\xi }F(\xi )=|\xi |\psi (|\xi |)\,D^{2}\sigma (\xi )+\left (\frac{\psi (|\xi |)}{|\xi |}+\psi ^{\prime }(|\xi |)\right )D\sigma (\xi )\otimes \xi$ and complete squares three times to write
The second square can be trivially estimated. We will now deal with the other squared terms as well as the remaining integrals separately:
The first square will later be used to compensate for a term in the stability estimate of the bulk error. To this end, it will be useful to estimate the term with the help of the inequality $-a^{2}\leq -\frac{1}{2}(a+b)^{2}+b^{2}$ and (4.30):
Similarly, for the third square, the inequality $a^{2}\leq 2(a-b)^{2}+2b^{2}$ (applied twice), the Lipschitz continuity of $\mu$ , the compatibility condition (5.4), and finally (4.30) yield
It remains to show that
The third integral on the left-hand side can be expanded in a way that resembles the approximate evolution equations (5.1) and (5.2):
Here, the first two integrals in the right-hand side expression are already in a suitable form: For the first integral this can be seen directly from (5.1), whereas for the second term a computation yields
so that one can invoke (5.2). Here we have used a fact that follows from the positive $0$ -homogeneity of $D\sigma$ , namely that $D^{2}\sigma (\xi )\xi = \frac{d}{ds}\Big |_{s=0}D\sigma (\mathrm{e}^{s}\xi )=\frac{d}{ds}\Big |_{s=0}D\sigma (\xi )=0$ .
Furthermore, an application of the chain rule shows that
and we will from now on use this slightly shorter formulation for the third right-hand side integral of (5.19).
By using (5.19) and plugging in the computations for the three right-hand side integrals as well as the weak formulation of anisotropic mean curvature flow in (2.19), we obtain
It is now advisable to expand the first right-hand side integral and to add zero in order to isolate two more errors controlled by the time-integrated relative entropy:
The first two integrals are controlled by $\int _{0}^{{T^{\prime }}}{\mathscr{E}\left [\chi (t)\big |\xi (t)\right ]}dt$ ; in the case of the second integral this follows from the local Lipschitz continuity of $F$ .
We approach the last integral in (5.21) by an application of Taylor’s theorem and show that it cancels with the second to last integral in (5.20). Indeed, one computes that
where now each summand is controlled by $\sigma (\nu )-|\xi |\psi (|\xi |)D\sigma (\xi )\cdot \nu$ by Lemma 5.4 (iii) and (vi). Here we use that $\sigma \in C^{3}\big ({\mathbb{R}}^{d}\setminus \{0\}\big )$ .
Plugging in (5.21) and (5.22) into (5.20) yields
The four remaining surface integrals can be written as one integral involving the divergence of a matrix field since
As a result of an integration by parts and the symmetry relation $\mathrm{div}(\mathrm{div}(a\otimes b))= \mathrm{div}(\mathrm{div}(b\otimes a))$ , one can then see that
Plugging in (5.24) into (5.23) yields (5.18).
In a last step, plugging in (5.16), (5.17) and (5.18) into (5.15) yields
This concludes the proof of Lemma 5.7.
6. Conclusion
In this paper, we began by constructing solutions of the anisotropic Allen–Cahn equation using relatively elementary arguments, relying on convexity for instance, for a naturally associated minimizing movements scheme. With this, we showed that these solutions are precompact in an appropriate functional space and, as the interface width vanishes, converge to a solution of anisotropic mean curvature flow. In contrast to previous methods, our approach relies on control of a relative entropy, which quantifies the distance between the sharp interface and smooth interface coming from the approximation. Having access to the relative entropy naturally allows for further understanding of the BV-solution concept associated with the anisotropic curvature flow. Precisely, we proved that solutions of the anisotropic mean curvature flow, defined in terms of a distributional formulation of the curvature and an optimal energy dissipation relation, actually possess a weak–strong uniqueness property: meaning that when classical, smooth solutions exist, the weak solution with the same initial condition coincides with the smooth solution. Future questions will involve further investigation of convergence in the multiphase setting and obtaining quantitative convergence rates.
Competing interests
None.
Acknowledgements
The present paper is an extension of the third author’s master’s thesis at the University of Bonn.
Financial Support
This project had received funding from the Deutsche Forschungsgemeinschaft (DFG, German Research Foundation) under Germany’s Excellence Strategy – EXC-2047/1 – 390685813 and the Graduiertenkolleg (Research Training Group) 2339 “Interfaces, Complex Structures, and Singular Limits in Continuum Mechanics”.