Reynolds-robust solvers for the stationary Navier-Stokes equations using H(div)–L² elements¶

This demo shows a discretisation using H(div)-L² elements and preconditioner for the stationary incompressible Navier-Stokes equations that exhibits Reynolds-robustness. For the discretisation, this means that its error estimates do not depend explicitly on the Reynolds number; for the preconditioner, this means that the number of Krylov iterations per Newton step barely changes as the Reynolds number is varied.

The demo was contributed by Patrick Farrell.

The stationary incompressible Navier-Stokes equations are a fundamental model for viscous fluid flow. A notorious difficulty is that classical iterative solvers degrade badly as the Reynolds number \(\mathrm{Re}\) grows: standard block preconditioners such as pressure convection-diffusion (PCD) or least-squares commutator (LSC) require iteration counts that grow with \(\mathrm{Re}\) [ESW14]. This demo shows how to construct a solver whose iteration count remains independent of both \(\mathrm{Re}\) and the mesh size, following the approach of [BO06, FMW19, HKXZ15].

The strategy combines three ingredients: an H(div)-conforming discretisation that captures the divergence-free constraint exactly at the discrete level; the augmented Lagrangian technique of [FG83] to control the pressure Schur complement at any Reynolds number; and a parameter-robust geometric multigrid preconditioner based on vertex-star space decomposition [Schoberl99] to handle the augmented velocity block efficiently.

The problem¶

We solve the classical lid-driven cavity problem for the stationary incompressible Navier-Stokes equations on the unit square \(\Omega = (0,1)^2\):

\[ \begin{align}\begin{aligned}-\nabla \cdot \bigl(2\,\mathrm{Re}^{-1}\,\varepsilon(u)\bigr) + \nabla \cdot (u \otimes u) + \nabla p &= 0 \quad \text{in } \Omega,\\\nabla \cdot u &= 0 \quad \text{in } \Omega,\end{aligned}\end{align} \]

where \(\varepsilon(u) = \tfrac{1}{2}(\nabla u + \nabla u^\top)\) is the symmetric velocity gradient. We impose no-slip conditions \(u = 0\) on the left, right, and bottom walls (\(\Gamma_1, \Gamma_2, \Gamma_3\)) and a lid velocity \(u = (16 x^2(1-x)^2,\, 0)\) on the top wall (\(\Gamma_4\)).

After discretisation this yields a saddle-point linear system for the Newton update:

\[\begin{split}\begin{pmatrix} A & B^\top \\ B & 0 \end{pmatrix} \begin{pmatrix} \delta u \\ \delta p \end{pmatrix} = \begin{pmatrix} f \\ 0 \end{pmatrix}.\end{split}\]

Here \(A\) is the linearised velocity block and \(B\) encodes the divergence constraint. The key difficulty for preconditioning is the pressure Schur complement \(S = -B A^{-1} B^\top\). For the Stokes equations (no advection), the Schur complement is spectrally equivalent to a scaled pressure mass matrix \(Q\) [ESW14, FG83, SW94], but this equivalence quickly degrades once the convection term is introduced.

The augmented Lagrangian and block preconditioning¶

In the context of the Stokes equations, the augmented Lagrangian method [FG83] adds a penalty term to the Lagrangian for the incompressibility constraint:

\[\mathcal{L}_\gamma(u, p) = \mathcal{L}(u, p) + \frac{\gamma}{2} \int_\Omega (\nabla \cdot u)^2 \,\mathrm{d}x,\]

where \(\gamma \geq 0\) is the augmentation parameter. Since the exact solution satisfies \(\nabla \cdot u = 0\), this term does not alter the solution. Applying this term to the Stokes equations, the corresponding augmented velocity block is

\[A_\gamma = 2\,\mathrm{Re}^{-1} \int_\Omega \varepsilon(\varphi_j) : \varepsilon(\varphi_i)\,\mathrm{d}x + \gamma \int_\Omega (\nabla \cdot \varphi_j)(\nabla \cdot \varphi_i)\,\mathrm{d}x,\]

and the pressure Schur complement approximation becomes \(S_\gamma \approx -(2\,\mathrm{Re}^{-1} + \gamma)^{-1} Q\). For Stokes the Schur complement is already well-controlled, so the augmentation is useful but not critical; for Navier-Stokes it is essential, since for large \(\gamma\) the Schur complement is driven to a scalar multiple of \(Q\), which is trivially invertible. The tradeoff is that large \(\gamma\) makes \(A_\gamma\) increasingly ill-conditioned, so the specialised multigrid strategy described below becomes necessary.

Block Gaussian elimination on the saddle-point system can be carried out in two distinct orderings. Eliminating the velocity block first gives the standard block LDU factorisation

\[\begin{split}\begin{pmatrix} A & B^\top \\ B & -C \end{pmatrix}^{-1} = \begin{pmatrix} I & -A^{-1}B^\top \\ 0 & I \end{pmatrix} \begin{pmatrix} A^{-1} & 0 \\ 0 & -S^{-1} \end{pmatrix} \begin{pmatrix} I & 0 \\ -BA^{-1} & I \end{pmatrix},\end{split}\]

where \(S = C + BA^{-1}B^\top\) is the pressure Schur complement. (For our inf-sup stable discretisation, \(C = 0\).) This requires one application of \(S^{-1}\) and two applications of \(A^{-1}\) per Krylov iteration. Since each application of \(A^{-1}\) corresponds to a full multigrid cycle, paying for it twice per iteration is significant.

An even more efficient alternative arises by eliminating the pressure block first—the block UDL factorisation. We cannot choose the pressure block as our pivot for the original matrix because \(C = 0\), but we can construct a preconditioner by adding the shift \(-\gamma^{-1}Q\). This gives a preconditioner

\[\begin{split}P_{\gamma} = \begin{pmatrix} A & B^\top \\ B & -\gamma^{-1}Q \end{pmatrix}.\end{split}\]

with an invertible \((2,2)\) block. The exact inverse of this matrix admits the factorisation

\[\begin{split}P_{\gamma}^{-1} = \begin{pmatrix} A & B^\top \\ B & -C \end{pmatrix}^{-1} = \begin{pmatrix} I & 0 \\ C^{-1}B & I \end{pmatrix} \begin{pmatrix} A_\gamma^{-1} & 0 \\ 0 & -C^{-1} \end{pmatrix} \begin{pmatrix} I & B^\top C^{-1} \\ 0 & I \end{pmatrix},\end{split}\]

where the velocity Schur complement is

\[A_\gamma = A + B^\top C^{-1} B = A + \gamma\, B^\top Q^{-1} B,\]

exactly the augmented Lagrangian velocity block. This factorisation requires only a single application of \(A_\gamma^{-1}\) per Krylov iteration, together with two applications of \(C^{-1} = \gamma Q^{-1}\). With the right choice of degrees of freedom for the pressure space (passing variant="integral" to the function space construction), \(Q\) is diagonal, so inverting it is essentially free. Using this reverse ordering halves the number of expensive multigrid solves compared with the standard block LDU.

Note that the reverse factorisation requires \(C\) to be invertible, which fails in the limit \(\gamma \to \infty\). Here, however, \(\gamma\) is a parameter chosen by us for our convenience, so this is never an obstacle.

To solve \(A_\gamma\) in a \(\gamma\)-robust way, the multigrid smoother must capture the kernel of the divergence operator in a certain sense. The theoretical foundation is due to Schöberl [Schoberl99]: put coarsely, a multigrid method is parameter-robust if the relaxation captures the kernel and its prolongation maps coarse kernel functions to (nearly) fine kernel functions. It is not known how to devise multigrid components that satisfy these requirements for arbitrary discretisations. The \(\mathrm{Re}\)-robust solver we present here can (at present) be applied to Scott-Vogelius discretisations [FMSW21a, FMSW21b], high-order Taylor-Hood discretisations, and H(div)-L² discretisations as coded in this demo. It also applies to the Mardal-Tai-Winther discretisation [MTW02] and a modification of the Bernardi-Raugel discretisation [FMW19]; its extension to further discretisations is a matter of ongoing research. In particular, it is not known how to apply these ideas to the popular low-order Taylor–Hood, MINI, or stabilised equal-order discretisations. In any case these latter discretisations do not give exactly incompressible velocity approximations, and their error estimates therefore depend on the Reynolds number, which is undesirable.

H(div) conforming discretisation¶

We use the Brezzi–Douglas–Marini (BDM) space of degree \(k = 2\) for velocity and a DG space of degree \(k-1\) for pressure, using the integral variant so that the pressure mass matrix is diagonal (and hence trivially inverted). This element pair yields a strongly divergence free velocity approximation, which causes the discretisation error estimates to become independent of the Reynolds number [JLM+17]. Because BDM functions have only normal continuity across edges, the viscous term must be discretised using a symmetric interior penalty DG formulation. The convective term uses standard DG upwinding.

We build a mesh hierarchy for geometric multigrid; this can be refined to make the simulation more accurate/expensive. The distribution parameters configure the mesh partitioning to enable greater overlap than usual, which is necessary for vertex-star relaxation in parallel.

from firedrake import *
from firedrake.petsc import PETSc
from collections.abc import Iterable

print = PETSc.Sys.Print

distribution_parameters = {"overlap_type": (DistributedMeshOverlapType.VERTEX, 1)}
num_refinements = 2
base = UnitSquareMesh(16, 16, distribution_parameters=distribution_parameters)
mh = MeshHierarchy(base, num_refinements)
mesh = mh[-1]
n = FacetNormal(mesh)
(x, y) = SpatialCoordinate(mesh)

We define the BDM–DG mixed space.

k = 2
V = FunctionSpace(mesh, "BDM", k)
Q = FunctionSpace(mesh, "DG", k-1, variant="integral")
W = MixedFunctionSpace([V, Q])

The boundary conditions impose the no-slip and lid conditions on the velocity component of the mixed space.

Re = Constant(1)
bcs = [DirichletBC(W.sub(0), 0, (1, 2, 3)),
       DirichletBC(W.sub(0), as_vector([16 * x**2 * (1-x)**2, 0]), (4,))]

We set up the solution function and test functions.

w = Function(W, name="Solution")
(u, p) = split(w)
z = TestFunction(W)
(v, q) = split(z)

Variational form¶

The full weak form consists of three groups of terms. The first group discretises the viscous term \(-\nabla \cdot (2\,\mathrm{Re}^{-1}\varepsilon(u))\) using a symmetric interior penalty DG method: a consistency term, a symmetry term, and a penalty term scaled by \(\sigma/h\) (with \(\sigma = 5(k+1)^2\)).

The second group is the DG upwind discretisation of the convective term \(\nabla \cdot (u \otimes u)\). On interior facets the upwind flux is \(\tfrac{1}{2}(u \cdot n + |u \cdot n|) u\).

The final group implements the pressure gradient and the divergence constraint.

gamma = Constant(10000)
sigma = Constant(5 * (k+1)**2)
h = CellVolume(mesh)/FacetArea(mesh)
uflux_int = 0.5*(dot(u, n) + abs(dot(u, n)))*u

F = (
      # Viscous terms
      2/Re * inner(sym(grad(u)), sym(grad(v)))*dx
    - 2/Re * inner(avg(sym(grad(u))), 2*avg(outer(v, n)))*dS
    - 2/Re * inner(2*avg(outer(u, n)), avg(sym(grad(v))))*dS
    + 2/Re * sigma/avg(h) * inner(avg(outer(u, n)), 2*avg(outer(v, n)))*dS
      # Convective terms
    - inner(u, div(outer(v, u)))*dx
    + inner(jump(uflux_int), jump(v))*dS
      # Off-diagonal terms
    - inner(p, div(v))*dx
    - inner(div(u), q)*dx
    )

Boundary conditions are imposed weakly via two helper functions. a_bc contributes the viscous boundary terms (analogues of the interior penalty terms restricted to boundary facets), and c_bc contributes the convective upwind flux on boundary facets.

def a_bc(u, v, bid, g):
    ures = u - g if g else u
    return (
           - 2/Re * inner(sym(grad(u)), outer(v, n))*ds(bid)
           - 2/Re * inner(outer(ures, n), sym(grad(v)))*ds(bid)
           + 1/Re * (sigma/h) * inner(ures, v)*ds(bid)
           )

def c_bc(u, v, bid, g):
    uflux_ext = 0.5*(dot(u, n) + abs(dot(u, n)))*u
    if g:
        uflux_ext += 0.5*(dot(u, n) - abs(dot(u, n)))*g
    return inner(uflux_ext, v)*ds(bid)

We loop over the Dirichlet boundary conditions, adding the weak boundary terms for each marked wall, and separately handle any remaining exterior facets with a zero-inflow flux.

exterior_markers = set(mesh.exterior_facets.unique_markers)
for bc in bcs:
    g = bc.function_arg
    bid = bc.sub_domain
    if isinstance(bid, Iterable):
        [exterior_markers.remove(_) for _ in bid]
    else:
        exterior_markers.remove(bid)
    F += a_bc(u, v, bid, g)
    F += c_bc(u, v, bid, g)
for bid in exterior_markers:
    F += c_bc(u, v, bid, None)

To construct the augmented Lagrangian preconditioner, we first add the penalty term and add the pressure mass matrix weighted by \(\gamma^{-1}\), and then we differentiate to obtain the preconditioning bilinear form Jp from which PETSc will extract \(A_\gamma\) and \(-\gamma^{-1}Q\) required for the Schur factorization.

Fp = F + inner(div(u)*gamma, div(v))*dx - inner(p/gamma, q)*dx
Jp = derivative(Fp, w)

Solver¶

The outer solver is FGMRES (flexible GMRES is needed because the preconditioner is nonlinear—it contains inner iterative solves) preconditioned by a full Schur factorisation fieldsplit. We need to swap the ordering of the fields in order to set the velocity to be the variable that is back-substituted. This is key to ensure that \(A_\gamma^{-1}\) is only applied once per Krylov iteration.

The velocity block applies a full-cycle geometric multigrid preconditioner to the augmented Lagrangian. At each multigrid level, five steps of GMRES are applied, preconditioned by the additive Schwarz method with vertex-star patches (ASMStarPC). A star patch around a vertex consists of all cells sharing that vertex; these patches together stably partition the divergence-free subspace, ensuring that the smoother captures the kernel of \(\nabla \cdot\) as required by Schöberl’s theory [Schoberl99]. The coarse-grid problem is solved exactly with LU factorisation.

The pressure block inverts the pressure mass matrix. For the integral-variant DG space the mass matrix is diagonal, so Jacobi is exact.

sp = {
    'mat_type': 'matfree',
    'snes_monitor': None,
    'snes_converged_reason': None,
    'snes_max_it': 20,
    'snes_atol': 1e-8,
    'snes_rtol': 1e-12,
    'snes_stol': 1e-06,
    'ksp_type': 'fgmres',
    'ksp_converged_reason': None,
    'ksp_monitor_true_residual': None,
    'ksp_max_it': 30,
    'ksp_atol': 1e-08,
    'ksp_rtol': 1e-10,
    'pc_type': 'fieldsplit',
    'pc_fieldsplit_type': 'schur',
    'pc_fieldsplit_schur_factorization_type': 'full',
    'pc_fieldsplit_0_fields': 1,
    'pc_fieldsplit_1_fields': 0,
    'fieldsplit_ksp_type': 'preonly',
    'fieldsplit_0_pc_type': 'jacobi',
    'fieldsplit_1': {
        'pc_type': 'python',
        'pc_python_type': 'firedrake.AssembledPC',
        'assembled': {
           'pc_use_amat': False,
           'pc_type': 'mg',
           'pc_mg_type': 'full',
           'mg_coarse_mat_type': 'aij',
           'mg_coarse_pc_type': 'lu',
           'mg_coarse_pc_factor_mat_solver_type': 'mumps',
           'mg_coarse_mat_mumps_icntl_14': 1000,
           'mg_levels': {
               'ksp_convergence_test': 'skip',
               'ksp_max_it': 5,
               'ksp_type': 'gmres',
               'pc_type': 'python',
               'pc_python_type': 'firedrake.ASMStarPC',
           },
        },
    },
}

Solving over a range of Reynolds numbers¶

We perform continuation in Reynolds number, using as initial guess the converged solution from the previous \(\mathrm{Re}\). We report the total Krylov iterations and the average per Newton step, which should remain nearly constant as \(\mathrm{Re}\) grows. As a diagnostic, we also print \(\|\nabla \cdot u\|_{L^2}\). Since the discretisation is exactly incompressible, this should remain close to machine precision.

(u_, p_) = w.subfunctions
u_.rename("Velocity")
p_.rename("Pressure")
pvd = VTKFile("output/navier_stokes.pvd")

problem = NonlinearVariationalProblem(F, w, bcs, Jp=Jp)
solver = NonlinearVariationalSolver(problem, solver_parameters=sp, pre_apply_bcs=False)

for Re_ in [1, 500]  + list(range(1000, 5001, 1000)):
    Re.assign(Re_)

    # Solve
    print(BLUE % f"Solving for Re = {Re_}")
    solver.solve()

    # Diagnostics
    linear_its = solver.snes.getLinearSolveIterations()
    nonlinear_its = solver.snes.getIterationNumber()
    print(f"  Krylov iterations: {linear_its} ({linear_its/nonlinear_its:.1f} per Newton step)")
    print(f"  ||div u||: {norm(div(u_), 'L2'):.2e}")
    pvd.write(u_, p_)

Across the whole range from \(\mathrm{Re} = 1\) to \(\mathrm{Re} = 5000\), the average Krylov iterations per Newton step remain in the range 4.4–6.5, confirming the Reynolds-robustness of the preconditioner.

A python script version of this demo can be found here.

References

[BO06]

Michele Benzi and Maxim A. Olshanskii. An augmented Lagrangian-based approach to the Oseen problem. SIAM Journal on Scientific Computing, 28(6):2095–2113, 2006. doi:10.1137/050646421.

[ESW14] (1,2)

Howard Elman, David Silvester, and Andy Wathen. Finite elements and fast iterative solvers. Oxford University Press, second edition edition, 2014.

[FMSW21a]

Patrick E. Farrell, Lawrence Mitchell, L. Ridgway Scott, and Florian Wechsung. A Reynolds-robust preconditioner for the Scott–Vogelius discretization of the stationary incompressible Navier–Stokes equations. SMAI Journal of Computational Mathematics, 7:75–96, 2021. doi:10.5802/smai-jcm.72.

[FMSW21b]

Patrick E. Farrell, Lawrence Mitchell, L. Ridgway Scott, and Florian Wechsung. Robust multigrid for nearly incompressible elasticity using macro elements. IMA Journal on Numerical Analysis, 2021. doi:10.1093/imanum/drab083.

[FMW19] (1,2)

Patrick E. Farrell, Lawrence Mitchell, and Florian Wechsung. An augmented Lagrangian preconditioner for the 3D stationary incompressible Navier–Stokes equations at High Reynolds Number. SIAM Journal on Scientific Computing, 41(5):A3073–A3096, 2019. doi:10.1137/18M1219370.

[FG83] (1,2,3)

Michel Fortin and Roland Glowinski. Augmented Lagrangian Methods: Applications to the Numerical Solution of Boundary-Value Problems. North-Holland, 1983.

[HKXZ15]

Qingguo Hong, Johannes Kraus, Jinchao Xu, and Ludmil Zikatanov. A robust multigrid method for discontinuous Galerkin discretizations of Stokes and linear elasticity equations. Numerische Mathematik, 132(1):23–49, 2015. doi:10.1007/s00211-015-0712-y.

[JLM+17]

Volker John, Alexander Linke, Christian Merdon, Michael Neilan, and Leo G. Rebholz. On the divergence constraint in mixed finite element methods for incompressible flows. SIAM Review, 59(3):492–544, 2017. doi:10.1137/15m1047696.

[MTW02]

Kent-Andre Mardal, Xue-Cheng Tai, and Ragnar Winther. A robust finite element method for Darcy–Stokes flow. SIAM Journal on Numerical Analysis, 40(5):1605–1631, 2002. doi:10.1137/s0036142901383910.

[Schoberl99] (1,2,3)

Joachim Schöberl. Robust multigrid methods for parameter dependent problems. PhD thesis, Johannes Kepler Universität Linz, 1999.

[SW94]

David Silvester and Andrew J. Wathen. Fast iterative solution of stabilised Stokes systems. Part II: Using general block preconditioners. SIAM Journal on Numerical Analysis, 31(5):1352–1367, 1994. doi:10.1137/0731070.