Improving the convergence properties of the moving-mesh code AREPO

Abstract

Accurate numerical solutions of the equations of hydrodynamics play an ever more important role in many fields of astrophysics. In this work, we reinvestigate the accuracy of the moving-mesh code arepo and show how its convergence order can be improved for general problems. In particular, we clarify that for certain problems arepo only reaches first-order convergence for its original formulation. This can be rectified by simple modifications we propose to the time integration scheme and the spatial gradient estimates of the code, both improving the accuracy of the code. We demonstrate that the new implementation is indeed second-order accurate under the L¹ norm, and in particular substantially improves conservation of angular momentum. Interestingly, whereas these improvements can significantly change the results of smooth test problems, we also find that cosmological simulations of galaxy formation are unaffected, demonstrating that the numerical errors eliminated by the new formulation do not impact these simulations. In contrast, simulations of binary stars followed over a large number of orbital times are strongly affected, as here it is particularly crucial to avoid a long-term build up of errors in angular momentum conservation.

hydrodynamics, methods: numerical, galaxy: formation

1 INTRODUCTION

Gravity and hydrodynamics provide the foundations for describing almost all phenomena in astrophysics. Often, the dynamics and geometry are sufficiently complicated that analytic solutions, especially for hydrodynamics, cannot be obtained. These limitations can be overcome by numerical approaches, which have developed over the recent decades into powerful tools for studying complex hydrodynamical flow problems. It is however an ongoing and persistent challenge to derive ever better discretization schemes that reduce numerical discretization errors to a minimum, while at the same time offering high adaptivity to different time- and length-scales and a high degree of parallel scalability. Astrophysical code development is therefore best viewed as an iterative effort which never is fully complete and finished. Rather, new generations of numerical ‘instruments’ (i.e. codes) should ideally improve their accuracy and speed, somewhat similar in spirit to the advances regularly realized in observational instrumentation.

Traditionally, stationary mesh codes with or without adaptive mesh refinement (e.g. Fryxell et al. 2000; Teyssier 2002; Stone et al. 2008; Almgren et al. 2010; Bryan et al. 2014) and smoothed particle hydrodynamics (SPH) codes (e.g. Wadsley, Stadel & Quinn 2004; Springel 2005) have dominated astrophysical fluid dynamics. Moving meshes and hybrid techniques combining Eulerian and Lagrangian meshes have been applied only rarely in astrophysics and related fields (e.g. Hirt, Amsden & Cook 1974; Gnedin 1995; Pen 1998). However, in particular in astrophysics they have not seen widespread use so far.

Recently, moving-mesh techniques based on a Voronoi mesh have been proposed (Springel 2010; Duffell & MacFadyen 2011), which offer a quasi-Lagrangian description that inherits some of the advantages of SPH while retaining the accuracy of a Eulerian mesh-based description. These new methods have matured into interesting alternative codes that have already been widely applied in cosmological simulations of structure formation (e.g. Sijacki et al. 2012; Vogelsberger et al. 2012, 2014; Marinacci, Pakmor & Springel 2014; Genel et al. 2014), as well as in first star formation (e.g. Greif et al. 2011, 2012; Smith et al. 2014), and to a smaller extent in stellar astrophysics (Pakmor et al. 2013) and in problems related to planet formation (Duffell & MacFadyen 2012; Muñoz et al. 2014, 2015). It has become clear that it can be very worthwhile to accept the technical complications introduced by a dynamic and unstructured mesh given the benefits realized by combining most of the advantages of fixed mesh codes (e.g. better convergence for smooth flows, good shock capturing) and SPH techniques (e.g. intrinsic adaptivity, lack of advection errors, small numerical diffusion).

A well-defined and rapid convergence rate is a particularly important feature of mesh codes, which are usually constructed to be at least second-order accurate for smooth flows, i.e. in the absence of shocks or other discontinuities. For SPH codes, in contrast, it is very hard to demonstrate proper convergence in the first place (Zhu, Hernquist & Li 2015). In practice, the convergence rate dictates how rapidly a numerical solution improves as more computational effort is invested, an aspect that is arguably even more important than the absolute size of the error itself. For example, while SPH often obtains a qualitatively correct result with acceptable error at low resolution, its poor convergence rate does not allow it to reach comparable accuracy to mesh codes at comparable numerical cost once the resolution is high. In contrast state of the art mesh-based codes are expected to show at least second-order convergence.

Moving meshes in general have been studied extensively (see e.g. Thomas & Lombard 1979; Guillard & Farhat 2000). In this paper, we re-examine the accuracy and convergence rate of the arepo code (Springel 2010), which is at present the most widely employed implementation of the moving-mesh technique in astrophysics. We address weaknesses in the original version of this code and show how they can be overcome to improve the overall accuracy of the scheme. This directly affects the range of applicability of the code and is hence important for further maturing the moving-mesh approach in general.

The paper is structured as follows. In Section 2, we concisely review the essential parts of the implementation of moving-mesh hydrodynamics in arepo and analyse its convergence. In Sections 3 and 4, we describe improvements to the time integration scheme and the gradient estimate, respectively. We apply the new implementation to test problems in Section 5, show results for realistic applications in Section 6, and discuss the implications of our work in Section 7.

2 MOVING-MESH HYDRODYNAMICS IN arepo

The moving-mesh code arepo solves the Euler equations using the finite-volume approach on an unstructured Voronoi mesh that is generated from a set of points (Springel 2010). At any given time during the simulation, the mesh can be uniquely constructed given only the positions of the mesh-generating points. Since the mesh-generating points move, the geometry of the mesh and its connectivity change over time.

The Euler equations can be written as a system of hyperbolic conservation laws for conserved quantities |${\boldsymbol U}$| and a flux function |${\boldsymbol F}({\boldsymbol U})$|⁠,

\begin{equation} \frac{\mathrm{\partial} {\boldsymbol U}}{\mathrm{\partial} t} + \nabla \cdot {\boldsymbol F} = 0. \end{equation}

(1)

For the standard Euler equations the conserved quantities are mass, momentum, and energy. They define a vector of conserved quantities and an associated flux function as

\begin{equation} {\boldsymbol U} = \left(\begin{array}{c}\rho \\ \rho {\boldsymbol v} \\ \rho e \end{array} \right), \ \ \ \ \ \ \ {\boldsymbol F}({\boldsymbol U}) = \left(\begin{array}{c}\rho {\boldsymbol v} \\ \rho {\boldsymbol v} {\boldsymbol v}^T + P \\ \rho e {\boldsymbol v} + P {\boldsymbol v} \end{array} \right), \end{equation}

(2)

where ρ, |${\boldsymbol v}$|⁠, P, and e are density, velocity, pressure, and total specific energy of the fluid, respectively. The latter is defined as the sum of the specific internal energy u and the specific kinetic energy |$\frac{1}{2} {\boldsymbol v}^2$|⁠, thus |$e = u + \frac{1}{2} {\boldsymbol v}^2$|⁠. The system of equation is closed by an equation of state P = ρ(γ − 1)e with the adiabatic index γ.

To solve the Euler equations, the state of the fluid is discretized using the cells of the Voronoi mesh. To this end, averages of the conserved quantities |${\boldsymbol U}$| of the fluid are computed for each cell through integration over the finite volume of a cell i,

\begin{equation} {\boldsymbol Q}_i = \int _{V_i} {\boldsymbol U} \, \mathrm{d}V, \end{equation}

(3)

yielding a census of the conserved physical quantities in the cell. This state is then evolved in time by

\begin{equation} {\boldsymbol Q}_i^{n+1} = {\boldsymbol Q}_i^{n} - \Delta t \sum _j A_{ij} \boldsymbol {\hat{F}}_{ij}^{n+1/2}, \end{equation}

(4)

where A_ij is the oriented area of the face between cells i and j and |$\boldsymbol {\hat{F}}$| is a time-averaged approximation of the true flux |${\boldsymbol F}_{ij}$| across the interface between cells i and j.

The calculation of the fluxes |$\boldsymbol {\hat{F}}$| is done in the original version of arepo using a MUSCL-Hancock scheme (van Leer 1984; Toro 1999), which has been shown to provide second-order accuracy in time and space for all kinds of stationary meshes (see e.g. Fryxell et al. 2000). This method uses a slope-limited piece-wise linear spatial reconstruction step in each cell and a first-order time extrapolation of the fluid states by half a timestep to obtain the states on both sides of all interfaces. Finally, a Riemann solver uses the states on both sides of an interface to compute the flux that is exchanged during the timestep.

The extrapolation is carried out in primitive variables

\begin{equation} {\boldsymbol W} = \left(\begin{array}{c}\rho \\ {\boldsymbol v} \\ P \end{array} \right), \end{equation}

(5)

which are straightforwardly calculated from the conserved quantities and the geometry of a cell with the equation of state, combined with estimates for their local spatial gradients. By construction, they represent the values at the centre of mass of a cell. Then, the left and right states of an interface are computed by a linear spatial extrapolation from the centre of mass |$\boldsymbol {s}$| of a cell to the centre |$\boldsymbol {f}$| of a cell face, and by a half-step prediction forward in time (where Δt is the full timestep) as

\begin{equation} {\boldsymbol W}^{\prime }_{\mathrm{L,R}} = {\boldsymbol W}_{\mathrm{L,R}} + \left. \frac{\mathrm{\partial} {\boldsymbol W}}{\mathrm{\partial} {\boldsymbol r}} \right|_{\mathrm{L,R}} \left( \boldsymbol {f} - \boldsymbol {s}_\mathrm{L,R} \right) + \left. \frac{\mathrm{\partial} {\boldsymbol W}}{\mathrm{\partial} {\boldsymbol t}} \right|_{\mathrm{L,R}} \frac{\Delta t}{2}. \end{equation}

(6)

The time derivatives of the primitive variables in equation (6) are not calculated directly. Instead, the Euler equations are used to express them in terms of the primitive variables and their spatial derivatives only:

\begin{eqnarray} \frac{ \mathrm{\partial} \rho }{ \mathrm{\partial} t } &=& - {\boldsymbol v} \nabla \rho - \rho \nabla {\boldsymbol v}, \end{eqnarray}

(7)

\begin{eqnarray} \frac{ \mathrm{\partial} {\boldsymbol v} }{ \mathrm{\partial} t } &=& - \frac{\nabla P}{ \rho } - {\boldsymbol v} \nabla {\boldsymbol v}^T, \end{eqnarray}

(8)

\begin{eqnarray} \frac{ \mathrm{\partial} P }{ \mathrm{\partial} t } &=& - \gamma P \nabla {\boldsymbol v} - {\boldsymbol v} \nabla P. \end{eqnarray}

(9)

The extrapolation and computation of the fluxes over an interface are all done in the rest frame of the (moving) interface, i.e. the interface velocity is subtracted from the equations above. This allows solutions that are manifestly Galilean-invariant, an important conceptual advantage over traditional Eulerian methods where the numerical truncation error grows with the fluid velocity.

In the original implementation of arepo, the spatial gradients are calculated using an improved Green–Gauss estimator that makes use of certain mathematical properties of the Voronoi mesh. Specifically, the gradient estimate for a primitive variable ϕ in cell i is given by

\begin{equation} \left< \nabla \phi \right>_i = \frac{1}{V_i} \sum _j A_{ij} \left( \left[ \phi _j - \phi _i \right] \frac{ {\boldsymbol c}_{ij} }{ r_{ij} } - \frac{ \phi _i + \phi _j }{2} \frac{ {\boldsymbol r}_{ij} }{ r_{ij} } \right), \end{equation}

(10)

where

\begin{equation} {\boldsymbol c}_{ij} = \frac{1}{A_{ij}} \int _{A_{ij}} \left( {\boldsymbol r} - \frac{{\boldsymbol r}_i + {\boldsymbol r}_j}{2} \right) \mathrm{d}A \end{equation}

(11)

is the centre of mass of the face between i and j, and |${\boldsymbol r}_{ij} = {\boldsymbol r}_i - {\boldsymbol r}_j$| is the difference between the positions of the mesh-generation points of cells i and j, and |$r_{ij} = | {\boldsymbol r}_{ij} |$| is its length. The sum runs over all cells that share an interface with cell i. Note that this estimate of the spatial gradient of a primitive variable depends only on the values of the primitive variable in the cell itself and its direct neighbours, and on the local geometry of the cell. If the values of the primitive variables sample an underlying smooth field at the positions of the mesh-generating points, then the gradient estimate is second-order accurate for an arbitrary mesh geometry, i.e. it reproduces a constant gradient in the field to machine precision. However, the primitive variables |${\boldsymbol Q}$| used to compute the gradient estimates represent volume-averaged quantities rather than the value of the fluid at the mesh-generating point of a cell.

In the original arepo implementation of Springel (2010), all computations that require geometrical quantities, e.g. volumes, areas, or positions of face centroids, use the geometry of the mesh at the beginning of the timestep. Nevertheless, it has been argued that the scheme is second-order accurate, based on simple sound-wave tests (Springel 2010) using the L¹ norm, and based on the much more-demanding stationary isentropic vortex flow (Yee, Vinokur & Djomehri 2000), where the error decreases quadratically with increasing spatial resolution (Springel 2011).

In this paper, we define the L^p norm of a continuous field as

\begin{equation} L^p = \left( \frac{1}{V} \int _V \left| f \left({\boldsymbol r} \right) \right|^p \mathrm{d}V \right)^{1/p}, \end{equation}

(12)

where |$f \left({\boldsymbol r} \right)$| is, for example, the deviation of the density field from its analytical value at a position |${\boldsymbol r}$|⁠. For our finite-volume discretization this becomes

\begin{equation} L^p = \left( \frac{1}{V} \sum _{i=1}^{N_\mathrm{cells}} \left| f_i \right|^p V_i \right)^{1/p}. \end{equation}

(13)

The L¹ norm of the density field for the Yee vortex (for the detailed setup, see Appendix A) evolved until t = 10 with the original arepo code is shown in Fig. 1. In contrast to previous claims, the error only decreases linearly with resolution, i.e. arepo shows first-order convergence only. The difference to previous results (in particular Springel 2011) arises from an inappropriate definition of the L²-norm employed there.

L1 norm of the density field as a function of the number of resolution elements per dimension at t = 10 for the isentropic Yee vortex, simulated with the standard arepo implementation using MUSCL-Hancock time integration and the improved Green–Gauss gradient estimates.

Figure 1.

L¹ norm of the density field as a function of the number of resolution elements per dimension at t = 10 for the isentropic Yee vortex, simulated with the standard arepo implementation using MUSCL-Hancock time integration and the improved Green–Gauss gradient estimates.

Open in new tab Download slide

On static Cartesian meshes, we find that arepo is fully second-order convergent for smooth problems. On moving meshes, however, an inaccuracy is introduced since the time integration only uses the geometry of the mesh at the beginning of the timestep, ignoring changes of the mesh geometry during a timestep. One possibility to account for these changes in the time integration is to do an additional mesh construction at the middle of the timestep and use the geometrical properties at t = t₀ + Δt/2 to calculate the fluxes. However, this requires at least one further (expensive) mesh construction per timestep, nearly doubling the computational cost. Such an approach combined with a Runge–Kutta time integration scheme is followed in tess (Duffell & MacFadyen 2011).

3 RUNGE–KUTTA TIME INTEGRATION

Alternatively, one can abandon the MUSCL-Hancock scheme and try to use a different Runge–Kutta time integration scheme that avoids a mid-step mesh construction. Of particular interest for us is Heun's method, which is a second-order Runge–Kutta variant that calculates the flux as an average of the fluxes at the beginning and the end of the timestep. Applying Heun's method to our moving mesh leads to the following update equations of the conservative variables for a cell i

\begin{eqnarray} {\boldsymbol Q}^\prime _i = {\boldsymbol Q}^n_i - \Delta t \sum _j A^n_{ij} \boldsymbol {\hat{F}}_{ij}^{n} ( {\boldsymbol U}^n ) , \end{eqnarray}

(14)

\begin{eqnarray} {\boldsymbol r}^\prime &=& {\boldsymbol r}^n + \Delta t \, {\boldsymbol w}^n, \end{eqnarray}

(15)

\begin{eqnarray} {\boldsymbol Q}^{n+1}_i = {\boldsymbol Q}^n_i -\frac{\Delta t}{2} \, \left( \sum _j A^n_{ij} \boldsymbol {\hat{F}}_{ij}^{n} ( {\boldsymbol U}^n ) + \sum _j A^\prime _{ij} \boldsymbol {\hat{F}}_{ij}^{\prime } ( {\boldsymbol U}^\prime) \right)\! , \end{eqnarray}

(16)

\begin{eqnarray} {\boldsymbol r}^{n+1} = {\boldsymbol r}^n + \frac{\Delta t}{2} \, ( {\boldsymbol w}^n + {\boldsymbol w}^\prime). \end{eqnarray}

(17)

Here, the vectors |${\boldsymbol r}$| denote again the coordinates of the mesh-generating points, |${\boldsymbol w}$| their velocities, and |$\boldsymbol {\hat{F}}_{ij}$| is an approximation for the fluxes of the conserved variables over the interface between cells i and j. In principle it is possible to allow the velocities of the mesh-generating points to change during a timestep (Duffell & MacFadyen 2011), but we choose to always keep them constant over the course of a full timestep.

Note that this update rule also requires the geometry of two different meshes (⁠|${\boldsymbol r}^n$| and |${\boldsymbol r}^\prime$|⁠), and the fluxes have to be computed twice for every timestep. However, since for Heun's scheme and a constant velocity |${\boldsymbol w}^\prime = {\boldsymbol w}^n$| of the mesh-generating points over the whole timestep we can show that

\begin{equation} {\boldsymbol r}^{n+1} = {\boldsymbol r}^n + \frac{\Delta t}{2} \, ( {\boldsymbol w}^n + {\boldsymbol w}^\prime) = {\boldsymbol r}^n + \Delta t \, {\boldsymbol w}^n = {\boldsymbol r}^\prime , \end{equation}

(18)

we can reuse the mesh we constructed for the second half of the current timestep for the first half of the next timestep. Thus, we only need to construct the mesh effectively once per timestep, which keeps the mesh construction effort equal to arepo's original implementation. The fluxes need to be calculated twice as often, but the computational cost required for this is a relatively small effort compared to the 3D mesh construction needed in a moving-mesh code, and therefore does not increase the overall computational cost significantly.

In practice, computing the intermediate state |${\boldsymbol Q}^\prime$| through equation (14) is inconvenient for several reasons. In particular, it significantly complicates the internal bookkeeping of the conserved variables since we need to know |${\boldsymbol Q}^n_i$| and |${\boldsymbol Q}^\prime _i$| at the same time in equation (16). While this can be resolved in principle, further complications arise when one tries to consistently implement this update scheme for individual time steps that vary locally.

These problems can be circumvented in a simple way by extrapolating to the intermediate step using spatial derivatives, like in the previous MUSCL-Hancock scheme. This leads to an update of the conservative variables through

\begin{eqnarray} {\boldsymbol W}^\prime _i &=& {\boldsymbol W}^n_i + \Delta t \, \frac{\mathrm{\partial} {\boldsymbol W}}{\mathrm{\partial} {\boldsymbol t}}, \end{eqnarray}

(19)

\begin{eqnarray} {\boldsymbol r}^\prime &=& {\boldsymbol r}^n + \Delta t \, {\boldsymbol w}^n, \end{eqnarray}

(20)

\begin{eqnarray} {\boldsymbol Q}^{n+1}_i = {\boldsymbol Q}^n_i -\frac{\Delta t}{2} \, \left( \sum _j A^n_{ij} \boldsymbol {\hat{F}}_{ij}^{n} ( {\boldsymbol W}^n ) + \sum _j A^\prime _{ij} \boldsymbol {\hat{F}}_{ij}^{\prime } ( {\boldsymbol W}^\prime) \right)\!, \end{eqnarray}

(21)

\begin{eqnarray} &&{{\boldsymbol r}^{n+1} = {\boldsymbol r}^\prime . } \end{eqnarray}

(22)

Here, we only need to do the time extrapolation for the primitive variables, and we again use the Euler equations to replace time derivatives with spatial derivatives. Note that this update scheme can be easily generalized to local individual time steps. For the calculation of the fluxes between two cells on different time steps, the time extrapolation (equation 19) is in this case done for each cell always from the last time the cell was active. At this time the estimates for its primitive variables and gradients were obtained. Similarly, the linear spatial extrapolation to the current centre of the interface is always done from the centre of mass for which primitive variables and their gradients have been calculated. Note, that with the time extrapolation to the intermediate step our scheme is not a Runge–Kutta scheme anymore, but becomes a hybrid between Runge–Kutta and MUSCL-Hancock schemes.

4 LEAST-SQUARE GRADIENT ESTIMATES

Another source of accuracy degradation in the original arepo implementation lies in the quality of the gradient estimates, which can suffer in general problems. The issue appears when a cell is distorted and its mesh-generating point deviates significantly from the position of the centre of mass of the cell. In this case, the Voronoi-optimized Green–Gauss estimate of equation (10) introduces a systematic error, because it assumes that the primitive variables are known at the mesh-generating points, whereas, by construction, the primitive variables represent volume-averaged quantities that represent the value of the underlying field at the cell's centre of mass. Although the small steering motions added by arepo to the velocities of the mesh-generating points usually manage to keep the mesh nicely regular with only a small offset between the mesh-generating point and the centre of mass of a cell, the typical distance between these two points can amount to a few per cent of the radius of the cell, enough to significantly degrade the accuracy of the gradient estimate in some situations.

We investigate this in Fig. 2 explicitly, where we show that such a small deviation already compromises the accuracy of the gradient estimate, spoiling its convergence rate for higher resolutions. In fact, a one per cent deviation of the mesh-generating points from the centres of mass in an almost Cartesian mesh already stops the improvement of the gradient estimate with resolution for resolutions better than 320² cells. Even worse, for a completely random mesh in which the mesh-generating points are a Poisson sample of the simulation domain the gradient estimate does not improve with resolution at all. Note, however, that the hydro scheme still converges with first order, even though the error in the gradient estimate is constant.

L1 error norm of the Voronoi-optimized Green–Gauss gradient estimate of the density field for different types of meshes for the initial state of the Yee vortex at t = 0.

Figure 2.

L¹ error norm of the Voronoi-optimized Green–Gauss gradient estimate of the density field for different types of meshes for the initial state of the Yee vortex at t = 0.

Open in new tab Download slide

To more robustly reach second-order convergence of the hydrodynamical scheme, we therefore need better gradient estimates that are correct to at least first order for severely distorted meshes where centre-of-mass values are known, equivalent to requiring that linear gradients are still reproduced exactly in this case.

Assuming that we know the local geometry of the mesh and the values of a quantity ϕ at the centroids of the cells, we can obtain such a gradient estimate through a local least-squares fit. We note that polynomial least-squares reconstructions are well known for unstructured meshes, and form a standard technique in particular for high-order ENO and WENO schemes (e.g. Ollivier-Gooch 1997). Also, such methods are well known for the estimation of gradients (e.g. Maron & Howes 2003). They have also been recently used for SPH and in new variants of mesh-less methods (Hopkins 2015) and to compute the cell-centred magnetic fields in constrained transport schemes for unstructured meshes (Mocz, Vogelsberger & Hernquist 2014). The method assumes that the quantity ϕ can be approximated everywhere within cell i as

\begin{equation} \phi \left( {\boldsymbol r} \right) = \phi \left( {\boldsymbol s}_i \right) + \left< \nabla \phi \right>_i ( {\boldsymbol r} - {\boldsymbol s}_i ). \end{equation}

(23)

The value of ϕ at the centre of mass of the cell |$\phi \left( {\boldsymbol s}_i \right) \equiv \phi _i$| is known as a pre-requisite, and we are now looking for an estimate of the gradient 〈∇ϕ〉 _i. We also know ϕ at the centres of mass of all neighbouring cells j, so we can require that our gradient estimate reproduces those ϕ_j as well as possible. Thus, for every neighbouring cell, we would like to have

\begin{equation} \phi _j = \phi _i + \left< \nabla \phi \right>_i \left( {\boldsymbol s}_j - {\boldsymbol s}_i \right). \end{equation}

(24)

For N neighbouring cells this is an overdetermined set of N equations with only up to three free variables (or two in 2D). We select the best linear approximation for 〈∇ϕ〉 _i by requiring that it minimises the sum of the deviations for all neighbours,

\begin{equation} S_\mathrm{{tot}} = \sum _j g_j \left( \phi _j - \phi _i - \left< \nabla \phi \right>_i \left( {\boldsymbol s}_j - {\boldsymbol s}_i \right) \right)^2. \end{equation}

(25)

Here, g_j is the relative weight for neighbour j. Different choices for these weights are possible, but according to our experiments the results are not very sensitive to the particular choice made. We follow de Vasconcellos & da Silva 2014 and set it to |$g_j = A_{ij} / | \textbf{s}_j - \textbf{s}_i |^2$|⁠.

To minimize S_tot, we use the normal equations which yield

\begin{eqnarray} \sum _j g_j {\boldsymbol n}_{ji} \otimes {\boldsymbol n}_{ji} \langle \nabla \phi \rangle_i | {\boldsymbol s}_j - {\boldsymbol s}_i|^2 =\sum _j g_j (\phi _j - \phi _i) {\boldsymbol n}_{ji} | {\boldsymbol s}_j - {\boldsymbol s}_i|,\nonumber\\ \end{eqnarray}

(26)

where |${\boldsymbol n}_{ji} = ( {\boldsymbol s}_j - {\boldsymbol s}_i ) / | {\boldsymbol s}_j - {\boldsymbol s}_i |$|⁠. This corresponds to a 2 × 2 or 3 × 3 matrix inversion problem in two- or three-dimensional problems, respectively, which can be solved by an appropriate solver to obtain 〈∇ϕ〉 _i.

The accuracy of the new least-squares-fit gradient estimate is shown in Fig. 3. On the nearly Cartesian mesh examined earlier, the gradient estimate is as accurate as the Voronoi-optimized Green–Gauss estimate. For increasing deviations from the Cartesian mesh, the accuracy of the least-squares gradient estimate slowly deteriorates, but importantly, it never becomes worse than first order. In particular, even for the random mesh it still accurately recovers the linear gradient, unlike the Voronoi-optimized Green–Gauss estimate.

L1 norm of the least-square-fit gradient estimate of the density field for different types of meshes for the initial state of the Yee vortex at t = 0.

Figure 3.

L¹ norm of the least-square-fit gradient estimate of the density field for different types of meshes for the initial state of the Yee vortex at t = 0.

Open in new tab Download slide

5 TEST PROBLEMS

To examine the difference between the original arepo implementation that employs a MUSCL-Hancock time integration scheme (MH) and the Voronoi-adjusted Green–Gauss gradient estimate (GG) versus the Runge–Kutta time integrator (RK) combined with the least-squares gradient estimator (LSF), we compare these schemes for several representative test problems. In obtaining the results for the different implementations, we always use identical code configurations, parameters, and initial conditions, and only vary the time integration method and/or gradient estimate.

All test problems in this paper are run using a Courant factor of 0.3 to determine the timestep and employ the same strategy and parameters to evolve the mesh while keeping it regular. To regularize the mesh, the mesh-generating points are moved towards the centre of mass of their cell. Their total velocity is calculated as

\begin{equation} {\boldsymbol v}_{\mathrm{vertex}} = {\boldsymbol v} -\frac{1}{2} \Delta t \frac{\nabla P}{\rho } + {\boldsymbol v}_{\mathrm{reg}}, \end{equation}

(27)

where |${\boldsymbol v}$|⁠, P, ρ, Δt, and |${\boldsymbol v}_{\mathrm{reg}}$| are the fluid velocity, pressure, and density, timestep of the cell, and a regularization component that is added purely to improve the mesh. It is given by

\begin{equation} {\boldsymbol v}_{\mathrm{reg}} = \epsilon \max \left( c, r\, \left| \nabla \times {\boldsymbol v} \right| \right) \frac{ {\boldsymbol d} }{ \left| {\boldsymbol d} \right| }. \end{equation}

(28)

Here, c is the sound speed in the cell, r the approximate extent of the cell calculated from its area (volume) in 2D (3D) assuming the cell is a circle (sphere), and |${\boldsymbol d} = {\boldsymbol r} - {\boldsymbol s}$| is the separation between mesh-generating point and centre of mass of the cell. Note that this separation is interesting in its own right, as it can be used to quantify the regularity of the mesh. Moreover, ϵ is defined as

\begin{equation} \epsilon = \max \left( 0, \min \left( 0.5, 0.5\, \frac{\alpha - 0.75 \beta }{0.25 \beta } \right) \right) \end{equation}

(29)

and α is given by

\begin{equation} \alpha = \max _i \left( \frac{1}{N_{\mathrm{dim}}} \frac{2 A_i}{\left| {\boldsymbol r} - {\boldsymbol r}_i \right|} \right), \end{equation}

(30)

where the maximum runs over all neighbours of a cell and their interfaces connecting a cell to them. We choose the parameter β to be 2.25 in all simulations in this paper.

5.1 Yee vortex

We first re-evaluate the convergence rate for the isentropic Yee vortex. Since this vortex flow is completely smooth, we in principle expect to see second-order convergence in the strong L¹ norm if the implementation is correct. As shown in Fig. 4, we indeed obtain second-order convergence up to very high resolution for arepo in the moving-mesh case if we use both the least-squares gradient estimate and the Runge–Kutta time integration scheme. Using only one of the two improvements while keeping the MUSCL-Hancock integration scheme or the improved Green–Gauss gradient estimate, respectively, does not change the convergence rate at all and makes the implementation drop back to first-order convergence in the L¹ norm. The detailed setup including the initial mesh can be found in Appendix A.

L1 norm of the density field for the evolved Yee vortex at t = 10 when different schemes for the hydrodynamics are employed. For the detailed setup of the Yee vortex problem see Appendix A.

Figure 4.

L¹ norm of the density field for the evolved Yee vortex at t = 10 when different schemes for the hydrodynamics are employed. For the detailed setup of the Yee vortex problem see Appendix A.

Open in new tab Download slide

It is interesting to note that the tess code has been shown to be second-order accurate for an isentropic sound wave with a Runge–Kutta time integration scheme (Duffell & MacFadyen 2011), while the same Voronoi-optimized Green–Gauss gradient estimate as implemented originally in arepo has been used. With this gradient estimate, the convergence order is however expected to break down for meshes in which the mesh-generating points are not close to the centres of mass of their cells. The difference here is most likely that the one-dimensional propagation of the wave in this particular problem only stretches the mesh, without displacing the mesh-generating points from the centres of mass of their cells. As discussed before, in this special case the Voronoi-optimized gradient estimate is sufficient and does not negatively impact the convergence properties. For the more demanding Yee vortex this is not the case, however, and the convergence is expected to break down in tess just as in the original arepo code.

Another interesting observation to make is that second-order convergence in arepo for the new implementation as measured by computing the L¹ norm between the current density field and the analytical solution is only reached at very high resolution when the initial mesh-generating points are set up in rings around the centre of the vortex. If an initially Cartesian mesh is adopted instead, the convergence degrades to first order at sufficiently high resolution. As shown in Fig. 5 there is only a weak correlation between the density error in a cell and its distortion, as both are largest close to the centre of the vortex.

This is fundamentally caused by the discretization of the analytical problem on to our mesh. There are two discretizations involved, a first discretization on the initial mesh to generate the discretized initial conditions and a second discretization on the current mesh when we measure the L¹ norm. If the structure of the mesh changes systematically between the initial and the current mesh, there is also a systematic difference in the two discretizations, which turns out to lead to a first-order error that dominates the total error at sufficiently high resolution. Because an initially Cartesian mesh will eventually be transformed to a spherical mesh by the mesh regularization, the measured L¹ norm after the mesh changed will not be better than first order at high resolution. To overcome this problem, we arrange the mesh-generating points in the initial mesh already on circles around the centre of the vortex. This mesh configuration is stable for the problem, thus there is no systematic change in the discretization over time.

Another approach is to look at error measures intrinsic to the discrete problem that do not require a second discretization of the analytical problem. Analysing the conservation of total angular momentum (which also may be viewed as a norm) instead of the L¹ norm of the total density field is therefore considerably simpler and also more robust. In particular, here the measured convergence rate is independent of the initial mesh, and the re-arrangement from an initially Cartesian to a circular mesh does not give rise to additional errors. As shown in Fig. 6 , the angular momentum conservation is again only improved if both, a better time integration method and a more accurate gradient estimate, are used. Interestingly, the angular momentum even shows third-order convergence in the new implementation, although this may be the result of the special spherical symmetry of the vortex flow and may be lost for more general problems.

Figure 6.

Conservation of total angular momentum for the Yee vortex at t = 10 when different hydrodynamical schemes are used.

Open in new tab Download slide

Figure 5.

Density error compared to the analytical solution (top panel) and relative separation between mesh-generating point and centre of mass (bottom panel) at t = 100 for the RKLSF Yee vortex.

Open in new tab Download slide

5.2 Keplerian disc

Evolving a cold Keplerian disc for many orbits is a common problem in astrophysics, with applications from planetary to galactic discs. Using such a set-up in the limit of a pressure-less gas disc is a demanding test problem (Cullen & Dehnen 2010; Hopkins 2015). In fact, as highlighted by Hopkins (2015), many state-of-the-art static mesh or SPH codes have severe problems in coping accurately with this situation. Instead, the disc is typically destroyed during the first 10 orbits by these methods.

The original arepo code with the MUSCL-Hancock time integration and improved Green–Gauss gradients already does quite well on this problem, as shown in Fig. 7 where we display the disc after roughly 15 inner orbits with a resolution of 320 × 320 cells. However, whereas the outer parts of the disc are very stable, the inner boundary of the disc moves outwards with time owing to systematic errors in angular momentum conservation. Our new implementation with Runge–Kutta time integration and least-squares gradients works significantly better, keeping the disc essentially perfectly stable until t = 100, and showing similar errors only much later, at times t ≃ 600 or later. These residual errors can now be much more efficiently improved to essentially arbitrary precision by an increase of the spatial resolution, thanks to the improved convergence order. This is in line with previous results on conservation of angular momentum in arepo for more realistic problems (Muñoz et al. 2015).

Figure 7.

Evolution of the surface density of a two-dimensional cold Keplerian disc. The top panels show the initial conditions and the evolved state at t = 100 obtained with the original arepo implementation based on the improved Green–Gauss gradient estimate and the MUSCL-Hancock time integration. The bottom panels show the new implementation with least-squares gradient estimate and Runge–Kutta time integration at t = 100 and t = 600, respectively. At the latter time, the inner disc has finished more than 250 orbits. The detailed setup can be found in Appendix B.

Open in new tab Download slide

These results compare very favourably not only to stationary Eulerian grid codes on Cartesian meshes, but also to the new mesh-free hydrodynamical method proposed by Hopkins (2015). Note, however, that the optimal mesh configuration for this specific problem is most likely a polar grid.

6 TESTING ON REALISTIC APPLICATIONS

6.1 Evolution of a stellar binary

In light of our previous results, it is interesting to investigate whether the improvements in the arepo code proposed here affect the results of real world scientific applications of the code. Compared to the Yee vortex and the Keplerian disc, which are smooth hydrodynamics-only problems, or hydrodynamical problems with a constant external gravitational field, one example of a more complex problem including discontinuities, shocks, and self-gravity is the inspiral and merger of a close binary system of two white dwarfs.

Such simulations are essential to understanding the complex evolution of the merger process and to determining the fate of the binary system (see e.g. Pakmor et al. 2013; Zhu et al. 2013; Dan et al. 2014). The initial mesh for the white dwarfs is constructed from shells that are tessellated using the healpix algorithm to obtain regular cells with roughly the same mass (Pakmor et al. 2012). The simulation box is then filled by a low-resolution Cartesian background grid. Fig. 8 shows the total angular momentum for the merger of two white dwarfs with masses of 0.65 M_⊙ and 0.625 M_⊙, an initial orbital period of 44 s, identical to the setup of the same system in Zhu et al. (2013). We employ a mass resolution of 10⁻⁶ M_⊙ (high resolution) and 5 10⁻⁶ M_⊙ (low resolution), respectively.

Figure 8.

Conservation of total angular momentum for the merger of two white dwarfs with an initial orbital period of 44 s for the MUSCL-Hancock time integration and improved Green–Gauss gradient estimate and the Runge–Kutta time integration combined with the least-squares gradient estimate.

Open in new tab Download slide

The binary merges at around t = 200 s, after initial mass transfer reduced the separation and lead to runaway mass transfer. It then forms a differentially rotating merger remnant. The differential rotation in the merger remnant is crucial for its further evolution. As shown in Fig. 8, conservation of angular momentum during the further evolution of the merger remnant is significantly violated for the implementation with MUSCL-Hancock time integration and improved Green–Gauss gradient estimates. The system loses a significant fraction of the total angular momentum present in the simulation. In contrast, the new implementation with Runge–Kutta time integration and least-squares gradient estimates conserves the total angular momentum in the simulation to a relative error of about one per cent even after many orbits for the high-resolution simulation and still loses less than 10 per cent of the total angular momentum in the low-resolution simulation.

6.2 Cosmological zoom simulation of galaxy formation

An even more complex problem are simulations of galaxy formation and evolution. They not only involve self-gravity, but also feature high Mach numbers and turbulent flows, as well as a large number of additional source terms to model effects like radiative cooling of gas or the energy injection of evolving stars. In addition, sometimes explicit subgrid models for unresolved physics are used that are introduced as a modified equation of state or pressure floors and the like.

To compare the performance of our new code with the original arepo version in this regime we have repeated one of the recent Milky Way galaxy formation simulations of Marinacci et al. (2014) and Pakmor et al. (2014). These are advanced calculations that can serve as an example for current state of the art simulations of cosmic structure formation (see e.g. Agertz, Teyssier & Moore 2011; Scannapieco et al. 2012; Stinson et al. 2013; Hopkins et al. 2014; Vogelsberger et al. 2014; Khandai et al. 2015; Schaye et al. 2015). As shown in the results overview of Fig. 9, there is no significant difference between the results obtained for both implementations.

Figure 9.

Comparison of two cosmological zoom runs of a galaxy similar to the Milky Way (as in Marinacci et al. 2014) using either the combination of MUSCL-Hancock time integration and improved Green–Gauss gradient estimate, or the combination of Runge–Kutta time integration and least-squares gradient estimate, respectively. Initial conditions, code configuration and parameters including all subgrid physics are identical for both runs and chosen in line with Marinacci et al. (2014) and Pakmor, Marinacci & Springel (2014). The top row shows, from left to right, stellar projections of the two runs at z = 0 and the evolution of the star formation rate in the disc of the main galaxy. The bottom row shows circularities of stars in the disc, evolution of the total mass of the halo compared to its stellar mass, and the average root mean square magnetic field strength in the disc for the two runs.

Open in new tab Download slide

A possible interpretation of the lack of differences for the cosmological runs is that the properties of galaxies and their dynamics are already captured sufficiently accurately by the standard arepo implementation. The accuracy improvements brought about by the new formulation proposed here are either irrelevant for this problem, or are completely dominated by first-order errors introduced by the subgrid models for radiative cooling, star formation, and feedback. The latter is particularly likely as it is now well understood that the outcome of galaxy formation simulations depends very sensitively on the treatment of highly non-linear feedback processes. Thus, in regions of the simulation that are crucially shaped by feedback we do not expect our improvements to the code to lead to significant changes of the results. Changing this to improve the solution requires implementing and coupling the source terms such that the combined system is second-order accurate. This may be different in regions where hydrodynamics and gravity are the only relevant phenomena. For example, conceivably some differences may be found in the approximately hydrostatic atmosphere of rich galaxy clusters, where our new formulation may result in a marginally better representation of turbulence.

7 CONCLUSIONS

In this paper, we have discussed in detail two simple modifications of the cosmological moving-mesh code arepo, which are nevertheless quite important to recover full second-order convergence in the L¹ norm for general smooth problems with non-trivial mesh motions. One of these changes concerns the time integration, where a Runge–Kutta time integration scheme is adopted instead of the MUSCL-Hancock approach in order to account for changes of the mesh geometry during a timestep at second order. This does not increase the number of mesh-constructions needed, but does double the number of required flux computations. The other change is the adoption of a more general (and slightly more expensive) gradient estimate that retains the necessary accuracy even for large offsets between the mesh-generating points and the centres of masses of cells.

These improvements are most relevant for smooth, pure hydrodynamics problems where the influence of self-gravity and of complicated source terms is limited. Among the astrophysical problems that fall into this regime and that have already been tackled with the original version of arepo are protoplanetary discs (Muñoz et al. 2014), cold gas in galactic discs (Smith et al. 2014), and dynamical stellar mergers (Pakmor et al. 2013). Such simulations and similar problems will benefit in the future from the added accuracy facilitated by the improvements proposed here. It is also prudent to carry out additional tests to see whether previous simulations were sometimes degradated in a noticeable way by an accuracy loss of the code, for example by an unnecessarily large error in the conservation of angular momentum. We expect such noticeable errors only for problems that evolve a system for many dynamical time-scales.

Cosmological simulations of galaxy formation seem to be unaffected by the improvements of the hydrodynamical moving-mesh scheme proposed here. This is of course reassuring as it means that previous results from galaxy formation studies carried out with arepo, both with zoom-in techniques (Marinacci et al. 2014; Pakmor et al. 2014) and in large cosmological boxes (Genel et al. 2014; Vogelsberger et al. 2014), have not suffered from the convergence rate issues discussed above. Nevertheless, it is clearly desirable to use the improved scheme in the future in this regime as well, especially since it offers higher accuracy at an insignificant increase of the computational cost.

This work has been supported by the European Research Council under ERC-StG grant EXAGAL-308037 and by the Klaus Tschira Foundation. VS and KS acknowledge support through subproject EXAMAG of the Priority Programme 1648 ‘Software for Exascale Computing’ of the German Science Foundation. Part of the simulations of this paper used the SuperMUC system at the Leibniz Computing Centre, Garching. PM acknowledges support by the National Science Foundation Graduate Research Fellowship under grant no. DGE-1144152. STO acknowledges financial support from Studienstiftung des deutschen Volkes. AB and KS acknowledge support by the IMPRS for Astronomy & Cosmic Physics at the University of Heidelberg.

REFERENCES

Agertz

Teyssier

Moore

2011

MNRAS

410

1391

Month:	Total Views:
November 2016	1
January 2017	1
February 2017	6
March 2017	14
April 2017	11
May 2017	18
June 2017	7
July 2017	11
August 2017	12
September 2017	9
October 2017	11
November 2017	8
December 2017	4
January 2018	6
February 2018	7
March 2018	5
April 2018	4
May 2018	5
June 2018	8
July 2018	2
August 2018	3
September 2018	8
October 2018	4
November 2018	4
December 2018	6
January 2019	21
February 2019	28
March 2019	19
April 2019	38
May 2019	24
June 2019	22
July 2019	19
August 2019	29
September 2019	33
October 2019	41
November 2019	25
December 2019	21
January 2020	24
February 2020	19
March 2020	18

Article Contents

Abstract

1 INTRODUCTION

2 MOVING-MESH HYDRODYNAMICS IN arepo

3 RUNGE–KUTTA TIME INTEGRATION

4 LEAST-SQUARE GRADIENT ESTIMATES

5 TEST PROBLEMS

5.1 Yee vortex

5.2 Keplerian disc

6 TESTING ON REALISTIC APPLICATIONS

6.1 Evolution of a stellar binary

6.2 Cosmological zoom simulation of galaxy formation

7 CONCLUSIONS

REFERENCES

APPENDIX A: YEE VORTEX

APPENDIX B: KEPLERIAN DISC

Citations

Views

Altmetric

Email alerts

Astrophysics Data System

Citing articles via

Latest

Most Read

Most Cited

This Feature Is Available To Subscribers Only