\documentclass[oneside]{amsart}

\usepackage{amsmath}
\usepackage{amssymb}
\usepackage{color}
\usepackage{graphicx}
\usepackage[latin9]{inputenc}
% \usepackage{mathpazo}
\usepackage[hyphens]{url}

% Put this package last
\usepackage[bookmarks, bookmarksopen, bookmarksnumbered]{hyperref}
% Put this package after hyperref
\usepackage[all]{hypcap}
% Don't use tt font for urls
\urlstyle{rm}


% Make a comment stand out visually
\newcommand{\todo}[1]{{\color{blue}$\blacksquare$~\textsf{[TODO: #1]}}}

% Name of a code
\newcommand{\code}[1]{\texttt{#1}}


\hyphenation{Cac-tus-Ein-stein Schwarz-schild South-amp-ton}

\sloppypar


\begin{document}

\title[Domain Decomposition in Carpet]{Domain Decomposition in
  Carpet:\\Regridding and Determination of the Communication Schedule}

\author{Erik Schnetter$^{(1,2)}$}

% \email
\thanks{\emph{Email}:~\url{mailto:schnetter@cct.lsu.edu}}

% \urladdr
\thanks{\emph{Web}:~\url{http://www.cct.lsu.edu/}}

% \address
\thanks{{}$^{(1)}$ Center for Computation \& Technology,
  Louisiana State University, Baton Rouge, LA, USA}

\thanks{{}$^{(2)}$ Department of Physics and Astronomy,
  Louisiana State University, Baton Rouge LA, USA}


\date{March 9, 2008}


\begin{abstract}
  This text describes the algorithms that Carpet
  \cite{Schnetter-etal-03b, carpetweb} uses for regridding, domain
  decomposition, and setting up the communication schedule.  It is
  intended for readers interested in more details than a casual user
  would need.  This text explains the concepts that Carpet uses
  internally to describe the grid hierarchy and to ensure its
  consistency.
\end{abstract}

\maketitle


\section{Introduction}

Setting up a grid hierarchy (``regridding'') is in Carpet handled by
three different entities: \emph{Carpet} itself decides the extent of
the domain, the type of outer boundary conditions, and distributes the
domain onto processors; a \emph{regridding thorn} is responsible for
deciding the shape of the grid hierarchy, and \emph{CarpetLib} handles
the details and actually manages the data.  (Technically speaking, it
is the regridding thorn which has to do the domain decomposition;
however, it can simply call a convenient helper routine in Carpet for
this task.)

This separation leaves the decision on the shape of the grid hiearchy
to a thorn which can be replaced or augmented as necessary.  All
handling of data happens in CarpetLib, which is thus the only entity
which needs to be optimised for speed.  Finally, Carpet retains the
overview of the regridding process.

In the following we assume that there is a single patch (block) which
contains a mesh refinement hierarchy.  If there are multiple patches,
then each patch is conceptually handled independently.  Certain
conditions may have to be satisfied if a refined mesh touches an
inter-patch boundary, but these are not discussed here.

We assume that the domain has $D$ spatial dimensions, usually $D=3$.


\section{Domain description}

We assume that boundary location and boundary discretisation are set
up via \emph{CoordBase}.  This is necessary since other methods do not
allow specifying sufficient details to handle e.g.\ refined regions
intersecting mesh refinement boundaries.  Below we give an overview
of the information that needs to be specified to describe a
boundary.  See the CoordBase thorn guide for details.

The extent of the overall simulation domain is described in terms of
real-valued coordinates.  The domain is cuboidal, i.e., it can be
described in terms of two vectors $x^i_\mathrm{min}$ and
$x^i_\mathrm{max}$.

The type of boundary conditions for each of the $2D$ faces is
described by three quantities:
\begin{itemize}
\item the total number of boundary points,
\item how many of these points are outside of the domain boundary,
\item whether the boundary is staggered with respect to the domain
  boundary, or whether one of the boundary points is located on the
  domain boundary.
\end{itemize}
\todo{Show the relevant figures from the CoordBase thorn guide.}

The extent of the domain $L^i := x^i_\mathrm{max} - x^i_\mathrm{min}$
must be commensurate with the coarse grid spacing $h^i$.  This means
that $L^i$ must be a multiple of $h^i$, modulo the fact that the
boundary may be staggered.

An \emph{outer boundary condition} can either be a \emph{physical}
boundary condition, such as e.g.\ a Dirichlet condition, or a
\emph{symmetry} boundary condition, such as e.g.\ a reflection
symmetry.  This distinction is irrelevant for Carpet.  Both kinds of
outer boundary conditions are applied by thorns which are scheduled in
the appropriate way.

The main distinction between an \emph{outer boundary point} and an
\emph{interior point} from Carpet's point of view is that an outer
boundary point is not evolved in time.  Instead, the value of boundary
points must be completely determined by the value of interior points.
(This is clearly the case for Dirichlet or symmetry boundary
conditions.)  The reason for this distinction is that Carpet cannot
fill outer boundary points on refined grids via interpolation, because
this requires off-centred interpolation stencils which are not
implemented at the moment.  Therefore, Carpet does not fill any outer
boundary points through interpolation.

\todo{We should introduce mesh refinement and parallelisation before
  this paragraph.}
%
If the refined region abuts the outer boundary, then outer boundary
points can also be refinement boundary points.  Carpet does not fill
these outer boundary points through synchronisation (prolongation)
either.  This requires that the outer boundary condition is applied
after every synchronisation.  This is usally automatically the case
when using the Cactus boundary condition selection mechanism, i.e.,
when the group \texttt{ApplyBCs} is scheduled.  Synchronising outer
boundary points would be possible, but this is not done so that
refinement boundaries and inter-processor boundaries are treated in
the same way.


\section{Regridding}

A regridding thorn, such as e.g.\ \texttt{CarpetRegrid} or
\texttt{CarpetRegrid2}, sets up the \emph{grid hierarchy}.  The grid
hierarchy consists of several \emph{refinement levels}, and each
refinement level consists of several \emph{refined regions}.  A
refined region is a cuboid set of grid points which is assigned to a
particular processor.  (The ``refined regions'' which are specified
e.g.\ in a parameter file are broken up during domain is decomposition
into a set of regions, so that each region is assigned to exactly one
processor.)  There can be zero or multiple regions per processor, and
different processors may own different numbers of regions.  Each face
of a region is either an outer or an internal boundary.  Refined
regions are described by the data structure \code{region\_t}, which is
declared in \code{CarpetLib/src/region.hh}.

The refined regions on one level may not overlap.  (If they do not
touch each other or the outer boundary, then they usually even have to
keep a certain minimum distance, as described below.)

The ``semantics'' (the result obtained in a simulation) of a grid
hierarchy is independent of the number of refined regions, or of the
processors which are assigned to them.  The only relevant quantity is
the set of refined grid points, i.e., the conjunction of all refined
regions.  When running on more processors, the regions will be split
up so that there are more and smaller regions.

The assignment of regions to processors is handled during regridding
because it affects performance.  When there are only few processors
available, then it may be more efficient to use fewer regions.
Combining regions may require some fill-in.  \todo{Show figure.}  No
current regridding thorn in Carpet offers this at the moment;
currently, the set of refined points is independent of the number of
processors.

Each face of a refined region is either an interior or an outer
boundary face.  Carpet adds no ghost or buffer zones to outer boundary
faces, and marks this face as outer boundary when calling thorns.  An
outer boundary face must extend up to the outermost outer boundary
point, since otherwise thorns will apply the outer boundary conditions
incorrectly.  The regridding thorn has to ensure this property.  Faces
which are not outer boundary faces must be sufficiently far away from
outer boundaries, so that any ghost or buffer zones which are added to
that face do not intersect the outer boundary.  \todo{Show figure.}
The regridding thorn also has to ensure this property.

Each face which is not an outer boundary face is either an
inter-processor or a refinement boundary face.  Inter-processor faces
have no buffer zones, and the ghost zones can be filled in completely
from other refined regions on the same level.  Refinement boundary
faces do have buffer zones, and at least some of the ghost zones must
be filled via prolongation from the next coarser grid.  It is possible
to have faces which are partly an inter-processor boundary and partly
a refinement boundary.  These are counted as and handled as refinement
boundaries, although some of the ghost zones may still be filled via
synchronisation.  (Any grid points which can be filled via
synchronisation are always also filled via synchronisation.)  This is
explained in more detail below.

As described below, ghost zones are added at the outside of regions by
Carpet.  Buffer zones need to be taken into account by the regridding
thorn, most likely by extending the refined regions appropriately.
(In terms of previous terminology: buffer zones are always ``inner''
buffer zones; ``outer'' buffer zones do not exist any more.  However,
if they are added by the regridding thorn, they do not need to be
taken into account when specifying the refinement hierarchy in a
parameter file -- this depends on the regridding thorn.
\todo{CarpetRegrid2 does this, CarpetRegrid does this not.})  Carpet
assumes buffer zones at all faces which are marked as refinement
boundaries.

It is the responsibility of the regridding thorn to mark outer
boundaries and refinement boundaries appropriately.  If the outer
boundary mark is chosen wrongly, then the simulation is inconsistent,
since the outer boundary condition may be applied at a the wrong
location, or the outer boundary may be filled by interpolation which
is less accurate.  Depending on the number of boundary points and
stencil sizes, this may or may not be detected later by
self-consistency checks.


\section{Zoning}

This section defines the algorithm which Carpet uses to define which
grid points are defined by what action.  This algorithm is codified in
\code{dh.cc}.  Since the code in \code{dh.cc} is tested, it should be
assumed to be correct where it differs with this description.  This
algorithm is applied to every refinement level.

\emph{Grid arrays} are handled in the same way as grid functions,
except that there are no refined regions.  This may seem like overkill
at first, but in fact it greatly simplifies the implementation since
grid arrays have (almost) only a subset of the capabilities of grid
functions.

\subsection{Domain}

Let $dom$ be the simulation domain.  Let $domact$ be the set of grid
points which are evolved in time, and which is required to be cuboidal
and not empty.\footnote{While grid functions cannot be empty, $domact$
  for grid arrays can be empty.}  (An empty region is also counted as
cuboidal.)

Each face has a layer of outer boundary points.  Let $domob$ be the
set of these layers.  We require that all values in $domob$ can be
calculated from the values in $domact$.  Let $domext := domact \cup
domob$.  It follows that $domext$ is cuboidal and not
empty.

\todo{Show figure.}

The following properties hold for $dom$:
\begin{itemize}
\item $domact$ is cuboidal and not empty
\item $domob$ is a possibly empty layer around $domact$
\item $domob$ has a width of \code{nboundaryzones} as obtained from
  \code{GetBoundarySpecification}
\item $domact \cap domob = \emptyset$
\item $domext = domact \cup domob$ is cuboidal and not empty
\item $domext$ has the size \code{cctk\_gsh}
\end{itemize}
For completeness, one can define $dombnd = dombuf = \emptyset$, and
$domint = domown = domact$.  Then the same relations hold for $dom$ as
for $reg$ below.

\subsection{Refined regions}

Let $reg$ be a refined region.  Let $regint$ be its interior, which is
required to be cuboidal and not empty.  $regint$ includes buffer zones
and outer boundary points.  We require that $regint \subseteq domext$.

There may be other refined regions $reg'$ on the same level.  We
require that $\forall_{reg' \ne reg}: regint \cap regint' =
\emptyset$.

Each face of $regint$ which is not an outer boundary face has a layer
of \code{cctk\_nghostzones} ghost zones added to it.  (These are what
Cactus calls ``ghost zones'', but note that not all ghost zones are
filled via synchronisation.)  Let $regghost$ be the set of these
layers.  We require that $regghost \subseteq domact$.  Let $regext :=
regint \cup regghost$ be the interior with the ghost zones added.  It
follows that $regext$ is cuboidal and not empty.  We require that
$regext \subseteq domext$.

\todo{Show figure.}

On each face of $regext$, the outermost layer of \code{nboundaryzones}
points are not communicated.  Let $regcomm$ be the set of communicated
points.  The $regcomm \subseteq regext$, we require it to be not
empty, and we require that $recomm \subseteq domact$.

Let $regob := regext - regcomm$ be the set of outer boundary points.
It follows that $regob \subseteq domob$.

The owned region $regown$ is the interior region, extended up to the
boundary, i.e., $regint$ with a layer of \code{nboundaryzones} points
added to it at all outer boundary faces.  Thus $regown$ is cuboidal,
and we require it to be not empty.  It is also $regown \subseteq
domact$, and $regown \subseteq regext$.  It is also $\forall_{reg' \ne
  reg}: regown \cap regown = \emptyset$.

Let $regbnd := regcomm - regown$.  Then $regbnd \subseteq domact$.
\todo{Does this follow, or is this a requirement?}  These points are
filled via synchronisation \todo{Is this so?  Is this always via
  inter-processor communiocation, or also via prolongation?}, and they
cannot be too close to the outer boundary.

Let $regbuf$ be the set of grid points which have a distance less than
\code{buffer\_width} from the boundary of the conjunction of all
active grid points.  Then $regbuf \subseteq regown$.  Let $regact :=
regown - regbuf$.  Then also $regact \subseteq regown$.  In general,
$regact$ is not cuboidal.  $regact$ can be empty.  \todo{This may be
  controversial.}

\todo{Show figure.}

Let $regsync := regbnd \cap allown$ be the boundary points which can
be filled via synchronisation.  Note that $regbnd \cap regown =
\emptyset$ by construction, which means that a region cannot
synchronise from itself.  Note that outer boundary points are
synchronised for backward compatibility.

Let $regref := (regbnd - regsync) \cup regbuf$ be the set of all
points which need to be filled via prolongation.  Note that $regref
\cap regsync = \emptyset$, which means that no point is both
synchronised and prolongated.  Note also that $regref \cap regob =
\emptyset$, which means that no outer boundary point is prolongated.

\todo{Show figure.}

The following properties hold for $reg$:
\begin{itemize}
\item $regint \subseteq domext$ is cuboidal and not empty
\item $regghost$ is a layer on the outside of $regint$ on those faces
  which are not marked ``outer boundary''
\item $regext = regint \cup regghost \subseteq domext$ is cuboidal and
  not empty
\item $recgomm \subseteq domact$
\item $regob \subseteq domob$
\item $regown \subseteq domact$ is cuboidal and not empty
\item $\forall_{reg' \ne reg}: regown \cup regown' = \emptyset$
\item $regbnd$ is a layer around $regown$ on those faces which are not
  marked ``outer boundary''
\item $regbuf$ is a layer around $regact$ on those faces which are
  marked ``refinement boundary''
\item $regsync$ are those boundary points which can be filled via
  synchronisation
\item $regref$ are those boundary or buffer points which are filled
  via prolongation
\end{itemize}
$regint$ is not important for Carpet's working, but only for Cactus.
Since Cactus has no notion of outer boundaries, we introduce $regint$,
which corresponds to $regown$, but includes outer boundary points.


\section{Algorithmic details}

It is straightforward to extend or shrink a cuboidal region $C$ by a
layer of grid points with a given width.  It is also straightforward
to extend an arbitrary region $R$, which is internally represented as
a set of non-overlapping cuboidal regions ${C^i}$.  One sets
$\mathrm{ext}[R] := \bigcup \mathrm{ext}[C^i]$.  The individual
extended cuboidal regions $\mathrm{ext}[C^i]$ will overlap in general,
but this overlap is handled correctly by the operator $\bigcup$.
However, there is no straightforward way to shrink an arbitrary region
$R$.  One possibility is to extend the complement of $R$ instead.

The layer of buffer zones $allbuf$ can be calculated in the following
way.  Let $allown := \bigcup regown$ as above be the set of all
refined grid points.  Then $domact - allown$ is the set of all
not-refined grid points in the domain.  In order to handle the outer
boundary of the domain correctly, we define $domlarge$ to be $domact$,
sufficiently enlarged in each direction, i.e., enlarged at least by
the number of buffer zones.  Let $notown := domlarge - allown$ be the
complement of $allown$.  Let $notact$ be $notown$, enlarged by the
number of buffer zones.  Then $allbuf := allown - notact$.


\appendix
\section{Terminology}

\todo{To be filled in}
\begin{description}
\item[Block:] See map.
\item[Buffer zone:] ???
\item[Component:] ???
\item[Ghost zone:] ???
\item[Grid hierarchy:] ???
\item[Inter-processor boundary:] ???
\item[Map:] ???
\item[Outer boundary:] ???
\item[Patch:] See map.
\item[Physical boundary:] ???
\item[Refinement boundary:] ???
\item[Refinement level:] ???
\item[Region:] ???
\item[Regridding:] ???
\item[Symmetry boundary:] ???
\end{description}


% With titles and urls
\bibliographystyle{bibtex/amsplain-url}
\bibliography{bibtex/references}

\end{document}