Electronic Transactions on Numerical Analysis. ETNA
Volume 45, pp. 524–544, 2016.
© Kent State UniversityCopyright c 2016, Kent State University.
http://etna.math.kent.edu
ISSN 1068–9613.
AN ADAPTIVE CHOICE OF PRIMAL CONSTRAINTS
FOR BDDC DOMAIN DECOMPOSITION ALGORITHMS∗
JUAN G. CALVO† AND OLOF B. WIDLUND‡
Abstract. An adaptive choice for primal spaces based on parallel sums is developed for BDDC deluxe methods
and elliptic problems in three dimensions. The primal space, which forms the global, coarse part of the domain
decomposition algorithm and which is always required for any competitive algorithm, is defined in terms of generalized
eigenvalue problems related to subdomain edges and faces; selected eigenvectors associated to the smallest eigenvalues
are used to enhance the primal spaces. This selection can be made automatic by using tolerance parameters specified
for the subdomain faces and edges. Numerical results verify the results and provide a comparison with primal spaces
commonly used. They include results for cubic subdomains as well as subdomains obtained by a mesh partitioner.
Different distributions for the coefficients are also considered with constant coefficients, highly random values, and
channel distributions.
Key words. elliptic problems, domain decomposition, BDDC deluxe preconditioners, adaptive primal constraints
AMS subject classifications. 65F08, 65N30, 65N35, 65N55
1. Introduction. There has recently been a considerable amount of activity in developing
adaptive methods for the selection of primal constraints for BDDC algorithms and, in particular,
for BDDC deluxe variants. The primal constraints of a BDDC or FETI–DP algorithm provide
the global, coarse part of such a preconditioner, and they are of crucial importance for obtaining
rapid convergence of these preconditioned conjugate gradient methods for the case of many
subdomains. When the primal constraints are chosen adaptively, we aim at selecting a primal
space which for a certain dimension of the coarse space provides the fastest rate of the
convergence for the iterative method. In the alternative, we can try to develop criteria which
will guarantee that the condition number of the iteration stays below a given tolerance. We
note that a fair comparison between different algorithms must also take into account the cost
of the set-up phase and the iterations to follow. So far a comparative study of these issues for
three dimensional problems, similar to the recent study [14] for problems in two dimensions,
appears to be missing.
A particular inspiration for our own work has been a talk (see [7]) by Clark Dohrmann at
DD21, the twenty-first international conference on domain decomposition methods held in
Rennes in June 2012. Dohrmann had then started a joint work with Clemens Pechstein; see
also [23]. Their work has recently resulted in a significant contribution to the theory; see [24].
This rich paper also provides a historical context and many references to the literature.
Much of the earlier work for adaptive BDDC and FETI-DP iterative substructuring
algorithms, which has been supported by theory, has been confined to developing primal
constraints for equivalence classes related to two subdomain boundaries such as those for the
subdomain edges for problems defined on domains in the plane; see, in particular, the paper
by Klawonn, Radtke, and Rheinbach [14]. In our context, the equivalence classes are sets of
finite element nodes which belong to the boundaries of more than one subdomain with the
equivalence relation defined by the sets of subdomain boundaries to which the nodes belong.
In other words, two nodes on the interface Γ between the subdomains belong to the same
∗Received February 25, 2016. Accepted November 21, 2016. Published online on December 12, 2016. Recom-
mended by Ulrich Langer. The work of J. G. C. was supported in part by the National Science Foundation Grant
DMS-1216564. The work of O. B. W. was supported in part by the National Science Foundation Grants DMS-1216564
and DMS-1522736.
†CIMPA, Universidad de Costa Rica, San Jose, Costa Rica, 11501 (juan.calvo@ucr.ac.cr).
‡Courant Institute of Mathematical Sciences, 251 Mercer Street, New York, 10012, USA
(widlund@cims.nyu.edu).
524
ETNA
Kent State University
http://etna.math.kent.edu
BDDC WITH ADAPTIVE PRIMAL SPACES 525
equivalence class if they belong to the same set of subdomain boundaries ∂Ωi. While it is
important to further study the best way of handling all cases, the basic issues appear to be well
settled when the equivalence classes all are defined by just two subdomain boundaries.
We note that this work is relevant for problems posed in H(div) even in three dimensions
(3D) since the degrees of freedom on the interface between subdomains for Raviart-Thomas
and Brezzi-Douglas-Marini elements are associated only with faces of the elements; see [22,
29]. But for other elliptic problems in 3D, there is, except for quite special subdomain
configurations, a need to develop algorithms and results for equivalence classes with three or
more subdomain boundaries.
There is early work by Mandel, Šístek, and Sousedík, who developed condition number
indicators; cf. [19, 20]. A firm theoretical foundation for these algorithms has now been
developed for problems in two dimensions and more recently for an enhanced variant and three
dimensions; see [14] and [13], respectively. Talks by Clark Dohrmann and Axel Klawonn
at DD23, the twenty-third international conference on domain decomposition methods held
in July 2015 on Jeju Island, Korea, have reported on recent progress. A talk by Hyea Hyun
Kim in the same session on joint work with Eric Chung and Junxian Wang also has reported
considerable progress for a different kind of algorithm. Their main new algorithm for problems
in three dimensions is similar but not the same as ours; see further [12] and (3.6). The main
result of this paper, which had been developed independently, has been reported on by the
second author at the same DD23 mini-symposium.
This paper will focus on using parallel sums for general equivalence classes. The use
of parallel sums for equivalence classes with two subdomain boundaries has proven very
successful in simplifying the formulas and arguments; see in particular Pechstein [23, 24] and
also Section 2.1. We note that algorithms using parallel sums for equivalence classes with
more than two subdomain boundaries have been quite successful in numerical experiments by
Simone Scacchi and Stefano Zampini, reported in [5], for problems arising in isogeometric
analysis and also by Zampini [28] in a study of problems formulated in H(curl) based in part
on [9].
We also note that we previously have attempted to design adaptive algorithms, which
resulted in quite complicated formulas and limited success. Among other complications, in one
of these approaches, the primal constraints then had to be extracted by using a QR factorization
of a matrix generated from several bases for spaces of prospective primal constraint vectors
related to pairs of Schur complements. We note that an alternative would be to carry out several
changes of variables, thus enhancing the primal space in several steps, as is done in [11]. See
also a discussion in [24].
In this paper, we will focus on low-order, nodal finite element approximations for scalar
elliptic problems in three dimensions
(1.1) −∇ · (ρ(x)∇u) = f(x), x ∈ Ω, ρ(x) > 0,
resulting in a linear system of equations to be solved using BDDC domain decomposition
algorithms, in particular, its deluxe variant. We will always assume that the choice of boundary
conditions results in a positive definite, symmetric stiffness matrix. Future work is planned on
what is known as the economic variant of the BDDC deluxe algorithm, (e-deluxe), cf. [9], and
on linear elasticity including the almost incompressible case.
The outline of this paper is as follows: in Section 2, we briefly introduce the BDDC
algorithms. This is followed by a discussion of the case of equivalence classes with two
subdomain boundaries and a related generalized eigenvalue problem. The success of the
adaptive algorithm in this case can be explained by examining the eigenvalues of a generalized
eigenvalue problem which is closely related to a face lemma. Such a lemma provides a standard
ETNA
Kent State University
http://etna.math.kent.edu
526 J. G. CALVO AND O. B. WIDLUND
technical tool in domain decomposition theory; see [26, Subsection 4.6.3]. Several numerical
experiments reported in Section 2.2 highlight the fact that a small number of primal constraints
often can result in a very favorable bound.
We then, in Section 3, focus on the case of equivalence classes with three subdomain
boundaries. This is the main part of our paper and is relevant, in particular, for contributions of
subdomain edges to the values of a jump operator PD acting on the elements in a product space
related to the subdomains and the finite element space. We derive an upper bound of the square
of a norm based on a Schur complement and note that it has been known for over a decade that
such bounds provide an estimate of the condition number of the FETI–DP algorithm; see (2.1)
and [16]. Given the close connection of the BDDC and FETI–DP algorithms, the bound for
that same jump operator is equally relevant for our work; see [18]. We find that the square of
this norm of a subdomain edge contribution to PDw can be bounded from above in terms of
three parallel sums of single Schur complements and sums of two others. We then attempt
to find a common upper bound of these expressions in terms of the parallel sum of all the
relevant Schur complements. This is not successful, and we instead work directly with the
operators obtained in the estimate of the jump operator and formulate a generalized eigenvalue
problem for each of the edges of the subdomains. We can then select a few eigenvectors
associated with the smallest eigenvalues and generate a primal constraint from each of these
eigenvectors. These generalized eigenvalue problems are defined in terms of principal minors
of relevant Schur complements and Schur complements of these Schur complements associated
with a minimal energy extension, e.g., from a subdomain edge of a three-dimensional finite
element problem. We also provide a bound on the condition number in terms of the smallest
eigenvalues for the subdomain faces and edges which have been left out when constructing the
primal space. We note that Kim et al. [12] have followed a quite similar path although their
upper bound for the norm of the jump operator differs from ours.
In a section that follows, we indicate how to extend our preconditioner and bounds to
equivalence classes with four subdomain boundaries; no new ideas are required. Section 3 is
concluded by the definition of alternative generalized eigenvalue problems which has been
used with success in earlier work by Simone Scacchi and Stefano Zampini [5, 29]. So far these
algorithms lack a theoretical foundation.
Our paper concludes, in Section 4, by demonstrating the performance of our algorithm in
a series of numerical experiments using regular subdomains as well as subdomains generated
by a METIS mesh partitioner; see [10]. We demonstrate that we can obtain fast convergence
for problems with a quite irregular coefficient inside the subdomains. We also report on
experiments with two alternative algorithms based on other generalized eigenvalue problems
using parallel sums and sums of the two sets of Schur complements; we have not been able to
provide a theoretical justification for these variants.
2. Equivalence classes and BDDC algorithms. This section begins with a short intro-
duction to BDDC algorithms; for more details, see, e.g., [17]. For an introduction to its deluxe
variant, see, e.g., [27].
BDDC algorithms are domain decomposition algorithms based on the decomposition
of the domain Ω of an elliptic operator into non-overlapping subdomains Ωi, each often
associated with tens of thousands of degrees of freedom. The subdomain interface Γi of Ωi
does not cut through any elements and is defined by Γi := ∂Ωi \ ∂Ω⋃. Its equivalence classes
are associated with the subdomain faces, edges, and vertices of Γ := i Γi, the interface of the
entire decomposition. Thus, for a problem in three dimensions, a subdomain face is associated
with the degrees of freedom of the nodes belonging to the interior of the intersection of two
boundaries of two neighboring subdomains Ωi and Ωj and does not include any nodes on the
boundary of the face. If such a set consists of several disjoint components, then each of them
ETNA
Kent State University
http://etna.math.kent.edu
BDDC WITH ADAPTIVE PRIMAL SPACES 527
will be classified as a face. Those of a subdomain edge are typically associated with a set of
nodes common to three or more subdomain boundaries, while the endpoints of the subdomain
edges are the subdomain vertices which are associated with even more subdomains.
Given the stiffness matrix A(i) of the subdomain Ωi, we obtain a subdomain Schur
complement S(i) by eliminating the interior variables, i.e., all those that do not belong to Γi.
We will also work with principal minors of these Schur complements associated with faces F
and edges E denoting them by (i)SFF and
(i)
SEE , respectively.
The interface space is divided into a primal subspace of functions which are continuous
across Γ and a complementary, dual subspace for which we will allow multiple values across
the interface during part of the iteration. In this study, all the subdomain vertex variables will
always belong to the primal set.
The BDDC and FETI–DP algorithms can be described in terms of three product spaces of
finite element functions/vectors defined by their interface nodal values:
ŴΓ ⊂ W̃Γ ⊂WΓ.
WΓ is a product space of the spaces defined on the interfaces Γi without any continuity
constraints across the interface. Elements of W̃Γ have common values of the primal variables
but allow multiple values of the dual variables while the elements of ŴΓ are continuous at
all nodes on Γ. We will change variables, explicitly introducing the primal variables and a
complementary sets of dual variables in order to simplify the presentation; see, e.g., [17]. After
eliminating the interior variables, we can t[hen write the]subdomain Schur complements as
(i) (i)
S(i)
S S
= ∆∆ ∆Π .
(i) (i)
SΠ∆ SΠΠ
We will partially subassemble the S(i), obtaining S̃, enforcing the continuity of the primal
variables only. Thus, we then work in W̃Γ. In each step of the iteration, we solve a linear
system with the coefficient matrix S̃. In the alternative, we could also work with a linear system
with a matrix obtained by partially subassembling the subdomain stiffness matrices A(i). We
note that solving these linear systems will be considerably much faster than working with the
fully assembled system provided that the dimension of the primal space is modest. At the
end of each iteration, the approximate solution is made continuous at all nodal points of the
interface; continuity is restored by applying a weighted averaging operator ED, which maps
W̃Γ into ŴΓ.
In each iteration, we first compute the residual of the fully assembled Schur complement
system. We then apply ETD to obtain a right-hand side for the partially subassembled linear
system, solve this system, and then apply ED. This last step changes the values on Γ unless
the iteration has converged and can result in non-zero residuals at nodes not on Γ. In a final
step of each iteration step, we eliminate these residuals by solving a Dirichlet problem on
each of the subdomains. We always accelerate the iteration with the preconditioned conjugate
gradient algorithm.
2.1. BDDC deluxe. When designing a BDDC algorithm, we have to choose an effective
set of primal constraints and also a good recipe for the averaging across the interface. This
paper concerns the choice of the primal constraints while we will always use the deluxe recipe
in the construction of the averaging operator ED.
We note that in work on three-dimensional problems formulated in H(curl), it was found
that traditional averaging recipes did not always work well; cf. [8, 9]. The same is true for
problems in H(div); see [22]. This occasional failure has its roots in the fact that there are two
ETNA
Kent State University
http://etna.math.kent.edu
528 J. G. CALVO AND O. B. WIDLUND
sets of material parameters in these applications. The deluxe scaling that was then introduced
has also proven quite successful for a variety of other applications including isogeometric
analysis; cf. [5, 4]. For a survey, see [27].
A face component of the average operator ED across a subdomain face F ⊂ Γ common
to two subdomains Ωi and Ωj is defined in terms of the principal minors
(k)
SFF of the matrix
S(k), k = i, j. The deluxe averagin(g operator for)F is(then defined by−1 )
(i) (j) (i) (i) (j) (j)
w̄F := (EDw)F := SFF + SFF SFFwF + SFFwF .
Here, (i)wF is the restriction of w
(i) to the face F, etc.
The action of (i) (j)(SFF + S
−1
FF ) can be implemented by solving a Dirichlet problem on
Ωi ∪ F ∪ Ωj , where F is the face between the two subdomains. This can add significantly
to the cost. We can also compute the Schur complements, add them, and factor the sum. We
note that if we would use the popular numerical factorization package MUMPS [1], we could
exploit that these Schur complements are provided when the subdomain matrices are factored.
In an economic variant (e-deluxe), we replace this large domain by a thin domain built from
one or a few layers of elements next to the face, and this often results in very similar iteration
counts; see, e.g., [9]. The advantage of using the e-deluxe variant will depend considerably on
the software used in assembling a program; a discussion of these matters can be found in a
recent paper on problems posed in H(div); see [22].
Deluxe averaging operators are also developed for subdomain edges and any other equiva-
lence classes of interface variables, and the operator ED is assembled from all these compo-
nents; see further Section 3. Our bound for this operator will be obtained from bounds for the
individual equivalence sets and will include factors that depend on the number of equivalence
classes associated with the faces and edges of the individual subdomains; see Theorems 3.2
and 3.4.
The core of any estimate for a BDDC algorithm is the norm of the averaging operator ED.
By an algebraic argument known f(or FETI–DP)since 2002, we know that
(2.1) κ M−1BDDC Ŝ ≤ ‖ED‖S̃ ;
see [16]. Here, κ is the condition number of the iteration matrix, M−1BDDC denotes the BDDC
preconditioner, and Ŝ the fully assembled Schur complement of the problem.
The analysis of any BDDC deluxe algorithm can be reduced to establishing bounds for
the individual subdomains. The analysis of traditional BDDC algorithms requires the use of
an extension theorem, cf. [15]; the deluxe version does not.
Instead of developing an estimate forED, we will work with PD := I−ED. Instead of es-
timating the norm ofED, we will bound that of PD; we note that as proven in [24, Appendix A],
the norm of these operators are the same. Thus, instead of estimating (RT TF w̄F ) S
(i)RTF w̄F ,
we will work with the S(i)-norm of T (i)RF (wF − w̄F ). Here, RF denotes the restriction to the
face F. By elementary algebra, we(find that )−1
(i) − (i) (j) (j) (i) (j)wF w̄F = SFF + SFF SFF (wF − wF ).
M(ore algebra gives,)by usin(g that (i)S := R)S(i) TFF F RF ,T
(i) (i)
RTF (w − w̄ ) S(i) R(TF F F (wF − w̄F))−1 ( )−1
(i) − (j) T (j) (i) (j) (i) (i) (j) (j) (i) (j)= (wF wF ) SFF SFF + SFF SFF SFF + SFF SFF (wF − wF ).
ETNA
Kent State University
http://etna.math.kent.edu
BDDC WITH ADAPTIVE PRIMAL SPACES 529
Adding a similar contribution from Ωj , we obtain, following Clemens Pechstein, that the
relevant expression of the energy(is )−1
(i) − (j) T (i) (i) (j) (j) (i) (j)(wF wF ) SFF SFF + SFF SFF (wF − wF )
(i)
= (wF −
(j) (i) (j) (i) (j)
wF )
TSFF : SFF (wF − wF ).
The matrix of this positive definite, symmetric quadratic form is a parallel sum; we will use
the notation
A : B := A(A+B)−1B;
cf. [2]. We note that if A and B are positive definite, then A : B = (A−1 +B−1)−1. If A+B
is only positive semi-definite, then we can replace (A + B)−1 by (A + B)†, a generalized
inverse, without any complications. However, see [25] and Section 3 for a discussion of the
case of parallel sums of more than two positive semi-definite operators. We can also work with
shifted, positive definite operators obtained by adding a small positive multiple of the identity
operator to the operators that are singular. This idea has worked well in our experiments; using
generalized inverses have produced results of quite similar quality.
In standard BDDC theory, the required estimate can then be obtained by using a face
lemma, cf. [26, subsection 4.6.3], where such a result is established for constant coefficients
in each subdomain and for polyhedral subdomains. For an adaptive algorithm, this result is
replaced by the use of a generalized eigenvalue problem. Thus, we first solve the generalized
eigenvalue problem
(2.2) (i) (j) (i) (j)S̆FF : S̆FFφ = λSFF : SFFφ.
Here,
(i) (i) − (i)T (i)−1 (i)S̆FF := SFF SF ′F SF ′F ′ SF ′F ,
with (i)SF ′F ′ the principal minor of S
(i) with respect to \ and (i)Γi F SF ′F an off-diagonal block
of S(i). This Schur complement of a Schur complement represents the minimum norm exten-
sion of any finite element function defined on F and will therefore provide a uniform bound for
any extension of the values on to the rest of In other words, (i)T (i) (i)F Γi. wF S̆FFwF ≤ ai(w,w)
for any w that equals (i)wF on the face F. We note that the choice of
(i) (j)
S̆FF : S̆FF as one of
the matrices of the generalized eigenvalue problem is fully motivated in the proof of [24,
Lemma 5.24].
We note that the eigenvalue problem (2.2) can be replaced by an eigenvalue problem which
uses the inverses of the operators and that, following Zampini, we can extract the inverses of
all the matrices (i) (i)S̆FF from the inverse of S
(i); the inverse of S̆FF appears as a principal minor
of the inverse of S(i). The same trick can also be used to obtain all the matrices required for
eigenvalue problems related to the subdomain edges which are developed in Section 3. The
inverses of (i)SFF , etc., are computed via Cholesky factorizations. There are of course also
costs in the set-up phase of the algorithm to solve a generalized eigenvalue problem for each
subdomain face and each subdomain edge. We note that all these computations are local and
can be carried out in parallel. Detailed information on the relative cost of the set-up phase and
the iteration are provided for some large-scale parallel computations in [22].
Primal constraints are then generated by using the eigenvectors of a few of the smallest
eigenvalues of (2.2) and making (i) (j) (i) (j)(S̆FF : S̆FF )(wF − wF ) orthogonal to these eigenvectors.
ETNA
Kent State University
http://etna.math.kent.edu
530 J. G. CALVO AND O. B. WIDLUND
This orthogonality condition allows us to conclude that( )
T (i) (j) ≤ 1 T (i) (j)wF∆(SFF : SFF )wF∆ w S̆λF F∆ FF
: S̆FF wF∆
tol
for any element wF∆ ∈ W∆, where λFtol is the smallest eigenvalue of the eigenvectors that
have not been chosen when we select the primal constraints. By using all the eigenvectors
as a new basis for the subspace associated with an individual subdomain face, we can easily
separate primal and dual subspaces.
We note that the subdomain matrices A(i) are singular for interior subdomains and so
are the Schur complements (i)S̆FF . As it was previously pointed out, we can replace the
inverses in the definition of the parallel sum, etc., by generalized inverses without any further
complications.
We now note that (i)wF −
(j)
wF belongs to the dual space and that therefore
(i) − (j) T (i) (j) (i) (j)(wF wF ) SFF : SFF (wF − wF )
≤ 1 (i) − (j) T (i) (j) (i) (j)(wF wF ) S̆FF : S̆FF (wF − wF ).λFtol
What remains is to estimate the expression on the right hand side by the energy attributed
to the two subdomains Ωi and Ωj . To do so, we will use a formula for the parallel sum of two
operators; cf. [3, Theorem 9], see also [24, Corollary 5.11].
LEMMA 2.1. Let A and B be two symmetric, positive semi-definite matrices of the same
order. Then, zTA : Bz = inf (xTAx+ yTz=x+y By).
Choosing (i)x = wF and y = −
(j)
wF , we find that we have established the following result.
LEMMA 2.2. Let λFtol be the smallest eigenvalue of (2.2) not chosen when selecting the
primal constraints for a subdomain face F . We then have
1
(2.3) ‖(PDw)|F ‖2 ≤ (aS̃ F i(w,w) + aj(w,w)) ,λtol
where ai(·, ·) is the bilinear form associated with (1.1) obtained by restricting the integration
to Ωi, etc.
We note that a less precise version of formula (2.3) has previously been developed with
the factor 2F ; see, e.g., [14] as well as in the original version of this paper.λtol
2.2. Convergence of eigenvalues. The success of this kind of algorithm is closely related
to the rapid convergence of the eigenvalues of (2.2) to 1. Numerical experiments reported
in four plots illustrate a rapid decay of the eigenvalues of (i)−1 (i) (i)SFF (SFF − S̆FF ) even for
problems with highly oscillatory coefficients; see Figure 2.1. The same can be said for
subdomains generated by the METIS mesh partitioner software; see Figure 2.2. This shows
that the eigenvalues of (i)−1 (i)SFF S̆FF approach 1 quite rapidly and that, except on a very small
subspace, the action of (i)SFF and
(i)
S̆FF are virtually the same. Therefore the same can be said
of (i) (j) and (i) (j)SFF : SFF S̆FF : S̆FF . This fact is illustrated in four additional plots; see Figures 2.3
and 2.4.
In the case of a random coefficient ρ(x), we use a uniform distribution to pick a number r
in the interval [−3, 3] and then assign the value 10r to ρ in individual elements.
As a consequence of these findings, the eigenvalues of (2.2) converge to 1 quite rapidly
even for problems with large changes in the coefficients inside the subdomains. Therefore, we
do not need to expand the primal space very much.
ETNA
Kent State University
http://etna.math.kent.edu
BDDC WITH ADAPTIVE PRIMAL SPACES 531
0 0
10 10
−5
10 −5
10
−10
10
−10
10
−15
10
−15
10
0 50 100 150 200 0 50 100 150 200
ρ = 1. Random ρ.
FIG. 2.1. Eigenvalues of (i)−1 (i)SFF (SFF −
(i)
S̆FF ) for a face with 225 nodes of 3D problems with cubic subdomains.
0 0
10 10
−5
10
−5
10
−10
10
−10
10
−15
10
0 10 20 30 40 50 60 70 80 90 0 10 20 30 40 50 60 70 80 90
ρ = 1. Random ρ.
FIG. 2.2. Eigenvalues of (i)−1 (i) (i)SFF (SFF − S̆FF ) for a face with 90 nodes of 3D problems with METIS
subdomains.
Figures 2.1 and 2.2 suggest that the operator (i)−1 (i) (i)SFF (SFF − S̆FF ) is associated with a
compact operator. We can offer an explanation at least for the case with constant coefficients.
We first recall that the trace class of H1(Ωi) is H1/2(∂Ωi). For a face F ⊂ ∂Ωi, the trace
semi-norm is defined by ∫ ∫
| |2 |u(x)− u(y)|
2
(2.4) u H1/2(F ) := dS dS .|x− y|3 x yF F
We obtain the H1/2(F )-norm by adding 1/HF ‖u‖2L2(F ), where HF is the diameter of F. We
find that the (i)SFF -norm is equivalent to the norm obtained by restricting (2.4) to the finite
element space; cf. [26, Lemma 4.6].
It is also known that the H1/2(Γi)-norm of the minimal norm extension of any element
u ∈ H1/2(F ) is bounded uniformly by ‖u‖H1/2(F ). But it is also known that an extension by
zero even for 1/2H0 (F ), the closure of C
∞
0 (F ) in this norm, fails to be uniformly bounded.
The subspace for which the extension by zero is bounded is known as 1/2H00 (F ). A norm for
this subspace is given by ∫
|u(x)|2
(2.5) ‖u‖2 1/2 := |u|2 1/2 + dS ,
H00 (F )
H (F ) x
F d(x)
where d(x) is the distance from x to ∂F. The formula (2.5) can be derived for any Lipschitz
ETNA
Kent State University
http://etna.math.kent.edu
532 J. G. CALVO AND O. B. WIDLUND
1 1
0.9 0.9
0.8 0.8
0.7 0.7
0.6 0.6
0.5 0.5
0.4 0.4
0.3 0.3
0.2 0.2
0.1 0.1
0 0
0 50 100 150 200 0 50 100 150 200
ρ = 1. Random ρ.
FIG. 2.3. Eigenvalues of (i) (j) (i) (j)S̆FF : S̆FFφ = λSFF : SFFφ for a face with 225 nodes of 3D problems with
cubic subdomains.
1 1
0.9 0.9
0.8 0.8
0.7 0.7
0.6 0.6
0.5 0.5
0.4 0.4
0.3 0.3
0.2 0.2
0.1 0.1
0 0
0 10 20 30 40 50 60 70 80 90 0 10 20 30 40 50 60 70 80 90
ρ = 1. Random ρ.
F . 2.4. Eigenvalues of (i) (j) (i) (j)IG S̆FF : S̆FFφ = λSFF : SFFφ for a face with 90 nodes of 3D problems with
METIS subdomains.
region by considering the square of the H1/2(Γi)-norm of the extension of u(x) by zero onto
F ′ := Γi \ F ; see, e.g., [21].
A reflection of the fact that 1/2 1/2H00 is a true subspace of H0 (F ) is the well-known bound
for finite element spaces
‖uh‖2 2 21/2 ≤ C (1 + log(Hi/hH (F ) i)) ‖uh‖H1/2(F ),00
which is known to be sharp; see [6, Lemma 4.2 and Remark 4.3] and also [26, Lemma 4.24]. It
is interesting to note that this estimate gives us an estimate of the smallest non-zero eigenvalue
of (2.2); we can establish that in the s∫pecial case considered, this eigenvalue is proportional to1/(1 + log(H/h))2; see also [26, Subsubsection 4.6.3].2
The restriction of the new term |u(x)|d(x) dx to the finite element space gives a weightedF
mass matrix which is spectrally equivalent to a diagonal matrix with elements varying in
proportion to 1/d(x). This matrix is easily seen to be well approximated by a matrix of low
rank since the weight function 1/d(x) varies between values of 2/Hi and of 1/hi. It then
follows that the matrix (i)−1 (i)SFF (SFF −
(i)
S̆FF ) can be approximated well by a matrix of low
rank.
3. Equivalence classes with three subdomain boundaries. We begin this section by
considering parallel sums of more than two operators. We will work with symmetric matrices
which all are at least positive semi-definite. We recall that for a pair of symmetric, positive
ETNA
Kent State University
http://etna.math.kent.edu
BDDC WITH ADAPTIVE PRIMAL SPACES 533
definite matrices A and B, their parallel sum is given by A : B := A(A + B)−1B or
(A−1 +B−1)−1. If A+B is singular, we can work with a generalized inverse.
For three positive definite matrices, we can define their parallel sum by
A : B : C := (A−1 +B−1 + C−1)−1,
with similar formulas for four or more matrices. A quite complicated formula for A : B : C
is given in [25] for the general case when some or all of the matrices might be only positive
semi-definite. It is also shown in [25, Theorem 3] that A : B : C = (A† +B† + C†)† if and
only if the three operators A,B, and C have the same range. In our context, this is not always
the case since the matrix (i)S̆EE defined below will be singular if Ωi is an interior subdomain
while it will be non-singular if ∂Ωi intersects a part of ∂Ω where a Dirichlet condition is
imposed. This issue can be avoided by making all operators non-singular by adding a small
positive multiple of the identity to the singular operators.
As we previously have pointed out, our interest in working with parallel sums with more
than two operators is inspired by Scacchi’s and Zampini’s success in using parallel sums of
more than two operators.
We will first focus on a case of an equivalence class common to three subdomain bound-
aries as it arises for most subdomain edges in a three-dimensional finite element context if the
subdomains are generated using a mesh partitioner. We will use the notation (i) (j)SEE , SEE , and
(k)
SEE for the principal minors, of the degrees of freedom of an edge E, of the subdomain Schur
complements of the three subdomains that have this subdomain edge in common. The Schur
complements of the Schur complements representing the minimal energy extensions to indi-
vidual subdomains of given values on the subdomain edge E will be denoted by (i) (j)S̆EE , S̆EE ,
etc., and are defined by
(3.1) (i) (i) − (i)T (i)−1 (i)S̆EE := SEE SE′ESE′E′ SE′E , etc.
Here (i)SE′E′ is the principal minor of S
(i) of Γi \ E, and (i)SE′E is an off-diagonal block.
We can now i(ntroduce the deluxe av)erag(e over the edge E by−1 )
(3.2) (i) (j) (k) (i) (i) (j) (j) (k) (k)w̄E := SEE + SEE + SEE SEEwE + SEEwE + SEEwE .
We then establish that the contribution of the subdomain Ωi to the square of the norm of the
contribut(ion of the edge to PD)w =(w(− EDw is th)e square of the
(i)
SEE-norm of
−1 )
(i) (j) (k) (j) (k) (i) (j) (j) (k) (k)
SEE + SEE + SEE SEE + SEE wE − SEEwE − SEEwE ,
which c(an be estima)te(d by the sum of )−1 ( )−1( )
(i)T (j)( (k) (i) (j) (k) (i) (i) (j) (k) (j) (k) (i)3wE SEE + SEE SEE + SE)E + SEE( SEE SEE + SE)E + SEE SEE + SEE wE ,−1 −1
(j)T (j) (i) (j) (k) (i) (i) (j) (k) (j) (j)
3wE SEE SEE + SEE + SEE SEE SEE + SEE + SEE SEEwE ,
and ( )−1 ( )−1
(k)T (k) (i) (j) (k) (i) (i) (j) (k) (k) (k)
3wE SEE SEE + SEE + SEE SEE SEE + SEE + SEE SEEwE .
Here, we can replace (i) by the difference (i)wE wE∆ between the original
(i)
wE and an appropriate
element in the primal space. The same shift is used to construct (j)wE∆ and
(k)
wE∆.
ETNA
Kent State University
http://etna.math.kent.edu
534 J. G. CALVO AND O. B. WIDLUND
The other two subdomains also contribute terms which can be obtained from the formulas
above b(y changing )su(perscripts appropria)tely. Th(e three terms that in)volv
(i)
(e wE∆ are−1 −1 )
(i)T (j() (k) (i) )(j) (k) (i) (i) (j) (k) (j) (k) (i)3wE∆ SEE + SEE SEE + SEE + SEE( SEE SEE + SE)E + SEE SEE + SEE wE∆,−1 −1
(i)T (i) (i) (j) (k) (j) (i) (j) (k) (i) (i)
3wE∆ SEE SEE + SEE + SEE SEE SEE + SEE + SEE SEEwE∆,
and ( )−1 ( )−1
(i)T (i) (i) (j) (k) (k) (i) (j) (k) (i) (i)
3wE∆ SEE SEE + SEE + SEE SEE SEE + SEE + SEE SEEwE∆.
By first adding the second and third terms and then usingA(A+B)−1B = B(A+B)−1A,
we find that we can write the sum of these(three terms as)
(i)T (i) (j) (k) (i)
3wE∆ SEE : SEE + SEE wE∆.
There are also two additional terms representing the squares of certain norms of (j)wE∆ and
(k)
wE∆.
They are obtained by appropriately permuting the superscripts.
This formula represents a simplification of what was worked out in a joint paper with
Beirão da Veiga et al. [4] and also in the development of the theory in a more recent paper
with Dohrmann on three-dimensional problems in H(curl); see [9]. We can now immediately
obtain a bound of t(he square of the norm of this edge-component of PDw )by
(i)T (i) (i) (j)T (j) (j) (k)T (k) (k)
3 wE∆ SEEwE∆ + wE∆ SEEwE∆ + wE∆ SEEwE∆
by using only that (i) (j) (k) (i)SEE : (SEE + SEE) ≤ SEE , etc. For certain problems, e.g., those
with constant coefficients in each subdomain and polyhedral subdomains, we can then obtain
respectable bounds even without solving any generalized eigenvalue problems. This typically
results in a bound involving a factor C(1 + log(H/h)); cf. [26, Lemma 4.16]. A fully
satisfactory proof of this result in given in [9].
Returning to the search for adaptive primal spaces, we note that ideally, we would now
like to prove that the three operators ( )
(i) (i) ( (j) (k)TE := SEE : SEE + SEE) ,
(3.3) (j) (j) ( (i) (k)TE := SEE : SEE + SEE) ,
(k) (k) (i) (j)
TE := SEE : SEE + SEE
all can be bounded uniformly from above(by )−1
(3.4) (i) (j) (k) (i)−1 (j)−1 (k)−1SEE : SEE : SEE := SEE + SEE + SEE .
If this were possible, we could use that same matrix for estimates for (i) (j) (k)wE∆, wE∆, and wE∆.
But we are not that lucky. Before we look at the details, we note that if we were to use the
generalized eigenvalues obtained from two parallel sums with three Schur complements as
in (3.4), we could complete our argument by noting that
(i) (j) (k) ≤ (i)S̆EE : S̆EE : S̆EE S̆EE , etc.,
ETNA
Kent State University
http://etna.math.kent.edu
BDDC WITH ADAPTIVE PRIMAL SPACES 535
and using similar arguments as in the previous section. Thus, a second parallel sum would be
constructed by using the Schur complements of the previous Schur complements, associated
with the minimal energy extension given by (3.1).
Let us now make an att(empt to find a)bound such as
(i) (j) (k)
SEE : SEE + SEE ≤ Const.
(i) (j) (k)
SEE : SEE : SEE .
The operator on the left equal(s ( ) )−1 −1
(i)−1 (j) (k)
SEE + SEE + SEE
and the one on the right is given by (3.4). The desi(red inequality)would hold if−1
(j)−1 (k)−1 (j) (k)
SEE + SEE ≤ Const. SEE + SEE ,
but by using the eigensystem of the generalized eigenvalue problem, (j) (k)SEEφ = µSEEφ, we
find that the best constant above would be (i)−1 (j) (i)−1 (k)maxµ(µ+ 2 + 1/µ). If SEE SEE and SEE SEE
were well-conditioned, then we would obtain a good bound. In our experience, this is not at
all the case for many problems.
If we like to prove a bound which does not require any additional assumptions, we have
to find a different common upper bound for (i) (j) (k)TE , TE , and TE defined in (3.3). This can be
accomplished by using the trivial inequality
(i) (i) (j) (k)
TE ≤ TE + TE + TE , etc.
We will therefore de(fine our generalized)eigenva(lue problem as )
(3.5) (i) (j) (k) (i) (j) (k)S̆EE : S̆EE : S̆EE φ = λ TE + TE + TE φ .
This is the recipe that we have used in most of our numerical experiments to select primal
constraints by using a selection of eigenvectors. We note that these eigenvalue problems are
different from those of Section 2.2 and less attractive; see further the next section. Given the
success of others with using parallel sums of each of the two sets of three Schur complements,
we have also carried out experiments with that alternative generalized eigenvalue problem
although we have not been able to justify this choice theoretically. We have also tested an
additional alternative.
We note that in their recent paper, Kim, Chung, and Wang [12] used an approach similar
to ours but u(sing the operator )−1 ( )−1
(j) (i) (j) ((k) (i) (i) )(j) (k) (j)SE SE + SE + SE SE SE + SE + SE( SE(3.6) −1 )−1
(k) (i) (j) (k) (i) (i) (j) (k) (k)
+ SE SE + SE + SE SE SE + SE + SE SE
in place of our (i)TE , etc.
An alternative generalized eigenvalue problem could be obtained by replacing the sum on
the right hand side of (3.5) by (i) (j) (k) (i) (i)SEE + SEE + SEE . Since TE ≤ SEE , we see that we again
can obtain a solid bound. But we would then be one step further away from the expression of
the energy of (PDw)|E .
ETNA
Kent State University
http://etna.math.kent.edu
536 J. G. CALVO AND O. B. WIDLUND
We can now write down a bound similar to the one of (2.3) in terms of a tolerance for the
eigenvalues of (3.5) by using that
(i) (j) (k) (i)
S̆EE : S̆EE : S̆EE ≤ S̆EE , etc.
We have the following lemma:
LEMMA 3.1. Let λEtol be the smallest eigenvalue of (3.5) not chosen when selecting the
primal constraints for a subdomain edge E shared by three subdomains Ωi,Ωj , and Ωk. We
then have
‖ 3(P 2Dw)|E‖ ≤ (ai(w,w) + aj(w,w) + ak(w,w)) ,S̃ λEtol
where ai(·, ·) is the bilinear form associated to (1.1) and the subdomain Ωi, etc.
We can now combine the estimates of Lemmas 2.2 and 3.1 into what is our main theoretical
result. Our result is similar to that of [12, Lemma 4.1].
THEOREM 3.2. Assume that all subdomain edges are common to no more than three
subdomains. The S̃-norm of the op(erator PD then satisfies )
2 2
‖P w‖2D ≤
2NF 6N+ E ‖w‖2 .
S̃ min λF min λE ŜF tol E tol
Here NF is the maximum number of faces of any subdomain, and NE is the maximum number
of edges. Therefore, the co(ndition num)ber of the deluxe BDDC algorithm satisfies2N2 6N2
κ M−1 F EBDDC Ŝ ≤ + ·minF λFtol minE λEtol
Proof. We first note that we can write PDw as the sum of the sum of (PDw)|F over all
subdomain faces of the interface Γ and the sum of (PDw)|E over all subdomain edges of Γ.
By an elementary estimate, we find that ‖PDw‖2 can be bounded by 2 times the sum of theS̃
squares of the norms of the two sums. We next consider the square of the S(i)-norm of the sum
of (PDw)|F over the faces of Ωi. The fact that there are at most NF such faces introduces
an additional factor NF in front of the sum of ‖(P w)| ‖2D F (i) . Lemma 2.2 shows that eachS
such term can be bounded by 1F (ai(w,w) + aj(w,w)). We find that the factor multiplyingλtol
2
ai(w,w) is bounded by
2NF
F .minF λtol
The estimate of the edge terms can be derived very similarly. The factor will be larger
since we have an extra factor 3 in Lemma 3.1.
We note that we have found this bound to be quite pessimistic given the quadratic factors
2N2F and 6N
2
E . A similar bound is provided in Section 3.2 for the case when the maximum
number of the subdomains common to a subdomain edge equals 4; the factor 6 in the second
term will be replaced by 8.
3.1. Some eigenvalue distributions. Following the example of Section 2.2, we have
computed the eigenvalues of (i)−1 (i) − (i)SEE (SEE S̆EE). The four plots provide information on the
eigenvalues of the generalized eigenvalue problem defined by (i)SEE and
(i)
S̆EE in four different
cases.
We note that while in all cases we have one eigenvalue equal to 1, the decay of the rest
of the spectra is much less pronounced than for the faces. We note that a subdomain edge
typically will be associated with much fewer degrees of freedom than a subdomain face and
that therefore the need for a very rapid decay of these eigenvalues might be less important.
ETNA
Kent State University
http://etna.math.kent.edu
BDDC WITH ADAPTIVE PRIMAL SPACES 537
0 0
10 10
−1
10
−2
10
−1
10
−3
10
0 5 10 15 20 25 30 35 0 5 10 15 20 25 30 35
ρ = 1. Random ρ.
FIG. 3.1. Eigenvalues of (i)−1 (i) − (i)SEE (SEE S̆EE) for an edge with 31 nodes of 3D problems with cubic
subdomains.
0 0
10 10
−2
10
−1
10
−4
10
0 5 10 15 20 25 30 35 0 5 10 15 20 25 30 35
ρ = 1. Random ρ.
FIG. 3.2. Eigenvalues of (i)−1 (i) (i)SEE (SEE − S̆EE) for an edge with 32 nodes of 3D problems with METIS
subdomains.
3.2. Equivalence classes with four subdomain boundaries. This case very closely
parallels(the previous. The average de)fined(by (3.2) is replaced by−1 )
(i) (j) (k) (`) (i) (i) (j) (j) (k) (k) (`) (`)
w̄E := SEE + SEE + SEE + SEE SEEwE + SEEwE + SEEwE + SEEwE .
We find that the energy of PDw can be estimated by the sum of four terms, the first two of
which are ( )
(i)T (i) (j) (k) (`) (i)
4wE∆ SEE : SEE + SEE + SEE wE∆
and ( )
(j)T (j) (i) (k) (`) (j)
4wE∆ SEE : SEE + SEE + SEE wE∆.
If a bound of (i)SEE in terms of
(i)
S̆EE and similar bounds for the other pairs of Schur
complements were available, then we could obtain a bound right away for the BDDC algorithm,
without adaption, with a factor 4. This is also an improvement as far as the constant is concerned
in comparison to previous results. But here we will focus on selecting the primal constraints
adaptively.
With four su(bdomains in the equiv)alence class, we use f(our operators )
(i) (i) (j) (k) (`) (j) (j) (i) (k) (`)
TE := SEE : SEE + SEE + SEE , TE := SEE : SEE + SEE + SEE , etc.
ETNA
Kent State University
http://etna.math.kent.edu
538 J. G. CALVO AND O. B. WIDLUND
All these operators are symmetric, positive definite, and they appear directly in our estimate of
the energy of PDw. We can now use the trivial inequality
(i) ≤ (i) (j) (k) (`)TE TE + TE + TE + TE
and very sim(ilar bounds for the other te)rms and (arrive at the generalized eige)nvalue problem
(3.7) (i) (j) (k) (`) (i) (j) (k) (`)S̆EE : S̆EE : S̆EE : S̆EE φ = λ TE + TE + TE + TE φ.
Both operators of (3.7) are symmetric with respect to the Schur complements. What will
be featured in the final bound would be the smallest eigenvalue not taken into account, i.e.,
with eigenvectors not associated with the primal space, and a fixed factor similar to what
appears in the bounds of Lemmas 2.2 and 3.1 and Theorem 3.2.
LEMMA 3.3. Let λEtol be the smallest eigenvalue of (3.5) not chosen when selecting the
primal constraints for a subdomain edge E shared by four subdomains Ωi,Ωj , Ωk, and Ω`.
We then have
‖(P 2Dw)|E‖ ≤
4
(ai(w,w) + aj(w,w) + ak(w,w) + a`(w,w)) ,S̃ λEtol
where ai(·, ·) is the bilinear form associated to (1.1) and the subdomain Ωi, etc.
We can now combine the estimates of Lemmas 2.2 and 3.3 into what is our main theoretical
result for this case.
THEOREM 3.4. Assume that all subdomain edges are common to no more than four
subdomains. The S̃-norm of the operator PD then satisfies
( 2
‖ ‖2 ≤ 2NF 8N
2 )
PDw +
E ‖w‖2 .
S̃ min λFF tol min
E Ŝ
E λtol
Here NF is the maximum number of faces of any subdomain, and NE is the maximum number
of edges. Therefore, the co(ndition num)ber of the deluxe BDDC algorithm satisfies2 2
κ M−1
2N
Ŝ ≤ F 8NEBDDC + ·min FF λtol min λEE tol
The proofs of this lemma and theorem follow exactly the same lines as those of Lemma 3.1
and Theorem 3.2.
3.3. Recipes of some previous work. Several other generalized eigenvalue problems
have been employed quite successful but so far lack full theoretical justifications.
Scacchi and Zampini have used what would correspond to the operators (i) (j) (k)SEE : SEE : SEE
and (i) (j) (k) (i) (j) (k)S̆EE + S̆EE + S̆EE or S̆EE : S̆EE : S̆EE for difficult, very ill-conditioned problems
arising in isogeometric analysis. Stefano Zampini has also used (i) (j) (k)SEE : SEE : SEE and
(i) (j) (k)
S̆EE : S̆EE : S̆EE successfully for subdomain edges and three-dimensional H(curl) prob-
lems; see [28]. So far, we have not found as full a justification for these recipes as for the one
based on using the generalized eigenvalue problems (2.2), (3.5), and (3.7).
4. Numerical results. We present some numerical results for our adaptive BDDC deluxe
algorithm. We consider a triangulation of the unit cube into tetrahedral elements and decom-
positions of this domain into cubic subdomains or subdomains obtained by using a METIS
mesh partitioner; see Figure 4.1.
ETNA
Kent State University
http://etna.math.kent.edu
BDDC WITH ADAPTIVE PRIMAL SPACES 539
METIS decomposition. One of the METIS subdomains.
FIG. 4.1. Domain decomposition obtained by METIS for the unit cube, N = 27.
Channels with ρ = 103. Coefficient distribution.
FIG. 4.2. Coefficient distribution with channels: black elements on the right have ρ = 103, and all the other
elements have ρ = 1.
We solve the resulting linear systems with random right-hand sides using BDDC pre-
conditioners to a relative residual tolerance of 10−6. We always use a zero initial guess. The
number of iterations and condition number estimates (in parentheses) are reported for each
experiment.
EXAMPLE 4.1. We first consider the scalability of the BDDC deluxe algorithm for a cubic
subdomain partitioning of the unit cube and for different standard choices of the primal space.
The column label “Corners” represents the common choice with only primal constraints for the
subdomain vertices. “Edges” adds the average over each edge to the set of primal constraints
while “Edges and Faces” additionally uses the averages over each face; see Table 4.1. We then
compare these results with adaptive algorithms based on generalized eigenvalue problems;
see Table 4.2. The numbers in its first two columns represent the fraction of the range of the
eigenvalues related to the subdomain edges that are incorporated into the primal space through
their eigenvectors. Thus, given the interval between the smallest and the largest eigenvalues,
we use, as primal constraints, the eigenvectors of all the eigenvalues that lie in the leftmost 5%
or 50% of this interval. For the faces, we always use a fixed 5%. Finally, for the last column,
“Adaptive”, we use λFtol = (1 + log(H/h))
−1, λEtol = (kH/h)
−1, where k is the number of
subdomains that share the edge E to select the eigenvalues. These formulas are borrowed
from [12] and have allowed us to make direct comparisons with results of that study. We have
also found that this recipe selects a relatively small number of effective primal constraints.
ETNA
Kent State University
http://etna.math.kent.edu
540 J. G. CALVO AND O. B. WIDLUND
TABLE 4.1
Performance for different choices of primal constraints with H/h = 8 and cubic subdomains. NE is the
number of subdomain edges, and DOF is the number of degrees of freedom of the problem.
Corners Edges Edges and Faces NE DOF
ρ N I(κ) |WΠ| I(κ) |WΠ| I(κ) |WΠ|
1 33 12(14.9) 8 12(13.9) 44 12(13.9) 98 36 15626
43 17(16.6) 27 17(15.6) 135 17(15.6) 279 108 35937
53 24(17.2) 64 24(16.1) 304 23(16.1) 604 240 68921
63 26(17.6) 125 25(16.5) 575 25(16.5) 1115 450 117649
R 33 23(42.9) 8 21(39.2) 44 23(39.1) 98 36 15626
43 34(77.9) 27 33(64.8) 135 37(62.3) 279 108 35937
53 51(83.4) 64 48(75.5) 304 51(75.2) 604 240 68921
63 68(106) 125 66(90.0) 575 61(90.0) 1115 450 117649
S 33 24(176) 8 24(174) 44 23(173) 98 36 15626
43 37(1068) 27 37(985) 135 33(981) 279 108 35937
53 60(1994) 64 59(1812) 304 55(1804) 604 240 68921
63 74(2234) 125 71(2022) 575 64(2013) 1115 450 117649
TABLE 4.2
Scalability for adaptive primal constraints with H/h = 8 and cubic subdomains.
Primal 5% Primal 50% Adaptive
ρ N I(κ) |WΠ| I(κ) |WΠ| I(κ) |WΠ| [f ]
1 33 6(1.5) 98 6(1.5) 122 8(2.2) 50 [0.6]
43 6(1.5) 279 6(1.5) 333 8(2.2) 189[0.8]
53 7(1.5) 604 6(1.5) 700 8(2.2) 460[0.9]
63 7(1.5) 1115 7(1.5) 1265 8(2.1) 905[0.9]
R 33 14(5.9) 115 11(3.2) 213 10(2.5) 237[2.6]
43 16(7.4) 336 13(7.3) 622 11(3.1) 746[3.0]
53 19(12.1) 765 13(4.0) 1361 11(3.1) 1698[3.1]
63 22(20.9) 1368 14(5.5) 2534 11(3.5) 3140[3.2]
S 33 9(10.5) 102 9(10.5) 119 10(10.6) 65[0.7]
43 10(14.3) 285 10(13.8) 340 11(11.6) 197[0.8]
53 11(15.2) 612 11(14.5) 708 11(15.2) 473[0.9]
63 12(15.3) 1125 12(14.6) 1272 12(15.3) 918[0.9]
However, we note that in numerical experiments not reported here, we have found that the
smallest eigenvalues of the generalized eigenvalue problems for the subdomain edges remain
above a positive constant when H/h increases.
The number [f ] in square brackets represents the ratio of the dimension of the primal
space and the total number of edges and faces. Different choices of ρ are considered in all
cases: ρ = 1, ρ = R (random values for each element), and ρ = S (a distribution with rods
and with jumps in the coefficients; see Figure 4.2). In the case of random values, we use a
uniform distribution to pick a number r in the interval [−3, 3] and then assign to each element
the value 10r.
These experiments show that the standard choices of primal constraints can fail quite
badly for problems with a coefficient that varies considerably inside the subdomains. The
results for the adaptive choices of primal constraints are much more satisfactory. We also
find that we can have success with only a small number of primal constraints even for the
ETNA
Kent State University
http://etna.math.kent.edu
BDDC WITH ADAPTIVE PRIMAL SPACES 541
TABLE 4.3
Performance for different choices of primal constraints with H/h = 8 and METIS subdomains. NE is the
number of subdomain edges.
Corners Edges Edges and Faces NE
ρ N I(κ) |WΠ| I(κ) |WΠ| I(κ) |WΠ|
1 33 17(7.0) 51 16(6.4) 169 15(6.3) 268 126
43 20(7.4) 164 19(6.4) 516 17(6.3) 793 389
53 22(8.2) 417 25(11.2) 1265 23(11.1) 1886 951
63 26(10.0) 658 28(10.1) 1977 25(9.9) 2912 1458
R 33 21(15.5) 51 32(44.1) 169 31(54.9) 268 126
43 27(14.7) 164 46(124) 516 48(236) 793 389
53 34(19.5) 417 64(384) 1265 61(383) 1886 951
63 39(24.1) 658 68(108) 1977 71(242) 2912 1458
S 33 29(147) 51 50(173) 169 56(171) 268 126
43 35(263) 164 64(242) 516 69(232) 793 389
53 52(254) 417 110(1911) 1265 117(1859) 1886 951
63 57(398) 658 161(1125) 1977 160(1121) 2912 1458
TABLE 4.4
Scalability for adaptive primal constraints with H/h = 8 and METIS subdomains.
Primal 5% Primal 50% Adaptive
ρ N I(κ) |WΠ| I(κ) |WΠ| I(κ) |WΠ| [f ]
1 33 7(2.0) 252 8(2.0) 299 9(2.4) 108[0.5]
43 8(1.9) 732 7(1.8) 855 9(2.4) 363[0.5]
53 12(5.9) 1697 12(5.9) 1884 13(6.0) 927[0.6]
63 15(5.7) 2688 14(5.7) 2967 14(5.7) 2319[0.9]
R 33 15(15.1) 275 12(4.9) 379 13(4.8) 264[1.2]
43 16(12.9) 798 14(7.9) 1143 21(24.3) 834[1.2]
53 23(15.3) 1852 19(11.8) 2607 20(13.1) 2036[1.2]
63 24(16.0) 3003 22(15.7) 4216 20(15.8) 4115[1.7]
S 33 15(47.9) 263 14(37.8) 300 14(32.4) 143[0.6]
43 20(50.8) 748 20(50.3) 864 20(31.3) 402[0.6]
53 23(86.0) 1723 25(85.9) 1908 23(86.6) 981[0.6]
63 33(55.0) 2710 33(55.0) 3019 35(55.0) 2379[0.6]
subdomain edges. We also note that adaptive choices of the primal constraints can result in
much smaller condition numbers even for the case of a constant coefficient ρ.
EXAMPLE 4.2. We verify the scalability of our algorithm for METIS subdomains with
the same coefficient distribution as in Example 4.1; see Tables 4.3 and 4.4. The results are in
many cases quite similar to those for cubic subdomains.
EXAMPLE 4.3. This example is used to study the behavior of our algorithm for increasing
values of H/h with 27 cubic subdomains; see Table 4.5. We find the results, all obtained with
the tolerances used in the "Adaptive" columns of Tables 4.2 and 4.4, quite satisfactory.
EXAMPLE 4.4. This example is used to compare the behavior of different eigenvalue
problems with 27 cubic subdomains and H/h = 16; see Table 4.6. Here, ET refers to the
generalized e(igenvalue problem (3.7), E)par refer(s to )
(i) (j) (k) (`) (i) (j) (k) (`)
S̆EE : S̆EE : S̆EE : S̆EE φ = λ SEE : SEE : SEE : SEE φ,
ETNA
Kent State University
http://etna.math.kent.edu
542 J. G. CALVO AND O. B. WIDLUND
TABLE 4.5
Results for the adaptive algorithm with 27 cubic subdomains for increasing values of H/h.
ρ = 1 ρ = R ρ = S
H/h I(κ) |WΠ| [f ] I(κ) |WΠ| [f ] I(κ) |WΠ| [f ]
4 5(1.2) 62[0.7] 7(1.7) 154[1.7] 5(1.2) 62[0.7]
8 8(2.2) 50[0.6] 10(2.5) 237[2.6] 10(10.7) 65[0.7]
12 9(2.7) 50[0.6] 12(4.0) 265[2.9] 11(19.7) 89[1.0]
16 10(3.1) 50[0.6] 13(4.3) 270[3.0] 12(5.4) 60[0.7]
TABLE 4.6
Results for different eigenvalue problems with N cubic subdomains and H/h = 8.
ET Epar Emix
ρ N I(κ) |WΠ| [f ] I(κ) |WΠ| [f ] I(κ) |WΠ| [f ]
1 33 8(2.1) 50[0.6] 8(2.1) 50[0.6] 8(2.2) 38[0.5]
43 8(2.1) 189[0.8] 8(2.1) 189[0.8] 8(2.2) 141[0.6]
53 8(2.1) 460[0.8] 8(2.1) 460[0.8] 8(2.2) 352[0.7]
63 8(2.1) 905[0.9] 8(2.1) 905[0.9] 9(2.2) 713[0.7]
R 33 10(2.5) 237[2.6] 13(4.3) 71[0.8] 13(4.3) 59[0.7]
43 11(7.3) 746[3.0] 15(9.6) 246[1.0] 14(4.4) 198[0.8]
53 11(3.1) 1698[3.1] 16(8.8) 604[1.1] 14(4.8) 496[0.9]
63 11(3.5) 3140[3.2] 18(10.3) 1158[1.2] 17(8.5) 966[1.0]
S 33 10(10.6) 65[0.7] 10(12.0) 57[0.7] 11(13.6) 41[0.5]
43 11(11.6) 197[0.8] 14(30.0) 189[0.8] 14(29.1) 140[0.6]
53 11(15.2) 473[0.9] 15(30.0) 463[0.9] 14(29.1) 354[0.7]
63 12(15.3) 918[0.9] 17(30.0) 906[0.9] 16(29.4) 713[0.7]
and Emix to( ) ( )
(i) (j) (k) (`) (i) (j) (k) (`)
S̆EE + S̆EE + S̆EE + S̆EE φ = λ SEE : SEE : SEE : SEE φ.
5. Conclusions. We have developed adaptive choices for the primal spaces for BDDC
deluxe methods and elliptic problems and a theoretical bound for the condition number of the
preconditioned system. We have first observed that adaptivity can considerably improve the
performance since classical choices with primal vertices, edge averages, and faces averages
can fail in case of large variations in the coefficients; see Table 4.1 and 4.3. Second, numerical
experiments show that the primal constraints related to the subdomain faces generally are easy
to handle; this is supported by the discussion in Section 2.2. Therefore, the 5%-option has
been used in many of the experiments resulting in just one or two constraints per face in most
of the cases. For the subdomain edges, we note that there is no significant difference in the
case of constant coefficients if we use 5% or 50% of the interval. For the other two cases
considered, extending this interval beyond 5% can be more important; it is clear that such an
increase will improve the condition number and the iteration count as illustrated in Tables 4.2
and 4.4. Here, the results in the "Adaptive" column show that we can keep the primal space
small.
As we have already observed, the tolerances used for subdomain faces and edges seem to
work well since they produce small primal spaces with good condition numbers. In most of
the cases, the ratio between the dimension of the primal space and the total number of edges
and faces [f ] is smaller than 1, which means that, on average, we use fewer than one constraint
ETNA
Kent State University
http://etna.math.kent.edu
BDDC WITH ADAPTIVE PRIMAL SPACES 543
per subdomain face/edge. Finally, Table 4.6 exemplifies that different eigenvalue problems
considered by others can have a similar performance as ours.
Acknowledgments. The authors wishes to thank one of the referees who provided
several suggestions that have helped improve our paper. They also wish to thank Dr. Clemens
Pechstein for the suggestion to use [24, Corollary 5.11] to improve our Lemma 2.2 and its
proof.
REFERENCES
[1] P. R. AMESTOY, I. S. DUFF, J.-Y. L’EXCELLENT, AND J. KOSTER, A fully asynchronous multifrontal solver
using distributed dynamic scheduling, SIAM J. Matrix Anal. Appl., 23 (2001), pp. 15–41.
[2] W. N. ANDERSON, JR. AND R. J. DUFFIN, Series and parallel addition of matrices, J. Math. Anal. Appl., 26
(1969), pp. 576–594.
[3] W. N. ANDERSON, JR. AND G. E. TRAPP, Shorted operators. II, SIAM J. Appl. Math., 28 (1975), pp. 60–71.
[4] L. BEIRÃO DA VEIGA, L. F. PAVARINO, S. SCACCHI, O. B. WIDLUND, AND S. ZAMPINI, Isogeometric
BDDC preconditioners with deluxe scaling, SIAM J. Sci. Comput., 36 (2014), pp. A1118–A1139.
[5] , Adaptive selection of primal constraints for isogeometric BDDC deluxe preconditioners, SIAM J. Sci.
Comput., to appear, 2016.
[6] S. C. BRENNER AND Q. HE, Lower bounds for three-dimensional nonoverlapping domain decomposition
algorithms, Numer. Math., 93 (2003), pp. 445–470.
[7] C. R. DOHRMANN AND C. PECHSTEIN, Constraint and weight selection algorithms for BDDC, Talk by
C. R. Dohrmann in Rennes, France, June 2012.
www.osti.gov/scitech/servlets/purl/1117109
[8] C. R. DOHRMANN AND O. B. WIDLUND, Some recent tools and a BDDC algorithm for 3D problems in
H(curl), in Domain Decomposition Methods in Science and Engineering XX, R. E. Bank, M. Holst,
O. Widlund, and J. Xu, eds., vol. 91 of Lect. Notes Comput. Sci. Eng., Springer, Heidelberg, 2013,
pp. 15–26.
[9] A BDDC algorithm with deluxe scaling for three-dimensional H(curl) problems, Comm. Pure Appl.
Math., 69 (2016), pp. 745–770.
[10] G. KARYPIS, R. AGGARWAL, K. SCHOEGEL, V. KUMAR, AND S. SHEKHAR, METIS home page.
http://glaros.dtc.umn.edu/gkhome/views/metis
[11] H. H. KIM AND E. T. CHUNG, A BDDC algorithm with enriched coarse spaces for two-dimensional elliptic
problems with oscillatory and high contrast coefficients, Multiscale Model. Simul., 13 (2015), pp. 571–
593.
[12] H. H. KIM, E. T. CHUNG, AND J. WANG, BDDC and FETI-DP algorithms with adaptive coarse spaces
for three-dimensional elliptic problems with oscillatory and high contrast coefficients, Preprint on arXiv,
2015. http://arxiv.org/abs/1606.07560
[13] A. KLAWONN, M. KÜHN, AND O. RHEINBACH, Adaptive coarse spaces for FETI-DP in three dimensions,
SIAM J. Sci. Comput., 38 (2016), pp. A2880–A2911.
[14] A. KLAWONN, P. RADTKE, AND O. RHEINBACH, A comparison of adaptive coarse spaces for iterative
substructuring in two dimensions, Electron. Trans. Numer. Anal., 45 (2016), pp. 75–106.
http://etna.ricam.oeaw.ac.at/vol.45.2016/pp75-106.dir/pp75-106.pdf
[15] A. KLAWONN, O. RHEINBACH, AND O. B. WIDLUND, An analysis of a FETI-DP algorithm on irregular
subdomains in the plane, SIAM J. Numer. Anal., 46 (2008), pp. 2484–2504.
[16] A. KLAWONN, O. B. WIDLUND, AND M. DRYJA, Dual-primal FETI methods for three-dimensional elliptic
problems with heterogeneous coefficients, SIAM J. Numer. Anal., 40 (2002), pp. 159–179.
[17] J. LI AND O. B. WIDLUND, FETI-DP, BDDC, and block Cholesky methods, Internat. J. Numer. Methods
Engrg., 66 (2006), pp. 250–271.
[18] J. MANDEL, C. R. DOHRMANN, AND R. TEZAUR, An algebraic theory for primal and dual substructuring
methods by constraints, Appl. Numer. Math., 54 (2005), pp. 167–193.
[19] J. MANDEL AND B. SOUSEDÍK, Adaptive selection of face coarse degrees of freedom in the BDDC and
the FETI-DP iterative substructuring methods, Comput. Methods Appl. Mech. Engrg., 196 (2007),
pp. 1389–1399.
[20] J. MANDEL, B. SOUSEDÍK, AND J. ŠÍSTEK, Adaptive BDDC in three dimensions, Math. Comput. Simulation,
82 (2012), pp. 1812–1831.
[21] J. NEČAS, Les Méthodes Directes en Théorie des Équations Elliptiques, Academia, Prague, 1967.
[22] D.-S. OH, O. B. WIDLUND, S. ZAMPINI, AND C. R. DOHRMANN, BDDC algorithms with deluxe scaling
and adaptive selection of primal constraints for Raviart-Thomas vector fields, Tech. Report TR2015-978,
Courant Institute, New York University, 2015.
ETNA
Kent State University
http://etna.math.kent.edu
544 J. G. CALVO AND O. B. WIDLUND
[23] C. PECHSTEIN AND C. R. DOHRMANN, Modern domain decomposition methods, BDDC, deluxe scaling, and
an algebraic approach, Talk by C. Pechstein at the University Linz, December 2013.
http://people.ricam.oeaw.ac.at/c.pechstein/pechstein-bddc2013.pdf
[24] , A Unified Framework for Adaptive BDDC, Tech. Report 2016-20, Johann Radon Institute for Compu-
tational and Applied Mathematics (RICAM), University Linz, 2016.
https://www.ricam.oeaw.ac.at/files/reports/16/rep16-20.pdf
[25] Y. TIAN, How to express a parallel sum of k matrices, J. Math. Anal. Appl., 266 (2002), pp. 333–341.
[26] A. TOSELLI AND O. WIDLUND, Domain Decomposition Methods—Algorithms and Theory, Springer, Berlin,
2005.
[27] O. B. WIDLUND AND C. R. DOHRMANN, BDDC deluxe domain decomposition, in Domain Decomposition
Methods in Science and Engineering XXII, T. Dickopf, M. J. Gander, L. Halpern, R. Krause, L. F. Pavarino,
eds., vol. 104 of Lecture Notes in Comput. Sci., Springer, Cham, 2016, pp. 93–103.
[28] S. ZAMPINI, Adaptive BDDC deluxe for H(curl), in Proceedings of the 23rd International Conference on
Domain Decomposition Methods, 2015, to appear.
[29] , PCBDDC: a class of robust dual-primal preconditioners in PETSc, SIAM J. Sci. Comput., 38 (2016),
pp. S282–S306.