We investigate the proximal map for the weighted mean absolute error function. An algorithm for its efficient and vectorized evaluation is presented. As a demonstration, this algorithm is applied to a nonsmooth energy minimization problem.

1 INTRODUCTION

The proximity operator, or proximal map, plays a fundamental role in nonsmooth optimization; see, for instance, [1-3]. Given a function $urn:x-wiley:16177061:media:pamm202300042:pamm202300042-math-0001$ , the proximal map $urn:x-wiley:16177061:media:pamm202300042:pamm202300042-math-0002$ is defined as the solution of the problem

$urn:x-wiley:16177061:media:pamm202300042:pamm202300042-math-0003$ (1)

Under the mild condition that f is proper, lower semicontinuous and convex, $urn:x-wiley:16177061:media:pamm202300042:pamm202300042-math-0004$ , is well defined. We refer the reader to [4, Ch. 12.4] for details and further properties. We also mention http://proximity-operator.net/ , where formulas and implementations of the proximal map for various functions have been collected.

In this paper, we present theory and an efficient algorithm for the evaluation of $urn:x-wiley:16177061:media:pamm202300042:pamm202300042-math-0005$ , where $urn:x-wiley:16177061:media:pamm202300042:pamm202300042-math-0006$ is defined as

$urn:x-wiley:16177061:media:pamm202300042:pamm202300042-math-0007$ (2)

Here $urn:x-wiley:16177061:media:pamm202300042:pamm202300042-math-0008$ are given, positive weights and $urn:x-wiley:16177061:media:pamm202300042:pamm202300042-math-0009$ are given data, $urn:x-wiley:16177061:media:pamm202300042:pamm202300042-math-0010$ for some $urn:x-wiley:16177061:media:pamm202300042:pamm202300042-math-0011$ . We refer to (2) as the weighted mean absolute error. Any of its minimizers is known as a weighted median of the data $urn:x-wiley:16177061:media:pamm202300042:pamm202300042-math-0012$ . Clearly, f is proper, continuous, and convex, and so is $urn:x-wiley:16177061:media:pamm202300042:pamm202300042-math-0013$ for any $urn:x-wiley:16177061:media:pamm202300042:pamm202300042-math-0014$ .

By definition, the proximal map $urn:x-wiley:16177061:media:pamm202300042:pamm202300042-math-0015$ for f as in (2) is given by

$urn:x-wiley:16177061:media:pamm202300042:pamm202300042-math-0016$ (3)

In the case of a single data point ( $urn:x-wiley:16177061:media:pamm202300042:pamm202300042-math-0017$ ), problem (3) reduces to the well-known problem

$urn:x-wiley:16177061:media:pamm202300042:pamm202300042-math-0018$ (4)

with $urn:x-wiley:16177061:media:pamm202300042:pamm202300042-math-0019$ and $urn:x-wiley:16177061:media:pamm202300042:pamm202300042-math-0020$ , whose unique solution is explicitly given in terms of the soft-thresholding operator $urn:x-wiley:16177061:media:pamm202300042:pamm202300042-math-0021$ . In this case, we have

$urn:x-wiley:16177061:media:pamm202300042:pamm202300042-math-0022$ (5)

This map, often with $urn:x-wiley:16177061:media:pamm202300042:pamm202300042-math-0023$ , arises in many iterative schemes for the solution of problems involving the 1-norm; see, for instance, [5, 6]. We can therefore view (3) as a multithresholding operation.

We point out that [7] have considered the slightly more general problem

$urn:x-wiley:16177061:media:pamm202300042:pamm202300042-math-0024$

with F strictly convex, differentiable and $urn:x-wiley:16177061:media:pamm202300042:pamm202300042-math-0025$ bijective. The prototypical examples are functions $urn:x-wiley:16177061:media:pamm202300042:pamm202300042-math-0026$ with $urn:x-wiley:16177061:media:pamm202300042:pamm202300042-math-0027$ . We concentrate on the case $urn:x-wiley:16177061:media:pamm202300042:pamm202300042-math-0028$ , which agrees with (3). The algorithm in [7] computes the solution of (3) by finding the median in the union of two sorted lists of cardinality N. By contrast, our implementation in1 finds an index in a single sorted list of cardinality N. In contrast to [7], we provide a vectorized, open-source implementation of (3) in [8].

This paper is structured as follows: We establish an algorithm for the evaluation of the proximal map of the weighted mean absolute error (3) in Section 2 and prove its correctness in Theorem 2.1. In Section 3, we briefly discuss the structural properties of the proximal map. We conclude by showing an application of the proposed algorithm to a nonsmooth energy minimization problem.

2 ALGORITHM FOR THE EVALUATION OF THE PROXIMAL MAP

In this section, we derive an efficient algorithm for the evaluation of the proximal map $urn:x-wiley:16177061:media:pamm202300042:pamm202300042-math-0029$ (3) and prove its correctness in Theorem 2.1. To this end, we assume that the points $urn:x-wiley:16177061:media:pamm202300042:pamm202300042-math-0030$ have been sorted and duplicates have been removed and their weights added. As a result, we can assume $urn:x-wiley:16177061:media:pamm202300042:pamm202300042-math-0031$ . Moreover, we assume $urn:x-wiley:16177061:media:pamm202300042:pamm202300042-math-0032$ , $urn:x-wiley:16177061:media:pamm202300042:pamm202300042-math-0033$ and $urn:x-wiley:16177061:media:pamm202300042:pamm202300042-math-0034$ for all $urn:x-wiley:16177061:media:pamm202300042:pamm202300042-math-0035$ . Summands with $urn:x-wiley:16177061:media:pamm202300042:pamm202300042-math-0036$ can obviously be dropped from the sum in (3).

We divide the real line into the intervals

$urn:x-wiley:16177061:media:pamm202300042:pamm202300042-math-0037$ (6)

which overlap in the given data points. It is also useful to set $urn:x-wiley:16177061:media:pamm202300042:pamm202300042-math-0038$ and $urn:x-wiley:16177061:media:pamm202300042:pamm202300042-math-0039$ . We further introduce the forward and reverse cumulative weights as

$urn:x-wiley:16177061:media:pamm202300042:pamm202300042-math-0040$

We extend these definitions by setting $urn:x-wiley:16177061:media:pamm202300042:pamm202300042-math-0041$ , $urn:x-wiley:16177061:media:pamm202300042:pamm202300042-math-0042$ and $urn:x-wiley:16177061:media:pamm202300042:pamm202300042-math-0043$ . We therefore have $urn:x-wiley:16177061:media:pamm202300042:pamm202300042-math-0044$ for all $urn:x-wiley:16177061:media:pamm202300042:pamm202300042-math-0045$ . Using this notation, we can rewrite the derivative of f as

$urn:x-wiley:16177061:media:pamm202300042:pamm202300042-math-0046$

This formula reflects the fact that f is piecewise linear and convex because $urn:x-wiley:16177061:media:pamm202300042:pamm202300042-math-0047$ is monotonically increasing with i. Moreover, $urn:x-wiley:16177061:media:pamm202300042:pamm202300042-math-0048$ holds for all $urn:x-wiley:16177061:media:pamm202300042:pamm202300042-math-0049$ (points to the left of smallest data point d₁), and $urn:x-wiley:16177061:media:pamm202300042:pamm202300042-math-0050$ holds for all $urn:x-wiley:16177061:media:pamm202300042:pamm202300042-math-0051$ (points to the right of the largest data point $urn:x-wiley:16177061:media:pamm202300042:pamm202300042-math-0052$ ). At $urn:x-wiley:16177061:media:pamm202300042:pamm202300042-math-0053$ , $urn:x-wiley:16177061:media:pamm202300042:pamm202300042-math-0054$ , f is nondifferentiable but we can specify its subdifferential, which is

$urn:x-wiley:16177061:media:pamm202300042:pamm202300042-math-0055$

The objective

$urn:x-wiley:16177061:media:pamm202300042:pamm202300042-math-0056$ (7)

of (3) is piecewise quadratic and strongly convex. Its derivative is thus strongly monotone and it satisfies

$urn:x-wiley:16177061:media:pamm202300042:pamm202300042-math-0057$

Consequently, the unique minimizer of Φ lies between these bounds.

The idea to finding the unique minimizer $urn:x-wiley:16177061:media:pamm202300042:pamm202300042-math-0058$ of (7) is to locate the smallest index $urn:x-wiley:16177061:media:pamm202300042:pamm202300042-math-0059$ such that $urn:x-wiley:16177061:media:pamm202300042:pamm202300042-math-0060$ holds, that is, the nearest data point to the right of $urn:x-wiley:16177061:media:pamm202300042:pamm202300042-math-0061$ . In other words, we need to find $urn:x-wiley:16177061:media:pamm202300042:pamm202300042-math-0062$ such that

$urn:x-wiley:16177061:media:pamm202300042:pamm202300042-math-0063$ (8a)

$urn:x-wiley:16177061:media:pamm202300042:pamm202300042-math-0064$ (8b)

holds. Now we can distinguish two cases: $urn:x-wiley:16177061:media:pamm202300042:pamm202300042-math-0065$ and $urn:x-wiley:16177061:media:pamm202300042:pamm202300042-math-0066$ . The first case applies if and only if

$urn:x-wiley:16177061:media:pamm202300042:pamm202300042-math-0067$

Otherwise, $urn:x-wiley:16177061:media:pamm202300042:pamm202300042-math-0068$ lies in $urn:x-wiley:16177061:media:pamm202300042:pamm202300042-math-0069$ , and thus, it is the unique minimizer $urn:x-wiley:16177061:media:pamm202300042:pamm202300042-math-0070$ of the locally quadratic objective Φ. In either case, once the index $urn:x-wiley:16177061:media:pamm202300042:pamm202300042-math-0071$ has been identified, $urn:x-wiley:16177061:media:pamm202300042:pamm202300042-math-0072$ is given by

$urn:x-wiley:16177061:media:pamm202300042:pamm202300042-math-0073$ (9)

The considerations above lead to Algorithm 1. In our implementation, we evaluate (10) for all k simultaneously and benefit from the quantities being monotone increasing with k when finding the first nonnegative entry.

ALGORITHM 1. Evaluation of (3), the proximal map of the weighted mean absolute error.

Require:	data points $urn:x-wiley:16177061:media:pamm202300042:pamm202300042-math-0074$ , $urn:x-wiley:16177061:media:pamm202300042:pamm202300042-math-0075$ , and $urn:x-wiley:16177061:media:pamm202300042:pamm202300042-math-0076$ , $urn:x-wiley:16177061:media:pamm202300042:pamm202300042-math-0077$
Require:	weight vector $urn:x-wiley:16177061:media:pamm202300042:pamm202300042-math-0078$ with entries $urn:x-wiley:16177061:media:pamm202300042:pamm202300042-math-0079$
Require:	prox parameter $urn:x-wiley:16177061:media:pamm202300042:pamm202300042-math-0080$ and point of evaluation $urn:x-wiley:16177061:media:pamm202300042:pamm202300042-math-0081$
Ensure:	$urn:x-wiley:16177061:media:pamm202300042:pamm202300042-math-0082$ , the unique solution of (3)
1:	Find the smallest index $urn:x-wiley:16177061:media:pamm202300042:pamm202300042-math-0083$ that satisfies $urn:x-wiley:16177061:media:pamm202300042:pamm202300042-math-0084$ (10)
2:	return $urn:x-wiley:16177061:media:pamm202300042:pamm202300042-math-0085$

Let us prove the correctness of Algorithm 1.

Theorem 2.1.Under the assumptions stated in Algorithm 1, it returns $urn:x-wiley:16177061:media:pamm202300042:pamm202300042-math-0086$ , the unique solution of (3).

Proof.Let $urn:x-wiley:16177061:media:pamm202300042:pamm202300042-math-0087$ be the index found in Algorithm 1. First, suppose $urn:x-wiley:16177061:media:pamm202300042:pamm202300042-math-0088$ . Then $urn:x-wiley:16177061:media:pamm202300042:pamm202300042-math-0089$ and $urn:x-wiley:16177061:media:pamm202300042:pamm202300042-math-0090$ are both finite, and (8) and (10) imply

$urn:x-wiley:16177061:media:pamm202300042:pamm202300042-math-0091$

Owing to the properties of the subdifferential of strongly convex functions, there exists a unique point $urn:x-wiley:16177061:media:pamm202300042:pamm202300042-math-0092$ such that $urn:x-wiley:16177061:media:pamm202300042:pamm202300042-math-0093$ , that is, $urn:x-wiley:16177061:media:pamm202300042:pamm202300042-math-0094$ is the unique minimizer of (3). This point either belongs to $urn:x-wiley:16177061:media:pamm202300042:pamm202300042-math-0095$ , or else $urn:x-wiley:16177061:media:pamm202300042:pamm202300042-math-0096$ holds. In the first case, Φ is differentiable, so that $urn:x-wiley:16177061:media:pamm202300042:pamm202300042-math-0097$ holds, yielding

$urn:x-wiley:16177061:media:pamm202300042:pamm202300042-math-0098$

Otherwise, we have $urn:x-wiley:16177061:media:pamm202300042:pamm202300042-math-0099$ , and $urn:x-wiley:16177061:media:pamm202300042:pamm202300042-math-0100$ implies

$urn:x-wiley:16177061:media:pamm202300042:pamm202300042-math-0101$

In either case, the unique solution $urn:x-wiley:16177061:media:pamm202300042:pamm202300042-math-0102$ of (3) is determined by $urn:x-wiley:16177061:media:pamm202300042:pamm202300042-math-0103$ , which is the quantity returned in Algorithm 1.

It remains to verify the marginal cases $urn:x-wiley:16177061:media:pamm202300042:pamm202300042-math-0104$ and $urn:x-wiley:16177061:media:pamm202300042:pamm202300042-math-0105$ . In case $urn:x-wiley:16177061:media:pamm202300042:pamm202300042-math-0106$ , we have $urn:x-wiley:16177061:media:pamm202300042:pamm202300042-math-0107$ due to the minimality of k. Hence, $urn:x-wiley:16177061:media:pamm202300042:pamm202300042-math-0108$ . A similar reasoning applies in the case $urn:x-wiley:16177061:media:pamm202300042:pamm202300042-math-0109$ . $urn:x-wiley:16177061:media:pamm202300042:pamm202300042-math-0110$

We provide an efficient and vectorized open-source Python implementation of Algorithm 1 in [8]. It allows the simultaneous evaluation of (3) for multiple values of x, provided that each instance of (3) has the same number N of data points. The weights $urn:x-wiley:16177061:media:pamm202300042:pamm202300042-math-0111$ and data points $urn:x-wiley:16177061:media:pamm202300042:pamm202300042-math-0112$ as well as the prox parameter γ may vary between instances. The discussion so far assumed positive weights for simplicity, but the case $urn:x-wiley:16177061:media:pamm202300042:pamm202300042-math-0113$ is a simple extension and it is allowed in our implementation. This is convenient in order to simultaneously solve problem instances that differ with respect to the number of data points N. In this case, we can easily pad all instances to the same number of data points using zero weights. In addition, data points are allowed to be duplicate, that is, we only require $urn:x-wiley:16177061:media:pamm202300042:pamm202300042-math-0114$ , $urn:x-wiley:16177061:media:pamm202300042:pamm202300042-math-0115$ . Notice that since the data points are assumed to be sorted, finding the index in Algorithm 1 is of complexity $urn:x-wiley:16177061:media:pamm202300042:pamm202300042-math-0116$ .

3 STRUCTURE OF $urn:x-wiley:16177061:media:pamm202300042:pamm202300042-math-0117$

In this section, we briefly discuss the structure of the map $urn:x-wiley:16177061:media:pamm202300042:pamm202300042-math-0118$ . Since it generalizes the soft-thresholding operation (5), it is not surprising that we obtain a graph that features a staircase pattern. An illustrative plot for certain choices of weights $urn:x-wiley:16177061:media:pamm202300042:pamm202300042-math-0119$ , data points $urn:x-wiley:16177061:media:pamm202300042:pamm202300042-math-0120$ , and prox parameter $urn:x-wiley:16177061:media:pamm202300042:pamm202300042-math-0121$ is shown in Figure 1. Each of the N distinct data points provides one plateau in the graph.

Details are in the caption following the image — **FIGURE 1**
Open in figure viewer PowerPoint

Example of the proximal map $urn:x-wiley:16177061:media:pamm202300042:pamm202300042-math-0122$ in case $urn:x-wiley:16177061:media:pamm202300042:pamm202300042-math-0123$ data points.

**FIGURE 1**
Open in figure viewer PowerPoint

Example of the proximal map $urn:x-wiley:16177061:media:pamm202300042:pamm202300042-math-0122$ in case $urn:x-wiley:16177061:media:pamm202300042:pamm202300042-math-0123$ data points.

Two alternating regimes occur for $urn:x-wiley:16177061:media:pamm202300042:pamm202300042-math-0124$ , as x ranges over $urn:x-wiley:16177061:media:pamm202300042:pamm202300042-math-0125$ . First, when $urn:x-wiley:16177061:media:pamm202300042:pamm202300042-math-0126$ holds, then (9) implies that $urn:x-wiley:16177061:media:pamm202300042:pamm202300042-math-0127$ is an affine function of x with slope 1. This is the case for x whose associated index k is constant, that is,

$urn:x-wiley:16177061:media:pamm202300042:pamm202300042-math-0128$

As x increases beyond the upper bound of the interval above, $urn:x-wiley:16177061:media:pamm202300042:pamm202300042-math-0129$ enters a constant regime that applies to

$urn:x-wiley:16177061:media:pamm202300042:pamm202300042-math-0130$

Notice that the case $urn:x-wiley:16177061:media:pamm202300042:pamm202300042-math-0131$ reduces to the soft-thresholding map (5) with only one plateau.

4 APPLICATION TO A DEFLECTION ENERGY MINIMIZATION PROBLEM

In this section, we consider the minimization of the nonsmooth energy $urn:x-wiley:16177061:media:pamm202300042:pamm202300042-math-0132$ of a deflected membrane. We apply an alternating direction method of multiplier (ADMM) scheme that requires the parallel evaluation of the proximal map Algorithm 1 in each inner loop of the scheme.

Consider a bounded domain $urn:x-wiley:16177061:media:pamm202300042:pamm202300042-math-0133$ occupied by a thin membrane. In our model, the energy of this membrane is given in terms of its unknown deflection (displacement) function $urn:x-wiley:16177061:media:pamm202300042:pamm202300042-math-0134$ and it takes the following form:

$urn:x-wiley:16177061:media:pamm202300042:pamm202300042-math-0135$ (11)

The first term describes the potential energy of the membrane, where $urn:x-wiley:16177061:media:pamm202300042:pamm202300042-math-0136$ is the stiffness constant. The second term accounts for an external area force density f, which we assume to be a nonnegative function on Ω. Consequently, the deflection will be nonnegative as well. The specialty of our model is the third term, which can be interpreted as follows. The nonnegative constants $urn:x-wiley:16177061:media:pamm202300042:pamm202300042-math-0137$ serve as thresholds.1 Once the deflection z at a point in Ω exceeds any of these thresholds $urn:x-wiley:16177061:media:pamm202300042:pamm202300042-math-0139$ , an additional downward force of size $urn:x-wiley:16177061:media:pamm202300042:pamm202300042-math-0140$ times the excess deflection activates. Finally, the fourth term models an additional potential energy due to springs along the boundary Γ of the domain with stiffness constant α. A related problem has been studied in [9] with an emphasis on a posteriori error analysis and adaptive solution.

Notice that the minimization of the convex, nonsmooth energy $urn:x-wiley:16177061:media:pamm202300042:pamm202300042-math-0141$ among all displacements $urn:x-wiley:16177061:media:pamm202300042:pamm202300042-math-0142$ corresponds to the weak form of a partial differential equation (PDE), or rather a variational inequality. In fact, the necessary and sufficient optimality conditions for the minimization of (11) amount to

$urn:x-wiley:16177061:media:pamm202300042:pamm202300042-math-0143$ (12)

Here, $urn:x-wiley:16177061:media:pamm202300042:pamm202300042-math-0144$ is the characteristic function of the set A with values in $urn:x-wiley:16177061:media:pamm202300042:pamm202300042-math-0145$ . Provided that the set $urn:x-wiley:16177061:media:pamm202300042:pamm202300042-math-0146$ is of Lebesgue measure zero, the variational inequality (12) becomes an equation, whose strong form—using integration by parts—can be seen to be

$urn:x-wiley:16177061:media:pamm202300042:pamm202300042-math-0147$

From here, we also learn that the boundary term in the energy (11) leads to boundary conditions of Robin type.

We employ a standard Galerkin approach to numerically discretize the energy (11). To this end, we replace the displacement space $urn:x-wiley:16177061:media:pamm202300042:pamm202300042-math-0148$ by some finite-dimensional subspace of piecewise linear, globally continuous finite element functions defined over a triangulation of Ω. Denoting the associated nodal basis by $urn:x-wiley:16177061:media:pamm202300042:pamm202300042-math-0149$ , we define the stiffness matrix

$urn:x-wiley:16177061:media:pamm202300042:pamm202300042-math-0150$

as well as the lumped mass matrix

$urn:x-wiley:16177061:media:pamm202300042:pamm202300042-math-0151$

with $urn:x-wiley:16177061:media:pamm202300042:pamm202300042-math-0152$ when $urn:x-wiley:16177061:media:pamm202300042:pamm202300042-math-0153$ . The reason for using mass lumping to approximate all area integrals in (11) that do not involve derivatives is to translate the pointwise maximum operator into a coefficientwise one. This technique is crucial for an efficient numerical realization and has been used before, for example, in [10, 11].

We continue to use z to denote the nodal values of the finite element discretization of the deflection and thus obtain the discrete energy

$urn:x-wiley:16177061:media:pamm202300042:pamm202300042-math-0154$ (13)

Here, 1 denotes the vector of all ones and the max operation is understood coefficientwise.

The minimization of the convex but nonsmooth energy (13) is not straightforward. Our method of choice here is an ADMM scheme, which moves the nonsmooth terms into a separate subproblem, which can then be efficiently solved using Algorithm 1. We refer the reader to [12] and the references therein for an overview on ADMM.

To obtain a subproblem of type (3), we use the identity $urn:x-wiley:16177061:media:pamm202300042:pamm202300042-math-0155$ and arrive at

$urn:x-wiley:16177061:media:pamm202300042:pamm202300042-math-0156$ (14)

where $urn:x-wiley:16177061:media:pamm202300042:pamm202300042-math-0157$ is a constant and $urn:x-wiley:16177061:media:pamm202300042:pamm202300042-math-0158$ is a modification of the force vector. Moreover, the absolute value $urn:x-wiley:16177061:media:pamm202300042:pamm202300042-math-0159$ is understood coefficientwise. Dropping the constant C, introducing a second variable y, the constraint $urn:x-wiley:16177061:media:pamm202300042:pamm202300042-math-0160$ , and associated (scaled) Lagrange multiplier μ gives rise to the augmented Lagrangian

$urn:x-wiley:16177061:media:pamm202300042:pamm202300042-math-0161$ (15)

where $urn:x-wiley:16177061:media:pamm202300042:pamm202300042-math-0162$ is the squared norm induced by the positive diagonal matrix M.

An ADMM computes iterates $urn:x-wiley:16177061:media:pamm202300042:pamm202300042-math-0163$ , $urn:x-wiley:16177061:media:pamm202300042:pamm202300042-math-0164$ , $urn:x-wiley:16177061:media:pamm202300042:pamm202300042-math-0165$ according to the following update scheme;

$urn:x-wiley:16177061:media:pamm202300042:pamm202300042-math-0166$ (16a)

$urn:x-wiley:16177061:media:pamm202300042:pamm202300042-math-0167$ (16b)

$urn:x-wiley:16177061:media:pamm202300042:pamm202300042-math-0168$ (16c)

For the problem at hand, (16a) amounts to the solution of the discretized Poisson-like problem

$urn:x-wiley:16177061:media:pamm202300042:pamm202300042-math-0169$

The y-update (16b) on the other hand can be cast as the problem

$urn:x-wiley:16177061:media:pamm202300042:pamm202300042-math-0170$ (17)

Owing to the diagonal structure of the mass matrix, this problem fully decouples, the mass matrix cancels, and we obtain

$urn:x-wiley:16177061:media:pamm202300042:pamm202300042-math-0171$ (18)

for each component $urn:x-wiley:16177061:media:pamm202300042:pamm202300042-math-0172$ of y. This problem fits into the pattern of (3), and thus, can be solved efficiently and simultaneously for all components $urn:x-wiley:16177061:media:pamm202300042:pamm202300042-math-0173$ by Algorithm 1.

To validate the method, we consider two simple, convex domains

$urn:x-wiley:16177061:media:pamm202300042:pamm202300042-math-0174$

We use meshes with 2521 vertices for Ω₁ and 2637 vertices for Ω₂. We choose the stiffness constant c, force density f, boundary stiffness constant α, additional downward forces $urn:x-wiley:16177061:media:pamm202300042:pamm202300042-math-0175$ , and threshold deflections $urn:x-wiley:16177061:media:pamm202300042:pamm202300042-math-0176$ to be constant over the domains in each case, as

$urn:x-wiley:16177061:media:pamm202300042:pamm202300042-math-0177$ (19)

We apply the ADMM scheme (16). The iterations are terminated as soon as the stopping criterion

$urn:x-wiley:16177061:media:pamm202300042:pamm202300042-math-0178$ (20)

is satisfied, where we recall that $urn:x-wiley:16177061:media:pamm202300042:pamm202300042-math-0179$ denotes the norm induced by the lumped mass matrix M. With this setup and $urn:x-wiley:16177061:media:pamm202300042:pamm202300042-math-0180$ , the ADMM algorithm required 186 iterations for the problem on Ω₁ and 278 iterations in case of Ω₂. The resulting solutions z are shown in Figure 2.

**FIGURE 2**
Open in figure viewer PowerPoint

Resulting deflections z that minimize the discrete energy functional $urn:x-wiley:16177061:media:pamm202300042:pamm202300042-math-0181$ defined in (13) over two different domains, with data given in (19). Deflections were computed with the ADMM scheme (16) and stopping criterion (20). The subdomains are colored according to the number of active terms ( $urn:x-wiley:16177061:media:pamm202300042:pamm202300042-math-0182$ in (13).

ACKNOWLEDGMENTS

This work was supported by DFG grants HE 6077/10–2 and SCHM 3248/2–2 within the Priority Program SPP 1962 (Nonsmooth and Complementarity-based Distributed Parameter Systems: Simulation and Hierarchical Optimization), which is gratefully acknowledged.

Open access funding enabled and organized by Projekt DEAL.

REFERENCES

1Chambolle, A., & Pock, T. (2011). A first-order primal-dual algorithm for convex problems with applications to imaging. Journal of Mathematical Imaging and Vision, 40(1), 120–145. https://doi.org/10.1007/s10851-010-0251-1
10.1007/s10851-010-0251-1
Web of Science® Google Scholar
2Combettes, P. L., & Pesquet, J. C. (2011). Proximal splitting methods in signal processing. In H. Bauschke, R. Burachik, P. Combettes, V. Elser, D. Luke, H. Wolkowicz (Eds.), Fixed-point algorithms for inverse problems in science and engineering, Springer Optimization and Its Applications (Vol. 49, pp. 185–212). Springer. https://doi.org/10.1007/978-1-4419-9569-8_10
10.1007/978-1-4419-9569-8_10
Web of Science® Google Scholar
3Parikh, N., & Boyd, S. (2014). Proximal algorithms. Foundations and Trends in Optimization, 1(3), 127–239. https://doi.org/10.1561/2400000003
10.1561/2400000003
Google Scholar
4Bauschke, H. H., & Combettes, P. L. (2011). Convex analysis and monotone operator theory in Hilbert spaces. CMS Books in Mathematics/Ouvrages de Mathématiques de la SMC. Springer. https://doi.org/10.1007/978-1-4419-9467-7
10.1007/978?1?4419?9467?7
Google Scholar
5Daubechies, I., Defrise, M., & De Mol, C. (2004). An iterative thresholding algorithm for linear inverse problems with a sparsity constraint. Communications on Pure and Applied Mathematics, 57(11), 1413–1457. https://doi.org/10.1002/cpa.20042
10.1002/cpa.20042
Web of Science® Google Scholar
6Goldstein, T., & Osher, S. (2009). The split Bregman method for L1-regularized problems. SIAM Journal on Imaging Sciences, 2(2), 323–343. https://doi.org/10.1137/080725891
10.1137/080725891
Web of Science® Google Scholar
7Li, Y., & Osher, S. (2009). A new median formula with applications to PDE based denoising. Communications in Mathematical Sciences, 7(3), 741–753. https://doi.org/10.4310/cms.2009.v7.n3.a11
10.4310/CMS.2009.v7.n3.a11
CAS Web of Science® Google Scholar
8Baumgärtner, L., Herzog, R., Schmidt, S., & Weiß, M. (2023). The proximal map of the weighted mean absolute error. https://doi.org/10.5281/zenodo.7620815
10.5281/zenodo.7620815
Google Scholar
9Bostan, V., Han, W., & Reddy, B. D. (2005). A posteriori error estimation and adaptive solution of elliptic variational inequalities of the second kind. Applied Numerical Mathematics. An IMACS Journal, 52(1), 13–38. https://doi.org/10.1016/j.apnum.2004.06.012
10.1016/j.apnum.2004.06.012
Web of Science® Google Scholar
10Wachsmuth, G., & Wachsmuth, D. (2011). Convergence and regularization results for optimal control problems with sparsity functional. ESAIM: Control, Optimisation and Calculus of Variations, 17(3), 858–886. https://doi.org/10.1051/cocv/2010027
10.1051/cocv/2010027
Web of Science® Google Scholar
11Casas, E., Herzog, R., & Wachsmuth, G. (2012). Approximation of sparse controls in semilinear equations by piecewise linear functions. Numerische Mathematik, 122(4), 645–669. https://doi.org/10.1007/s00211-012-0475-7
10.1007/s00211-012-0475-7
Web of Science® Google Scholar
12Boyd, S., Parikh, N., Chu, E., Peleato, B., & Eckstein, J. (2010). Distributed optimization and statistical learning via the alternating direction method of multipliers. Foundations and Trends in Machine Learning, 3(1), 1–122. https://doi.org/10.1561/2200000016
10.1561/2200000016
PubMed Web of Science® Google Scholar

1 We could also allow $urn:x-wiley:16177061:media:pamm202300042:pamm202300042-math-0138$ to be nonnegative functions on Ω, with minor modifications in what follows.

Volume23, Issue4

Special Issue:93rd Annual Meeting of the International Association of Applied Mathematics and Mechanics (GAMM)

December 2023

e202300042

The proximal map of the weighted mean absolute error

Abstract

1 INTRODUCTION

2 ALGORITHM FOR THE EVALUATION OF THE PROXIMAL MAP

ALGORITHM 1. Evaluation of (3), the proximal map of the weighted mean absolute error.

3 STRUCTURE OF $urn:x-wiley:16177061:media:pamm202300042:pamm202300042-math-0117$

4 APPLICATION TO A DEFLECTION ENERGY MINIMIZATION PROBLEM

ACKNOWLEDGMENTS

REFERENCES

Figures

References

Information

About Wiley Online Library

Help & Support

Opportunities

Connect with Wiley

The proximal map of the weighted mean absolute error

Abstract

1 INTRODUCTION

2 ALGORITHM FOR THE EVALUATION OF THE PROXIMAL MAP

ALGORITHM 1. Evaluation of (3), the proximal map of the weighted mean absolute error.

3 STRUCTURE OF

4 APPLICATION TO A DEFLECTION ENERGY MINIMIZATION PROBLEM

ACKNOWLEDGMENTS

REFERENCES

Figures

References

Related

Information

3 STRUCTURE OF $urn:x-wiley:16177061:media:pamm202300042:pamm202300042-math-0117$