Volume 2013, Issue 1 680768

Research Article

Open Access

Convergence Analysis of Alternating Direction Method of Multipliers for a Class of Separable Convex Programming

Zehui Jia

School of Mathematical Science and Key Laboratory for NSLSCS of Jiangsu Province, Nanjing Normal University, Nanjing, Jiangsu 210023, China nnu.cn

Search for more papers by this author

Ke Guo,

Ke Guo

College of Mathematics and Information, China West Normal University, Nanchong, Sichuan 637009, China cwnu.edu.cn

Search for more papers by this author

Xingju Cai,

Corresponding Author

Xingju Cai

[email protected]

School of Mathematical Science and Key Laboratory for NSLSCS of Jiangsu Province, Nanjing Normal University, Nanjing, Jiangsu 210023, China nnu.cn

Search for more papers by this author

Zehui Jia,

Zehui Jia

School of Mathematical Science and Key Laboratory for NSLSCS of Jiangsu Province, Nanjing Normal University, Nanjing, Jiangsu 210023, China nnu.cn

Search for more papers by this author

Ke Guo,

Ke Guo

College of Mathematics and Information, China West Normal University, Nanchong, Sichuan 637009, China cwnu.edu.cn

Search for more papers by this author

Xingju Cai,

Corresponding Author

Xingju Cai

[email protected]

School of Mathematical Science and Key Laboratory for NSLSCS of Jiangsu Province, Nanjing Normal University, Nanjing, Jiangsu 210023, China nnu.cn

Search for more papers by this author

First published: 24 September 2013

https://doi.org/10.1155/2013/680768

Citations: 1

Academic Editor: Xu Minghua

Share a link

Email
Wechat
Bluesky

Abstract

The purpose of this paper is extending the convergence analysis of Han and Yuan (2012) for alternating direction method of multipliers (ADMM) from the strongly convex to a more general case. Under the assumption that the individual functions are composites of strongly convex functions and linear functions, we prove that the classical ADMM for separable convex programming with two blocks can be extended to the case with more than three blocks. The problems, although still very special, arise naturally from some important applications, for example, route-based traffic assignment problems.

1. Introduction

In this paper, we consider the convex programming with separable functions:

(1)

where

(i = 1,2, …, m) are closed proper convex functions (not necessarily smooth);

(i = 1,2, …, m);

(i = 1,2, …, m) are closed convex sets; b ∈ ℛ^l and

. Throughout the paper, we assume that the solution set of (1) is nonempty.

For the special case of (1) with m = 2,

(2)

the problem has been studied extensively. Among lots of numerical methods, one of the most popular methods is the alternating direction method of multipliers (ADMM) which was presented originally in [1, 2]. The iterative scheme of ADMM for (2) is as follows:

(3)

where λ^k is Lagrange multiplier associated with the linear constraints and β > 0 is the penalty parameter. The convergence of ADMM for (2) was also established under the condition that the involved functions are convex and the constrained sets are convex too.

While there are diversified applications whose objective function is separable into m ≥ 3 individual convex functions without coupled variables, such as traffic problems, the problem of recovering the low-rank, sparse components of matrices from incomplete and noisy observation in [3], the constrained total-variation image restoration and reconstruction problem in [4, 5], and the minimal surface PDE problem in [6], it is thus natural to extend ADMM from 2 blocks to m blocks, resulting in the iterative scheme:

(4)

Unfortunately, the convergence of the natural extension is still open under convex assumption, and the recent convergence results [7] are under the assumption that all the functions involved in the objective functions are strongly convex. This lack of convergence has inspired some ADM-based methods, for example, prediction-correction type method [3, 8–11], that is, the iterate is regarded as a prediction, and the next iterate is a correction for it. However, the numerical results show that the algorithm (4) always performs better than these variants. Recently, Han and Yuan [7] show that the global convergence of the extension of ADMM for m ≥ 3 is valid if the involved functions are further assumed to be strongly convex. This result does not answer the open problem regarding the convergence of the extension of ADMM under the convex assumption, but it makes a key progress towards this objective.

In this paper, we consider the separable convex optimization problem (1) where each individual function f_i is the combination of a strongly convex function g_i and a linear transform B_i. That is, (1) takes the following form:

(5)

where

(i = 1,2, …, m) are closed proper strongly convex function with the modulus μ_i (not necessarily smooth);

(i = 1,2, …, m);

(i = 1,2, …, m) are closed convex sets; b ∈ ℛ^l and

;

(i = 1,2, …, m), where B_i may not have full column rank (if B_i has full column rank, the composite function is strongly convex and reduces to the case considered in [7]). Note that although (5) is very special, it arises frequently from many applications. One example is under the route-based traffic assignment problem [12], where g_i is the link traffic cost, B_i is the link-path incidence matrix, and x is the path follow vector.

In the following, we abuse a little the notation and still write g_i with f_i; that is, the problem under consideration is

(6)

where

(i = 1,2, …, m) are closed proper strongly convex function with the modulus μ_i (not necessarily smooth).

The rest of the paper is organized as follows. In the next section, we list some necessary preliminary results that will be used in the rest of the paper. We then describe the algorithm formally and analyze its global convergence under reasonable conditions in Section 3. We complete the paper with some conclusions in Section 4.

2. Preliminaries

In this section, we summarize some basic concepts and their properties that will be useful for further discussion.

Let ∥·∥_p denote the standard definition of the l^p-norm, and particularly, let ∥·∥ = ∥·∥₂ denote the Euclidean norm. For a symmetric and positive definite matrix G, we denote ∥·∥_G the G-norm, that is, . If G is the product of a positive parameter β and the identity matrix I, that is, G = βI, we use the simpler notation: ∥·∥_G = ∥·∥_β.

Let f : ℛⁿ → ℛ ∪ {+∞}. If the domain of f denoted by dom f = {x ∈ ℛⁿ∣f(x)<+∞} is not empty, then f is said to be proper. If for any x ∈ ℛⁿ and y ∈ ℛⁿ, we have

(7)

then f is said to be convex. Furthermore, f is said to be strongly convex with the modulus μ > 0 if and only if

(8)

A set-valued operator T defined on ℛⁿ is said to be monotone if and only if

(9)

and T is said to be strongly monotone with modulus μ > 0 if and only if

(10)

Let Γ₀(ℛⁿ) denote the set of closed proper convex functions from ℛⁿ to ℛ ∪ {+∞}. For any f ∈ Γ₀(ℛⁿ), the subdifferential of f which is the set-valued operator, defined by

(11)

is monotone. Moreover, if f is strongly convex function with the modulus μ, ∂f is strongly monotone with the modulus μ.

Let F be a mapping from a set Ω ⊂ ℛⁿ → ℛⁿ. Then F is said to be co-coercive on Ω with modulus γ > 0, if

(12)

Throughout the paper, we make the following assumptions.

Assumption 1. (i) n_i∥B_ix_i∥ ≥ ∥A_i∥∥x_i∥, , i ∈ {1,2, …, m}; (ii) the solution set of (1) is nonempty.

Remark 2. Assumption 1 is a little restrictive. However, some problems can satisfy it. A remarkable one is the following route-based traffic assignment problem.

Consider a transportation network G(𝒩, E), where 𝒩 is the set of nodes. We denote the set of links by 𝒜, and the number of the element of 𝒜 by N_𝒜, respectively. Let RS denote the set of origin-destination (O-D) pairs. For an O-D pair rs ∈ RS, let q^rs be its traffic demand; let P^rs be the set of routes connecting rs, and p ∈ P^rs; 𝒩^rs denotes the number of the routes connecting rs; let be the route flow on p. The feasible route flow vector h = (p ∈ P^rs∣rs ∈ RS) is thus given by

(13)

Define E as the link-route incidence matrix such that

(14)

Then, link flow f_a can be written as

(15)

By denoting the link cost function as C_a(f) and for the additive case, the route cost function as C_p(h), they can be related by

(16)

The user equilibrium traffic assignment problem can be formulated as a VI: find f^* ∈ F such that

(17)

or equivalently, find h^* ∈ H such that

(18)

where C = {C_a} is the vector of the link cost function.

In general, it is easy to show that e is a row of E and E is not a full column rank (if E is, then the above variational inequality is strongly monotone).

For simplicity, in the following, we only consider the case for m = 3. Notice that for m ≥ 3, it can be proved similarly following the processing of m = 3.

3. The Method

In this section, we consider the following convex minimization problem with linear constraint, where the objective function is in the form of the sum of three individual functions without coupled variable:

(19)

where

(i = 1,2, 3) are closed proper strongly convex function with the modulus μ_i (not necessarily smooth);

(i = 1,2, 3),

(i = 1,2, 3);

(i = 1,2, 3) are closed convex sets; b ∈ ℛ^l and

The iterative scheme of ADMM for problem (19) is as follows:

(20)

where λ^k is the Lagrangian multiplier associated with the linear constraints and β > 0 is the penalty parameter.

4. Convergence

In this section, we prove the convergence of the extended ADMM for problem (19). As the assumptions aforementioned, by invoking the first-order necessary and sufficient condition for convex programming, we easily see that the problem (19) under the condition is characterized by the following variational inequality (VI): find u^* ∈ 𝒰 and

such that

(21)

where

(22)

We denote the VI (21)-(22) by MVI(𝒰, Q).

Similarly, in [7], we propose an easily implementable stopping criterion for executing (20):

(23)

and its rationale can be seen in the following lemma.

Lemma 3 (see [7].)If and (i = 1,2, 3), then is a solution of MVI(𝒰, Q).

Lemma 3 implies that the iterate is a solution of MVI(𝒰, Q) when the inequality (23) holds with ϵ = 0. Some techniques of establishing the error bounds in [13] can help us analyze how precisely the iterate satisfies the optimality conditions when the proposed stopping criterion is satisfied with a tolerance ϵ > 0.

Lemma 4. Let be the solution of the problem (19), and let λ^* be a corresponding Lagrange multiplier associated with the linear constraint. Then, the sequence generated by (20) satisfies

(24)

Proof. By invoking the first-order optimality condition for the related subproblem in (20), for any x_i ∈ 𝒳_i, i = 1,2, 3, we get

(25)

Setting

(i = 1,2, 3) in (25), we have

(26)

On the other hand, setting

in (21), it follows that

(27)

Adding (26) and (27), we obtain

(28)

With the rearrangement of the above inequalities, we derive that

(29)

Adding the above inequalities (29), we have

(30)

The proof is complete.

Hereafter, we define a matrix which will make the notation of proof more succinct. More specifically, let

(31)

Obviously, M is a positive semidefinite matrix, only for analysis convenience; we denote

(32)

Lemma 5. Let be a solution of MVI(𝒰, Q), and let the sequence be generated by (20). Then, one has

(33)

Proof. From (20) and Lemma 4, we have

(34)

Since

(35)

and

, we can get

(36)

Using Cauchy-Schwarz inequality, we have

(37)

Substituting (36) and (37) into (34), we get

(38)

Since f_i is strongly convex, from the strong monotonicity of the subdifferential mapping ∂f_i (with the modulus μ_i), then we have

(39)

where

, for any i ∈ {1,2, 3}.

By using the notion of , from (38) we have

(40)

The proof is complete.

Theorem 6. Under Assumption 1, for any

(41)

the sequence

generated by (20) converges to a solution of MVI(𝒰, Q).

Proof. From Lemma 5, we have

(42)

where

(43)

From Assumption 1, it follows that

(44)

Consequently,

(45)

From (45), we have

(46)

which means that the generated sequence {u^k} is bounded.

Furthermore, it follows that

(47)

which means that

(48)

Therefore, we have

(49)

Since ∥A_i∥ is nonzero and bounded, from (48) we have

(50)

Since {u^k} is bounded, {λ^k} has at least one cluster point, say

. Let

be the corresponding subsequence that converges to

. Taking a limit along this subsequence in (25) and (49), we obtain

(51)

which follows that

is an optimal Lagrange multiplier. Since λ^* is arbitrary, we can set

in (46) and conclude that the whole generated sequence converges to a solution of MVI(𝒰, Q).

5. Conclusions

In this paper, we extend the convergence analysis of the ADMM for the separable convex optimization problem with strongly convex functions to the case in which the individual functions are composites of strongly convex functions with a linear transform. Under further assumptions, we established the global convergence of the algorithm.

It should be admitted that although some problems arising from applications such as traffic assignment fall into our analysis, the problems considered here are too special. Thus, it is far away to solve the open problem of convergence of the ADMM with more than three blocks.

Acknowledgments

Xingju Cai was supported by the National Natural Science Foundation of China (NSFC) Grants nos. 11071122 and 11171159 and by the Doctoral Fund of Ministry of Education of China no. 20103207110002.

References

1 Gabay D. and Mercier B., A dual algorithm for the solution of nonlinear variational problems via finite element approximation, Computers and Mathematics with Applications. (1976) 2, no. 1, 17–40, 2-s2.0-0002211517, https://doi.org/10.1016/0898-1221(76)90003-1, ZBL0352.65034.
10.1016/0898-1221(76)90003-1
Google Scholar
2 Gabay D., M. Fortin and R. Glowinski, Applications of the method of multipliers to variational inequalities, Augmented Lagrangian Methods: Applications to Numerical Solution of Boundary-Value Problems, 1983, North-Holland Publisher, Amsterdam, The Netherland, 299–331.
10.1016/S0168-2024(08)70034-1
Google Scholar
3 Tao M. and Yuan X., Recovering low-rank and sparse components of matrices from incomplete and noisy observations, SIAM Journal on Optimization. (2011) 21, no. 1, 57–81, 2-s2.0-79957510064, https://doi.org/10.1137/100781894, ZBL1218.90115.
10.1137/100781894
Web of Science® Google Scholar
4 Ng M. K., Weiss P., and Yuan X., Solving constrained total-variation image restoration and reconstruction problems via alternating direction methods, SIAM Journal on Scientific Computing. (2010) 32, no. 5, 2710–2736, 2-s2.0-78149331304, https://doi.org/10.1137/090774823, ZBL1217.65071.
10.1137/090774823
Web of Science® Google Scholar
5 Rudin L. I., Osher S., and Fatemi E., Nonlinear total variation based noise removal algorithms, Physica D. (1992) 60, no. 1–4, 259–268, 2-s2.0-44049111982, https://doi.org/10.1016/0167-2789(92)90242-F, ZBL0780.49028.
10.1016/0167-2789(92)90242-F
Web of Science® Google Scholar
6 Wen Z., Goldfarb D., and Yin W., Alternating direction augmented Lagrangian methods for semidefinite programming, Mathematical Programming Computation. (2010) 2, no. 3-4, 203–230, 2-s2.0-77956945296, https://doi.org/10.1007/s12532-010-0017-1, ZBL1206.90088.
10.1007/s12532-010-0017-1
Google Scholar
7 Han D. and Yuan X., A note on the alternating direction method of multipliers, Journal of Optimization Theory and Applications. (2012) 155, 227–238, 2-s2.0-84856787939, https://doi.org/10.1007/s10957-012-0003-z, ZBL1255.90093.
10.1007/s10957-012-0003-z
Web of Science® Google Scholar
8 Han D. R., Yuan X. M., Zhang W. X., and Cai X. J., An ADM-based splitting method for separable convex programming, Computational Optimization and Applications. (2013) 54, 343–369, https://doi.org/10.1007/s10589-012-9510-y.
10.1007/s10589-012-9510-y
Web of Science® Google Scholar
9 He B. S., Tao M., and Yuan X. M., Alternating direction method with Gaussian back substitution for separable convex programming, SIAM Journal on Optimization. (2012) 22, 313–340, https://doi.org/10.1137/110822347.
10.1137/110822347
Web of Science® Google Scholar
10 He B. S., Tao M., Xu M. H., and Yuan X. M., Alternating directions based contraction method for generally separable linearly constrained convex programming problems, Optimization. (2013) 62, 573–596, https://doi.org/10.1080/02331934.2011.611885.
10.1080/02331934.2011.611885
Web of Science® Google Scholar
11 He B. S., Tao M., and Yuan X. M., A splitting method for separable convex programming, IMA Journal of Numerical Analysis. In presshttps://doi.org/10.1080/02331934.2011.611885.
10.1080/02331934.2011.611885
Google Scholar
12 Han D. and Lo H. K., Solving non-additive traffic assignment problems: a descent method for co-coercive variational inequalities, European Journal of Operational Research. (2012) 159, no. 3, 529–544.
10.1016/S0377-2217(03)00423-5
Google Scholar
13 Facchinei F. and Pang J. S., Finite-Dimensional Variational Inequalities and Complementary Problems. Volume I and II, 2003, Springer, New York, NY, USA.
Google Scholar

All articles

Convergence Analysis of Alternating Direction Method of Multipliers for a Class of Separable Convex Programming

Abstract

1. Introduction

2. Preliminaries

3. The Method

4. Convergence

5. Conclusions

Acknowledgments

References

References

Information

About Wiley Online Library

Help & Support

Opportunities

Connect with Wiley

Convergence Analysis of Alternating Direction Method of Multipliers for a Class of Separable Convex Programming

Abstract

1. Introduction

2. Preliminaries

3. The Method

4. Convergence

5. Conclusions

Acknowledgments

References

References

Related

Information