Volume 2012, Issue 1 410137
Research Article
Open Access

Strong Convergence of a Projected Gradient Method

Shunhou Fan

Shunhou Fan

Department of Mathematics, Tianjin Polytechnic University, Tianjin 300387, China tjpu.edu.cn

Search for more papers by this author
Yonghong Yao

Corresponding Author

Yonghong Yao

Department of Mathematics, Tianjin Polytechnic University, Tianjin 300387, China tjpu.edu.cn

Search for more papers by this author
First published: 05 April 2012
Academic Editor: Yeong-Cheng Liou

Abstract

The projected-gradient method is a powerful tool for solving constrained convex optimization problems and has extensively been studied. In the present paper, a projected-gradient method is presented for solving the minimization problem, and the strong convergence analysis of the suggested gradient projection method is given.

1. Introduction

In the present paper, our main purpose is to solve the following minimization problem:
()
where C is a nonempty closed and convex subset of a real Hilbert space H, f : HR is a real-valued convex function.
Now it is well known that the projected-gradient method is a powerful tool for solving the above minimization problem and has extensively been studied. See, for instance, [18]. The classic algorithm is the following form of the projected-gradient method:
()
where γ > 0 is an any constant, PC is the nearest point projection from H onto C, and ∇f denotes the gradient of f.
It is known [1] that if f has a Lipschitz continuous and strongly monotone gradient, then the sequence {xn} generated by (1.2) can be strongly convergent to a minimizer of f in C. If the gradient of f is only assumed to be Lipschitz continuous, then {xn} can only be weakly convergent if H is infinite dimensional. An interesting problem is how to appropriately modify the projected gradient algorithm so as to have strong convergence? For this purpose, recently, Xu [9] introduced the following algorithm:
()
Under some additional assumptions, Xu [9] proved that the sequence {xn} converges strongly to a minimizer of (1.1). At the same time, Xu [9] also suggested a regularized method:
()
Consequently, Yao et al. [10] proved the strong convergence of the regularized method (1.4) under some weaker conditions.

Motivated by the above works, in this paper we will further construct a new projected gradient method for solving the minimization problem (1.1). It should be pointed out that our method also has strong convergence under some mild conditions.

2. Preliminaries

Let C be a nonempty closed convex subset of a real Hilbert space H. A bounded linear operator B is said to be strongly positive on H if there exists a constant α > 0 such that
()
A mapping T : CC is called nonexpansive if
()
A mapping T : CC is said to be an averaged mapping, if and only if it can be written as the average of the identity I and a nonexpansive mapping; that is,
()
where α ∈ (0,1) is a constant and R : CC is a nonexpansive mappings. In this case, we call T is α-averaged.
A mapping T : CC is said to be ν-inverse strongly monotone (ν-ism), if and only if
()
The following proposition is well known, which is useful for the next section.

Proposition 2.1 (See [9]). (1) The composite of finitely many averaged mappings is averaged. That is, if each of the mappings is averaged, then so is the composite T1, …, TN. In particular, if T1 is α1-averaged and T2 is α2-averaged, where α1, α2 ∈ (0,1), then the composite T1T2 is α-averaged, where α = α1α2α1α2.

(2) T is ν-ism, then for γ > 0, γT is (ν/γ)-ism.

Recall that the (nearest point or metric) projection from H onto C, denoted by PC, assigns, to each xH, the unique point PC(x) ∈ C with the property
()
We use S to denote the solution set of (1.1). Assume that (1.1) is consistent, that is, S. If f is Frechet differentiable, then x*C solves (1.1) if and only if x*C satisfies the following optimality condition:
()
where ∇f denotes the gradient of f. Observe that (2.6) can be rewritten as the following VI
()
(Note that the VI has been extensively studied in the literature, see, for instance [1125].) This shows that the minimization (1.1) is equivalent to the fixed point problem
()
where γ > 0 is an any constant. This relationship is very important for constructing our method.
Next we adopt the following notation:
  • (i)

    xnx means that xn converges strongly to x;

  • (ii)

    xnx means that xn converges weakly to x;

  • (iii)

    Fix (T): = {x : Tx = x} is the fixed points set of T.

Lemma 2.2 (See [26]). Let {xn} and {yn} be bounded sequences in a Banach space X and let {βn} be a sequence in [0,1] with

()
Suppose
()
for all n ≥ 0 and
()
Then, lim nynxn∥ = 0.

Lemma 2.3 (See [27] (demiclosedness principle)). Let C be a closed and convex subset of a Hilbert space H and let T : CC be a nonexpansive mapping with Fix (T) ≠ . If {xn} is a sequence in C weakly converging to x and if {(IT)xn} converges strongly to y, then

()
In particular, if y = 0, then x ∈ Fix (T).

Lemma 2.4 (See [28]). Assume {an} is a sequence of nonnegative real numbers such that

()
where {γn} is a sequence in (0,1) and {δn} is a sequence such that
  • (1)

    ;

  • (2)

    limsup nδn/γn ≤ 0 or .

Then, lim nan = 0.

3. Main Results

Let C be a closed convex subset of a real Hilbert space H. Let f : CR be a real-valued Frechet differentiable convex function with the gradient ∇f. Let A : CH be a ρ-contraction. Let B : HH be a self-adjoint, strongly positive bounded linear operator with coefficient α > 0. First, we present our algorithm for solving (1.1). Throughout, we assume S.

Algorithm 3.1. For given x0C, compute the sequence {xn} iteratively by

()
where σ > 0, γ > 0 are two constants and the real number sequence {θn}⊂[0,1].

Remark 3.2. In (3.1), we use two projections. Now, it is well-known that the advantage of projections, which makes them successful in real-word applications, is computational.

Next, we show the convergence analysis of this Algorithm 3.1.

Theorem 3.3. Assume that the gradient ∇f is L-Lipschitzian and σρ < α. Let {xn} be a sequence generated by (3.1), where γ ∈ (0,2/L) is a constant and the sequence {θn} satisfies the conditions: (i) lim nθn = 0 and (ii) . Then {xn} converges to a minimizer of (1.1) which solves the following variational inequality:

()

By Algorithm 3.1 involved in the projection, we will use the properties of the metric projection for proving Theorem 3.3. For convenience, we list the properties of the projection as follows.

Proposition 3.4. It is well known that the metric projection PC of H onto C has the following basic properties:

  • (i)

    PC(x) − PC(y)∥≤∥xy∥, for all x, yH;

  • (ii)

    xy, PC(x) − PC(y)〉≥∥PC(x) − PC(y)∥2, for every x, yH;

  • (iii)

    xPC(x), yPC(x)〉≤0, for all xH, yC.

The Proof of Theorem 3.3 Let x*S. First, from (2.8), we note that PC(Iγf)x* = x*. By (3.1), we have

()
Thus, by induction, we obtain
()

Note that the Lipschitz condition implies that the gradient ∇f is (1/L)-inverse strongly monotone (ism), which then implies that γf is (1/γL)-ism. So, Iγf is (γL/2)-averaged. Now since the projection PC is (1/2)-averaged, we see that PC(Iγf) is ((2 + γL)/4)-averaged. Hence we have that

()
where R, T are nonexpansive and β = (2 + γL)/4 ∈ (0,1). Then we can rewrite (3.1) as
()
where
()
Set zn = (1 − β)xn + βTxn for all n. Since {xn} is bounded, we deduce {A(xn)}, {B(xn)}, and {Txn} are all bounded. Hence, there exists a constant M > 0 such that
()
Thus,
()
It follows that
()
This together with Lemma 2.2 implies that
()
So,
()
Since
()
we deduce
()
Next we prove
()
where x* is the unique solution of VI (3.2).

Indeed, we can choose a subsequence of {xn} such that

()
Since is bounded, there exists a subsequence of which converges weakly to a point . Without loss of generality, we may assume that converges weakly to . Since γ ∈ (0, 2/L), PC(Iγf) is nonexpansive. Thus, from (3.14) and Lemma 2.3, we have . Therefore,
()
Finally, we show . By using the property of the projection PC, we have
()
It follows that
()
It is obvious that . Then we can apply Lemma 2.4 to the last inequality to conclude that . The proof is completed.

In (3.1), if we take A = 0 and B = I, then (3.1) reduces to the following.

Algorithm 3.5. For given x0C, compute the sequence {xn} iteratively by

()
where σ > 0, γ > 0 are two constants and the real number sequence {θn}⊂[0,1].

From Theorem 3.3, we have the following result.

Theorem 3.6. Assume that the gradient ∇f is L-Lipschitzian and σρ < α. Let {xn} be a sequence generated by (3.20), where γ ∈ (0, 2/L) is a constant and the sequences {θn} satisfies the conditions: (i) lim nθn = 0 and (ii) . Then {xn} converges to a minimizer of (1.1) which is the minimum norm element in S.

Proof. As a consequence of Theorem 3.3, we obtain that the sequence {xn} generated by (3.20) converges strongly to which satisfies

()
This implies
()
Thus,
()
That is, is the minimum norm element in S. This completes the proof.

Acknowledgment

Y. Yao was supported in part by Colleges and Universities Science and Technology Development Foundation (20091003) of Tianjin, NSFC 11071279 and NSFC 71161001-G0105.

      The full text of this article hosted at iucr.org is unavailable due to technical difficulties.