Volume 2013, Issue 1 593549
Research Article
Open Access

Solving Optimization Problems on Hermitian Matrix Functions with Applications

Xiang Zhang

Corresponding Author

Xiang Zhang

Department of Computer Science and Information, Guizhou University, Guiyang 550025, China gzu.edu.cn

Search for more papers by this author
Shu-Wen Xiang

Shu-Wen Xiang

Department of Computer Science and Information, Guizhou University, Guiyang 550025, China gzu.edu.cn

Search for more papers by this author
First published: 21 April 2013
Academic Editor: K. Sivakumar

Abstract

We consider the extremal inertias and ranks of the matrix expressions f(X, Y) = A3B3X − (B3X) *C3YD3 − (C3YD3) *, where   B3,   C3, and D3 are known matrices and Y and X are the solutions to the matrix equations A1Y = C1, YB1 = D1, and A2X = C2, respectively. As applications, we present necessary and sufficient condition for the previous matrix function f(X, Y) to be positive (negative), non-negative (positive) definite or nonsingular. We also characterize the relations between the Hermitian part of the solutions of the above-mentioned matrix equations. Furthermore, we establish necessary and sufficient conditions for the solvability of the system of matrix equations A1Y = C1, YB1 = D1, A2X = C2, and B3X + (B3X) * + C3YD3 + (C3YD3) * = A3, and give an expression of the general solution to the above-mentioned system when it is solvable.

1. Introduction

Throughout, we denote the field of complex numbers by , the set of all m × n matrices over by m×n, and the set of all m × m Hermitian matrices by . The symbols A* and (A) stand for the conjugate transpose, the column space of a complex matrix A respectively. In denotes the n × n identity matrix. The Moore-Penrose inverse [1] A of A, is the unique solution X to the four matrix equations:
(1)
Moreover, LA and RA stand for the projectors LA = IAA,   RA = IAA induced by A. It is well known that the eigenvalues of a Hermitian matrix An×n are real, and the inertia of A is defined to be the triplet
(2)
where i+(A), i(A), and i0(A) stand for the numbers of positive, negative, and zero eigenvalues of A, respectively. The symbols i+(A) and i(A) are called the positive index and the negative index of inertia, respectively. For two Hermitian matrices A and B of the same sizes, we say AB (AB) in the Löwner partial ordering if AB is positive (negative) semidefinite. The Hermitian part of X is defined as H(X) = X + X*. We will say that X is Re-nnd (Re-nonnegative semidefinite) if H(X) ≥ 0,   X is Re-pd (Re-positive definite) if H(X) > 0, and X is Re-ns if H(X) is nonsingular.
It is well known that investigation on the solvability conditions and the general solution to linear matrix equations is very active (e.g., [29]). In 1999, Braden [10] gave the general solution to
(3)
In 2007, Djordjević [11] considered the explicit solution to (3) for linear bounded operators on Hilbert spaces. Moreover, Cao [12] investigated the general explicit solution to
(4)
Xu et al. [13] obtained the general expression of the solution of operator equation (4). In 2012, Wang and He [14] studied some necessary and sufficient conditions for the consistence of the matrix equation
(5)
and presented an expression of the general solution to (5).
Note that (5) is a special case of the following system:
(6)
To our knowledge, there has been little information about (6). One goal of this paper is to give some necessary and sufficient conditions for the solvability of the system of matrix (6) and present an expression of the general solution to system (6) when it is solvable.

In order to find necessary and sufficient conditions for the solvability of the system of matrix equations (6), we need to consider the extremal ranks and inertias of (10) subject to (13) and (11).

There have been many papers to discuss the extremal ranks and inertias of the following Hermitian expressions:
(7)
(8)
(9)
(10)
Tian has contributed much in this field. One of his works [15] considered the extremal ranks and inertias of (7). He and Wang [16] derived the extremal ranks and inertias of (7) subject to A1X = C1, A2XB2 = C2. Liu and Tian [17] studied the extremal ranks and inertias of (8). Chu et al. [18] and Liu and Tian [19] derived the extremal ranks and inertias of (9). Zhang et al. [20] presented the extremal ranks and inertias of (9), where X and Y are Hermitian solutions of
(11)
(12)
respectively. He and Wang [16] derived the extremal ranks and inertias of (10). We consider the extremal ranks and inertias of (10) subject to (11) and
(13)
which is not only the generalization of the above matrix functions, but also can be used to investigate the solvability conditions for the existence of the general solution to the system (6). Moreover, it can be applied to characterize the relations between Hermitian part of the solutions of (11) and (13).

The remainder of this paper is organized as follows. In Section 2, we consider the extremal ranks and inertias of (10) subject to (11) and (13). In Section 3, we characterize the relations between the Hermitian part of the solution to (11) and (13). In Section 4, we establish the solvability conditions for the existence of a solution to (6) and obtain an expression of the general solution to (6).

2. Extremal Ranks and Inertias of Hermitian Matrix Function (10) with Some Restrictions

In this section, we consider formulas for the extremal ranks and inertias of (10) subject to (11) and (13). We begin with the following Lemmas.

Lemma 1 (see [21].)(a) Let A1, C1, B1, and D1 be given. Then the following statements are equivalent:

  • (1)

    system (13) is consistent,

  • (2)

    (14)

  • (3)

    (15)

In this case, the general solution can be written as
(16)
where V is arbitrary.

(b) Let A2 and C2 be given. Then the following statements are equivalent:

  • (1)

    equation (11) is consistent,

  • (2)

    (17)

  • (3)

    (18)

In this case, the general solution can be written as
(19)
where W is arbitrary.

Lemma 2 ([22, Lemma 1.5, Theorem 2.3]). Let , Bm×n, and , and denote that

(20)
Then one has the following
  • (a)

    the following equalities hold

    (21)
    (22)

  • (b)

    if (B)⊆(A), then i±(L) = i±(A)   +   i±(DB*AB)). Thus i±(L) = i±(A) if and only if (B)⊆(A) and i±(DB*AB) = 0,

  • (c)

    (23)

Lemma 3 (see [23].)Let Am×n, Bm×k, and Cl×n. Then they satisfy the following rank equalities:

  • (a)

    r[AB] = r(A) + r(EAB) = r(B) + r(EBA),

  • (b)

    ,

  • (c)

    ,

  • (d)

    ,

  • (e)

    ,

  • (f)

    ,

Lemma 4 (see [15].)Let , Bm×n, , Qm×n, and Pp×n be given, and Tm×m be nonsingular. Then one has the following

  • (1)

    i±(TAT*) = i±(A),

  • (2)

    ,

  • (3)

    ,

  • (4)

    .

Lemma 5 (see [22], Lemma  1.4.)Let S be a set consisting of (square) matrices over m×m, and let H be a set consisting of (square) matrices over . Then Then one has the following

  • (a)

    S has a nonsingular matrix if and only if max XSr(X) = m;

  • (b)

    any XS is nonsingular if and only if min XSr(X) = m;

  • (c)

    {0} ∈ S if and only if min XSr(X) = 0;

  • (d)

    S = {0} if and only if max XSr(X) = 0;

  • (e)

    H has a matrix X > 0 (X < 0) if and only if max XHi+(X) = m  (max XHi(X) = m);

  • (f)

    any XH satisfies X > 0 (X < 0) if and only if min XHi+(X) = m  (min XHi(X) = m);

  • (g)

    H has a matrix X ≥ 0 (X ≤ 0) if and only if min XHi(X) = 0  (min XHi+(X) = 0);

  • (h)

    any XH satisfies X ≥ 0 (X ≤ 0) if and only if max XHi(X) = 0  (max XHi+(X) = 0).

Lemma 6 (see [16].)Let p(X, Y) = ABX − (BX) *CYD − (CYD) *, where A, B, C, and D are given with appropriate sizes, and denote that

(24)
Then one has the following:
  • (1)

    the maximal rank of p(X, Y) is

    (25)

  • (2)

    the minimal rank of p(X, Y) is

    (26)

  • (3)

    the maximal inertia of p(X, Y) is

    (27)

  • (4)

    the minimal inertias of p(X, Y) is

    (28)

where
(29)

Now we present the main theorem of this section.

Theorem 7. Let A1m×n, C1m×k, B1k×l, D1n×l,  A2t×q, C2t×p, , B3p×q,   C3p×n, and D3p×n be given, and suppose that the system of matrix equations (13) and (11) is consistent, respectively. Denote the set of all solutions to (13) by S and (11) by G. Put

(30)
Then one has the following:
  • (a)

    the maximal rank of (10) subject to (13) and (11) is

    (31)

  • (b)

    the minimal rank of (10) subject to (13) and (11) is

    (32)

  • (c)

    the maximal inertia of (10) subject to (13) and (11) is

    (33)

  • (d)

    the minimal inertia of (10) subject to (13) and (11) is

    (34)

Proof. By Lemma 1, the general solutions to (13) and (11) can be written as

(35)
where W and Z are arbitrary matrices with appropriate sizes. Put
(36)
Substituting (36) into (10) yields
(37)

Clearly P is Hermitian. It follows from Lemma 6 that

(38)
(39)
(40)
(41)
where
(42)
Now, we simplify the ranks and inertias of block matrices in (38)–(41).

By Lemma 4, block Gaussian elimination, and noting that

(43)
we have the following:
(44)
By C1B1 = A1D1, we obtain
(45)
By Lemma 2, we can get the following:
(46)
(47)
Substituting (44)-(47) into (38) and (41) yields (31)–(34), respectively.

Corollary 8. Let A1, C1, B1, D1, A2, C2, A3, B3, C3, D3, and Ei, (i = 1,2, …, 5) be as in Theorem 7, and suppose that the system of matrix equations (13) and (11) is consistent, respectively. Denote the set of all solutions to (13) by S and (11) by G. Then, one has the following:

  • (a)

    there exist XG and YS such that A3B3X − (B3X) *C3YD3 − (C3YD3) * > 0 if and only if

    (48)

  • (b)

    there exist XG and YS such that A3B3X − (B3X) *C3YD3 − (C3YD3) * < 0 if and only if

    (49)

  • (c)

    there exist XG and YS such that A3B3X − (B3X) *C3YD3 − (C3YD3) * ≥ 0 if and only if

    (50)

  • (d)

    there exist XG and YS such that A3B3X − (B3X) *C3YD3 − (C3YD3) * ≤ 0 if and only if

    (51)

  • (e)

    A3B3X − (B3X) *C3YD3 − (C3YD3) * > 0 for all XG and YS if and only if

    (52)

  • (f)

    A3B3X − (B3X) *C3YD3 − (C3YD3) * < 0 for all XG and YS if and only if

    (53)

  • (g)

    A3B3X − (B3X) *C3YD3 − (C3YD3) * ≥ 0 for all XG and YS if and only if

    (54)

  • (h)

    A3B3X − (B3X) *C3YD3 − (C3YD3) * ≤ 0 for all XG and YS if and only if

    (55)

  • (i)

    there exist XG and YS such that A3B3X − (B3X) *C3YD3 − (C3YD3) * is nonsingular if and only if

    (56)

3. Relations between the Hermitian Part of the Solutions to (13) and (11)

Now we consider the extremal ranks and inertias of the difference between the Hermitian part of the solutions to (13) and (11).

Theorem 9. Let A1m×p, C1m×p, B1p×l, D1p×l, A2t×p, and C2t×p, be given. Suppose that the system of matrix equations (13) and (11) is consistent, respectively. Denote the set of all solutions to (13) by S and (11) by G. Put

(57)
Then one has the following:
(58)

Proof. By letting A3 = 0, B3 = −I, C3 = I, and D3 = I in Theorem 7, we can get the results.

Corollary 10. Let A1, C1, B1, D1, A2, C2, and Hi, (i = 1,2, …, 5) be as in Theorem 9, and suppose that the system of matrix equations (13) and (11) is consistent, respectively. Denote the set of all solutions to (13) by S and (11) by G. Then, one has the following:

  • (a)

    there exist XG and YS such that (X + X*)>(Y + Y*) if and only if

    (59)

  • (b)

    there exist XG and YS such that (X + X*)<(Y + Y*) if and only if

    (60)

  • (c)

    there exist XG and YS such that (X + X*)≥(Y + Y*) if and only if

    (61)

  • (d)

    there exist XG and YS such that (X + X*)≤(Y + Y*) if and only if

    (62)

  • (e)

      (X + X*)>(Y + Y*) for all XG and YS if and only if

    (63)

  • (f)

      (X + X*)<(Y + Y*) for all XG and YS if and only if

    (64)

  • (g)

      (X + X*)≥(Y + Y*) for all XG and YS if and only if

    (65)

  • (h)

      (X + X*)≤(Y + Y*) for all XG and YS if and only if

    (66)

  • (i)

    there exist XG and YS such that (X + X*)−(Y + Y*) is nonsingular if and only if

    (67)

4. The Solvability Conditions and the General Solution to System (6)

We now turn our attention to (6). We in this section use Theorem 9 to give some necessary and sufficient conditions for the existence of a solution to (6) and present an expression of the general solution to (6). We begin with a lemma which is used in the latter part of this section.

Lemma 11 (see [14].)Let , , C1q×m, and be given. Let , , , M = RAB*, N = A*LB, and S = B*LM. Then the following statements are equivalent:

  • (1)

    equation (5) is consistent,

  • (2)

    (68)

  • (3)

    (69)

In this case, the general solution of (5) can be expressed as
(70)
where U1, U2, V1, V2, W1, and W2 are arbitrary matrices over with appropriate sizes.

Now we give the main theorem of this section.

Theorem 12. Let Ai, Ci, (i = 1,2, 3), Bj, and Dj, (j = 1,3) be given. Set

(71)
(72)
(73)
Then the following statements are equivalent:
  • (1)

    system (6) is consistent,

  • (2)

    the equalities in (14) and (17) hold, and

    (74)

  • (3)

    the equalities in (15) and (18) hold, and

    (75)

In this case, the general solution of system (6) can be expressed as
(76)
where
(77)
where U1, U2, V1, V2, W1, and W2 are arbitrary matrices over with appropriate sizes.

Proof. (2) ⇔ (3): Applying Lemma 3 and Lemma 11 gives

(78)
By a similar approach, we can obtain that
(79)
(1) ⇔ (2): We separate the four equations in system (6) into three groups:
(80)
(81)
(82)
By Lemma 1, we obtain that system (80) is solvable if and only if (14), (81) is consistent if and only if (17). The general solutions to system (80) and (81) can be expressed as (16) and (19), respectively. Substituting (16) and (19) into (82) yields
(83)
Hence, the system (5) is consistent if and only if (80), (81), and (83) are consistent, respectively. It follows from Lemma 11 that (83) is solvable if and only if
(84)
We know by Lemma 11 that the general solution of (83) can be expressed as (77).

In Theorem 12, let A1 and D1 vanish. Then we can obtain the general solution to the following system:
(85)

Corollary 13. Let A2, C2, B1, D1, B3, C3, D3, and be given. Set

(86)
Then the following statements are equivalent:
  • (1)

    system (85) is consistent

  • (2)

    (87)
    (88)

  • (3)

    (89)

In this case, the general solution of system (6) can be expressed as
(90)
where
(91)
where U1, U2, V1, V2, W1, and W2 are arbitrary matrices over with appropriate sizes.

Acknowledgments

The authors would like to thank Dr. Mohamed, Dr. Sivakumar, and a referee very much for their valuable suggestions and comments, which resulted in a great improvement of the original paper. This research was supported by the Grants from the National Natural Science Foundation of China (NSFC (11161008)) and Doctoral Program Fund of Ministry of Education of P.R.China (20115201110002).

      The full text of this article hosted at iucr.org is unavailable due to technical difficulties.