Positive definite solution of the matrix equation X = Q − A∗X −1A + B∗X −1B via Bhaskar-Lakshmikantham fixed point theorem

Abstract

Purpose

The purpose of this paper is to study the existence and uniqueness of a positive definite solution to the nonlinear matrix equation X  =  Q  −  A X  −1 A + B X  −1 B , which is a special stochastic rational Riccati equation arising in stochastic control theory.

Methods

Our technique is based on the Bhaskar and Lakshmikantham coupled fixed point theorem.

Results

A new result on the existence of a unique positive definite solution is derived. An iterative method is constructed to compute the unique positive definite solution. Finally, some numerical examples are used to show that the iterative method is feasible.

Conclusion

Coupled fixed point theory on ordered metric spaces can be a useful tool to solve some classes of nonlinear matrix equations.


Introduction

We consider the matrix equation:

X=QAX1A+BX1B,

where Q is an n  ×  n Hermitian positive definite matrix, and A and B are arbitrary n  ×  n matrices. Equation ( 1 ) is a special stochastic rational Riccati equation arising in stochastic control theory, and it can be described below. Some stochastic control problems lead to computing the positive definite solution of the following stochastic rational Riccati equation[ 1 ]:

CXCX+S+π1(X)(L+CXP+π12(X))(R+PXP+π2(X))+(L+CXP+π12(X))=0,

where Z + stands for the Moore-Penrose inverse of a matrix Z and C; P, S, R and L are given matrices of size n × n n × m n × n m × m , and n × m , respectively, such that

T=SLLR

is a Hermitian matrix, and the operator

π(X)=π1(X)π12(X)π12(X)π2(X)

is positive, i.e. X  ≥ 0 implies π ( X ) ≥ 0. Consider the following case: C is the identity matrix, P is an n  ×  n nonsingular matrix, S is an n  ×  n positive definite matrix, L is the zero matrix, and π 12 ( X ) =  π 2 ( X ) = 0, π 1 ( X ) = ( R + P XP ) −1 , where R + P XP is positive definite for all positive semidefinite matrices X . Meanwhile, the stochastic rational Riccati Equation ( 2 ) has the form

S+(R+PXP)1XP(R+PXP)1PX=0.

Set

Y=R+PXP,

then

P(YR)=XP.

By Equations 3 to 5 , we have

S+Y1P(YR)Y1(YR)P1=0,

which implies that

Y+RY1RPY1P=2R+PSP.

Set

Q=2R+PSP,A=R,B=P,

then Equation 3 can be equivalently written as Equation 1 . Therefore, Equation 1 is a special stochastic rational Riccati equation (Equation 2). Moreover, some special cases of Equation 1 are also problems of practical importance, such as the matrix equation X + M X −1 M  =  Q that arises in the control theory, ladder networks, dynamic programming, stochastic filtering, statistics, and so on[ 24 ]. The matrix equation XM X −1 M  =  Q arises in the analysis of stationary Gaussian reciprocal processes over a finite interval[ 5 , 6 ].

Since 1993, the matrix equations X + M X −1 M  =  Q and XM X −1 M  =  Q have been extensively studied, and the research results mainly concentrated on the following:

sufficient conditions and necessary conditions for the existence of a (unique) positive definite solution[ 2 , 68 ];

numerical methods for computing the (unique) positive definite solution[ 46 , 913 ];

properties of the positive definite solution[ 2 , 4 ]; and

perturbation bound for the positive definite solution[ 3 , 14 ].

In addition, other nonlinear matrix equations such as A X 2 + BX + C  = 0[ 15 ], X s  ±  A X t A  =  Q [ 16 , 17 ], X+i=1mAiX1Ai=I [ 18 , 19 ], X  ±  A X q A  =  Q [ 3 , 2027 ], Xi=1mAiXδiAi=Q [ 28 ], X + A F ( X ) A  =  Q [ 29 , 30 ] have been investigated by many authors. However, results on the general nonlinear matrix equation (Equation 1) are few as far as we know.

In this paper, we first use the the Bhaskar and Lakshmikantham fixed point theorem to study the positive definite solution of the nonlinear matrix equation (Equation 1). A new sufficient condition for the existence of a unique positive definite solution to Equation 1 is derived. An iterative method is constructed to compute the unique Hermitian positive definite solution, and the error estimation formal is also given. In the end, we use some numerical examples to illustrate that the iterative method is feasible to compute the unique positive definite solution of Equation 1 .

Methods

Throughout this paper, we denote by (N) and (N) the set of N  ×  N complex and N  ×  N Hermitian matrices, respectively. For A,B(N) , A  ≥ 0 ( A  > 0) means that A is positive semi-definite (positive definite). Moreover, A  ≥  B ( A  >  B ) means that A  −  B  ≥ 0 ( A  −  B  > 0), and X ∈[ A , B ] means A  ≤  X  ≤  B . A and r ( A ) denote the complex conjugate transpose and the spectral radius of A , respectively. We denote by ∥·∥ the spectral norm, i.e., A=λ+(AA) , where λ + ( A A ) is the largest eigenvalue of A A . The N  ×  N identity matrix will be written as I . We denote by ∥·∥ tr the trace norm. Recall that this norm is given by

Atr=j=1Nσj(A),

where σ j ( A ), j  = 1,…, N are the singular values of A .

The following lemmas will be useful later.

Lemma 2.1 (See[31])

Let A  ≥ 0 and B  ≥ 0 be N  ×  N matrices, then 0 ≤  tr ( AB ) ≤ ǁ A ǁ tr( B ).

Lemma 2.2 (See[32])

If 0 <  θ  ≤ 1, and P and Q are positive definite matrices of the same order with P , Q  ≥  bI  > 0, then for every unitarily invariant norm ||| P θ  −  Q θ ||| ≤  θ b θ −1 ||| P  −  Q ||| and ||| P θQ θ ||| ≤  θ b −( θ + 1) ||| PQ |||.

Lemma 2.3 (See[32])

Let A(N) satisfying −  I  <  A  <  I , then ∥ A ∥ < 1.

Let ( X ,≼) be a partially ordered set and F : X  ×  X  →  X be a given mapping. We say that F has the mixed monotone property if for any x , yX ,

x1,x2X,x1x2F(x1,y)F(x2,y),y1,y2X,y1y2F(x,y1)F(x,y2).

We say that ( x , y ) is a coupled fixed point of F if x  =  F ( x , y ) and y  =  F ( y , x ).

The proof of our main result is based on the following two fixed point theorems.

Theorem 2.1 ([33])

Let ( X ,≼) be a partially ordered set endowed with a metric d such that ( X , d ) is complete. Let F : X  ×  X  →  X be a continuous mapping having the mixed monotone property on X . Assume that there exists a δ  ∈[0,1), such that

d(F(x,y),F(u,v))δ2[d(x,u)+d(y,v)],

for all ( x , y ),( u , v ) ∈  X  ×  X with x  ≽  u and y  ≼  v . We suppose that there exist x 0 , y 0  ∈  X , such that x 0  ≼  F ( x 0 , y 0 ) and y 0  ≽  F ( y 0 , x 0 ). Then,

In addition, suppose that every pair of elements has a lower bound and an upper bound, then

For other results concerning fixed point theorems on ordered sets, we refer to[ 3437 ].

Theorem 2.2 (Schauder Fixed point theorem)

Let S be a nonempty, compact, convex subset of a normed vector space. Every continuous function f : S  →  S mapping S into itself has a fixed point.

Results and discussion

There exist a  > 0, b  > 0 (real numbers), such that the following assumptions were considered:

We denote by Ω the set of matrices defined by

Ω={X(N):XaI}.

Our main result is discussed below:

Theorem 3.1

Under the assumptions 1 to 4, we have

converge to x¯ , that is,

limnXnX¯tr=limnYnX¯tr=0,

and the error estimation is given by

maxXnX̂tr,YnX̂trδn1δmaxX1X0tr,Y1Y0tr,

where 0 <  δ  < 1.

Proof

For all X,Y(N) , let

F(X,Y)=QAX1A+BY1B.

We claim that F ( Ω  ×  Ω ) ⊂  Ω . Indeed, let X , Y  ∈  Ω , that is, X  ≥  aI and Y  ≥  aI . This implies that

QAX1A+BY1BQAX1AQa1AA.

On the other hand, from assumption 1, we have

QAX1AaI.

Thus, we have

F(X,Y)=QAX1A+BY1BaI,

which implies that F ( X , Y ) ∈  Ω . Then, our claim holds.

Now, the mapping F  :   Ω   ×  ΩΩ is well defined. Let X , Y , U , V  ∈  Ω , such that X  ≥  U and Y  ≤  V . We have

F(X,Y)F(U,V)tr=A(U1X1)A+B(Y1V1)BtrA(U1X1)Atr+B(Y1V1)Btr=trA(U1X1)A+trB(Y1V1)B=trAA(U1X1)+trBB(Y1V1).

Since U −1  −  X −1  ≥ 0 and Y −1  −  V −1  ≥ 0, using Lemma 2.1, we get

F(X,Y)F(U,V)trAAtr(U1X1)+BBtr(Y1V1).

On the other hand, since X , Y , U , V  ≥  aI , using Lemma 2.2, we have

tr(U1X1)a2tr(XU)

and

tr(Y1V1)a2tr(VY).

Thus, we get

F(X,Y)F(U,V)trAAa2XUtr+BBa2VYtr.

This implies that

F(X,Y)F(U,V)trδ2XUtr+VYtr,

where

δ=2a2maxAA,BB.

From condition 4 and Lemma 2.3, we can easily show that 0 ≤  δ  < 1. Now, taking X 0  =  aI and Y 0  =  bI , from conditions 2 and 3, we can easily show that X 0  ≤  F ( X 0 , Y 0 ) and Y 0  ≥  F ( Y 0 , X 0 ). On the other hand, for every X,Y(N) , there is a greatest lower bound and a least upper bound. Note also that F is a continuous mapping. Now, (I) and (III) follow immediately from Theorem 2.1. Let x¯ be the unique solution to Equation 1 in Ω .

To prove (II), we shall use the Schauder fixed point theorem. We define the mapping G :[ F ( aI , bI ), F ( bI , aI )] →  Ω by

G(X)=F(X,X),for allX[F(aI,bI),F(bI,aI)].

We claim that G ([ F ( aI , bI ), F ( bI , aI )])⊆[ F ( aI , bI ), F ( bI , aI )]. Let X ∈[ F ( aI , bI ), F ( bI , aI )], that is,

F(aI,bI)XF(bI,aI).

Using the mixed monotone property of F , we get

F(F(aI,bI),F(bI,aI))F(X,X)=G(X)F(F(bI,aI),F(aI,bI)).

On the other hand, from conditions 2 and 3, we have

aIF(aI,bI)andbIF(bI,aI).

Again, using the mixed monotone property of F , we get

F(F(bI,aI),F(aI,bI))F(bI,aI)andF(F(aI,bI),F(bI,aI))F(aI,bI).

From Equations 7 and 8 , it follows that

F(aI,bI)G(X)F(bI,aI).

Thus, our claim that G ([ F ( aI , bI ), F ( bI , aI )])⊆[ F ( aI , bI ), F ( bI , aI )] holds.

Now, G maps the compact convex set [ F ( aI , bI ), F ( bI , aI )] into itself. Since G is continuous, it follows from Schauder fixed point theorem (see Theorem 2.2 ) that G has at least one fixed point in this set. However, fixed points of G are solutions of Equation 1 , and we proved already that Equation 1 has a unique solution in Ω . Thus, this solution must be in the set [ F ( aI , bI ), F ( bI , aI )], that is,

X¯[Q+b1BBa1AA,Q+a1BBb1AA].

Thus, we proved (II). This makes end to the proof. □

The following results are immediate consequences of our Theorem 3.1.

Theorem 3.2

Consider Equation 1 with Q  =  I . Suppose that

( 1 ) 0<a12,b1+a2 ; and

( 2 ) AA<a22I,BB<a22I .

Then, items I to III of Theorem 3.1 hold.

Theorem 3.3

Consider Equation 1 with A and B which are unitary matrices. Suppose that

Then, items I to III of Theorem 3.1 hold.

Theorem 3.4

Consider Equation 1 with A  = 0. Suppose that

Then, items I to III of Theorem 3.1 hold.

Theorem 3.5

Consider Equation 1 with B  = 0. Suppose that

Then, items I to III of Theorem 3.1 hold.

Numerical experiments

All programs are written in MATLAB version 7.1.

Example 1

In this example, we consider Equation 1 with

Q=701071118,A=2.110.010.010.051.980.180.10.192.38,B=3.090.010.010.013.150.090.040.12.94.

All the hypotheses of Theorem 3.1 are satisfied with a  = 5 and b  = 14. We consider the sequences { X n } and { Y n } defined in item III of Theorem 3.1 with X 0  =  aI and Y 0  =  bI . For each iteration k , we consider the errors

R(Xk)=Xk(QAXk1A+BXk1B),
R(Yk)=Yk(QAYk1A+BYk1B)

and

Rk=max{R(Xk),R(Yk)}.

After 23 iterations, we get

X¯X23=Y23=7.680201122270050.029506336696800.889174866125000.029506336696807.796938173834590.925604525774540.889174866125000.925604525774548.34452699090856

with

R23=2.42861287e017.

Example 2

In this example, we consider Equation 1 with

Q=100010001,A=0.30.010.0100.280.020.020.030.34,B=0.340000.3400.010.010.32.

All the hypotheses of Theorem 3.2 are satisfied with a  = 0.5 and b  = 5. After 20 iterations, we get

X¯X20=Y20=1.024447459494210.0035616230998368260.012962823383459680.0035616230998368261.0348236752821710.0082185789803086370.012962823383459680.0082185789803086390.9861513844061653

with

R20=2.09918957e016.

Example 3

We consider Equation 1 with

Q=1053.45106.73.46.710,A=0.05910.07370.03280.07370.03280.05910.03280.05910.0737,B=0.5910.7370.3280.7370.3280.5910.3280.5910.737.

In this case, A and B are unitary matrices. All the hypotheses of Theorem 3.3 are satisfied with a  = 1.514 and b  = 101.5. After 7 iterations, we get

X¯X7=Y7=10.064126899410095.0132637235503493.3450793249298845.01326372355034910.139999446575516.7198879398948023.3450793249298846.71988793989480210.29931432720346

with

R7=1.77635684e015.

Example 4

We consider Equation 1 with

Q=100503450100673467100,A=000000000,B=10.500.5100.50.51.5.

All the hypotheses of Theorem 3.4 are satisfied with a  = 3.5 and b  = 300. After 3 iterations, we get

X¯X3=Y3=100.010462998708950.0045068006224934.0043507679599750.00450680062249100.010522175965566.9953801120922234.0043507679599766.99538011209222100.0407917033456

with

R3=3.00990733e014.

Example 5

We consider Equation 1 with

Q=1053.45106.73.46.710,A=0.50.2500.250.500.250.250.75,B=000000000.

All the hypotheses of Theorem 3.5 are satisfied with a  = 2 and b  = 100. After 10 iterations, we get

X¯X10=Y10=9.9737389153364334.9887612642282043.3888191290125714.9887612642282049.9735426757535656.7120617143630093.3888191290125716.7120617143630099.89541012219485

with

R10=1.32107728e014.

Conclusion

Fixed point theory on ordered metric spaces can be a useful tool to solve various classes of nonlinear matrix equations. In this work, to solve the nonlinear matrix equation X  =  Q  −  A X −1 A + B X −1 B , we suggested an iterative method based on a coupled fixed point theorem of Bhaskar and Lakshmikantham for mixed monotone mappings. The numerical experiments demonstrated that the proposed method is satisfactory.


Acknowledgements

The second author acknowledges the supports from the National Natural Science Foundation of China (grant no.: 11101100) and the Natural Science Foundation of Guangxi Province (grant no.: 2012GXNSFBA053006).


Competing interests

The authors declare that they have no competing interests.


Authors’ contributions

All authors contributed equally and significantly in writing this paper. All authors read and approved the final manuscript.


References

  1. Yong and Zhou (1999) Springer-Verlag
  2. Engwerda et al. (1993) Necessary and sufficient conditions for the existence of a positive definite solution of the matrix equation X + A∗ X−1 A = Q (pp. 255-275) 10.1016/0024-3795(93)90295-Y
  3. Hasanov (2005) Positive definite solutions of the matrix equations X ± A∗ X−q A = Q (pp. 166-182) 10.1016/j.laa.2005.02.024
  4. Zhan (1996) Computing the extreme positive definite solutions of a matrix equation (pp. 632-645) 10.1137/S1064827594277041
  5. Benner and Fabbender (2007) On the solution of the rational matrix equation X = Q + LX−1 L∗ (pp. 1-10)
  6. Ferrante and Levy (1996) Hermitian solutions of the equations X = Q + NX−1 N∗ (pp. 359-373) 10.1016/0024-3795(95)00121-2
  7. Engwerda (1993) On the existence of a positive definite solution of the matrix equation X +  ATX−1A = I (pp. 91-108) 10.1016/0024-3795(93)90115-5
  8. Zhan and Xie (1996) On the matrix equation X + ATX−1 A = I (pp. 337-345) 10.1016/0024-3795(95)00120-4
  9. Fital and Guo (2008) A note on the fixed-point iteration for the matrix equations X ± A∗X−1 A = I (pp. 2098-2112) 10.1016/j.laa.2008.06.005
  10. Guo and Lancaster (1999) Iterative solution of two matrix equations (pp. 1589-1603) 10.1090/S0025-5718-99-01122-9
  11. Ivanov et al. (2004) Improved methods and starting values to solve the matrix equations X ± A∗X−1 A = I iteratively (pp. 263-278) 10.1090/S0025-5718-04-01636-9
  12. Monsalve and Raydan (2010) A new inversion-free method for a rational matrix equation (pp. 64-71) 10.1016/j.laa.2010.02.006
  13. Meini (2001) Efficient computation of the extreme solutions of X + A∗X−1 A = Q and X − A∗X−1 A = Q (pp. 1189-1204) 10.1090/S0025-5718-01-01368-0
  14. Xu (2001) Perturbation analysis of the maximal solution of the matrix equation X + A∗X−1 A = P (pp. 61-70) 10.1016/S0024-3795(01)00300-7
  15. Bai et al. (2005) On two iteration methods for the quadratic matrix equation (pp. 114-122)
  16. Duan and Liao (2008) On the existence of Hermitian positive definite solutions of the matrix equation Xs + A∗X−t A = Q (pp. 673-687) 10.1016/j.laa.2008.03.019
  17. Liu and Gao (2003) On the positive definite solutions of the matrix equation Xs ± A∗X−t A = In (pp. 83-97) 10.1016/S0024-3795(02)00661-4
  18. He and Long (2010) On the Hermitian positive definite solution of the nonlinear matrix equationIX+∑i=1mAi∗X−1Ai= (pp. 3480-3485) 10.1016/j.amc.2010.04.041
  19. Long et al. (2008) On the Hermitian positive definite solution of the nonlinear matrix equation IX+A1∗X−1A1+A2∗X−1A2= (pp. 645-654)
  20. Duan and Liao (2009) On the nonlinear matrix equation X + A∗X−q A = Q(q ≥ 1) (pp. 936-945) 10.1016/j.mcm.2008.10.009
  21. Hasanov and El-Sayed (2006) On the positive definite solutions of the nonlinear matrix equations X + A∗X−δ A = Q (pp. 154-160) 10.1016/j.laa.2005.06.026
  22. Ivanov (2006) On positive definite solutions of the family of matrix equations X + A∗X−n A = Q (pp. 277-301) 10.1016/j.cam.2005.06.007
  23. Ivanov and El-Sayed (1998) Properties of positive definite solutions of the equation X + A∗X−2 A = I (pp. 303-316) 10.1016/S0024-3795(98)00023-8
  24. Peng and El-Sayed (2007) On positive definite solution of a nonlinear matrix equation (pp. 99-113) 10.1002/nla.510
  25. P eng et al. (2007) Iterative methods for the extremal positive definite solution of the matrix equation X + A∗X−α A = Q (pp. 520-527) 10.1016/j.cam.2006.01.033
  26. Berzig (2012) Comment to: Perturbation estimates for the nonlinear matrix equation X − A∗Xq A = Q (0 < q < 1) by G. Jia and D. Gao
  27. Berzig and Samet (2011) Solving systems of nonlinear matrix equations involving Lipshitzian mappings
  28. Duan et al. (2008) On the nonlinear matrix equationQX-∑i=1mAi∗Xδ1Ai= (pp. 110-121) 10.1016/j.laa.2008.02.014
  29. El-Sayed and Ran (2001) On an iterative method for solving a class of nonlinear matrix equations (pp. 632-645) 10.1137/S0895479899345571
  30. Berzig (2012) Solving a class of matrix equations via Bhaskar-Lakshmikantam coupled fixed point theorem (pp. 1638-1643) 10.1016/j.aml.2012.01.028
  31. Ran and Reurings (2004) A fixed point theorem in partially ordered sets and some applications to matrix equations (pp. 1435-1443) 10.1090/S0002-9939-03-07220-4
  32. Bhatia R: Matrix Analysis. Graduate Texts in Mathematics . Vol. 169. Springer-Verlag, New York (1997) Vol. 169. Springer-Verlag, New York (1997)
  33. Bhaskar and Lakshmikantham (2006) Fixed point theorems in partially ordered metric spaces and applications (pp. 1379-1393) 10.1016/j.na.2005.10.017
  34. Berzig and Samet (2012) Positive solutions to periodic boundary value problems involving nonlinear operators of Meir-Keeler-type (pp. 279-296) 10.1007/s12215-012-0089-z
  35. Berzig and Samet (2012) Positive fixed points for a class of nonlinear operators and applications
  36. Berzig and Samet (2012) An extension of coupled fixed point’s concept in higher dimension and applications (pp. 1319-1334) 10.1016/j.camwa.2012.01.018
  37. Samet (2010) Coupled fixed point theorems for a generalized Meir-Keeler contraction in partially ordered metric spaces (pp. 4508-4517) 10.1016/j.na.2010.02.026