Redacted for Privacy particular, characterize Master

advertisement
AN ABSTRACT OF THE THESIS OF
for the
CHARLES GORDON LINDBERG
(Name)
Mathematics
(Major)
in
Master of Science
(Degree)
August 31, 1967
presented on
(Date)
Title A CONE ASSOCIATED WITH THE LYAPUNOV MAPPING
Abstract approved:
Redacted for Privacy
(C. S
In this
+
In
of the cone of
under this mapping. In Section
matrix
`
hermitian matrix
A.
P
is a hermitian
particular, for special positive stable
characterize the image
nX n
PA
is a positive stable matrix and
A
matrix.
Ballantine)
paper we investigate the Lyapunov mapping
P -- AP
where
.
H
V
A
we
positive definite matrices
we give five conditions on an
and a general
nXn
that are equivalent to the condition:
for some positive definite (hermitian)
P.
positive stable
H
=
AP
+
PA
A Cone
Associated with the Lyapunov Mapping
by
Charles Gordon Lindberg
A THESIS
submitted to
Oregon State University
partial fulfillment of
the requirements for the
in
degree of
Master
of
Science
June 1968
APPROVED:
Redacted for Privacy
Associate Professor
of Mathematics
In Charge of Major
Redacted for Privacy
Chairman of Department
of
Mathematics
Redacted for Privacy
Dean of Graduate School
Date thesis is presented
August 31, 1967
Typed by Carol Baker for
Charles Gordon Lindberg
ACKNOWLEDGMENT
This author is indeed indebted to Dr. C.S. Ballantine for his
guidance and encouragement during the work on this thesis.
TABLE OF CONTENTS
Page
I.
INTRODUCTION
II.
SOME PROPERTIES OF THE MAPPING
III.
[A
IV.
[A]sfP
V.
]a)
1
FOR DIAGONAL
FORA GENERAL
CONDITIONS ON
HE[A]P
BIBLIOGRAPHY
H
AND
P
_ AP+ PA''`
A
6
15
2X2
A
A
25
EQUIVALENT TO
32
42
A CONE ASSOCIATED WITH THE LYAPUNOV MAPPING
A
large amount
I.
INTRODUCTION
of
research in matrix theory in the past few
years has been concerned with the matrix equation
Y
=
Lyapunov in his 1892 monograph
of Motion 1/
[
8]
AX
+
XA*
(1.1)
.
The General Problem of Stability
established that the equation
(1. 1) could be used to
obtain information pertaining to the location of the characteristic
values
a
of
variety
A.
of
More recently, interest in equation (1.
directions.
A
has taken
survey of results linked with this matrix
equation can be found in Givens' report
Elementary Divisors and
Some Properties of the Lyapunov Mapping
70]
1)
X
-
AX
+ XA
[
5, p. 62-
.
One
direction
of study was
suggested by Olga Taus sky in the
Bulletin of the American Mathematical Society [ 15, p. 711] when
she queried:
Let A be an nXn matrix with complex elements
and characteristic roots with negative real parts. It is
known (theorem of Lyapunov) that such matrices are
1/
The translation of the title of Lyapunov's monograph used here is
taken from Givens [5, p.
3]
.
2
characterized by the fact that
a positive definite G
exists with AG + GA* negative definite. What is
the range of AG + GA* if G runs .4through all
positive definite nXn matrices? A is the
complex conjugate and transposed matrix.
This thesis addresses itself to this question. After considering
some of the properties of the matrix equation (1.
mapping, we characterize the set
as
V
P
=
AP
and the related
PA* for some special
A,
ranges over the set of positive definite matrices. In Section
and a positive stable
A
of
+
(Theorem 6) we give five conditions on an nXn
H
H
AP
1)
+
PA''`
that are equivalent to the condition:
A
for some positive definite
brief background
of
hermitian matrix
equation (1.
1)
P.
is given now for the sake
completeness. As noted, Lyapunov is credited with the use
the equation (1.
1)
of
for the location of characteristic values. What he
did was to establish a connection between the location of the charac-
teristic values
of a
general
matrix and the signature
nXn
quadratic form. The equation (1.
1)
of a
results when one considers the
matrices associated with the quadratic forms H(x, x) and P(x,
which are related in the following way:
dt
P(x, x)
where
dx
dt
=
xA
=
H (x, x)
,
x)
3
A
is a real
If we
nXn
matrix, and x is a real
1
Xn
row vector.
differentiate and make the appropriate substitutions, the
following equation results:
P(xA, x)
In
+
P(x, xA)
matrix form, equation
(xA)PxT
+
H(x, x).
=
(1. 2) can be
xP(xA)T
(1. 2)
expressed as
T
xHx,
=
or as
AP
P =[p..]
(i,j
=
+
PAT
1,2,...,n)
=
H.
and
(1.3)
H
= [
h..]
(i,
j =
1,2
...,n)
,
in
(1.3) are the symmetric matrices formed respectively from the
coefficients
transpose
of
of the
1.
and
P(x, x),
and
AT
is the
The central result of Lyapunov is
1.
The
positive definite
of
A
1
+
A
is stable (all of the
have negative real part) if and only if
matrix P
nXn
AP
Proofs for Theorem
real matrix
nXn
characteristic values
a
H(x, x)
the matrix A.
given in Theorem
Theorem
forms
PAT
=
-I
=
PT
exists such that
.
can be found in [ 10,
16]
.
4
We can
immediately generalize Lyapunov's work to the case
arbitrary complex matrix
of an
The quadratic forms
H(x, x)
A
a..]
= [
and
(i, j
=
1, 2,
,
n).
are replaced by the
P(x, x)
hermitian forms
n
H (x, x)
n
h,X.X.
i3 1
=
3
i=1
j=1
n
n
and
P(x, x)
p..13 x.1 x
=
i=1 j=1
Correspondingly, the matrix equation
(1. 3) is
replaced by the
matrix equation
H
where
H
= [
h. .]
i
(1,
=
j=
AP
+
PA',
1, 2,
,
n)
and
P
= [
P. .]
i
(i,
j = 1, 2,
,
n)
are the hermitian matrices formed respectively from the coefficients
of the
hermitian forms H(x,
x)
and
jugate transpose of A.
We follow
consider positive stable
A
P(x, x).
A''`
Ballantine
[
1, p. 2] in
rather than stable
A.
matrix is called positive stable provided each
of its
values has positive real part. We note that if
A
-A
is stable, and
-H
denotes the con-
An
that we will
nXn
characteristic
is positive stable,
is negative definite for positive definite
H.
5
If we
1
consider positive stable
rather than stable A,
A
can be generalized to complex matrices
Theorem 1*.
if and only if
The
nXn
AP
+
in the following way:
A
complex matrix A
a positive definite
PA*
=
matrix P
I
.
=
Theorem
P*
is positive stable
exists such that
6
II.
P- AP+ PA'''
SOME PROPERTIES OF THE MAPPING
Since we wish to characterize the set
AP
+
PA*
as
P
ranges over the positive definite hermitian matrices, it will at
times be more descriptive to utilize the mapping
[A]: P -AP+
PA *,
rather than the equation
H
=
AP+ PA*
P- AP
The fact that the mapping
by
specification
of the
of
A
notation [A]
= [
.
a,
ij
.
(i,
]
+
PA'
is completely determined
j = 1, 2,
for the mapping.
,
n)
justifies the use
(Since brackets are also
used in this thesis to enclose the components of a matrix and for
grouping, every effort will be made to make it clear from context
which meaning is intended.
the mapping
Lemma
of
order
list
a number of
properties
of
[A]
[
A
in the space
x
of
complex
is a linear transformation.
n
Proof: The mappings
Thus the mapping
We now
which we will call upon later in the thesis.
The mapping
1.
matrices
[A],
)
-AP and P -PA* are
P -AP + PA''` is linear.
P
]
:
clearly linear.
7
It follows
basis
x, [A]
of
matrix
from Lemma
of
A
[
product
]
determines an n2 X n2
[A] relative to this basis).
the ordered basis
by
that, with respect to each ordered
1
of
x
matrix (namely, the
We
will now show that if
is chosen properly, the matrix determined
where ® denotes the Kronecker
is
A ®I + I®A,
For
a definition and some of the
properties
of the
Kronecker product (which is also known as the direct or tensor
product) see [ 9, p.
81 -88]
or
[
2, p. 227
-229]
.
In each of the cited
references the authors found no need to define the Kronecker product
for nonsquare matrices. We will now define the Kronecker product
a row
vector by a column vector, since this concept will be used to
exhibit the particular basis of which we have spoken.
nX
1
of
column vector and
Kronecker product
z®y
y
a
1
row vector.
Xn
Let
z
be an
Then the
is defined to be the ordinary matrix product
zy. We shall now describe the above mentioned basis of X. Let e.
be an nX 1 column vector with 1 in the ith position and zeros
i
elsewhere. Similarly, let f.
the
jth
be a
1
Xn
row vector with
1
in
position and zeros elsewhere. Then
(2. 1)
where the matrix on the right side of (2.
1)
designates the
nXn
8
matrix with
in the
1
position and zeros elsewhere (this
(i, j)
matrix is sometimes denoted E..).
13
precede
{
e
.
1
e!
®f.: i
J
,
Q<)
j
for
V
J
=
1, 2,
,
If we
or for
require that e. ® f.
i <
i'
n
forms an ordered basis
}
i
=
and
i'
j'
j <
of
X
then
,
The
.
ordering described above is called standard lexicographic ordering.
To show that the matrix of
A
®I
+
IQ A,
[
A
with respect to this basis is
]
we will consider the mapping
individually. The mapping
respect to the basis
{
e. ®
P --AP
f.:
i, j
=
-AP
P
and P --PA
is linear and its matrix with
1, 2,
is determined by
n1
,
i
its action on the basis elements
(i,j
f.
1,2,,n).
=
have
j
We
column
^allal2
a21a22
a
a
_ nl n2
0-
aln^
i
all
-0
0._
ali
a2n
a
th
nn_
0
0
a
0
0
.
ms.
so that
ei®
f.
J
- k=1
Therefore the matrix
aki (ek® f.)
(i, j
-=
1, 2,
.
n)
.
J
of
P
- AP,
ordering the
{eX
1
f.J
:i,j
=
1,2,,n}
9
standard lexicographically, is
A
®
Proceeding in a similar
I.
P. PA*,
J.
fashion with the transformation
we find that a typical
basis element maps as follows:
n
ei®f.
® fk)
ak (ei
J
J
(i, j
=
2,
1 ,
. .
,
n)
.
k=1
Thus the matrix of this mapping with respect to the indicated basis
is
I®
le. ®f.:
J
[A]
Hence the matrix of
.
i, j
=
1, 2,
,
n
with respect to the
ordered standard lexicographically,
} ,
is
A®I+ I®Á.
Lemma
X. + X.
i
2.
(i,
If
A
j = 1, 2,
has characteristic values
(i= 1,2, '
X.
i
then
,n) are the characteristic values of A ®I+ I®A
J
and hence are the characteristic values of the mapping
Proof:
It is
well known that we can choose
that Al =PAP -1
diagonal
of
A1,
teristic values
is
, n),
of
is
triangular with
provided the
A
[
9,
clearly triangular with
characteristic values
calculations verify that
=
X. +
T.
J
(i,
1, 2,
1, 2,
,
Theorem 41.3, p.
X. + X.
i
(i
X.
=
A
]
.
nonsingular such
P
(i
X.
[
75]
.
n)
,
on the
n)
are the charac A1®I
+
I
©A1
on the diagonal and thus has
j = 1, 2,
,
n).
The following
J
Al ©
I +
I©
Al
is
similar to A©
I +
I®
A:
10
Al ®I
IQA1
+
=
PAP -1 ®I
I ©PAP -1
+
1
_
(P®P)(A®I)(P- ®P
_
(P®P)(A®I
+
-1)+
i®A)(P®P)-1,
(POP) is nonsingular. (The properties
since
1
®17)(I®A)(P- 017)-1)
of the
Kronecker
product used above are proved or references to their proofs are
given in
A®I
of
[
+
9, p. 82 -83]
are
I®A
Lemma 3.
If
A ®I + I®A
Proof:
Let
(i
I®A
that
+
=
,
Re (X.
+
X.)
(
X. + X.
for i,
(i, j
> 0
X.+
X.
i
of
Let
[A]
2,
,
n).
[A]
are nonsingular.
characteristic values
=
,n
j = 1, 2,
Re(X. +-X.)
j
(Lemma 2). We have
and therefore
,n)
1, 2,
A®I
+
I®A
means the real part of the com-
.
J
A
be any matrix of
all nXn hermitian matrices
Then
1 ,
positive stable matrix, then the
nXn
a
Lemma 4.
=
characteristic values
Then the characteristic values of
is nonsingular. Here
plex number
i, j
and thus the mapping
n).
are
I®A
+-X.
be positive stable with
A
1, 2,
.
is an
A
matrix
X.
X
It follows that the
. )
x
and let
47/4
be the set
is a subset of
(thus 90/
is a (real) linear mapping of 2Ì
into
x ).
and at
least one (real) matrix representing this real linear mapping is
similar (over the complex field) to
A
®I
+
I ®A
.
First, for
Proof:
for all real
Further,
[A]
to
P
V
maps
í7%
P'
=
r2.
and
r1
and
P1
Thus
implies
into PT
P2
/
in
r1P1
4
PA'``)'
+
r2P2
is in N
is a real linear space.
%;i
(AP
+
AP
=
so that
PA *,
+
The fact that the restriction of
.
matrices Ekk,
E
J
a
basis
)
(j
of
x
kj
,
k
,n,
1, 2,
=
form a basis
j < k)
are matrices
[A]:
x
-
x
+Ek.,
J
Thus the (real) matrix
(over the complex field).
of
k
'7/ and also
of
associated with this basis and the matrix
M[A]
a
and the real field being a subfield of the complex
x
One can routinely verify that the
i(E Jk -E
[A]
7,/ being
is a linear mapping follows from Lemma 1,
subset of
field.
11
.iI+ I® A
A
with respect to different bases and
hence are similar.
Before proceeding we make the following definition:
be a subset of a
r1R1
+
r2R2
real linear space. Then JR is
is in
positive (real)
.A for all
r2.
and
r1
and
R1
R2
definite matrices forms a cone. Further, if
nXn
(or any complex
let
H1
and
tive. Then
P2
in
kip
H2
H1
,
= [
matrix),
be in
A
]
P1
[A]
[ALP,
and
[
A
,,
A ] P2
r
and all
set iP of positive
is positive stable
is a cone.
;U"
and let
H2-
a cone provided
in
We note that the
Let ,R,
and
for some
To show this
r2
be posi-
Pl
so we have
r1Hi+r2H2
rl[A]P1+r2[A]P2
_
[A](r1P1+r2P2),
and
12
[A] JP,
which is in
since
,60
is a cone.
Since we are primarily interested in how
cone of positive definite matrices JP
[
A
] :
P
-AP +
definite and
we will call the mapping
,
P
be positive
be positive stable, the Lyapunov mapping.
Lyapunov mapping is thus
restricted to
maps the
subject to the restriction that
PA
A
[A]
[A]
(for some positive stable
and its range is
,
we are interested in
[
characterizing.
A
]
0.
[
We note
A ] X9
that both
the Lyapunov mapping depend on the positive stable
A
The
A)
is the cone
[A]
0)
and
under con-
sideration at the time.
We now
the cone
[
Theorem
2.
state three theorems which give information concerning
A] (P.
If
is positive stable and
A
H
is positive definite,
then
A P + PA
can be solved with
Proof:
H
=
RR's
We follow
P
(2.
H
2)
positive definite.
Taussky
[
13]
for some nonsingular
AP
which is equivalent to
=
+
Since
.
is positive definite,
Then we have (to solve)
R.
PA*
H
=
RR*
,
13
R- 1ARR -1P(R -1)yß+ R- 1P(R- 1)^RJA >4(R *)
=
I,
which can be written
A1P1
P1AI*
+
(2. 3)
I,
=
where
A1
=
PI
=
R
-1AR
1P(R
,;.
Now since
Al
values as
A),
with
positive definite. Hence it follows that
P1
is positive stable (it has the same
it follows from Theorem l''` that (2.
positive definite and satisfies (2.
Theorem 3.
characteristic
H
=
positive definite
AP
+
P
PA
P
3)
=
can be solved
RP1R*
is
2).
for some positive stable
if and only if
A
and some
has at least one positive
H
characteristic value.
Stein gives a proof of Theorem
Functions
of
and Theorem
3
in his paper On the Ranges of Two
Positive Definite Matrices
3
[
11 ]
.
From Theorem
2
it follows that the range of the Lyapunov mapping
contains the cone
of
positive definite matrices but does not contain
any nonpositive definite matrices.
In Lemma 3, when we
established
14
that
[A]
was nonsingular for positive stable
the existence of
[A] -1.
characterize the range
This existence of
[
A
]
)P
A,
[A] -1
we
assured
allows us to
of the Lyapunov mapping in the
following way.
Theorem
H
4.
is in
In the next
Let A
[A]
0
be positive stable and
if and only if
[
A
]
-1H
H
be hermitian. Then
is positive definite.
section (and in later sections) we use Theorem
characterize the range
of the Lyapunov mapping for some
positive stable matrices.
4
to
special
15
III.
In this
[
A ],a) FOR DIAGONAL
A
section we consider only positive stable matrices
which are diagonal.
A
These matrices display their characteristic
values, and since [A]
and [A] -1H
P
are easily calculated,
characterize the image cone [A]6).
we can use Theorem 4 to
The
simplest kind of diagonal matrix is a scalar matrix, which is just a
scalar multiple
case where
tion that
of
A
is a positive stable scalar matrix
A
H
0
[A]
rather
a
be hermitian.
i. e.
,
Then
A
Re(X)
Ps
A] P
P(XI)*
= [
if and only if
(XI)P
+
=
XI
restric-
> 0,
be positive stable
and so
A
is in
H
,
_ (X
is positive definite (since
Thus for scalar
is a cone).
itself by [A]
H
=
The
XI.
severe one, and the results are indicative
if and only if, for some
H
the cone
S
X
+ X > 0
and
is mapped onto
.
is an
When A
of
is
XI
=
identity. For this reason we first examine the
the severity of the restriction. Let
and let
a)
of the
nXn
nonscalar diagonal matrix, the range
the Lyapunov mapping is not so conveniently described.
considering the
case in detail.
it is of the form
nXn
A
2 X 2
Before
case, it is interesting to examine the
2X2
complex matrix is hermitian if and only if
16
w
x+iy
L-iy
where
the
are real and
w, x, y, z
2X2
=.
i
If we
fix the basis (for
hermitian matrices) to be the one mentioned in Lemma
4, with the ordering
2X2
J
E
,
E
12
+E
21
i(E
,
hermitian matrix is associated with
12
-E
21
), E
then each
22,
a 4 -tuple by the following
mapping:
w
x+iy
x-iy
z
(w,x,y,z),
which is an isomorphism from the space of
to the space of real 4- tuples.
2X2
Now if we apply the
space of 4- tuples, we can describe the cones
geometrically since they are subsets
matrices. First, a point
(w, x, y, z)
hermitian matrix) corresponds to
matrix
usual distance to the
and
set of
of the
[
2X2
(this represents a
A] W
hermitian
2 X 2
positive definite (hermitian)
if and only if
w > 0
(We get
a
QP
hermitian matrices
and
wz - (x2 +y2) >
from the above conditions that
0
.
z > 0
also.
)
The set of
points satisfying this system of inequalities forms the interior of
17
(one nappe of) a four -dimensional cone.
The traces of the boundary of
this cone in each of the three -dimensional coordinate hyperplanes
(a c oor -
dinate hyperplane is the set obtained by setting one of the coordinates
equal to zero and is labeled by the remaining coordinates) are given
by the following:
(1)
If
x
=
we have the
0,
w,
hyperplane.
y, z
The condi-
tions which describe the boundary of the cone in this hyperplane are
and
w > 0
wz -y2
=
0.
This is an elliptic cone of one nappe with axis
is tangent to the
positive
w
(2)
If
plane and the
wy
and
z
y
0,
=
y
=
w
-z
= 0
and which
plane respectively along the
yz
axes.
we get the
hyperplane. The condi-
w, x, z
tions that define the boundary in this hyperplane are
and
w > 0
wz
2
=
0,
and the locus is similar to the one described in (1).
(3)
or
line;
If
w
= 0
or
=
0,
we get the hyperplanes
x, y,
z
The trace in either of these hyperplanes is just a half -
w, x, y.
the positive half
half of the
z
w
of the
z
axis in the latter.
axis in the former and the positive
18
[A]
Now if we apply the mapping
to the cone
lU"
of
posi-
tive definite matrices, Lemma 4 assures that the matrices in the
[A] a) will be hermitian. Thus the image can be described
image
geometrically in the same manner as a)
A
2X
2
diagonal matrix is positive stable and nonscalar if
and only if it is of the form
and
Re(µ)
> O.
Thus let
w
P
X
0
0
µ
and
with
X
Re(X) >
0
this form and let
a
b+ic
b-ic
d
I¡'
/ µ,
H=
z
hermitian. Then
_
A be of
x+iy
=
x-iy
be
.
is in the range of the Lyapunov mapping if
H
and only if
P
is positive definite.
Lemma 3.
a
H
With
=
[A] -1H
The existence of
A, P and H
b+ic
X
0
w
d
0
µ
x-iy
[A]
-1
is guaranteed by
as given we have
x+iy
w
x+iy
z
Lx-iY
z
X
0 1
0
µ
=
b-ic
(\+
.
)w
(µ +X)(x -iy)
From this we get
(X
+µ) (x +iy)
(µ +µ )z
19
b+ic
a
+µ
X+-X'
x±iy
w
P
=
=
x-iy
[A]
H,
z
b-ic
d
µ+
and it follows that
µ+µ
is in the range of the Lyapunov mapping if
H
and only if
X
a>
+ X.
0
ad
and
b2
(R+T)(µ+µ)
Thus the point with coordinates
hermitian matrix in [A]
a > 0
and
XP
(X
c2
+
> 0
+µ)(µ +X)
corresponds to a
(a, b, c, d)
2X2
if and only if
Kad -(b2 +c 2)
> 0
,
where
K
(X+1-0(X+11)
(X+\)(+)
The following calculations verify that
every
?
K >
)
1
is of this form (with
K >
Re(X)
1
and conversely that
> 0,
Re(p.)
> 0
and
Namely, we have that the following inequalities are pair-
wise equivalent:
20
2>
IX-p.1 >
0
µ+µ
XX + 1111>
1X +
µµ
+-X
+ X11 > X11 +
(X
+µ)(X+µ)
> (X +
(X
+µ)(µ+X)
>
(X
+fi)(µ +µ)
The conditions which
H
µ).
+
11
Xµ
+
)(µ+µ)
1
must satisfy to be in the range of the
Lyapunov mapping resemble those conditions a matrix
satisfy to be in 6)
K
For
scalar) the cone
= 1
(i.
e.,
have already seen.
must
the one difference being the coefficient K.
,
Therefore, how this
K
P
A
affects the cone is
of
prime interest.
[Al
To see what happens as
as we
_63%
gets large, we
K
shall look at the traces of the boundary locus in the respective
coordinate hyperplanes.
When
a
or
d
equals zero,
not affect the locus and thus it is the same as that of 62
corresponding hyperplanes. If we let
boundary in the
a, b,
d
c
of
hyperplane, the effect of
this trace are the following:
a >
0
and
Kad -b 2 -
O.
does
in the
and consider the
= 0
The necessary and sufficient conditions for
K
H
K
can be seen.
to be on the boundary
21
It is
clear from the equation Kad -b2
positive definite.
any
As
H
such that the diagonal entries of
are positive we could find an
such that
of the
in
[
A
(2X2
similar locus arises when
A
contains the cone
A,
when
When
-
(2)
n
= 2
is a cone which
and whose size depends
is diagonal, in the manner just described.
A
n >
for all diagonal A
VP
(1)
1:
is set equal to zero.
b
Thus the range of the Lyapunov mapping when
on
be in the range
H
corresponding Lyapunov mapping (see Lemma
1, p. 3] ).
H
positive stable and diagonal)
was large enough to assure that
K
need not be
H
increases, b2 /(ad) must increase. For
K
hermitian
(2 X2)
that
= 0
it is no longer necessary that the
2
characteristic
values be distinct for the matrix to be nonscalar. In general a diagonal positive stable matrix A
where the
Re(X.)
(i
> 0
=
can be denoted by
1, 2,
,
are not necessarily distinct. Now if
is hermitian and
P
= [
p..]
(i, j
=
H
1, 2,
AP
=
"
` ,
+
n),
PA
n)
H
and the
=
[h..]
ij
diag
X
(X1,X2,
.(i =1, 2,
(i, j
= 1,
2,
for some positive definite
we have
,
,
,
n)
°, n)
Xn),
22
hllh12.
hln
h21h22
h2n
hnlhn2
hnn
..
0
A2
0
p
X
0
0
0
X
n
+-X
(X.
h..
3.3
Hence the
(i, j)
_
P21P22
p2n
Pn1Pn2
pnnr
p1 n
P21P22
P2n
Pn1Pn2
Pnn
-
-
2+-X.
pin
pl 1p12
(X1 +X.1)p11
(X
pllp12
1)p21
)pn1
(X.
(X
1
+X2)p12
...
+X. 2)p22
..
(ñn +X2)pn2
...
X
0
0
X2
0
0
(X.1
(X2
-
.
0
...
0
X.
n
_
+Xn)pin
+Xn)p2n
(Xn +X
)pnn
n)
(..+X..)p13
(i, j
entry
positive definite matrix P is given by
i
.
=
1, 2,
J
of the
23
pij
1J
-
j)
(X. +
=
-1
h..
1J
J
i
(i, j
=
1, 2,
.
n).
.
One of the well known conditions for positive definiteness applied to
would give a system of requirements that
P
H
must satisfy
to be in the range of the Lyapunov mapping.
To get more information we now change the notation as follows:
let
X.
(so
k < n).
i
(i
1, 2,
=
,
Then
characteristic values
be the distinct
k)
is
A
similar to
a
A
of
block diagonal matrix, each
diagonal block of which is a scalar matrix, with distinct blocks cor-
responding to distinct scalars. Thus we may assume
in this form.
(i
P
=
1, 2,
and
,
H
Let the
k,
where
diagonal block of
m.
i
->
for
m.
J
H
P.
,
1J
is in the cone
[A
H..
=
for some positive definite
are each
]
kP
(i,
(i,
j = 1, 2,
dimensions
,k)
m. Xm..
1
if and only if
j = 1, 2,
Note that the
P.
of
m.
and let
)
.
j)Pij
J
iJ
(X. +
i < j
l
H..
and
1J
have order
A
P.131.
and H.J
il
be partitioned into blocks
respectively, where
We find that
ith
itself is
A
m1
.
X
,
k)
m1
leading
principal submatrix
H11 -
is positive definite, since
)P11
(X1 +
1
X1
+
X1
> 0
and
P11
is the
leading principal submatrix of a positive definite matrix.
mXm
It is
easily
J
24
verified that thus
H
has at least
values. We recall that ml
m1
>
-
m.
theorem
(i
of
1, 2,
=
,
positive characteristic
m1
is the multiplicity of
N.1
and that
This result is a special case of a
k).
P. Stein and A. M Pfeffer.
That theorem is given
next without proof in the present author's wording.
Theorem
Let A
5.
be an
positive stable matrix with
nXn
m
equal to the maximum number of Jordan blocks associated with
one
characteristic value.
If
H
has at least
m
with at least
similar to
A
m
P
=
is positive definite, then
AP
+
PA*
positive roots; further, for every hermitian
H
positive characteristic values there exists a
B
and a positive definite
H
P
=BP+ PB*
such that
.
2/
Mentioned by O. Taussky in an address at the Argonne Matrix
Conference March 30 - 3 1, 1967; to appear soon in the Bulletin
of the International Computation Centre.
25
[A I
IV.
FOR A GENERAL
In this section we use
[A161,
when
Theorem
this case we hoped that
A
to characterize the cone
general positive stable matrix.
is a 2X 2
A
4
2X2
[A]-1H might be familiar and thus
might be able to generalize our results to larger
were unable to see any such generalization
and thus we feel that Theorem
characterizing the cone [A ]
4
of the
In
we
(n > 3).
n
We
results obtained,
is limited as a practical method of
in the general nXn
W
case if
n > 3.
We now
ated with
matrix.
[
A
proceed to find the inverse
]
where
,
is a general
A
4X4
of a
2X
2
matrix associ-
positive stable
Let
a
A
=
d
c
be a positive stable matrix.
the basis
{
e.
®
f,
:
i,
The matrix of
j = 1, 2,
,
n
}
[ A. ]
(ordered standard lexicograph-
J
ically) then will be
A®
I
+
I® A
with respect to
a+a
b
b
0
c
a+d
0
b
c
0
d+a
b
0
c
c
d+d
=
26
We will use the adjoint method to find the
inverse of A® I
+
I®A.
The adjoint matrix is defined to be the transpose of the cofactor
matrix, and thus
[AQI+ I© A]
The factor
ad -bc
-1
=
= [
det(AQI
+
IQA)]
adj(A® I
occurs in many
det A
of
I®A)
+
the components of
the adjoint matrix; therefore, we will use the substitution
A
also find that expressions
We
ad -bc.
=
of the
form
A -L
occur frequently.
Thus a substitution of the form A
transpose
such that
A'®
I© A'
I +
siderably.
a
and
d'
=
A)
of
d
=
A ®I + I ®A,
We note
E
A'
=
det A'
AQI + IQA,
FaT
b
c
d'
so we let
a'
=
a -E
and
Then set
=
a'd'-bc. Before solving the equation
=a+d+a+d
(not the
is real and such that
A'
=
we let
T
A'
would also simplify the adjoint con-
is pure imaginary.
A'
so that
=
-
that subtracting the same imaginary number from
does not change
where
d -E,
0'
A(a +d) +L(d +a)
and
=trA+trA
= X
+
µ
+X+µ>
>
A'
for
E,
27
where
since
D'
=
and
X
of
A.
Now,
is pure imaginary we find by routine calculation that
E
A'
are the characteristic values
µ
only if
E
-
A-0
T
(which is obviously pure imaginary).
A
Conversely, if
T
then the same calculation shows that
E,
A'
=
A'
Using this value for
.
we see that indeed
A'
and hence that
®I
[
A'
+I
®A'
=
[A]
]
=
A
a..
(i, j
=
+I®A
We will now
.
components of the adjoint of A®
denote them
®I
1, 2, 3,
repeatedly use the fact that
A'
I + I®A
(=
A' © I
+
I ®A'
)
and
(During these calculations we
4).
=
determine the sixteen
A'
To simplify the notation
)
.
during the calculation we will drop primes until otherwise specified.
all
-
_
a+d)(d+a)(d+d)
-
bc(a+d)
-
(ad-bc)(a+d)+ (ad-cb)(d+a)
(a+d)
_ (A
+ A
+dd)T
(d+a)
+
dd(a+d
+
cb(d+a)
+
dd(a+d
a+d)
+
a+d)
28
a12
{b[ (d+a)(d+d) - cb]
= -
=
-b[ (d 2+ad + ad + dd - cb
=
-bdT
cb)
+
ad
-
ad]
=
b(-cb)
,
=b[ -cb
a14
+
}
-0+d(d+a+a+d)]
_
a13
cbb
+
-
-
(a+d) b(d+d)
(ad
+
+ad+dd+d2+cb)+ad
b[
=
-bd
=
-[ -bb(d+a) - (a+d) bb]
=
bbT.
-
+
-
ad]
A+
=
T
cb2
,
The remaining components are found in analogous fashion.
simplify to the following
a21
=
a22
=
a23
=
a24
=
-cdT,
(/A+ ad)T
bcT
-abr
,
,
:
They
29
=
-cdT,
a32
=
cbT,
a33
=
(ad+A)T,
a34
=
-abT,
a41
= CCT,
a
=
-caT,
=
-ac T,
a
a
31
42
43
a44
If we
=
(A+
denote the determinant of
D
We can see
Re(µ)
aa)T
> O.
= (X +
from
X)(X +
(4. 1)
that
A ®I + I ®A
µ)(µ
+
X)(µ+
by
we have
D,
(4.1)
).
is positive since
Re(X)
> 0
Thus the inverse of the matrix A ©I+ I©A
is
D
A+ dd
-bd
-db
bb
-cd
A+ad
bc
-ab
-cd
cb
A+ad
-ab
cc
-ca
-ac
T
D
and
A+aa
amp
and we can now find
[A]
1H.
To do this we
order
H
(standard)
30
lexicographically, getting a
perform the operation (AOI+ I ©A)
We can now
H.
column vector which we denote
4 X1
rearrange the result in the usual
given by the following,
when H
matrix form.
2X 2
and then
1H
[
A
]
-1H
is
w
=
z
ú
w(A+dd) - ubd --u-db
+
zbb
u(A+ad)
-
wcd
-
zab
+
ucb
(A+ad) - wcd - zab
+
ucb
z(A+aa)
-
uca
-
uca
+
wcc
T
D
u-
It
turns out that
[A
where
A
[
A
H
]
]
-1
can be written in the form
H
D (OH +
=
*
AHA'),
(4. 2)
is the transpose of the cofactor matrix of
A.
We note
that
(Á
,.
-dc
-b
rd
-c
rd
a
-b
a
-(A*),
and thus there should be no confusion in the expression (4. 2).
this point we restore the primes and consider the conditions
must satisfy to be in the cone
[
A
]
1E1
[A
]P
.
(4. 2)
D (A'H
=
D
=
ti
D{(A'-E )H+A.HA"+EAH
{
A'H
H
becomes
A.'HA' *)
=
+
At
+
(A-EI)H(A'+EI)}
- EHA
.
(4.3)
31
Thus
H is in
definite.
the cone
[
A
]
pif and
only if (4.3) is positive
32
CONDITIONS ON H AND A
TO HE[A]a)
V.
Using the equation
H
=
AP
+
EQUIVALENT
or the related mapping is
PA*
not the only method one might consider to characterize the cone
[
A]0).
In this
one might use to
section (Theorem
four equivalent methods
6) we give
characterize the cone. Some motivation for these
is as follows:
The integral formula in (2) (in Theorem
Bellman
[
2,
Theorem 6, p.
175]
,
6)
is essentially given by
where he assumes the existence
integral, but later fails to discuss that existence as he promises.
of the
The equation in (3) is one used by Stein
[ 11 ]
and by Taussky [13]
in a manner very close to that used here.
motivated by an exercise in Bellman
(4) was
Lemma
5
2, p. 99, Ex. 7]
Qp
by means of its dual cone
all nonzero nonnegative definite matrices
of
of (1) and (4)
means
of
In (5) we
Our
.
is a corrected and simplified version of that exercise, and
characterizes the cone
set
[
(in Theorem 6)
its dual cone
([
A* ] - 1É9'
assume
[A
A
),
the
The equivalence
similarly characterizes
simply observe that
In this section we
B.
(6)'
[
A]
a) by
)
1]
i
and
=
H
[Au.
are given.
We now
33
state and prove Theorem
Theorem
6.
Let
H
5.
be an
nXn hermitian matrix and
positive stable matrix.
nXn
A
be an
Then the following five statements are
equivalent:
(1)
H
=
(2)
AP+ PA*
e
for some positive definite
AtHe- A`tdt
is positive definite
P;
;
S0
(3)
C
(4)
H
=
=
(A+I) -1(A - I)
tr(KH)
A *K +KA
(5)
P- CPC* for some positive definite
H
Proof:
=
where
;
for every nonzero hermitian
> 0
P,
is nonnegative definite
A 1P + P(A
We will show
1)*
that
such that
;
for some positive definite
P.
equivalent to each of the other four
(1) is
statements. To show that
(1) and (2)
inte g
in (2) is
the integral
-114,
[A]
K
are equivalent
we will show
that
and thus equivalence follows from
Theorem 4. To verify this let
P
=
s ° e -AtHe -A
*
t
dt,
(5. 1)
0
where the exponential function
of a
matrix is given by the usual power
34
series, which always converges
Theorem
2,
matrix such that Al
be a nonsingular
Jordan blocks
[
of the
=
2, p. 166]
1AS
S
.
Let
S
is the direct sum of
form
n
0
..
.
0
1
0
.
1
1
=
Then to show that (5.
1)
.
X.
1
Lo
AI+R.i
i
exists it is sufficient to show that the integral
co
P1
=
exists, where
(c. f.
[
H1
=
S
{-Alt}
exp {-Alt}H1 exp
`
`J
dt
0
1HS
1
I\,
since
4, Theorem 10. 2, p. 196])
exp
{
-(SAS- 1) }= S(exp{ -AIDS
and hence one easily sees that the
given integral equals
co
S
1
exp {-A.lt}H1
exp(-
and that
SH 1S*.
y
m
t}dt S',
JO
i. e.
,
P
=
SP 1S
H
=
Let
H1
be
partitioned
in such a way as to make the block multiplication
exp
{
-Alt
}H1 exp
{
-Al t
}
defined
Then the
(i, j)
block of
Pl
1
35
will be given by
oo
(P1)i3 -
g0
('
=
exp{-(XiI+Ri)t } (H1)iexp{-(.J I+RJ^ )t } dt
co
-
exp { - (Ai+j )t } exp { -R it
`
}
(H 1L.exp
{
-R t } dt
.
0
Note that
R.
and
are nilpotent, so the exponentials involving
R.
them are polynomials in t.
blocks of
vergence
and thus the existence of
P1
of
Consequently, the existence of the
integrals
°0k
t exp
J0
{
follow from the con-
form
of the
,
P
-(?.+.)t
}dt.
1
Churchill in [ 3, Theorem
J
1,
absolutely convergent for
p. 171] proves that this
k > 0
and for
Re(X. +X.
integral is
)
> 0
Since
.
3
we know that the
stability
of
A
guarantees the uniqueness
of
P
(Lemma 3), all that remains is to show that the integral formula does
in fact yield the solution.
Calculating, we have
36
oo
AP
+
PA
=
5(Ae -At He-A
e-AtH -A >tA
t
e
)dt
0
d
_
s:-
=
lim
T
=
We used the
e
(
dt
-
[
AtH
e -A t ) dt
e-AtH e-Ay4t]
0
co
H
T
.
fact that
e
-AT
which is verified in
-i
[
as
0
2, p. 241
We next show (1) is
T
--
oo,
-2421.
equivalent to
(3)
using the equivalent
transformations
(A - I)
C
=
(A + I) -
A
=
(I-C) 1(I+C)
1
and
.
To show that (1) implies (3) we need only substitute for
terms
of
C
by the above
transformation, and we get
A
given in
37
H
=
(I-C)-1 (I+C)P
+
P(I+C `)(I-C
-1
r)
1
where
(I-C)P (I+C*)] (I-C*)
=
(I-C)-1[ (I+C)P(I-C*)
=
(I-C)-1[ 2P
=
2(I-C)-1P(I-C F)-1-C 2(I-C)-1P(I-Cry4)-IC*
+
2CPC*] (I-C>;.)-
-
=P1
-CP1C,
P1
2(I- C) -1P[ (I -C) -1]
=
fact that
We used the
C
1
and thus is positive definite.
commutes with
assume
for some positive definite
(3)
implies
P
and make the substitution given above.
H
(1) we
=
P
=
(A +I)
=
(A +I)
-
(A+I)-1(A-I) P
P1
=
=
P -CPC
This yields
(A `-I)(A ' + I )
-1[ (A +I)P(A +I) -(A -I) P (A -I)] (A *+I)
1
[
2AP+ 2PA.*]
=AP1 +PlA
where
H
(I-C)-1. To show that
1
(A.,, +I) -1
,
2(A +I) 1P[ (A +I) -
and is thus positive definite.
]
Before we establish the equivalence of parts
(1) and (4), we
state and prove an essential lemma.
Lemma
5.
Let
P
be an
nXn
hermitian matrix. Then P
positive definite if and only if tr(PB)>
0
for every nonzero
is
38
nonnegative definite (hermitian)
Proof:
tr[
,,
(T
We
B.
first note that for nonsingular T,
PT)B]
=
tr[ P(TBT''`)]
only consider diagonal
Thus in proving the lemma we need
.
P,
since every hermitian matrix is con-
junctive with a diagonal matrix, and a matrix is nonzero nonnegative
definite if and only if every matrix conjunctive with it is nonzero
nonnegative definite. Let
definite, and
B
= [
b..]
iJ
definite. Then
X..
> 0
P
(i, j,
(i
=
i
with
b..u
>
=
diag
1, 2,
=
1, 2,
for at least one
O
(X1, X2,
n),
be positive
Xn)
be nonzero nonnegative
n)
,
,
,
b..ii
and
-> 0
(i= 1,2,
,n)
Thus
i.
n
tr(PB)
=
)
L
X.b.. >
1 ii
0
1=1
for any nonzero nonnegative definite
B.
for every nonzero nonnegative definite
(i, j, k
= 1,
2,
'
,
n)
be the n
1
0
Then the
Bk
Now
B.
assume that tr(PB)>
Let
Bk
matrices with elements
if
i
= j =
k
otherwise
are nonzero nonnegative definite and
tr(PBk) = Xk
(k
= 1,
2,
,
n)
.
bi'J , k
0
39
Therefore
for
Ak > 0
k
,n,
1, 2,
=
and it follows that
P
is positive definite.
We
are now ready to show that
we must show that
such that
tr(KH)
for some positive definite
Note that
P.
for some positive definite
tr [ K(AP
tr
=
To do this
for every nonzero hermitian
> 0
and thus is nonzero for all nonzero
=
implies (4).
is nonnegative definite, whenever
A K + KA
tr(KH)
(1)
(KAP
+
P.
K.
A
H
=
is
KA
Suppose that
K
AP +PA*
K
H
=AP +PA
Then
PAN
+
KPA
*
)
=
tr(KAP)
+
tr(KPA
=
tr (KAP)
+
tr
(A
)
KP)
=tr[(AK+KA)P]
(Lemma
> 0
whenever
(4)
A K + KA
implies (1),
such that
Then
H
let
A K
+
A
P,
= [
]
is nonzero nonnegative definite.
tr(KH)
KA
5)
> 0
for every nonzero hermitian
is nonnegative definite.
and so
To show that
Let
P
=
[A]
K
1H.
40
tr(KH)
=
tr[ (AP
=
tr (APK)
+
tr (PA
=
tr(PKA)
+
tr(PA K)
=
tr [ P(KA
+ A JFK)]
=
tr[ P(A *K
PA*)K]
+
+
KA)]
K)
.
For every nonzero nonnegative definite
hermitian)
AK
+
KA
namely
K,
=
[A ]K
tr(PB)
=
=
K
[A*] -1B,
a (nonzero
such that
and hence such that
B,
tr[ P(A*K
It follows from Lemma
=
there exists
B,
5
+
KA)]
that
=
tr(KH)
P
> 0
.
is positive definite, which com-
pletes this part of the proof.
To show (1) implies (5), let (1) be given.
H
=
AP
=
(A - 1A)
=
where
if we
+
PA
PA
A-1P1
P1
=
assume
Then
APA
+AP(AA'-1)
P1(A-1),*.
and is therefore positive definite.
(5) is given, we have
Similarly,
41
H
=
A- 1P
=
A-1P[ (A,.)-lA*]
P(A
+
=AP1+ PIA
where
(5)
P1
implies
=
A
-1
P(A
)
*
+
(AA-1)P(A-1)*
',
-1
)
*
and is thus positive definite. To show that
(1) we could have
applied the part already proven ((1)
implies (5)) to the positive stable matrix
proof of Theorem
5.
A
-1
.
This completes the
42
BIBLIOGRAPHY
1.
Ballantine, C.S. Associate Professor, Department of Mathematics. A note on the matrix equation H = AP + PA *.
Corvallis, Oregon State University, 1967. 16 p.
(Unpublished typescript)
2.
Bellman, Richard. Introduction to matrix analysis.
McGraw -Hill, 1960. 328 p.
3.
Churchill, Ruel
4.
Finkbeiner, Daniel T. Introduction to matrices and linear
transformations. San Francisco, W. H. Freeman, 1960.
Operational mathematics.
York, McGraw-Hill, 1958. 337 p.
V.
New York,
2d ed.
New
246 p.
5.
Givens, Wallace. Elementary divisors and some properties of
the Lyapunov mapping X -- AX + XA *. 1961. 80 p.
(U.S. Atomic Energy Commission. ANL -6456)
6.
Hoffman, Kenneth and Ray Kunze. Linear algebra.
Englewood Cliffs, Prentice -Hall, 1964. 322 p.
7.
Lang, Serge.
2d ed.
Linear algebra. Reading, Addison -Wesley, 1966.
294 p.
Probleme general dè la stabilite du mouvement.
Princeton, N.J., Princeton Univérsity, 1947. 270 p. (Annals
of Mathematics Studies, no. 17; Reprint of French translation of Russian original)
8.
Lyapunov, A. M.
9.
MacDuffee, C.C.
The theory of matrices.
(Ergebnisse
1933.
Grenzgebiete, vol. 2, no.
110 p.
der
Berlin, Springer,
Mathematik und ihrer
5)
10.
Ostrowski, Alexander and Hans Schneider. Some theorems on
the inertia of general matrices. Journal of Mathematical
Analysis and Applications 4: 72-84. 196 2.
11.
Stein, P.
ranges of two functions of positive definite
matrices. Journal of Algebra 2: 350 -353. 1965.
On the
43
12.
Taussky, Olga. A generalization of a theorem of Lyapunov.
Journal of the Society of Industrial and Applied Mathematics
9: 640-643. 1961.
Matrices
13.
Algebra
5-10.
C
with
Cri
--
0.
Journal of
1964.
of
A remark on a theorem of Lyapunov. Journal
Mathematical Analysis and Applications 2: 105-107. 1961.
of
Research problems: matrix theory. Bulletin
the American Mathematical Society 71: 711. 1965.
14.
15.
16.
1:
Taus sky, Olga and H. Wielandt. On the matrix function AX +XA'.
Archive for Rational Mechanics and Analysis 9: 93 -96. 1962.
Download