On The Computation of Matrix Function having Mixed

advertisement
Computing matrix functions using Vandermonde matrix and
Lagrange- Sylvester’s method
Mohamed A. Ramadan * a and Adel A. El-Sayed
*b
* a Department of Mathematics, Faculty of Science, Menoufia University, Egypt
*b Department of Mathematics, Faculty of Science, Fayoum University, Egypt
Abstract
Recently, computing the matrix functions ๐‘“(A) of a square matrix A with distinct real
eigenvalues is studied using Vandermonde matrix and Lagrange-Sylvester’s definition for square
matrices. These methods concentrate only on matrices having real eigenvlues.
In this paper,
first we discuss the computation of the matrix functions ๐‘“(๐ด) of square matrices having complex
eigenvalues using Vandermonde matrix and Lagrange-Sylvester’s definition. Secondly, we apply
Sylvester’s definition on square matrices having complex or mixed eigenvalues. In addition, we
apply Sylvester’s definition on some important types of square matrices (Block diagonal
matrices) which help us in calculating two forms of matrix functions ๐‘“(๐ด) and drive a new
formula of Sylvester’s definition to compute matrix functions of a square matrix A having
repeated real eigenvalues. Finally, the proposed methods are tested on several problems to
illustrate their applicability and accuracy.
Keywords: Matrix functions, Vandermonde matrix, Lagrange-Sylvester’s definition, Sylvester’s
definition, Block matrices, a square root of matrix, a square root of complex
number.
1. Introduction
There are many studies for computing matrix functions ๐‘“(๐ด) , see [6-9] of square matrices
having real eigenvalues, however using these methods to compute matrix functions of square
matrices having complex eigenvalues is not studied.
In this paper, first we discuss the
computation of the matrix functions ๐‘“(๐ด) for square matrices having complex eigenvalues using
Vandermonde matrix and Lagrange-Sylvester’s definition. Secondly, we apply Sylvester’s
definition on some important types of matrices (diagonal matrices see [1, 8]) which help us in
calculating two forms of matrix functions ๐‘“(๐ด). In case of large block diagonal matrices, we
compute ๐‘“(๐ด) by computing matrix functions of square matrices of a main diagonal only, see [8].
______________________________________________________________________________
1
Corresponding Author: Mohamed A. Ramadan: mramadan @eun.eg ; ramadanmohamed13@yahoo.com
Generally, we consider the problem of the computation of ๐‘“(๐ด) where A is an n × n matrix and
and ๐‘“ is a given analytic function in a certain domain of complex plane containing the spectrum
of A that we denote σ(A) = {λ1 , λ2 , โ‹ฏ , λn } where the eigenvalues λ1 , λ2 , โ‹ฏ , λn may be of the
following forms: all eigenvalues are pure complex, mixed (complex and distinct real), all are real
and distinct, all are real and repeated, some are repeated eigeanvalues.
This paper is organized as follows: In section 2, we give some important definitions for
computing matrix functions of square matrices [2-9]. In section 3, we present a generalization to
Vandermonde matrix for computing matrix functions of square matrices having pure complex
eigenvalues and illustrate the applicability of this method by an example. In section 4, an
extension of Lagrange- Sylvester’s definition for square matrices having pure complex
eigenvalues and illustrate the applicability and accuracy of this method by giving some examples.
In section 5, we extend Sylvester’s definition for square matrices having mixed eigenvalues and
also illustrate the accuracy of this method by considering some different numerical examples. In
section 6, a new formula of Sylvester’s definition to compute ๐‘“(๐ด) in case of repeated real
eigenvalues of a square matrix A is deduced. The applicability and the accuracy of this technique
is illustrated by considering some numerical examples. Finally, a brief conclusion ends this
paper.
2. Preliminaries
In this section, we introduce some needed definitions for computing matrix functions of
square matrices which will be used to implement our new techniques. The following definitions
are the most generally useful ones:
(I) Vandermonde matrix for computing matrix functions [4]
If ๐‘“(๐‘ง) is a scalar function defined on the spectrum of ๐ด ∈ โ„‚๐‘›×๐‘› , Then,
๐‘›−1
๐‘“(๐ด) = ∑ ๐›ผ๐‘– ๐ด๐‘– = ๐›ผ0 ๐ผ + ๐›ผ1 ๐ด + ๐›ผ2 ๐ด2 + โ‹ฏ + ๐›ผ๐‘›−1 ๐ด๐‘›−1
(2.1)
๐‘–=0
where the constants ๐›ผ0 , ๐›ผ1 , ๐›ผ2 , โ‹ฏ , ๐›ผ๐‘›−1 can be computed by using Vandermonde matrix
1 ๐œ†0
1
๐œ†1
โ‹ฎ
โ‹ฎ
(1 ๐œ†๐‘›−1
๐œ†20
๐œ†12
โ‹ฎ
๐œ†2๐‘›−1
โ‹ฏ
โ‹ฏ
โ‹ฏ
โ‹ฏ
๐›ผ0
๐œ†๐‘›−1
๐‘“(๐œ†0 )
0
๐‘›−1
๐›ผ
๐‘“(๐œ†1 )
1
๐œ†1
( โ‹ฎ )=(
)
โ‹ฎ
โ‹ฎ
๐›ผ๐‘›−1
๐‘“(๐œ†๐‘›−1 )
๐œ†๐‘›−1
๐‘›−1 )
where ๐œ†0 , ๐œ†1 , โ‹ฏ , ๐œ†๐‘›−1 are distinct real eigenvalues of ๐ด.
2
(2.2)
(II) Lagrange-Sylvester interpolation polynomial [5]
Let ๐ด ∈ โ„‚๐‘›×๐‘› be a square matrix, then we denote
๐›น(๐œ†) = (๐œ† − ๐œ†1 )๐‘š1 (๐œ† − ๐œ†2 )๐‘š2 โ‹ฏ (๐œ† − ๐œ†๐‘› )๐‘š๐‘›
(2.3)
to the minimal polynomial of ๐ด (where ๐œ†1 , ๐œ†2 , โ‹ฏ , ๐œ†๐‘› are all the distinct char-acteristic values of
the matrix ๐ด). The degree of this polynomial is ๐‘š = ∑๐‘›๐‘˜=1 ๐‘š๐‘˜ .
If the function ๐‘“(๐œ†) is defined on the spectrum of the matrix ๐ด, then ๐‘“(๐œŽ(๐ด)) = ๐‘”(๐œŽ(๐ด)),
where ๐‘”(๐œ†) is an arbitrary polynomials that assumes on the spectrum of the matrix ๐ด the same
values as does ๐‘“(๐œ†):
๐‘“(๐ด) = ๐‘”(๐ด).
Among all the polynomials with complex coefficients that assume on the spectrum of the
matrix ๐ด the same values as ๐‘“(๐œ†) there is one and only one polynomial ๐‘Ÿ(๐œ†) that is of degree less
than ๐‘š (this polynomial is obtained from any other polynomial having the same spectral values
by taking the remainder on division by ๐›น(๐œ†) of that polynomial. The polynomial ๐‘Ÿ(๐œ†) is
uniquely determined by the interpolation conditions:
๐‘Ÿ(๐œ†๐‘˜ ) = ๐‘“(๐œ†๐‘˜ ), ๐‘Ÿฬ (๐œ†๐‘˜ ) = ๐‘“ฬ (๐œ†๐‘˜ ), โ‹ฏ , ๐‘Ÿ (๐‘š๐‘˜ −1) (๐œ†๐‘˜ ) = ๐‘“ (๐‘š๐‘˜ −1) (๐œ†๐‘˜ ), ๐‘˜ = 1,2, โ‹ฏ , ๐‘›.
(2.4)
We consider the case in which the characteristic equation |๐ด − ๐œ†๐ผ| = 0 has no multiple roots.
The roots of this equation are the characteristic values of the matrix ๐ด which will be denoted by
๐œ†1 , ๐œ†2 , โ‹ฏ , ๐œ†๐‘› .
Then, ๐›น(๐œ†) = |๐ด − ๐œ†๐ผ| = (๐œ† − ๐œ†1 )(๐œ† − ๐œ†2 ) โ‹ฏ (๐œ† − ๐œ†๐‘› ) and conditions in (2.4) can be rewritten
as follows:
๐‘Ÿ(๐œ†๐‘˜ ) = ๐‘“(๐œ†๐‘˜ ),
(๐‘˜ = 1,2, โ‹ฏ , ๐‘›).
(2.5)
In this case, ๐‘Ÿ(๐œ†) is the ordinary Lagrange interpolation polynomial for the function ๐‘“(๐œ†) at the
points ๐œ†1 , ๐œ†2 , โ‹ฏ , ๐œ†๐‘› :
๐‘›
๐‘Ÿ(๐œ†) = ∑
๐‘˜=1
(๐œ† − ๐œ†1 ) โ‹ฏ (๐œ† − ๐œ†๐‘˜−1 )(๐œ† − ๐œ†๐‘˜+1 ) โ‹ฏ (๐œ† − ๐œ†๐‘› )
๐‘“(๐œ†๐‘˜ ).
(๐œ†๐‘˜ − ๐œ†1 ) โ‹ฏ (๐œ†๐‘˜ − ๐œ†๐‘˜−1 )(๐œ†๐‘˜ − ๐œ†๐‘˜+1 ) โ‹ฏ (๐œ†๐‘˜ − ๐œ†๐‘› )
(2.6)
Then, ๐‘“(๐ด) = ๐‘Ÿ(๐ด), this means:
๐‘›
๐‘“(๐ด) = ∑
๐‘˜=1
(๐ด − ๐œ†1 ๐ผ) โ‹ฏ (๐ด − ๐œ†๐‘˜−1 ๐ผ)(๐ด − ๐œ†๐‘˜+1 ๐ผ) โ‹ฏ (๐ด − ๐œ†๐‘› ๐ผ)
๐‘“(๐œ†๐‘˜ ).
(๐œ†๐‘˜ − ๐œ†1 ) โ‹ฏ (๐œ†๐‘˜ − ๐œ†๐‘˜−1 )(๐œ†๐‘˜ − ๐œ†๐‘˜+1 ) โ‹ฏ (๐œ†๐‘˜ − ๐œ†๐‘› )
(2.7)
(III) Sylvester definition for matrix functions [2]
Sylvester, in 1883, proposed the following definition of a matrix function corresponding to the
scalar function ๐‘“ (๐‘ง) as:
3
๐‘›
๐‘“(๐ด) = ∑ ∏
๐‘—=0 ๐‘–≠๐‘—
๐ด − ๐œ†๐‘– ๐ผ
๐‘“(๐œ†๐‘— )
๐œ†๐‘— − ๐œ†๐‘–
(2.8)
where ๐ด ∈ โ„‚๐‘›+1×๐‘›+1 is a square matrix with distinct characteristic roots ๐œ†0 , ๐œ†1 , โ‹ฏ , ๐œ†๐‘› (real). This
definition is a direct extension of the Lagrange interpolation formula for a polynomial ๐‘ (๐‘ง) of
degree ๐‘›, which is applicable only when ๐ด has distinct real roots.
(IV) Buchheim’s definition [3]
In 1886, Buchheim generalized Sylvester’s definition to the case where the characteristic roots
are not necessarily distinct. Buchheim’s definition can be represented as:
n
sj −1
๐‘“(๐ด) = ∑ ∏(A − λi I)si ∑
i=0 i≠j
k=0
dk
๐‘“(z)
[
] (A − λj I)k ,
k
dz ∏h≠j(z − zh )sh
(2.9)
where sj is the multiplicity of λj as a root of the minimum polynomial of A.
(V) M. Dehghan and M. Hajarian definition for matrix functions [2]
Let ๐ด be an (๐‘› + 1)-by- (๐‘› + 1) real matrix where its eigenvalues are not necessarily
distinct, ๐œŽ(๐ด) = {๐œ†0 , ๐œ†1 , โ‹ฏ , ๐œ†๐‘› } where ๐œ†0 ≤ ๐œ†1 ≤ โ‹ฏ ≤ ๐œ†๐‘› , and ๐‘“: โ„‚ → โ„‚ be defined on the
spectrum of ๐ด (๐œŽ(๐ด)) and ๐‘“(๐‘ง) be a scalar analytic defined function at ๐‘ง = ๐œ†๐‘– for
๐‘– = 0, 1, … , ๐‘›. Now they defined the matrix function ๐‘“(๐ด) as follows:
๐‘›
๐‘–−1
๐‘“(๐ด) = ∑ ๐พ[๐œ†0 , ๐œ†1 , … , ๐œ†๐‘– ] ∏(๐ด − ๐œ†๐‘— ๐ผ),
๐‘–=0
(2.10)
๐‘—=0
in which
๐‘“(๐œ†1 ) − ๐‘“(๐œ†0 )
,
๐œ†1 − ๐œ†0
๐‘“ (๐‘˜) (๐œ†๐‘– )
๐พ[๐œ†๐‘– , ๐œ†๐‘–+1 , … , ๐œ†๐‘–+๐‘˜ ] =
,
if ๐œ†๐‘– = ๐œ†๐‘–+๐‘˜
๐‘˜!
๐‘“[๐œ†๐‘–+1 , … , ๐œ†๐‘–+๐‘˜ ] − ๐‘“[๐œ†๐‘– , … , ๐œ†๐‘–+๐‘˜−1 ]
๐พ[๐œ†๐‘– , ๐œ†๐‘–+1 , … , ๐œ†๐‘–+๐‘˜ ] =
, ๐‘œ๐‘กโ„Ž๐‘’๐‘Ÿ๐‘ค๐‘–๐‘ ๐‘’.
(๐œ†๐‘–+๐‘˜ − ๐œ†๐‘– )
{
๐พ[๐œ†0 , ๐œ†1 ] =
(2.11)
Note that: if all eigenvalues of a square matrix ๐ด ∈ โ„‚๐‘›+1×๐‘›+1 are equal to λ then, the matrix
function takes the form:
๐‘“ฮ„ฮ„(๐œ†)
๐‘“ (๐‘›) (๐œ†)
2
(๐ด − ๐œ†๐ผ) + โ‹ฏ +
(๐ด − ๐œ†๐ผ)๐‘› .
(2.12)
๐‘“(๐ด) = ๐‘“(๐œ†)๐ผ + ๐‘“ฮ„(๐œ†)(๐ด − ๐œ†๐ผ) +
2!
๐‘›!
3. Extension of Vandermonde matrix for computing matrix functions of square matrices
having pure complex eigenvalues
Our purpose in this section is to obtain approximation for computing the matrix functions
๐‘“(๐ด) of square matrices having pure complex eigenvalues. We propose our method as follows:
4
General case:
Let ๐ด ∈ โ„‚๐‘›×๐‘› having pure complex eigenvalues λ0 , λ1 , โ‹ฏ , λn−1 with their complex conjugates
then the matrix function ๐‘“(๐ด) takes the general form:
๐‘›−1
๐‘“(๐ด) = ∑ ๐›พ๐‘– ๐ด๐‘– = ๐›พ0 ๐ผ + ๐›พ1 ๐ด + ๐›พ2 ๐ด2 + โ‹ฏ + ๐›พ๐‘›−1 ๐ด๐‘›−1
(3.1)
๐‘–=0
Now, need to find the constants ๐›พ0 , ๐›พ1 , ๐›พ2 , โ‹ฏ , ๐›พ๐‘›−1 using Vandermonde matrix.
Suppose eigenvalues of ๐ด are ๐œ†๐‘— = ๐›ผ๐‘— + ๐‘–๐›ฝ๐‘— with their complex conjugates
๐‘›−2
๐œ†๐‘—ฬ… = ๐›ผ๐‘— − ๐‘–๐›ฝ๐‘— where ๐›ผ๐‘— , ๐›ฝ๐‘— ∈ โ„, ๐›ฝ๐‘— ≠ 0 and ๐‘— = 0,1, โ‹ฏ ,
.
2
Then, the extension of Vandermonde matrix takes the form:
1
๐œ†0
1 ๐œ†ฬ…0
1
๐œ†1
1
๐œ†1ฬ…
โ‹ฎ
โ‹ฎ
1 ๐œ†๐‘›−2
2
(
1 ๐œ†ฬ…๐‘›−2
2
๐œ†20
(๐œ†ฬ…0 )2
๐œ†12
(๐œ†1ฬ… )2
โ‹ฎ
2
๐œ†๐‘›−2
โ‹ฏ
๐œ†๐‘›−1
0
โ‹ฏ (๐œ†ฬ…0 )๐‘›−1
โ‹ฏ
๐œ†1๐‘›−1
โ‹ฏ
(๐œ†1ฬ… )๐‘›−1
โ‹ฏ
โ‹ฎ
โ‹ฏ (๐œ†๐‘›−2 )๐‘›−1
2
2
(๐œ†ฬ…๐‘›−2 )2
๐›พ0
๐›พ1
๐›พ2
๐›พ3
=
โ‹ฎ
๐›พ๐‘›−2
(๐›พ๐‘›−1 )
โ‹ฏ (๐œ†ฬ…๐‘›−2 )๐‘›−1
2
2
)
๐‘“(๐œ†0 )
๐‘“(๐œ†ฬ…0 )
๐‘“(๐œ†1 )
๐‘“(๐œ†1ฬ… )
โ‹ฎ
๐‘“(๐œ†๐‘›−2 )
(3.2)
2
(
๐‘“(๐œ†ฬ…๐‘›−2 )
2
)
Solving the system of linear equations in (3.2) then, the constants ๐›พ0 , ๐›พ1 , ๐›พ2 , โ‹ฏ , ๐›พ๐‘›−1 can be
found and now, substitute in (3.1) to get ๐‘“(๐ด) .
3.1 Numerical example
1 2
) ∈ โ„‚2×2 , ๐‘“(๐‘ง) = √๐‘ง then compute ๐‘“(๐ด)
−2 1
It is not difficult to obtain the eigenvalues of ๐ด which are {1 + 2๐‘–, 1 − 2๐‘–}
Let ๐ด = (
then , ๐‘“(๐ด) = ๐›พ0 ๐ผ + ๐›พ1 ๐ด
(3.3)
To find the constants ๐›พ0 and ๐›พ1 using extension of Vandermonde matrix as in Eq. (3.2) then,
1
1
(
๐‘“(1 + 2๐‘–)
1 + 2๐‘– ๐›พ0
) (๐›พ ) = (
)
๐‘“(1 − 2๐‘–)
1 − 2๐‘–
1
Since ๐‘“(1 + 2๐‘–) = √1 + 2๐‘–
(3.4)
then, using polar coordinate we can get
๐‘“(1 + 2๐‘–) ≅ 1.27202 + 0.78615๐‘–,
๐‘“(1 − 2๐‘–) ≅ 1.27202 − 0.78615๐‘– ,
then substitute in (3.4) get the matrix equation as the form:
1 1 + 2๐‘– ๐›พ0
1.272 + 0.78615๐‘–
(
) (๐›พ ) = (
)
1 1 − 2๐‘–
1
1.272 − 0.78615๐‘–
(3.5)
Now, solving system of equations (3.5) we get:
๐›พ0 = 0.878945 and ๐›พ1 = 0.393075
(3.6)
5
Then, substitute from Eq. (3.6) in Eq. (3.3) we have:
1
0
๐‘“(๐ด) = 0.878945 (
0
1 2
1.27202
) + 0.393075 (
)≅(
1
−2 1
−0.78615
0.78615
)
1.27202
Note: since ๐‘“(๐ด) = √๐ด then (๐‘“(๐ด))2 = (√๐ด)2 = √๐ด. √๐ด = ๐ด
In this example we test the accuracy of the proposed method:
1.27202
−0.78615
√๐ด. √๐ด = (
0.78615
1.27202
).(
1.27202
−0.78615
0.78615
1 2
)≅(
)=๐ด
−2 1
1.27202
This shows that our technique maintains high accuracy.
4. The Lagrange-Sylvester interpolation polynomial for computing matrix functions of
square matrices having pure complex eigenvalues
Since for computing matrix functions of square matrices having distinct real eigenvalues is
previously investigated (see [5]) and definition (II). Therefore, we need to investigate the
computation of matrix functions of square matrices having pure complex eigenvalues. The
following method is proposed through after several steps:
Method I: General method for computing matrix functions of square matrices having pure
complex eigenvalues
Let ๐ด ∈ โ„‚๐‘›×๐‘› be a square matrix having pure complex eigenvalues, that is the spectrum
of ๐ด(๐œŽ(๐ด)) is of the form ๐œŽ(๐ด) = { ๐œ†1 , ๐œ†1ฬ… , ๐œ†2 , ๐œ†ฬ…2 , โ‹ฏ , ๐œ†๐‘› , ๐œ†ฬ…๐‘› } and ๐‘“(๐‘ง) be a scalar analytic
2
2
function defined on the spectrum of ๐ด . Then,
๐‘›
2
๐‘›
2
(๐ด − ๐œ†ฬ…๐‘˜ ๐ผ)(๐ด − ๐œ†๐‘— ๐ผ)(๐ด − ๐œ†๐‘—ฬ… ๐ผ)
๐‘“(๐ด) = ∑ ∏
๐‘“(๐œ†๐‘˜ )
(๐œ†๐‘˜ − ๐œ†ฬ…๐‘˜ )(๐œ†๐‘˜ − ๐œ†๐‘— )(๐œ†๐‘˜ − ๐œ†๐‘—ฬ… )
๐‘˜=1,3 ๐‘—=1,
๐‘—≠๐‘˜
๐‘›
2
๐‘›
2
+ ∑ ∏
๐‘˜=1,3 ๐‘—=1,
๐‘—≠๐‘˜
๐‘›
๐‘›
2
2
+ ∑ ∏
๐‘˜=2,4 ๐‘—=1,
๐‘—≠๐‘˜
๐‘›
๐‘›
2
2
+ ∑ ∏
๐‘˜=2,4 ๐‘—=1,
๐‘—≠๐‘˜
(๐ด − ๐œ†๐‘˜ ๐ผ)(๐ด − ๐œ†๐‘— ๐ผ)(๐ด − ๐œ†๐‘—ฬ… ๐ผ)
๐‘“(๐œ†ฬ…๐‘˜ )
(๐œ†ฬ…๐‘˜ − ๐œ†ฬ…๐‘˜ )(๐œ†ฬ…๐‘˜ − ๐œ†๐‘— )(๐œ†ฬ…๐‘˜ − ๐œ†๐‘—ฬ… )
(๐ด − ๐œ†๐‘— ๐ผ)(๐ด − ๐œ†๐‘—ฬ… ๐ผ)(๐ด − ๐œ†ฬ…๐‘˜ ๐ผ)
๐‘“(๐œ†๐‘˜ )
(๐œ†๐‘˜ − ๐œ†๐‘— )(๐œ†๐‘˜ − ๐œ†๐‘—ฬ… )(๐œ†๐‘˜ − ๐œ†ฬ…๐‘˜ )
(๐ด − ๐œ†๐‘— ๐ผ)(๐ด − ๐œ†๐‘—ฬ… ๐ผ)(๐ด − ๐œ†๐‘˜ ๐ผ)
๐‘“(๐œ†ฬ…๐‘˜ )
(๐œ†ฬ…๐‘˜ − ๐œ†๐‘— )(๐œ†ฬ…๐‘˜ − ๐œ†๐‘—ฬ… )(๐œ†ฬ…๐‘˜ − ๐œ†๐‘˜ )
6
(4.1)
where:
๐‘›
๐œ†๐‘˜ = ๐›ผ๐‘˜ + ๐‘–๐›ฝ๐‘˜ , ๐›ผ๐‘˜ , ๐›ฝ๐‘˜ ∈ โ„, ๐›ฝ๐‘˜ ≠ 0, ๐‘˜ = 1,2,3, โ‹ฏ , 2,
๐‘›
๐‘› ∈ 2๐‘งฦต+ , ๐‘› ≥ 2, ๐‘— = 1,2,3, โ‹ฏ , 2 , ๐‘˜ ≠ ๐‘—
and ๐œ†ฬ…๐‘˜ is a complex conjugate of ๐œ†๐‘˜ .
4.1 Numerical example
1
2
Let ๐ด = (
0
0
−2
1
0
0
0 0
0 0
) ∈ โ„‚4×4 , ๐‘“(๐‘ง) = √๐‘ง then compute ๐‘“(๐ด)
3 −4
4 3
Eigenvalues [๐ด] = {3 + 4๐‘–, 3 − 4๐‘–, 1 + 2๐‘–, 1 − 2๐‘–}
1
0
Suppose ๐œ†1 = 3 + 4๐‘–; ๐œ†1ฬ… = 3 − 4๐‘–; ๐œ†2 = 1 + 2๐‘–; ๐œ†ฬ…2 = 1 − 2๐‘–; ๐ผ = (
0
0
0
1
0
0
0
0
1
0
0
0
)
0
1
๐‘“[๐‘ง− ]: = ๐‘[√๐‘ง]
Now, using our suggested method I we have:
๐‘“(๐ด) =
๐‘“[๐œ†1 ]
(๐œ†1 − ๐œ†1ฬ… )(๐œ†1 − ๐œ†2 )(๐œ†1 − ๐œ†ฬ…2 )
(๐ด − ๐œ†1ฬ… I). (๐ด − ๐œ†2 I). (๐ด − ๐œ†ฬ…2 I)
+
๐‘“[๐œ†1ฬ… ]
(๐ด − ๐œ†1 I). (๐ด − ๐œ†2 I). (๐ด − ๐œ†ฬ…2 I)
(๐œ†1ฬ… − ๐œ†1 )(๐œ†1ฬ… − ๐œ†2 )(๐œ†1ฬ… − ๐œ†ฬ…2 )
+
๐‘“[๐œ†2 ]
(๐ด − ๐œ†1 I). (๐ด − ๐œ†1ฬ… I). (๐ด − ๐œ†ฬ…2 I)
(๐œ†2 − ๐œ†1 )(๐œ†2 − ๐œ†1ฬ… )(๐œ†2 − ๐œ†ฬ…2 )
+
๐‘“[๐œ†ฬ…2 ]
(๐ด − ๐œ†1 I). (๐ด − ๐œ†1ฬ… I). (๐ด − ๐œ†2 I)
(๐œ†ฬ…2 − ๐œ†1 )(๐œ†ฬ…2 − ๐œ†1ฬ… )(๐œ†ฬ…2 − ๐œ†2 )
= {{1.27202 + 0. ๐‘–, −0.786151 + 0. ๐‘–, 0 + 0. ๐‘–, 0 + 0. ๐‘–},
{0.786151 + 0. ๐‘–, 1.27202 + 0. ๐‘–, 0. ๐‘–, 0 + 0. ๐‘–, 0 + 0. ๐‘–},
{0. ๐‘–, 0 + 0. ๐‘–, 0 + 0. ๐‘–, 2 + 0. ๐‘–, −1 + 0. ๐‘–}, {0 + 0. ๐‘–, 0 + 0. ๐‘–, 1 + 0. ๐‘–, 2 + 0. ๐‘–}}
MatrixForm [%]
1.27202 + 0. ๐‘–
0.786151 + 0. ๐‘–
๐‘“(๐ด) = (
0 + 0. ๐‘–
0 + 0. ๐‘–
−0.786151 + 0. ๐‘–
1.27202 + 0. ๐‘–
0 + 0. ๐‘–
0 + 0. ๐‘–
0 + 0. ๐‘–
0 + 0. ๐‘–
2 + 0. ๐‘–
1 + 0. ๐‘–
7
0 + 0. ๐‘–
0 + 0. ๐‘–
)
−1 + 0. ๐‘–
2 + 0. ๐‘–
It is known that any ๐พ such that K 2 = A, then K is the a square root of matrix A. Now we obtain:
1
2
2
(๐‘“(A)) = (√A) = √A. √A ≅ (
0
0
2
−2
1
0
0
0 0
0 0
)=๐ด
3 −4
4 3
This shows that our method gives a very high accuracy.
5. Extension of Sylvester’s definition for computing matrix functions of square matrices
having mixed (distinct real and pure complex) eigenvalues
Our purpose in this section is to obtain approximation for computing the matrix function
๐‘“(๐ด) for some types of square matrices having mixed eigenvalues using extension Sylvester’s
definition. For Sylvester’s definition (III) as in Eq. (2.8), we extend it in the following two stages
depending on the type of the eigenvalues of a square matrix A:
Stage 5.1
Let A ∈ โ„‚3×3 be a square matrix having mixed eigenvalues (one real and the other is complex
with its complex conjugate). Now, Assume λ1 is real eigenvalue and λ2 is the complex
eigenvalue with its complex conjugate λฬ…2 then,
๐‘“(๐ด) =
(A − λ2 I)(A − λฬ…2 I)
(A − λ1 I)(A − λฬ…2 I)
(A − λ1 I)(A − λ2 I)
๐‘“(๐œ†1 ) +
๐‘“(๐œ†2 ) +
๐‘“(๐œ†ฬ…2 )
(λ1 − λ2 )(λ1 − λฬ…2 )
(λ2 − λ1 )(λ2 − λฬ…2 )
(λฬ…2 − λ1 )(λฬ…2 − λ2 )
(5.1)
Stage 5.2
Let A ∈ โ„‚4×4 be a square matrix having mixed eigenvalues (two real and one complex
eigenvalue with its complex conjugate). Then, suppose A has two real eigenvalues λ1 , λ2 and one
complex eigenvalue λ3 with its complex conjugate λฬ…3 . Then,
๐‘“(A) =
(A − λ2 I)(A − λ3 I)(A − λฬ…3 I)
(A − λ1 I)(A − λ3 I)(A − λฬ…3 I)
๐‘“(λ1 ) +
๐‘“(λ2 )
(λ1 − λ2 )(λ1 − λ3 )(λ1 − λฬ…3 )
(λ2 − λ1 )(λ2 − λ3 )(λ2 − λฬ…3 )
+
(A − λ1 I)(A − λ2 I)(A − λ3 I)
(A − λ1 I)(A − λ2 I)(A − λฬ…3 I)
๐‘“(λ3 ) +
๐‘“(λฬ…3 )
(λ3 − λ1 )(λ3 − λ2 )(λ3 − λฬ…3 )
(λฬ…3 − λ1 )(λฬ…3 − λ2 )(λฬ…3 − λ3 )
(5.2)
Remark: A generalization of this extension of Sylvester’s definition can be easily derived for any
size of block diagonal matrices; also this derivation is applicable for computing matrix functions
of this type of square matrices. Also, we can compute matrix functions for any ๐ด ∈ โ„‚๐‘›×๐‘› having
mixed eigenvalues by follow the same steps as in these stages. This generalization will be
illustrated in the following examples.
8
5.1 Numerical examples
In this section we propose several problems to support our investigated techniques.
Example 5.1
1 −3 11
Let ๐ด = (2 −6 16) ∈ โ„‚3×3 , ๐‘“(๐‘ง) = √๐‘ง then compute ๐‘“(๐ด)
1 −3 7
It is not difficult to obtain σ(๐ด) = {1 + ๐‘–, 1 − ๐‘–, 0}. Suppose ๐œ†1 = 1 + ๐‘–; ๐œ†1ฬ… = 1 − ๐‘–; ๐œ†2 = 0;
1
๐ผ = (0
0
0 0
1 0) ,
0 1
๐‘“[๐‘ง− ]: = ๐‘[√๐‘ง]
Now, using stage 5.1 as in Eq. (5.1) we have:
๐‘“(๐ด) =
๐‘“[๐œ†1ฬ… ]
๐‘“[๐œ†1 ]
(๐ด − ๐œ†1 I). (๐ด − ๐œ†2 I)
(๐ด − ๐œ†1ฬ… I). (๐ด − ๐œ†2 I) +
(๐œ†1 − ๐œ†1ฬ… )(๐œ†1 − ๐œ†2 )
(๐œ†1ฬ… − ๐œ†1 )(๐œ†1ฬ… − ๐œ†2 )
+
๐‘“[๐œ†2 ]
(๐ด − ๐œ†1 I). (๐ด − ๐œ†1ฬ… I)
(๐œ†2 − ๐œ†1 )(๐œ†2 − ๐œ†1ฬ… )
−0.832099 + 0. ๐‘–
= ( 0.266585 + 0. ๐‘–
0.45509 + 0. ๐‘–
2.4963 + 0. ๐‘–
−0.799756 + 0. ๐‘–
−1.36527 + 0. ๐‘–
−0.78636 + 0. ๐‘–
5.35066 + 0. ๐‘– )
3.82922 + 0. ๐‘–
Note: if the results are tested then obtain approximately the exact value of ๐ด.
Example 5.2
1
2
Let ๐ด = (
0
0
−2
1
−2
1
3 −4
4 3
) ∈ โ„‚4×4 , ๐‘“(๐‘ง) = √๐‘ง then compute ๐‘“(๐ด)
3 −4
4 3
1
It is not difficult to obtain σ(๐ด) = {2 (7 + ๐‘–√111),
1
2
(7 − ๐‘–√111), 1, 0}. Now, suppose ๐œ†1 =
1
0
(7 + ๐‘–√111); ๐œ†1ฬ… = 2 (7 − ๐‘–√111); ๐œ†2 = 1; ๐œ†3 = 0; ๐ผ = (
2
0
0
1
1
0
1
0
0
0
0
1
0
Now, using stage 5.2 as in Eq. (5.2) we have:
๐‘“(๐ด) =
๐‘“[๐œ†1 ]
(๐œ†1 − ๐œ†1ฬ… )(๐œ†1 − ๐œ†2 )(๐œ†1 − ๐œ†3 )
(๐ด − ๐œ†1ฬ… I)(๐ด − ๐œ†2 I)(๐ด − ๐œ†3 I)
+
๐‘“[๐œ†1ฬ… ]
(๐ด − ๐œ†1 I)(๐ด − ๐œ†2 I)(๐ด − ๐œ†3 I)
(๐œ†1ฬ… − ๐œ†1 )(๐œ†1ฬ… − ๐œ†2 )(๐œ†1ฬ… − ๐œ†3 )
+
๐‘“[๐œ†2 ]
(๐ด − ๐œ†1 I)(๐ด − ๐œ†1ฬ… I)(๐ด − ๐œ†3 I)
(๐œ†2 − ๐œ†1 )(๐œ†2 − ๐œ†1ฬ… )(๐œ†2 − ๐œ†3 )
9
0
0
), ๐‘“[๐‘ง− ]: = ๐‘[√๐‘ง]
0
1
+
๐‘“[๐œ†3 ]
(๐ด − ๐œ†1 I)(๐ด − ๐œ†1ฬ… I)(๐ด − ๐œ†2 I)
(๐œ†3 − ๐œ†1 )(๐œ†3 − ๐œ†1ฬ… )(๐œ†3 − ๐œ†2 )
1.07675 + 0. ๐‘–
1.46716 + 0. ๐‘–
=(
0.142679 + 0. ๐‘–
−0.400979 + 0. ๐‘–
−0.451189 + 0. ๐‘–
0.760639 + 0. ๐‘–
−0.522528 + 0. ๐‘–
0.61796 + 0. ๐‘–
1.30998 + 0. ๐‘–
−0.0368473 + 0. ๐‘–
1.96089 + 0. ๐‘–
1.26498 + 0. ๐‘–
−0.787252 + 0. ๐‘–
1.72203 + 0. ๐‘–
)
−0.831039 + 0. ๐‘–
1.63445 + 0. ๐‘–
It is known that any K such that K 2 = A, then K is the square root of matrix A. Now we obtain:
1
2
(๐‘“(A)) = (√A) = √A. √A = (
3.08428 × 10−16
−1.088 × 10−16
2
2
−2
1
−2
1
3 −4
4 3
)≅๐ด
3 −4
4 3
Note: By comparing the obtained results with the exact value of ๐ด an error 3.08428 × 10−16 is
obtained. This illustrates the applicability of our method and show that this method gives high
accuracy for approximating matrix functions ๐‘“(๐ด) for square matrices having mixed eigenvalues.
Example 5.3
1 −2 0 1
4 1 2 8
Let ๐ด = 2 3 2 7
0 0 0 3
(0 0 0 4
10
6
5 ∈ โ„‚5×5, ๐‘“(๐‘ง) = √๐‘ง then compute ๐‘“(๐ด)
−4
3)
1
1
2
2
It is not difficult to obtain σ(๐ด)={3 + 4๐‘–; 3 − 4๐‘–; (3 + ๐‘–√7); (3 − ๐‘–√7); 1}
1
1
Suppose: ๐œ†1 = 3 + 4๐‘–; ๐œ†1ฬ… = 3 − 4๐‘–; ๐œ†2 = 2 (3 + ๐‘–√7); ๐œ†ฬ…2 = 2 (3 − ๐‘–√7); ๐œ†3 = 1
1
0
๐ผ= 0
0
(0
0
0
๐‘“[๐‘ง− ]: = ๐‘[√๐‘ง]
0 ,
0
1)
Now, using generalization of stage 5.2 then, we have:
๐‘“(๐ด) =
0
1
0
0
0
0
0
1
0
0
0
0
0
1
0
๐‘“[๐œ†1 ](๐ด − ๐œ†1ฬ… I). (๐ด − ๐œ†2 I). (๐ด − ๐œ†ฬ…2 I). (๐ด − ๐œ†3 I)
(๐œ†1 − ๐œ†1ฬ… )(๐œ†1 − ๐œ†2 )(๐œ†1 − ๐œ†ฬ…2 )(๐œ†1 − ๐œ†3 )
+
๐‘“[๐œ†1ฬ… ](๐ด − ๐œ†1 I). (๐ด − ๐œ†2 I). (๐ด − ๐œ†ฬ…2 I). (๐ด − ๐œ†3 I)
(๐œ†1ฬ… − ๐œ†1 )(๐œ†ฬ…1 − ๐œ†2 )(๐œ†1ฬ… − ๐œ†ฬ…2 )(๐œ†1ฬ… − ๐œ†3 )
+
๐‘“[๐œ†2 ](๐ด − ๐œ†1 I)(๐ด − ๐œ†1ฬ… I). (๐ด − ๐œ†ฬ…2 I). (๐ด − ๐œ†3 I)
(๐œ†2 − ๐œ†1 )(๐œ†2 − ๐œ†1ฬ… )(๐œ†2 − ๐œ†ฬ…2 )(๐œ†2 − ๐œ†3 )
10
+
๐‘“[๐œ†ฬ…2 ](๐ด − ๐œ†1 I). (๐ด − ๐œ†1ฬ… I). (๐ด − ๐œ†2 I). (๐ด − ๐œ†3 I)
(๐œ†ฬ…2 − ๐œ†1 )(๐œ†ฬ…2 − ๐œ†1ฬ… )(๐œ†ฬ…2 − ๐œ†2 )(๐œ†ฬ…2 − ๐œ†3 )
+
๐‘“[๐œ†3 ](๐ด − ๐œ†1 I). (๐ด − ๐œ†1ฬ… I). (๐ด − ๐œ†2 I). (๐ด − ๐œ†ฬ…2 I)
(๐œ†3 − ๐œ†1 )(๐œ†3 − ๐œ†1ฬ… )(๐œ†3 − ๐œ†2 )(๐œ†3 − ๐œ†ฬ…2 )
= {{1.53557, −0.889822 + 5.55112 × 10−17 ๐‘–, 0.267787, −0.0603337 − 8.88178 × 10−16 ๐‘–, 2.89255}
−16
, {1.51186, 1.13389 − 1.11022 × 10
๐‘–, 0.755929,2.12949 − 8.88178 × 10−16 ๐‘–,
0.808917 − 4.44089 × 10−16 ๐‘–}, {−0.0474316,1.40168, 0.976284, 0.805245, 1.61564},
{0, 0, 0, 2 + 2.22045 × 10−16 ๐‘–, −1}, {0, 0, 0, 1, 2.22045 × 10−16 ๐‘–}}
2
2
Since ๐‘“(A) = √A this implies (๐‘“(A)) = (√A) = √A. √A = A then, we test the result. Since,
√A. √A = {{1 + 8.3925 × 10−17 ๐‘–, −2 + 2.46975 × 10−16 ๐‘–, 4.95941 × 10−16 + 4.19625 × 10−17 ๐‘–,
1 − 2.24509 × 10−15 ๐‘–, 10 + 1.97052 × 10−15 ๐‘–}, {4 − 1.6785 × 10−16 ๐‘–, 1 −
1.6785 ×
10−16 ๐‘–, 2 − 8.3925 × 10−17 ๐‘–, 8 − 4.33392 × 10−15 ๐‘–, 6 − 4.13742 × 10−16 ๐‘–},
{2,3 − 1.58251 × 10−16 ๐‘–, 2,7 − 1.02401 × 10−15 ๐‘–, 5 − 2.63727 × 10−16 ๐‘–},
{0, 0, 0,3 + 8.88178 × 10−16 ๐‘–, −4 − 4.44089 × 10−16 ๐‘–},
{0, 0, 0, 4 + 4.44089 × 10−16 ๐‘–, 3 + 8.88178 × 10−16 ๐‘–}}
Remarks:
1- By comparing of the obtained results with the exact value of ๐ด in this example the absolute
error of the real part is approximately 4.95941 × 10−16 and the absolute error of the
imaginary part is about 4.19625 × 10−17 . This illustrates the applicability of our method (the
method in stage 2) as in Eq. (2.4.2) and show that this method gives high accuracy for
approximating matrix functions ๐‘“(๐ด) of square matrices having mixed eigenvalues.
2- All computations in this section are done using MATHEMATICA 6 Program.
6. Extension of Sylvester’s definition for computing matrix functions of square matrices
having repeated real eigenvalues
If we use definition (II) directly for computing matrix functions of square matrices having
real repeated eigenvalues we face a problem in computation because the denominator is equal to
zero, although if we use definition (III) as in Eq. (2.9) can compute ๐‘“(๐ด) but this methods need to
high derivatives in case of repeated real eigenvalues many times as in Eq. (2.9). So we try to get
a new technique to ignore zeroes of denominator and high derivatives in the same time. In this
11
section, we use definition (II) and present a new technique for computing matrix functions of
square matrices having special forms with real repeated eigenvalues. This technique is proposed
after several steps and several cases. Now, we have:
Case 6.1: Let A ∈ โ„‚2×2 be a square matrix having one real eigenvalue repeated twice. We study
it in the following two forms:
Case 6.1a: Let A be a diagonal matrix having one real eigenvalue repeated twice.
Assume: A = (
a 0
) ∈ โ„‚2×2 and ๐‘“(z) is analytic function defined on the domain of the
0 a
spectrum of A then,
๐‘“(a)
0
๐‘“(A) = (
)
0
๐‘“(a)
Case 6.1b: Let A be an upper triangular matrix having one real eigenvalue repeated twice.
a b
) ∈ โ„‚2×2 then, consider eigenvalues having the form λ0 = a, λ1 ≅ a +
0 a
Assume: A = (
θ then, we compute ๐‘“(A) by using extension of Sylvester’s definition as:
๐‘“(A) =
(A − λ1 I)
(A − λ0 I)
๐‘“(λ0 ) +
๐‘“(λ1 )
(λ0 − λ1 )
(λ1 − λ0 )
1 0
a b
a
(
) − (a + θ) (
)
(
0 1 ) ๐‘“(λ ) + ( 0
=( 0 a
0
a − (a + θ)
1 0
b
) − (a) (
)
0 1 ) ๐‘“(λ )
a
1
(a + θ) − a
To show how to determine the value of the parameter θ according to the required accuracy,
consider the case of computing the principle square root of a square matrix ๐ด. Thus, we have:
๐‘“(A) = (√a
0
−b√a b√(a + θ)
+
)
θ
θ
√a
(6.1)
Our problem now is how we find the value of the parameter θ.
Since ๐‘“(A) = √A this implies to (๐‘“(A))2 = (√A)2 = √A. √A = A
−b√a b√(a + θ)
))
√A. √A = ( a 2√a( θ +
θ
0
a
Hence,
−b√a
A = (a 2√a( θ +
0
a
and we can obtain:
b√(a+θ)
θ
)) = A = (a b). This implies to 2b√a [√(a+θ)−√a] = b,
θ
0 a
12
(6.2)
√a(a + θ) − a 1
=
.
θ
2
(6.3)
Case 6.2: Let A ∈ โ„‚3×3 be a square matrix having one real eigenvalue repeated three times. We
study it in two forms:
Case 6.2a: Let A be a square diagonal matrix having one real eigenvalue repeated three times.
a
Suppose: A = (0
0
of A then,
0 0
a 0) and ๐‘“(z) is analytic function defined on the domain of the spectrum
0 a
๐‘“(a)
0
0
๐‘“(a)
0 )
๐‘“(A) = ( 0
0
0
๐‘“(a)
Case 6.2b: Let A be an upper triangular square matrix and having one real eigenvalue repeated
three times
a
Suppose: A = (0
0
b 0
a 0) , ๐‘“(z) = √z then, we compute ๐‘“(A) by using extension of Sylvester’s
0 a
definition.
It is clear that the eigenvalues of A are real and equal to {a, a, a}, which cause a problem for us in
computing matrix function by using Sylvester’s definition directly. To solve this problem we
suppose the following assumptions: λ0 = a, λ1 ≅ a + θ and λ2 ≅ a + 2θ.
For some small
real number θ this will be determined later according the required accuracy of ๐‘“(A). Then, we
have:
๐‘“(A) =
(A − λ1 I)(A − λ2 I)
(A − λ0 I)(A − λ2 I)
(A − λ0 I)(A − λ1 I)
๐‘“(λ0 ) +
๐‘“(λ1 ) +
๐‘“(λ2 )
(λ0 − λ1 )(λ0 − λ2 )
(λ1 − λ0 )(λ1 − λ2 )
(λ2 − λ0 )(λ2 − λ1 )
−1 −1 −θ
.
(0
θ 2θ
0
1 −1 0 b
+ .
(0 0
θ θ
0 0
1 1 0 b
+ . (0 0
2θ θ
0 0
=
√a
=
0
(0
b
0
−2θ
b
0
)
.
(
−θ 0
0
−2θ
0 ) √a
0 −θ
0
0
−2θ
0
−2θ
b
0
0) . ( 0
−2θ
0 ) √a + θ
0
0
0
−2θ
0
−θ b
0
0) . ( 0 −θ 0 ) √a + 2θ
0
0
0 −θ
−3b√a 2b√a + θ b√a + 2θ
+
−
2θ
θ
2θ
√a
0
0
0
√ a)
13
(6.4)
Now, our aim is to find the value of real number θ. Since, ๐‘“(A) = √A this implies that
(๐‘“(A))2 = (√A)2 = √A. √A = A . Hence, we can obtain:
−3b√a 2b√a + θ b√a + 2θ
2 √a (
+
−
)=b
2θ
θ
2θ
This implies
4√a + θ − √a + 2θ − 3√a
1
=
θ
√a
(6.5)
Thus, we can obtain value of real number θ from the following Table:
Suppose the repeated eigenvalues are real number and all equal to the number four (a = 4) then,
Table 6.1 Determination the value of the parameter θ
θ
(4√4 + θ − √4 + 2θ − 3√4)⁄θ
1
0.494782167
0.8
0.496411258
0.6
0.497822309
0.4
0.498951388
0.1
0.499925393
0.001
0.5
exact
1× 10−5 0.5
exact
From Table 6.1 we get on θ ∈ (0 , 1] and gives us a high approximation when θ tends to zero.
Note: if the repeated eigenvalue of a square matrix A are any positive real numbers we will get
on θ ∈ ]0,1] and we have a high approximation and accuracy when θ tends to zero.
Case 6.3: Let A ∈ โ„‚3×3 be a square matrix having only two real repeated eigenvalues.
a
Suppose: A = (0
0
0 0
a 0) , ๐‘“(z) = √z then, we compute ๐‘“(A) by using extension of Sylvester’s
0 b
definition.
Since eigenvalues of A are {a, a, b} so we can’t apply Sylvester’s definition directly, but we can
solve this problem by using the following assumption: λ0 = a, λ1 ≅ a + θ and λ2 = b.
14
For some small real number θ will be determined later according the required approximation and
accuracy of ๐‘“(A). Then as previous cases introduced in this section we can obtain ๐‘“(A) and θ ∈
]0,1] .
Now, from cases 1, 2 and 3 a general method for computing matrix functions of square
matrices having not necessarily distinct real eigenvalues can be introduced using an extension of
Sylvester's definition as in the next method:
Method II: General method for computing matrix functions of square matrices having real
repeated eigenvalues
Let ๐ด ∈ โ„‚๐‘›×๐‘› be a square matrix having real repeated eigenvalues as in spectrum of ๐ด(๐œŽ(๐ด))
of the form ๐œŽ(๐ด) = { ๐œ†1 , ๐œ†1 , ๐œ†1 , โ‹ฏ , ๐œ†1 , ๐œ†1 } where ๐œ†1 is repeated ๐‘› times and ๐‘“(๐‘ง) be a scalar
analytic function defined on the spectrum of ๐ด . Then,
๐‘›
๐‘“(๐ด) = ∑ ∏
๐‘—=1 ๐‘–≠๐‘—
๐ด − ๐œ†๐‘– ๐ผ
๐‘“(๐œ†๐‘— )
๐œ†๐‘— − ๐œ†๐‘–
(6.6)
where eigenvalues takes the form:
๐œ†1 , ๐œ†2 = ๐œ†1 + ๐œƒ, ๐œ†3 = ๐œ†1 + 2๐œƒ, ๐œ†4 = ๐œ†1 + 3๐œƒ, โ‹ฏ , ๐œ†๐‘› = ๐œ†1 + (๐‘› − 1)๐œƒ, ๐œƒ ∈]0,1]
(6.7)
and we get high accuracy for matrix functions ๐‘“(๐ด) when ๐œƒ tends to zero. Also this illustrates
the applicability of our method.
Note: If ๐‘“(z) is any another scalar analytic function defined on the complex plane of the
spectrum of A (σ(A)), we can compute matrix functions of square matrices having real repeated
eigenvalues by using extension of Sylvester’s definition as we introduced it in section 6. Also,
the proposed method II is practical in case of large order block diagonal matrices.
6.1 Numerical examples
In this subsection, we present several numerical examples to support our theoretical results
and illustrate the applicability and accuracy of the presented method. We compute ๐‘“(A) in all
possible cases of repeated real eigenvalues for a square matrix A which introduced in this section
Example 6.1
Suppose A = (
4 0
) , ๐‘“(z) = √z then compute ๐‘“(A).
0 4
It’s clear that eigenvalue of a square matrix A is one real and repeated twice then, by using case
6.1a as in Eq. (6.1) then, we have:
15
2 0
) = √A
0 2
๐‘“(A) = (
Example 6.2
4 2
) , ๐‘“(z) = √z then compute ๐‘“(A).
0 4
Let A = (
It is clear that A is of an upper triangular form having one real eigenvalue repeated twice. Then,
using case 6.1b as in Eq. (6.1) and Eq. (6.3) then, we have:
๐‘“(A) = √A = (2
0
2 √4 + θ − 4
)
θ
2
where θ ∈ ]0, 1] and we have a high accuracy as θ tends to zero.
Since, we know ๐‘“(A) = √A implies that (๐‘“(A))2 = (√A)2 = √A. √A = A . This implies
to
2√4+θ−4
θ
2
= 0.5. this implies ๐‘“(A) = √A = (
0
0.5
)
2
Now, the following Table supports our theoretical method.
Table 6.2: Determination the value of the parameter θ
θ
(2√4 + θ − 4)⁄θ
1
0.472135
0.8
0.477225
0.6
0.482536
0.4
0.488088
0.2
0.4939015
0.1
0.4969134
0.001
0.499968
1× 10−4 0.499996
1× 10−5 0.5
exact
Table 6.2 illustrates numerically our theoretical findings. Also if we compute (๐‘“(A))2 , we get
approximately the exact value of A.
16
Example 6.3
Let A = (
4
0
2
) , ๐‘“(z) = ez then compute ๐‘“(A).
4
It is clear this matrix have one real eigenvalue repeated twice so that we use our technique which
presented in case 6.1b as in Eq. (6.3) then, obtain:
4
๐‘“(A) = (e
0
2e4+θ − 2e4
)
θ
4
e
where θ ∈ ]0, 1] and we have the highest approximation when θ tends to zero.
So in this example we have θ = 1 × 10−5 this implies us:
๐‘“(A) = eA = (
54.598150033144236`
0
109.19684604573375`
)
54.598150033144236`
If we compute ๐‘“(A) = eA using definition (V) as in Eq. (2.12), Newton’s method the same result
of ๐‘“(A) can be obtained using our method when θ tends to zero.
Example 6.4
4 2 0
Suppose A = (0 4 0) , ๐‘“(z) = √z then compute ๐‘“(A).
0 0 4
We solve this example using two different methods as follows:
Method 1: We use our method which proposed in case 6.2b. It is not difficult to show the
eigenvalues of A are {4, 4, 4} (one real eigenvalue repeated three times). Also, the square matrix
is an upper triangular matrix then; we use case 6.2b as in Eq. (6.4) and Eq. (6.5) to compute ๐‘“(A)
as the following:
Let λ0 = 4,
λ1 ≅ 4 + θ,
λ2 ≅ 4 + 2θ where θ ∈ ]0, 1]. Then, follow the same steps as in
case 6.2b now, gets the form:
−6
4√4 + θ
−√4 + 2θ
0
0
0
0
θ
θ
θ
)+(
)+(
2 0
0
0
0
0
0
0 2
0
0
0
0
0
4√4 + θ − √4 + 2θ − 6
2
0
θ
=(
)
0
2
0
0
0
2
2
๐‘“(A) = (
0
0
17
0
)
0
0
where θ ∈ ]0, 1] and we have the high accuracy when θ tends to zero. So, in this example we
have θ = 1 × 10−4 which implies to
2 0.5 0
๐‘“(A) = (0 2 0).
0 0 2
Note: We can see that our method is applicable for computing matrix function ๐‘“(A) for a square
matrix having real repeated eigenvalues. The parameter θ is real number belongs to ]0, 1] in all
cases and we obtain a high accuracy when θ tends to zero.
Method 2: We use the method proposed in definition (V) as in Eq. (2.12). Since, eigenvalues are
one real repeated three times, this implies that σ(A) = {4, 4, 4} and ๐‘“(z) = √z then, using Eq.
(2.12) obtain the form:
๐‘“(A) = ๐‘“(4)I + ๐‘“ฬ (4)(A − 4I) +
๐‘“"(4)
(A − 4I)2
2!
1
Since ๐‘“(z) = √z โŸน ๐‘“´(z) = 2 z −1/2 โŸน ๐‘“"(z) =
๐‘“´(4) = 1/4,
−1 −3/2
z
4
then get the values ๐‘“(4) = 2,
๐‘“"(4) = −1/32
Now, we have:
1 0 0
4 2 0
1 0
1 4 2 0
1
๐‘“(A) = 2I + ((0 4 0) − 4 (0 1 0)) −
((0 4 0) − 4 (0 1
4
64
0 0 4
0 0 1
0 0 4
0 0
1 0 2 0
1 0
= 2I + (0 0 0) − (0
4
64
0 0 0
0
2 0
0 2
0 0) . (0 0
0 0
0 0
0
2
0) = (0
0
0
2
0
0))
1
0.5 0
2 0)
0 2
Remark: comparing the two methods we obtain the same result of matrix function, but in method
2 we need to compute derivatives as will as we need to square the matrix(A − 4I). So the
calculation is more difficult than our method. If the repeated eigenvalues are more than three we
need more difficult derivatives and multiplications (see definition (IV), [3]).
Example 2.5.6
4 0
Let A = (0 4
0 0
0
0) , ๐‘“(z) = √z then compute ๐‘“(A).
3
It is not difficult to see that this matrix having three real eigenvalues where only one of them is
repeated twice. So our technique in case 6.3 and case 6.1a as in Eq. (6.4) are used then, we
compute ๐‘“(A) as follows:
Suppose λ0 = 4,
λ1 = 4 + θ and λ2 = 3. Then, we have:
18
0 0 0
2 0 0
2 0 0
0
0
0
๐‘“(A) = (0 2 0) + 0I + (
) = (0 2 0 ).
0 0 0
0 0 √3
0 0 √3
Note: although this square matrix having repeated real eigenvalues and unrepeated real
eigenvalue, but our proposed method enabled us to compute the matrix functions ๐‘“(๐ด) for this
type of matrices.
Example 6.7
1
0
12 1
3 14
4
5
5
6
Let ๐ด =
6
7
7
6
8
9
9 10
(10 1
0
0
1
6
0
6
9
10
1
2
0
0
0
1
0
9
10
1
2
44
0
0
0
0
1
10
1
2
4
8
0
0
0
0
0
1
2
3
4
8
0
0
0
0
0
0
1
4
6
6
0
0
0
0
0
0
0
1
6
8
0
0
0
0
0
0
0
0
1
8
0
0
0
0
0
and ๐‘“(๐‘ง) = ๐‘’ ๐‘ง then compute ๐‘“(๐ด).
0
0
0
0
1)
The eigenvalues can be obtained ๐œŽ(๐ด) = {1,1,1, โ‹ฏ ,1}.
Using our suggested method (method II) then, as in Eq. (6.7) suppose:
๐œ†1 , ๐œ†2 = ๐œ†1 + ๐œƒ, ๐œ†3 = ๐œ†1 + 2๐œƒ, ๐œ†4 = ๐œ†1 + 3๐œƒ, โ‹ฏ , ๐œ†๐‘› = ๐œ†1 + 9๐œƒ, in this example take ๐œƒ = 0.01
then, we have:
๐œ†1 = 1,
๐œ†2 = 1.01,
๐œ†6 = 1.05,
๐œ†7 = 1.06,
๐œ†3 = 1.02,
๐œ†8 = 1.07,
๐œ†4 = 1.03,
๐œ†9 = 1.08
๐œ†5 = 1.04,
๐œ†10 = 1.09.
By applying method II, we can obtain:
10
๐‘“(๐ด) = ∑ ∏
๐‘—=1 ๐‘–≠๐‘—
=
๐‘“(๐œ†๐‘— )
(๐ด − ๐œ†๐‘– ๐ผ)
๐œ†๐‘— − ๐œ†๐‘–
๐‘“(๐œ†1 )
(๐ด − ๐œ†2 ๐ผ)(๐ด − ๐œ†3 ๐ผ) โ‹ฏ (๐ด − ๐œ†10 ๐ผ)
(๐œ†1 − ๐œ†2 )(๐œ†1 − ๐œ†3 ) โ‹ฏ (๐œ†1 − ๐œ†10 )
+
๐‘“(๐œ†2 )
(๐ด − ๐œ†1 ๐ผ)(๐ด − ๐œ†3 ๐ผ) โ‹ฏ (๐ด − ๐œ†10 ๐ผ)
(๐œ†2 − ๐œ†1 )(๐œ†2 − ๐œ†3 ) โ‹ฏ (๐œ†2 − ๐œ†10 )
+
๐‘“(๐œ†3 )
(๐ด − ๐œ†1 ๐ผ)(๐ด − ๐œ†2 ๐ผ)(๐ด − ๐œ†4 ๐ผ) โ‹ฏ (๐ด − ๐œ†10 ๐ผ)
(๐œ†3 − ๐œ†1 )(๐œ†3 − ๐œ†2 ) โ‹ฏ (๐œ†3 − ๐œ†10 )
+โ‹ฏ+
๐‘“(๐œ†10 )
(๐ด − ๐œ†1 ๐ผ)(๐ด − ๐œ†2 ๐ผ) โ‹ฏ (๐ด − ๐œ†9 ๐ผ).
(๐œ†10 − ๐œ†1 )(๐œ†10 − ๐œ†2 ) โ‹ฏ (๐œ†10 − ๐œ†9 )
Now, using MATEMATICA 6 program in computation then, we have:
๐‘“(๐ด) = {{2.71828, 0. , 0. , 0. , 0. , 0. , 0. , 0. , 0. , 0. },
19
{32.6194, 2.71828, 0. , 0. , 0. , 0. , 0. , 0. , 0. , 0. },
{236.491, 38.0559, 2.71828, 0. , 0. , 0. , 0. , 0. , 0. , 0. },
{573.558, 127.759, 16.3097, 2.71828, 0. , 0. , 0. , 0. , 0. , 0. },
{111.45, 16.3097, 0. , 0. , 2.71828, 0. , 0. , 0. , 0. , 0. },
{2400.27, 618.409, 89.7033, 24.4645, 27.1828, 2.71828, 0. , 0. , 0. , 0. },
{3578.13, 1005.76, 171.252, 51.6474, 29.9011, 5.43656, 2.71828, 0. , 0. ,0. },
{5962. , 1896.07, 361.53, 126.4, 87.8911, 19.028, 10.8731, 2.71828, 0. , 0. },
{8320. ,5291.5,1101.46,424.053,290.854,73.3936, 48.9291,16.3097,2.71828,0. },
{120832. , 13280. , 3949.5, 1598.34, 1045.28, 295.389,212.026, 86.985, 21.7463, 2.71828}}
Note: If we compute ๐‘’ ๐ด using M. Dehghan and M. Hajarian, Taylor approximation, Pade
approximation and Schur method [2, 7] then, we have approximately the same results. Hence we
can see that the values computed using these algorithms and method II are the same. Notice that
because eigenvalues of ๐ด are not necessarily distinct, we can not use Sylvester definition directly.
7. Conclusions
In this paper, we presented extension of Vandermonde matrix for computing matrix
functions of square matrices having pure complex eigenvalues. Also, we generalized LagrangeSylvester’s definition for computing matrix functions of square matrices having pure complex
eigenvalues. Moreover, we presented a new technique for computing the function of matrices
which having mixed (real and complex) eigenvalues or having real repeated eigenvalues. The
new methods are tested on several different problems. The obtained results showed and proved
the proposed approaches are applicable and give high accuracy. These methods and techniques
can also be used for computing matrix functions of block diagonal matrices practically.
References
[1] K.M. Abadir and J.R. Mangnus, Matrix Algebra, Cambridge University Press, New York
2005.
[2] M. Dehghan and M. Hajarian, Determination of a matrix function using the divided difference
method of Newton and the interpolation technique of Hermite, J. Comput. Appl. Math. 231
(2009) 67-81.
[3] M. Dehghan and M. Hajarian, Computing matrix functions using mixed interpolation
methods, Math. Comput. Model. 52 (2010) 826-836.
20
[4] G.H. Goulb and C.F. Van Loan, Matrix Computation, Third Edition, The Johns Hopkins
University press, (1996).
[5] F.R. Gantmacher, The theory of Matrices, volume one, Reprinted by the American
Mathematical Society, (2000).
[6] N.J. Higham, Functions of Matrices: Theory and Computation, SIAM, (2008).
[7] C.B. Moler and C.F. Van Loan, Nineteen dubious ways to compute the exponential of a
matrix, SIAM Rev. 20 (1979) 801-836.
[8] J. Stensby, Function of a matrix, EE4481528, Version 1.0 (2010).
[9] C.F. Van Loan, A study of the matrix exponential, MIMS EPrint (2006). 397, Manchester
Institute for Mathematical Sciences, the University of Manchester, UK, November (2006).
21
Download