This article was downloaded by:[Chinese University of Hong Kong]

advertisement
This article was downloaded by:[Chinese University of Hong Kong]
On: 24 November 2007
Access Details: [subscription number 768616791]
Publisher: Taylor & Francis
Informa Ltd Registered in England and Wales Registered Number: 1072954
Registered office: Mortimer House, 37-41 Mortimer Street, London W1T 3JH, UK
Numerical Functional Analysis and
Optimization
Publication details, including instructions for authors and subscription information:
http://www.informaworld.com/smpp/title~content=t713597287
Newton's Method for Underdetermined Systems of
Equations Under the γ-Condition
Jin-Su He a; Jin-Hua Wang b; Chong Li c
a
Department of Mathematics, Zhejiang Normal University, Jinhua, People's
Republic of China
b
College of Sciences, Zhejiang University of Technology, Hangzhou, People's
Republic of China
c
Department of Mathematics, Zhejiang University, Hangzhou, People's Republic of
China
Online Publication Date: 01 May 2007
To cite this Article: He, Jin-Su, Wang, Jin-Hua and Li, Chong (2007) 'Newton's Method for Underdetermined Systems of
Equations Under the γ-Condition', Numerical Functional Analysis and Optimization, 28:5, 663 - 679
To link to this article: DOI: 10.1080/01630560701348509
URL: http://dx.doi.org/10.1080/01630560701348509
PLEASE SCROLL DOWN FOR ARTICLE
Full terms and conditions of use: http://www.informaworld.com/terms-and-conditions-of-access.pdf
This article maybe used for research, teaching and private study purposes. Any substantial or systematic reproduction,
re-distribution, re-selling, loan or sub-licensing, systematic supply or distribution in any form to anyone is expressly
forbidden.
The publisher does not give any warranty express or implied or make any representation that the contents will be
complete or accurate or up to date. The accuracy of any instructions, formulae and drug doses should be
independently verified with primary sources. The publisher shall not be liable for any loss, actions, claims, proceedings,
demand or costs or damages whatsoever or howsoever caused arising directly or indirectly in connection with or
arising out of the use of this material.
Downloaded By: [Chinese University of Hong Kong] At: 08:26 24 November 2007
Numerical Functional Analysis and Optimization, 28(5–6):663–679, 2007
Copyright © Taylor & Francis Group, LLC
ISSN: 0163-0563 print/1532-2467 online
DOI: 10.1080/01630560701348509
NEWTON’S METHOD FOR UNDERDETERMINED SYSTEMS
OF EQUATIONS UNDER THE -CONDITION
Jin-Su He Department of Mathematics, Zhejiang Normal University,
Jinhua, People’s Republic of China
Jin-Hua Wang College of Sciences, Zhejiang University of Technology,
Hangzhou, People’s Republic of China
Chong Li Department of Mathematics, Zhejiang University, Hangzhou,
People’s Republic of China
The convergence criterion of Newton’s method for underdetermined system of equations under
the -condition is established and the radius of the convergence ball is obtained. Applications to
analytic operator are provided and some results due to Shub and Smale (SIAM J. Numer. Anal.
1996, 33:128–148) are extended and improved.
Keywords Newton’s method; Smale’s point estimate theory; The -condition;
Underdetermined systems of equations.
AMS Subject Classification 65H10; 49M15.
1. INTRODUCTION
Finding solutions of a nonlinear operator equation
f (x) = 0
(1.1)
in a Banach space is a very general subject that is widely studied in both
theoretical and applied areas of mathematics, where f is a nonlinear
operator from a real or complex Banach space E to another F . When
f is Fréchet differentiable, the most important method to find an
Address correspondence to Chong Li, Department of Mathematics, Zhejiang University,
Hangzhou 310027, People’s Republic of China; E-mail: cli@zju.edu.cn
663
Downloaded By: [Chinese University of Hong Kong] At: 08:26 24 November 2007
664
J.-S. He et al.
approximation solution is Newton’s method. One of the famous results on
Newton’s method is the well-known Kantorovich theorem (cf. [7]), which
guarantees convergence of Newton’s sequence to a solution under very
mild conditions. Another important result concerning Newton’s method
is Smale s point estimate theory (cf. [15–17]). The results on estimates
of the radii of convergence balls of Newton’s method are referred to
[19, 22, 23, 26, 27].
Recent interest is focused on finding zeros of singular nonlinear
systems, see, for example, [4–6, 9, 12, 14, 28]. Let n denote the
n-dimensional Euclidean space and let f : n → m be a nonlinear
Fréchet differentiable mapping with its Fréchet derivative denoted by Df .
In this case when Df (x) is singular for some x, Newton’s method can be
generalized to search for zeros of f , using the Moore–Penrose inverse of
the derivative; that is, the generalized Newton’s method is defined by
xn+1 = Nf (xn ) := xn − Df (xn )† f (xn )
for n = 0, 1 (1.2)
Note that a fixed point x ∗ ∈ n of the Newton operator Nf is a least square
solution of the system f (x) = 0, which, in general, is not necessarily a zero
of f . However, in the case when Df (x ∗ ) is surjective, x ∗ is a zero of f if and
only if it is a fixed point of the Newton operator Nf .
In the case when the initial point x0 is such that Df (x0 ) is surjective or
f has zero as a regular value, the convergence results of Newton’s method
(1.2) are obtained in [14] and have been successfully applied to explore
the complexity theoretic aspects of continuation methods for systems of
polynomial equations (cf. [1, 3, 6, 13, 14]). Here we pay special attention
to the convergence results of Newton’s method (1.2) due to Shub and
Smale in [14]. Suppose that n ≥ m and that x ∈ n is such that Df (x) is
surjective. Define
1
Df (x)† Dk f (x) k−1
,
(f , x) = sup k!
k≥2
(f , x) = Df (x)† f (x)
and
(f , x) = (f , x)(f , x)
Recall from [14] that y is a regular value of f if, for each x ∈ f −1 (y), Df (x)
is surjective. Then the main results about the convergence of Newton’s
method (1.2) in [14] are as follows (cf. [14, Theorems 1.4 and 1.7]).
Theorem 1.1. Let x0 ∈ n be such that Df (x0 ) is surjective. There is a
universal constant 0 approximately equal to 71 with this property: if f , x = x0 , are
as above with
(f , x0 ) < 0 ,
(1.3)
Downloaded By: [Chinese University of Hong Kong] At: 08:26 24 November 2007
Newton’s Method for Underdetermined Systems
665
then Newton’s method (1.2) with initial point x0 is well-defined for all n ≥ 0 and
convergent to a zero x ∗ of f , and
2n −1
1
xn+1 − xn ≤
x1 − x0 2
for each n = 0, 1, (1.4)
Theorem 1.2. Let f : n → m have zero as a regular value, and define
= max (f , x)
x∈f −1 (0)
Then there is a universal constant C such that if the distance
d(x0 , f −1 (0)) <
C
,
(1.5)
then Newton’s method (1.2) with initial point x0 is well-defined for all n ≥ 0 and
convergent to a zero x ∗ of f , and (1.4) holds.
Remark 1.3. Define a real-valued function by
√ 2
for each t ∈ 0, 1 −
2
(t ) = 1 − 4t + 2t
2
Then, from the proofs of Theorems 1.1 and 1.2 in [14], it is easy to see
that 0 is indeed equal
to the unique root of the equation 2t = (t )2 in
√
the interval (0, 1 − 22 ), while C is the smallest positive root of the equation
t
= 0 . Thus, 0 = 0130716944 and C = 0069778332 (t )2
The purpose of the current paper is to use the notion of the -condition
for nonlinear operators in Banach spaces, which was first introduced and
explored by Wang in [25] for the study of Smale’s point estimate theory,
to investigate the convergence of Newton’s method (1.2) with initial point
x such that Df (x) is surjective. The generalized Smale’s -criterion and
-criterion of Newton’s method (1.2) are established under the -condition.
In particular, when the results obtained in the current paper are applied to
the special case when the operator f is analytic, Theorems 1.1 and 1.2 are
improved in such a way that the criterions (1.3) and (1.5) in Theorems 1.1
and 1.2 are respectively replaced by the weaker conditions (1.6) and (1.7)
below:
√
13 − 3 17
(f , x0 ) ≤
≈ 0157671
(1.6)
4
Downloaded By: [Chinese University of Hong Kong] At: 08:26 24 November 2007
666
J.-S. He et al.
and
u0
,
d(x0 , f −1 (0)) <
(1.7)
00776121 is the smallest positive root of the equation
where u0 =
√
t
13−3 17
=
.
4
(t )2
2. NOTIONS AND PRELIMINARIES
Let A : n → m be a linear operator (or an m × n matrix). Recall
from [2, 10, 18, 20] (see also [11] for infinite dimensional Banach spaces)
that an operator (or an n × m matrix) A † : m → n is the Moore–Penrose
inverse of A if it satisfies the following four equations:
AA † A = A,
A † AA † = A † ,
(AA † )∗ = AA † , (A † A)∗ = A † A,
where A ∗ denotes the adjoint of A. Let ker A and im A denote the kernel
and image of A, respectively. For a subspace E of n , we use E to denote
the projection onto E . Then it is clear (cf. [2, 10, 11, 18, 20]) that
A † A = ker A⊥
and
AA † = im A (2.1)
In particular, in the case when A is surjective, A † = A ∗ (AA ∗ )−1 and
AA † = Im (2.2)
For x ∈ n and a positive number r , let B(x, r ) and B(x, r ) denote
respectively the open metric ball and the closed metric ball at x with
radius r . We assume throughout the whole paper that n ≥ m and that
f : n → m is a nonlinear operator that has continuous second Fréchet
derivative that is denoted by D2 f . The following definition is taken
from [25].
Definition 2.1. Let r > 0 and > 0. Let x0 ∈ n . Then f is said to satisfy
the -condition at x0 in B(x0 , r ) if Df (x0 ) is surjective and
Df (x0 )† D2 f (x) ≤
2
(1 − x − x0 )3
for each x ∈ B(x0 , r )
(2.3)
Below, we state a lemma that will be used in the remainder of this
paper.
Downloaded By: [Chinese University of Hong Kong] At: 08:26 24 November 2007
Newton’s Method for Underdetermined Systems
667
√
Lemma 2.2. Let > 0 and 0 < r < 2−2 2 . Suppose that f satisfies the
-condition at x0 in B(x0 , r ). Then, for each point x ∈ B(x0 , r ), Df (x) is surjective
and
Df (x)† Df (x0 ) ≤
(1 − x − x0 )2
1 − 4x − x0 + 22 x − x0 2
(2.4)
Proof. Because
1
Df (x0 )† (Df (x) − Df (x0 )) = Df (x0 )†
D2 f (x0 + (x − x0 ))(x − x0 )d,
0
it follows from (2.3) that
Df (x0 )† (Df (x) − Df (x0 )) ≤
1
Df (x0 )† D2 f (x0 + (x − x0 ))x − x0 d
0
1
2
x − x0 d
3
0 (1 − x − x0 )
1
−1
=
(1 − x − x0 )2
< 1
≤
(2.5)
Hence, from the Banach lemma, it is seen that (In − Df (x0 )† (Df (x0 ) −
Df (x)))−1 exists and
(In − Df (x0 )† (Df (x0 ) − Df (x)))−1 ≤
(1 − x − x0 )2
1 − 4x − x0 + 22 x − x0 2
(2.6)
By (2.1), one has that
In − Df (x0 )† (Df (x0 ) − Df (x)) = ker Df (x0 ) + Df (x0 )† Df (x)
(2.7)
As
Df (x0 )ker Df (x0 ) = 0
and
Df (x0 )Df (x0 )† = Im
(2.8)
thanks to (2.2), we have that Df (x) = Df (x0 )(ker Df (x0 ) + Df (x0 )† Df (x)).
This implies that Df (x) is surjective because Df (x0 ) is surjective and
(ker Df (x0 ) + Df (x0 )† Df (x)) is invertible thanks to (2.7). Note by (2.7) and
(2.8) that
Df (x)† Df (x0 )(In − Df (x0 )† (Df (x0 ) − Df (x)))
= Df (x)† Df (x) = (ker Df (x))⊥ (2.9)
Downloaded By: [Chinese University of Hong Kong] At: 08:26 24 November 2007
668
J.-S. He et al.
It follows from (2.6) that
Df (x)† Df (x0 ) = (ker Df (x))⊥ (In − Df (x0 )† (Df (x0 ) − Df (x)))−1 ≤ (In − Df (x0 )† (Df (x0 ) − Df (x)))−1 ≤
(1 − x − x0 )2
1 − 4x − x0 + 22 x − x0 2
The proof is complete.
3. GENERALIZED -THEORY
The majorizing function h, which is due to Wang [21, 24], will play a
key role in this section. Let > 0 and > 0. Define
h(t ) = − t +
t 2
1 − t
for each 0 ≤ t <
1
(3.1)
Let tn denote the sequence generated by Newton’s method with initial
value t0 = 0 for h, that is,
tn+1 = tn − h (tn )−1 h(tn )
for each n = 0, 1, (3.2)
Then we have the following proposition, which was proved in [21, 24].
√
Proposition 3.1. Suppose that = ≤ 3 − 2 2. Then the following
assertions hold.
(i) The zeros of h are
√
1 + − (1 + )2 − 8
,
r1 =
4
r2 =
1++
√
(1 + )2 − 8
4
(3.3)
and they satisfy
1
1
1 1
≤ r2 ≤ ≤ r1 ≤ 1 + √ ≤ 1 − √
2
2
2 and convergent to r1 .
(ii) tn is monotonic increasing
√
(iii) If = < 3 − 2 2, then
n √
(1 − q 2 ) (1 + )2 − 8
n
tn+1 − tn =
q 2 −1 n −1
n+1 −1
2
2
2(1 − q
)(1 − q
)
(3.4)
for each n = 0, 1, ,
(3.6)
Downloaded By: [Chinese University of Hong Kong] At: 08:26 24 November 2007
Newton’s Method for Underdetermined Systems
669
where
√
1 − − (1 + )2 − 8
q=
√
1 − + (1 + )2 − 8
(3.7)
√
1 + − (1 + )2 − 8
=
√
1 + + (1 + )2 − 8
(3.8)
and
Proposition 3.2 below was known in [21, 24], but a more direct proof
was given in [8].
√
Proposition 3.2. Assume that < 3 − 2 2. Then
n √
(1 − q 2 ) (1 + )2 − 8
≤ 1 for each n = 0, 1, 2(1 − q 2n −1 )(1 − q 2n+1 −1 )
(3.9)
Recall that f has continuous second Fréchet derivative. In the
remainder of this section, let x0 ∈ n be such that Df (x0 ) is surjective.
Define
= Df (x0 )† f (x0 )
and
= Theorem 3.3. Let
√
= ≤ 3 − 2 2
√
1+− (1+)2 −8
. Suppose that f satisfies the -condition at x0 in B(x0 , r1 ).
Let r1 =
4
Then Newton’s method (1.2) with initial point x0 is well-defined and the generated
√
sequence xn converges to a zero of f in B(x0 , r1 ). Moreover, if = < 3 − 2 2,
then
xn+1 − xn ≤ q 2
n −1
x1 − x0 for each n = 0, 1, ,
(3.10)
where q is defined by (3.7).
Proof. We will use mathematical induction to prove that
xn+1 − xn ≤ tn+1 − tn
(3.11)
holds for each n = 0, 1, Granting this, because tn is monotonic
increasing and convergent, xn is a Cauchy sequence and hence converges.
Furthermore, this together with (3.6) and (3.9) implies that (3.10) holds.
Downloaded By: [Chinese University of Hong Kong] At: 08:26 24 November 2007
670
J.-S. He et al.
To verify (3.11), noting that t0 = 0 and t1 = , one has that
x1 − x0 = Df (x0 )† f (x0 ) = ≤ t1 − t0 Therefore, (3.11) holds for n = 0. Assume that (3.11) holds for n =
0, 1, , k − 1. Then, we have
xk − x0 ≤ tk < r1 <
1−
√
2
2
(3.12)
Hence, Df (xk ) is surjective by Lemma 2.2 and xk+1 is well-defined.
Furthermore, by (2.4), we have that
Df (xk )† Df (x0 ) ≤
(1 − xk − x0 )2
1 − 4xk − x0 + 22 xk − x0 2
= −h (xk − x0 )−1 ≤ −h (tk )−1 ,
(3.13)
because −h (t )−1 is monotonic increasing on [0, 1 ). Below, we will prove
that (3.11) holds for n = k. Because xk − xk−1 = −Df (xk−1 )† f (xk−1 ) and
Df (xk−1 )Df (xk−1 )† = Im ,
f (xk ) = f (xk ) − f (xk−1 ) − Df (xk−1 )(xk − xk−1 )
1
=
[Df (xk−1 + (xk − xk−1 )) − Df (xk−1 )](xk − xk−1 )d
0
Hence, by (3.12) and (2.3), one has that
Df (x0 )† f (xk ) = Df (x0 )† (f (xk ) − f (xk−1 ) − Df (xk−1 )(xk − xk−1 ))
1
≤
Df (x0 )† [Df (xk−1 + (xk − xk−1 )) − Df (xk−1 )]xk − xk−1 d
0
≤
1
0
≤
≤
0
Df (x0 )† D2 f (xk−1 + s(xk − xk−1 ))xk − xk−1 2 ds d
1
0
0
1
0
0
2
dsxk − xk−1 2 d
(1 − (xk−1 − x0 + sxk − xk−1 ))3
2
ds(tk − tk−1 )2 d
(1 − (tk−1 + s(tk − tk−1 )))3
= h(tk ) − h(tk−1 ) − h (tk−1 )(tk − tk−1 )
= h(tk ),
(3.14)
Downloaded By: [Chinese University of Hong Kong] At: 08:26 24 November 2007
671
Newton’s Method for Underdetermined Systems
where the last equality holds because −h(tk−1 ) − h (tk−1 )(tk − tk−1 ) = 0 by
(3.2) (with n = k). Because Df (x0 )Df (x0 )† = Im , (3.13) and (3.14) imply
that
xk − xk−1 = Df (xk )† f (xk )
≤ Df (xk )† Df (x0 ) · Df (x0 )† f (xk )
≤ −h (tk )−1 h(tk )
= tk+1 − tk Thus, (3.11) holds for n = k and the proof is complete.
4. GENERALIZED -THEORY
Recall that f is a nonlinear operator that has continuous second
Fréchet derivative. Throughout this section, we assume that x ∗ ∈ n is such
that Df (x ∗ ) is surjective. Recall that
√ 2
2
(t ) = 1 − 4t + 2t for each t ∈ 0, 1 −
2
The following lemma estimates the quantity Df (x0 )† f (x0 ), which will
be used in the proof of the main theorem of this section.
√
Lemma 4.1. Let 0 < r ≤ 2−2 2 and let x0 ∈ B(x ∗ , r ). Suppose that f satisfies
the -condition at x ∗ in B(x ∗ , r ). Then Df (x0 ) is surjective and
Df (x0 )† f (x0 ) ≤
1−u
x0 − x ∗ ,
(u)
(4.1)
where u = x0 − x ∗ .
Proof. By Lemma 2.2, Df (x0 ) is surjective and
(1 − u)2
(u)
(4.2)
x0 − x ∗ (1 − u)
(4.3)
Df (x0 )† Df (x ∗ ) ≤
Below we will show that
Df (x ∗ )† f (x0 ) ≤
Granting this, by (4.2) and Df (x ∗ )Df (x ∗ )† = Im , we have that
Df (x0 )† f (x0 ) ≤ Df (x0 )† Df (x ∗ )Df (x ∗ )† f (x0 ) ≤
1−u
x0 − x ∗ (u)
Downloaded By: [Chinese University of Hong Kong] At: 08:26 24 November 2007
672
J.-S. He et al.
and so (4.1) is seen to hold. Observe that
f (x0 ) = f (x0 ) − f (x ∗ ) − Df (x ∗ )(x0 − x ∗ ) + Df (x ∗ )(x0 − x ∗ )
1
=
[Df (x ∗ + (x0 − x ∗ ))(x0 − x ∗ ) − Df (x ∗ )(x0 − x ∗ )]d
0
+ Df (x ∗ )(x0 − x ∗ )
1 =
D2 f (x ∗ + s(x0 − x ∗ ))(x0 − x ∗ )2 dsd + Df (x ∗ )(x0 − x ∗ )
0
0
(4.4)
Thus, by (2.3),
∗ †
1
Df (x ) f (x0 ) ≤
0
Df (x ∗ )+ D2 f (x ∗ + s(x0 − x ∗ ))(x0 − x ∗ )2 ds d
0
+ Df (x ∗ )† Df (x ∗ ) · x0 − x ∗ 1 2
≤
x0 − x ∗ 2 ds d + x0 − x ∗ ∗ )3
(1
−
sx
−
x
0
0
0
x0 − x ∗ =
1−u
and hence, (4.3) holds.
Let ū0 = 0080851 be the smallest positive root of the equation
√
t
= 3 − 2 2
2
(t )
(4.5)
Recall that u = x0 − x ∗ and = Df (x0 )† f (x0 ). Furthermore, set
=
(u)(1 − u)
and
= √
Theorem 4.2. Let r̄ = 2−2 2 and Suppose that f (x ∗ ) = 0 and f satisfies the
-condition at x ∗ in B(x ∗ , r̄ ). If x0 − x ∗ < ū0 , then the sequence xn generated
by Newton’s method (1.2) with initial point x0 converges to a zero of f . Moreover,
xn+1 − xn ≤ (q )2
where
n −1
x1 − x0 √
1 − − (1 + )2 − 8
q =
√
1 − + (1 + )2 − 8
(4.6)
Downloaded By: [Chinese University of Hong Kong] At: 08:26 24 November 2007
673
Newton’s Method for Underdetermined Systems
Proof. By Lemma 4.1, Df (x0 )† is surjective and
= Df (x0 )† f (x0 ) ≤
1−u
1−uu
x0 − x ∗ =
(u)
(u) (4.7)
Then
= ≤
√
u
ū0
<
=
3
−
2
2
(u)2
(ū0 )2
because the function t →
√
[0, 1 − 22 ). Let
r1
=
t
(t )2
1 + −
(4.8)
is strictly monotonic increasing on
√
(1 + )2 − 8
4
(4.9)
Then by (3.4),
≤
r1
1
1 1
≤ 1+ √ ≤ 1− √
2
2 (4.10)
In order that Theorem 3.3 is applicable, we have to show the assertion:
There exists r ≥ r1 such that f satisfies the -condition at x0 in B(x0 , r ).
For this purpose let
r = r̄ − x0 − x ∗ (4.11)
Then, by (4.10)
√ √ √
2 (u)(1 − u)
2 1
2− 2 u
r =
− ≥ 1−
= 1−
≥ r1 2
2
2 Below we show that f satisfies the -condition at x0 in B(x0 , r ). To do this,
let x ∈ B(x0 , r ). Because f satisfies the -condition at x ∗ in B(x ∗ , r̄ ) and x −
x ∗ ≤ x − x0 + x0 − x ∗ ≤ r̄ thanks to (4.11), we obtain that
Df (x ∗ )† D2 f (x) ≤
2
(1 − x − x ∗ )3
(4.12)
Note that Df (x ∗ )Df (x ∗ )† = Im . Thus, using Lemma 2.2 and (4.12), we
conclude that
Df (x0 )† D2 f (x) ≤ Df (x0 )† Df (x ∗ ) · Df (x ∗ )† D2 f (x)
≤
2
(1 − u)2
(u) (1 − x − x ∗ )3
Downloaded By: [Chinese University of Hong Kong] At: 08:26 24 November 2007
674
J.-S. He et al.
≤
(1 − u)3
2
(u)(1 − u) (1 − (x − x0 + x0 − x ∗ ))3
(1 − u)3
(1 − u − x − x0 )3
2
=
(1 − 1−u x − x0 )3
= 2
≤
2
(1 −
x
(u)(1−u)
− x0 )3
2
,
(1 − x − x0 )3
√
2
because 0 < (t ) < 1 for all t ∈ 0, 1 −
. Therefore, f satisfies the
2
-condition at x0 in B(x0 , r ) and the assertion holds. Thus, we apply
Theorem 3.3 to conclude that the sequence xn generated by Newton’s
method (1.2) with initial point x0 converges to a zero of f , and
=
xn+1 − xn ≤ (q )2
n −1
x1 − x0 ,
n = 0, 1, 2, The proof is complete.
5. APPLICATIONS TO ANALYTIC OPERATORS
Throughout this section, we shall always assume that the nonlinear
operator f : n → m is analytic. Let x ∈ n be such that Df (x) is
surjective. Following [14], we define
1
Df (x)† Dk f (x) k−1
(f , x) = sup
k!
k≥2
(5.1)
Also we adopt the convention that (f , x) = ∞ if Df (x) is not surjective.
Note that this definition is justified and in the case when DX (p) is
surjective, by analyticity, (f , x) is finite.
Let x0 ∈ n be such that Df (x0 ) is surjective. Let
= (f , x0 )
(5.2)
The following lemma shows that an analytic operator satisfies the
-condition at x0 in B(x0 , 1 ), the proof of which is easy and so is omitted
here.
Lemma 5.1. Let 0 < r ≤ 1 . Then f satisfies the -condition at x0 in B(x0 , r ).
Downloaded By: [Chinese University of Hong Kong] At: 08:26 24 November 2007
Newton’s Method for Underdetermined Systems
675
To study the computational complexity of Newton’s method for
nonlinear operators in Banach spaces, Smale introduced in [15] the notion
of an approximation zero for Newton’s method in 1981. But it was found
that it did not describe completely the property of quadratic convergence
of Newton’s method and was inconvenient for the application in the study
of the computational complexity. Hence, Smale proposed in [16] two kinds
of the modifications of the notion again in 1986, see also [17]. Here we are
interested in the first one, which was also stated in [14].
Definition 5.2. A point x0 ∈ n such that Df (x0 ) is surjective is called
an approximation zero of f if Newton’s method (1.2) with initial point x0
converges to a zero x ∗ of f and (3.10) holds for q = 12 (x ∗ is called the
associated zero).
By Theorem 3.3 and Lemma 5.1, we have the following corollary that
improves Theorem 1.1 (i.e., Shub and Smale [14, Theorem 1.4]). Recall
that x0 ∈ n is such that Df (x0 ) is surjective and recall also that =
Df (x0 )† f (x0 ) and = with = (f , x0 ).
Corollary 5.3. If
√
13 − 3 17
≈ 0157671,
= ≤
4
then x0 is an approximate zero of f .
Proof. Let
r1 =
1+−
√
(1 + )2 − 8
4
Then by (3.4), r1 < 1 . Therefore, it follows from Lemma 5.1 that f satisfies
the -condition at x0 in B(x0 , r1 ). On the other hand,
√
√
13 − 3 17
≤
< 3 − 2 2
4
Hence, using Theorem 3.3, one has that Newton’s method (1.2) with initial
point x0 is well defined and the generated sequence xn converges to a
zero of f in B(x0 , r1 ). Moreover,
xn+1 − xn ≤ q 2
where
n −1
x1 − x0 ,
√
1 − − (1 + )2 − 8
q=
√
1 − + (1 + )2 − 8
Downloaded By: [Chinese University of Hong Kong] At: 08:26 24 November 2007
676
J.-S. He et al.
Therefore,
we have that q ≤ 12 because
q increases as does on
√
√
13−3 17
13−3 17
is 12 . Thus x0 is an approximate
[0, 4 ] and the value of q at =
4
zero of f .
The following definition is taken from [3].
Definition 5.4. Let f : n → m be an operator with continuous Fréchet
derivative. A point y ∈ m is called a regular value of f if, for each x ∈
f −1 (y), Df (x) is surjective.
√
Recall that (t ) = 1 − 4t + 2t 2 for each t ∈ [0, 1 − 22 ). Let u0 =
00776121 be the smallest positive root of the equation
√
t
13 − 3 17
(5.3)
=
(t )2
4
By Theorem 4.2 and Lemma 5.1, one has the following corollary, which
improves Theorem 1.2 (i.e., Shub and Smale [14, Theorem 1.7]).
Corollary 5.5.
Let f : n → m have zero as a regular value and define
= max (f , x)
x∈f −1 (0)
Let x0 ∈ n satisfy
d(x0 , f −1 (0)) <
u0
(5.4)
Then x0 is an approximate zero of f .
Proof. By (5.4), there exists x ∗ ∈ f −1 (0) such that x0 − x ∗ <
x0 − x ∗ < (fu,x0 ∗ ) . Let
=
(f , x ∗ )
(u)(1 − u)
and
u0
;
hence
= ,
where = Df (x0 )† f (x0 ) and u = (f , x ∗ )x0 − x ∗ . √By Lemma 5.1,
2− 2
). Because u0
f satisfies the (f , x ∗ )-condition at x ∗ in B(x ∗ , 2(f
,x ∗ )
determined by (5.3) is less than u0 given by (4.5), Theorem 4.2 is
applicable. Thus we have that Newton’s method (1.2) with the initial point
x0 is well defined and
xn+1 − xn ≤ q 2
n −1
x1 − x0 ,
Downloaded By: [Chinese University of Hong Kong] At: 08:26 24 November 2007
Newton’s Method for Underdetermined Systems
where
677
√
1 − − (1 + )2 − 8
q =
√
1 − + (1 + )2 − 8
By Lemma 4.1,
= DX (p0 )† X (p0 ) ≤
It follows that
1−u
x0 − x ∗ (u)
√
u
u0
13 − 3 17
≤
=
,
≤
(u)2
(u0 )2
4
(5.5)
because the function t → (tt )2 is strictly monotonic increasing on
√
[0, 1 − 22 ). Hence, we have that q ≤ 12 and x0 is an approximate zero of f .
We end this paper with a simple example showing that the convergence
is guaranteed by Corollary 5.3 but not by Theorem 1.1.
Example 5.6. Let 2 be endowed with the l∞ -norm and let f : 2 → be given by
f (x) = 1 + t1 +
3 2
1 3
t2 +
t1 + t23
20
300
for each x = (t1 , t2 ) ∈ 2 Then
t2
t2
3t2
Df (x)w = 1 + 1 w1 +
+ 2 w2 for each w ∈ 2 ,
100
10
100
t2
t1
3
D2 f (x)w y =
w1 y1 +
+
w2 y2 for each w, y ∈ 2 ,
50
10 50
1
1
3
w1 y1 z1 +
w2 y2 z2 for each w, y, z ∈ 2 D f (x)w y z =
50
50
and Dk f (x) = 0 for each k ≥ 4. Take x0 = (0, 0). Then we have that
Df (x0 )† = (1, 0)T . Consequently,
= Df (x0 )† f (x0 ) = 1
and
1 Df (x0 )† D2 f (x0 ) Df (x0 )† D3 f (x0 ) 2
, = max = 015
2
3
Downloaded By: [Chinese University of Hong Kong] At: 08:26 24 November 2007
678
J.-S. He et al.
Hence
√
13 − 3 17
= = 015 <
,
4
and Corollary 5.3 is applicable but not Theorem 1.1 because = 015 >
0 = 0130716944 .
ACKNOWLEDGMENTS
This work was supported in part by the National Natural Science
Foundation of China (grant 10671175) and Program for New Century
Excellent Talents in University.
REFERENCES
1. E. Allgower and K. Georg (1993). Continuation and path following. Acta. Numer. 1–64.
2. A. Ben-Israel and T.N.E. Greville (1974). Generalized Inverses: Theory and Applications. John Wiley
& Sons, New York.
3. L. Blum, F. Cucker, M. Shub, and S. Smale (1997). Complexity and Real Computation. SpringerVerlag, New York.
4. J.P. Dedieu and M. Kim (2002). Newton’s method for analytic systems of equations with
constant rank derivatives. J. Complexity 18:187–209.
5. J.P. Dedieu and M. Shub (2000). Newton’s method for overdetermined systems of equations.
Math. Comput. 69:1099–1115.
6. J.P. Dedieu and M. Shub (2000). Multihomogeneous Newton’s method. Math. Comput. 69:
1071–1098.
7. L.V. Kantorovich and G.P. Akilov (1982). Functional Analysis. Pergamon, Oxford.
8. C. Li and J.H. Wang (2006). Newton’s method on Riemannian manifolds: Smale’s point
estimate theory under the -condition. IMA J. Numer. Anal. 26:228–251.
9. C. Li, W.H. Zhang, and X.Q. Jin (2004). Convergence and uniqueness properties of GaussNewton’s method. Comput. Math. Appl. 47:1057–1067.
10. M.Z. Nashed (1976). Generalized Inverse and Applications. Academic Press, New York.
11. M.Z. Nashed (1987). Inner, outer, and generalized inverses in Banach and Hilbert spaces.
Numer. Funct. Anal. Optim. 9:261–325.
12. M.Z. Nashed and X. Chen (1993). Convergence of Newton-like methods for singular operator
equations using outer inverses. Numer. Math. 66:235–257.
13. M. Shub and S. Smale (1993). Complexity of Bezout’s theorem I: Geometric aspects. J. Am.
Math. Soc. 6:459–501.
14. M. Shub and S. Smale (1996). Complexity of Bezout’s theorem IV: Probability of success,
Extensions. SIAM J. Numer. Anal. 33:128–148.
15. S. Smale (1981). The fundamental theorem of algebra and complexity theory. Bull. AMS.
4:1–36.
16. S. Smale (1986). Newton’s method estimates from data at one point. In: The Merging of
Disciplines: New Directions in Pure, Applied and Computational Mathematics (R. Ewing, K. Gross,
and C. Martin, eds.), Springer, New York, pp. 185–196.
17. S. Smale (1997). Complexity theory and numerical analysis. Acta Numer. 6:523–551.
18. G. Stewart and J. Sun (1990). Matrix Perturbation Theory. Academic Press, New York.
19. J.F. Traub and H. Wozniakowski (1979). Convergence and complexity of Newton’s iteration.
J. Assoc. Comput. Math. 29:250–258.
20. G.R. Wang, Y.M. Wei, and S.Z. Qiao (2004). Generalized Inverses: Theory and Computations. Science
Press, Beijing.
Downloaded By: [Chinese University of Hong Kong] At: 08:26 24 November 2007
Newton’s Method for Underdetermined Systems
679
21. X.H. Wang (1999). Convergence of Newton’s method and inverse function theorem in Banach
space. Math. Comput. 225:169–186.
22. X.H. Wang (2000). Convergence of Newton’s method and uniqueness of the solution of
equations in Banach space. IMA J. Numer. Anal. 20:123–134.
23. X.H. Wang (1980). The convergence on Newton’s method. KeXue TongBao (A Special Issue of
Mathematics, Physics and Chemistry). 25:36–37.
24. X.H. Wang and D.F. Han (1990). On the dominating sequence method in the point estimates
and Smale’s theorem. Scientia Sinica Ser. A. 33:135–144.
25. X.H. Wang and D.F. Han (1997). Criterion and Newton’s method. Chinese J. Numer. Appl.
Math. 19:96–105.
26. X.H. Wang and C. Li (2001). Local and global behaviors for algorithms of solving equations.
Chinese Sci. Bull. 46:441–448.
27. X.H. Wang and C. Li (2003). Convergence of Newton’s method and uniqueness of the solution
of equations in Banach spaces. Acta Math. Sinica English Ser. 19:405–412.
28. X.B. Xu and C. Li (2007). Convergence of Newton’s method for systems of equations with
constant rank derivatives. J. Comput. Math. (in press).
Download