Distributed computation o f fixed points ...

advertisement
Proc. Indian Acad. Sci. (Math. SO.), Vol. 106,No. 3, August 1996,pp. 289-300.
t~ Printed in India
Distributed computation of fixed points of ~ - n o n e x p a n s i v e maps
VIVEK S BORKAR
Department of Computer Science and Automation, Indian Institute of Science,
Bangalore 560012, India
MS received 7 August 1995;revised 3 May 1996
Absln~L The distributed implementationof an algorithm for computing fixed points of an
oo-nonexpansivemap is shownto convergeto the set offixedpointsunderverygeneralconditions.
KeywordL Distributed algorithm;fixedpoint computation; oo-nonexpansivemap; tapering
stepsize;controlled Markov chains.
I. Introduction
Many problems in optimization can be cast as problems of finding a fixed point of
a map F : R d ~ R ~ which is nonexpansive with respect to the oo-norm, or a weighted
version thereof. Examples are algorithms for solving dynamic programming equations
for shortest path problems and Markov decision processes, certain network flow
problems, etc. A standard approach then is to use the iteration
x , +1 = F ( x . )
or its 'relaxed' version
xn+ 1 = (1 - a)xn + aF(x~),
a~(0, 1).
(1)
A comprehensive account of synchronous and asynchronous implementations of these
algorithms and their applications to optimization and numerical analysis appears in
[11 along with an extensive bibliography. A continuous time analog is studied in [4].
This work considers a distributed implementation of a variant of (1). We replace 'a'
by a tapering stepsize {a(n)} c (0, 1) as in stochastic approximation theory, satisfying
for some re(O, 1),
1 >~ a(n + 1)/a(n) --, 1,
~, a(n) = oo,
n
~_, a(n) ~+~ <
oc
(2)
PI
for q >/r. An example is a ( n ) = (n + 2)-1, r = (say) 0-5. Our model of distributed
computation is as follows: We postulate a set-valued random process which selects
at each time a subset of { 1. . . . . d} indicating the indices of the components which
are to be updated. The update uses delayed information regarding other components
with the delays being required to satisfy a mild conditional moment bound. We obtain
a very general convergence theorem under these conditions (Theorem 1.1 below)
which does not require F to be a contraction or a pseudocontraction as in the earlier
literature.
Tapering stepsize as a means for suppressing the effects of delays was proposed in [3]
in the case of algorithms whose continuous limits are differential equations admitting
strict Liapunov functions. Equation (1) need not satisfy this condition. Furthermore,
the model of distributed computing is more elaborate than in the above work.
289
290
Vivek S Borkar
The paper is organized as follows. The remainder of this section formulates the
algorithm and states the main result. The next section studies an associated o.d.e, which
this algorithm tracks in the limit. The third section proves the main result using this
'o.d.e. limit'. The last section gives some examples if nonexpansive maps arising in
numerical analysis and optimization. A forthcoming companion paper studies the
asynchronous version of this algorithm (i.e., one without a 'global clock').
Introduce the norms
Ix;Ip}
=
,
1 ~<p< ~ ,
Ixll~ = max Ixil.
i
for x = [x~ .... , xd]. Let F:Ra--. R d be oo-nonexpansive, i.e.,
I l F ( x ) - f(y)ll~ <<.] : x - yl]oo,
x, Ye R~.
In particular, it is Lipschitz. Let G = {xIF(x) = x} denote the (closed) set of fixed points
of F, assumed nonempty.
Let I = { 1, 2 . . . . . d} and S a collection of nonempty subsets o f / t h a t cover I. Let { Y, }
be an S-valued process and for each n, z~j(n), i r
random variables (delays) taking
values {0, 1..... n}. We set z,(n) = 0 Vi, n. The distributed version of(l) we consider is as
follows: Given X(0), compute X ( n ) = [X 1(n)..... Xd(n)] iteratively by
Xi(n + 1) = Xi(n ) + a(n)[Fi(X 1(n - vii (n)) . . . . . Xa(n - Tin(n)))
-- X,(n)] I { i c Y n }
(3)
for iel, n >~O. Let . ~ n = a ( X ( m ) , Y(m), m <~n, ~ii(m), m < n , i , j e I ) and ~ , , = a ( X ( m ) ,
Y(m), "rij(m), m <~n, i,jeI), n >~O. We shall be making the following key assumptions:
(A1) There exists a fi > 0 such that the following holds. For any A, B e S , the quantity
P(Y.+ 1 = B / Y . = A, fgn)
(4)
is either always zero, a.s., or always exceeds 6, a.s. That is, having picked A at time n,
picking B at the next instant is either improbable or probable with a minimum
conditional probability of 3, regardless of n and the 'history' f~..
Furthermore, if we draw a directed graph with node set S and an edge from A to
B when (4) exceeds di a.s., then this graph is irreducible, i.e., there is a directed path from
any A e S to any BeS.
(A2) There exist b >1r/(1 - r) and C > 0 such that
E[(zlj(n))b/.~,,] <~C
a.s.
Vi,j,n.
(A3) If fi = the integer part ofa(n)'- ~, then ~ is o(n) and moreover, lim supn_~o a(n - h)/
a(n) < c~. (Note that this condition is satisfied by our example a(n) = (n + 2)- 1 with
r = 0"5.)
The condition z,(n) = 0 implies that for updating the ith component, its most recent
value is immediately available. The idea is that each component is computed by
a specific processor, and the different processors communicate with each other in
conformity with (A2). We make the following immediate observation for later use.
Fixed point computation
291
Let zeG. If C. = max.,.<. JiX(n) - z,l~, then by (3) and ce-nonexpansivity F, {C.} is
a nonincreasing sequence. Thus
s u p IIX(n) - zll ~o ~< I I S ( 0 ) - z [ l ~ ,
Our main result is the following:
Theorem 1.1. X (n) ~ G a.s.
The proof will use the fact that (3) asymptotically tracks an o.d.e. The next section
studies this o.d.e.
2. An associated o.d.e.
We start with some notation. For A e S , define Fa(.) = [F~(.) .... Faa(')]: Rd--* R a by
F'~(x) = Fj(x)I {yeA} + x f l {j~A}
for x = [ x t , . . . , x ~ ] . For any Polish space X, let ~ ( X ) denote the Polish space of
probability measures on X with the Prohorov topology. In particular, .~(S) is the space
of probability vectors on S. For # e ~ ( S ) , define Fu: R d --, R d by
FU(.) = ~ # ( A ) F A ( . ) .
Clearly, F A, F ~' are oo-nonexpansive. Given cc> O, say that # e ~ ( S ) is a-thick if
mina#(A) i> a and thick if it is ~-thick for some a > O. The following is easily proved.
Lemma 2.1. G is precisely the set of fixed points o f F ~ for thick # and the intersection of
the sets of fixed points o f F A, A e S (resp., F ~, ile~(S)).
Let U denote the space of.~(S)-valued trajectories fi = {#, t/> 0} with the coarsest
T
topology that renders continuous the maps #-...~ So
f ( t ) # f ( A ) d t for T~>0, A e S ,
f e L 2 [0, T]. U then is compact metrizable. Let us say that fie U is ~t-thick for a given
> 0 if #t is for a.e. t, where the 'a.e.' may be dropped by taking an appropriate version.
Say that it is thick if it is 0t-thick for some ~t > 0.
Lemma 2.2. Let fi" -* fib in U and fi" is a-thick, ~ > O, for n = 1, 2 ..... Then fW is ~-thick.
Proof. For any A e S , t > s >~O, n >~ 1,
f ' #~r(A)dy >~ct(t-- s).
The inequality is preserved in the limit n ~ oo. The rest is easy.
[]
Given fie U, consider the o.d.e.
2(0 = FU'(x(t)) -- x(t), x(O) = x.
(5)
Lemma 2.3. The map (fi, x ) e U x Rd ~ x(')eC([O, oc); R d) defined by (5) is continuous.
Proof. Let (fi",x,)--,(fi%x~). For n/> 1, let x"(') satisfy
.~"(t) = FU,(x"(t)) - x"(t), x"(O) = x,.
292
Vivek S Borkar
Using the Gronwall lemma and Arzela-Ascoli theorem, one verifies that {xn(')} is
relatively compact in C ( [0, ~); Rd). By dropping to a subsequence if necessary, suppose
that xn(.)--.x~
Then x~(0) = x~. Also, for t/> 0, n >/1,
x~(t)=x,+f'o(F~;(x~(s))-F"7(x~(s)))ds
j"
+fto(F#,(x~176176176176
(F"~(x ~ (s)) - F"2 (x ~ (s)))ds
+
0
(6)
The first and the fourth integral go to zero as n -~ oc because xn(.) ~ x ~ (.). (We use the
oo-nonexpansivity of F" in the former case.) So does the second integral in view of our
topology on U. Thus x~(') satisfies
~ ( t ) = F',~ ( x ~ 1 7 6
x~(t), x~(O) = x ~ .
The claim follows.
[]
The main result of this section is the following:
Tl~orem 2.1. Given a thick fie U, the solution x(.) of (5) converges to a point in G which
may depend on the initial condition. Furthermore, t -~ IIx(t) - x* II| is nonincreasing for
any x*~G.
The proof will closely follow that of Theorem 3.1 [4], except for the additional
complications caused by the fact that (5) is nonautonomous. We split the proof into
several lemmas. Let x * e G and/2 be ~t-thick for ct > 0.
Lemma 2.4. The map t--* '~lx(O - x* IFo~is nonincreasimj and hence converges to some a > O.
Proof. A straightforward computation shows that for pc(l, oo)
d
dt [Ix(t) - x* lip = - Ilx(t) - x* lip + [Ix(t) - x* 1191-Pg(t),
where
t4 i=1
~< IIx(t) - x , lipp-a
IlF~'(x(t))-F"'(x*)llv.
Thus for t > s,
,lx(t)-x*llr<~,lx(s)-x*l,+fts(-llx(y)-x*]i,
+ IIFU'(x(y)) - FU'(x *) IIp)dy.
Let p ~ ~ to obtain
,[x(t)-x*ll~<~llx(s)-x*,loo+fts(-,lx(y)-x*,,~o
+ IIF~'(x(Y)) - F~'(x *) II~)dy.
(7)
F i x e d point computation
The claim follows in view of the oo-nonexpansivity of F v.
293
[]
I f a = 0, we are done. Suppose a > 0. Define Ba = {xeRd[ [Ix -- x* I[o~ = a}. Introduce
the terminology: An m-face for m ~< d is a set of the type
{x = [x 1 . . . . . x d] Ix~ ~Ea~., b~.], k <~ m, x~ = c,., k > m},
where {i~ . . . . . ia } is a permutation of { 1.... , d} and b k > a k, c k are scalars. Then Ba is the
union of(d - 1)-faces of the type {xlx~ - x* = a (or - a) and Ixj - x~l < a f o r j :# i}. F o r
any (d - 1)-face of this type (say, H), define G n = { x e H l F ( x ) e H } . Then G u is closed,
possibly empty. Since [Ix(t) - x* li ~ --* a, x(t) ~ Bo. N o w the trajectories x ( t + .), t >1 O,
form a relatively compact set in C([0, ~ ) ; Ra). Thus any limit point ~(.) thereof as t ~
must lie in B,. By L e m m a s 2.2 and 2.3, .~(.) satisfies
~(t) = F ~ ( ~ ( t ) ) - ~(t)
for some ~t-thick/~eU. Let {~(.)} = { ~ ( t ) l t e R + } .
Lemma 2.5. { ~ ( - ) } c ~ H c G n.
Proof. If both sets are empty, there is nothing to prove. Suppose {~(-)} c~H ~ 4- F o r
simplicity, let n = {xlx~ - x* = a, lx i - x*[ ~ a,i > 1}. Suppose {~(t)lt~l-0,A-I} c n .
Then Yq (t) = x* + a, te[O, A], leading to 0 = xl (t) = F~'(~(t)) - xl (t). Thus F~t(~(t)) =
~ ( t ) = x * + a = F l ( ~ ( t ) ) in view of ~t-thickness of/~, for t e [ 0 , A ] . Since F is oo-
nonexpansive and x* is a fixed point of it, we also have
l,F(~(t)) - x* ]l~ ~ [I~7(t) - x* II ~ = a.
Thus we must have F ( g ( t ) ) e H , t e [ 0 , A ] , implying ~ ( t ) e G u. It follows that all
connected segments of {~(.)} c~ H that contain more than one point must be in G u. On
the other hand, those containing a single point must clearly be in the relative b o u n d a r y
OH of H, which is the union of its (d - 2)-faces. Let xe{~(')} c~ c~H. It suffices to show
that F ( x ) e d H . If not, F ( x ) - x
and therefore F * ( x ) - x for any thick ~ would be
transversal to c~H at x. This is not possible because ~(.) is a differentiable trajectory
confined to B a and cannot make 'sharp turns' around corners. This completes the proof.
[]
Consider a fixed (d - 1)-face H o f B a for the time being. Let H = { x l x 1 = x* + al,I
xj - x*l <-<a,j > 1 } for simplicity.
L e m m a 2.6. I f Gu q: qS, the map F :G u -~ H can be extended to an oo-nonexpansive map
F:H--* H which has a f i x e d point s
Proof. The second claim follows from the first by the Brouwer fixed point theorem. Fix
1 < i ~< d and define
gi(x) = inf (Fi(y) + [Ix - yll ~), x ~ n .
yeGB
Then gi(x) <~ Fi(x), x e G u. F o r x, y e G n, ~ - n o n e x p a n s i v i t y of F leads to
Fi(y ) + lix - y I[~ >~ Fi(x ).
294
Vivek S Borkar
Thus 9i(x) >1 Fi(x ), implying 9i = Fi on GH" N o w for x, z e H ,
9~(x) <~ inf (F~(y) + ]IY - z]] ~ + Ilz - xl] ~)
y~Gn
<~adz) + liz - xll~o.
(8)
Similarly, 91(z) ~< Oi(x) + [Iz - x I~. Hence
Igdx) - 0r
-< IIz - x l ~.
Let Fi(x) = max(x* - a, min(0i(x), x* + a)). Then
IFi(x) - Fi(z)I ~< Ilx - z l ' ~ .
Let/~l(x) = x* + a, x E H . Then F(.) = [ F I ( ' ) . . . . .
/Ta(') ] is
the desired map.
[]
Proof of Theorem 2.1. The same a r g u m e n t as in L e m m a 2.6 can be used to extend/~ to
an oc-nonexpansive m a p if: R a ~ R a that restricts to ff on H and to F on u A G a. Define
F A, if" in analogy with F A, F ~ using F in place of F. Then 2(.) satisfies
x(t) = F~,(~(t)) -
~(t).
We conclude that t --, II~(t) - 211~ is nonincreasing in t and hence decreases to b >/0. If
b = 0, we are done. If not, let B b = {x] l]x - 2 [I~o = b}. Then if(t) --* B b. Also, it is clear that
no (d - 1)-face H of B b is c o p l a n a r with H. This a r g u m e n t can n o w be repeated for each
( d - D - f a c e H of Bo that intersects {2(.)}, leading to possibly m o r e I!.l!~-spheres
Br n .... defined analogously to B, such that 2(t)--.B~c~B~c~Br
.... T h e a b o v e
remarks also imply that this intersection is a union of m-faces with m at most (d - 2).
N o w consider a limit point x'(') ofY(t + .), t t> 0, in C([-0, 0o); R a) as t--* 0o. Repeat the
a b o v e a r g u m e n t to conclude that x'(t) converges to a union of m-faces with m at most
(d - 3). Iterating this a r g u m e n t at most d times, we are left with a union offinitely m a n y
points to one of which 2('), x'(.).., and hence x(.) must converge and which then must
be a fixed point of F ~ for some thick/~, hence of F. This completes the proof.
[]
C O R O L L A R Y 2.1
Given ~, b > O, there exist T = T(e, b) > O, rl = rl(e, b) > 0 such that for any solution x(') of
(5) satisfyin# I I X ( 0 ) l l ~ <- K < oo and
(i) fi is b-thick,
(ii) {x(t)lte[O, T ] } n G ~ = ch where G ' = {xlinfr~61 x -Y~l < e } , we have
infllx(t)-yll~infllx(O)-yll~-~/
~G
for
t~>T.
~G
Proof. Suppose that the claim is false. Then there exist x " e R d, fi"e U such that for some
b > 0 , {fi"} are b-thick and x"(.) satisfy (i) ~"(t)=FUT(x"(t))--x*(t), x " ( 0 ) = x ~,
Ix"[I ~ ~< K, (ii) x"(t)ddG ~ for te[O,n] and
infll x~ - yll~o >/ sup infb[x"(t)-y,[~ >1 inf I'x~ - y l ] ~ - 1/n,
~n~G
t~[O,n] y~G
~G
for n 1> 1. By d r o p p i n g to a subsequence if necessary, we m a y then suppose that
Fixed point computation
295
x" --, x ~ r G ', ~" ~ / ~ in U and x"(') ~ x ~ (.) in C( [0, ~ ) ; Rn). By L e m m a s 2.2 and 2.3,/]~
is b-thick and x~
satisfies.
.~oo(t) = F . ? (x~~
- xOO(t), x~(O) = x ~.
Also, infy~ I!x ~ - y IIoo = inf~o fix ~ (t) - y il~, t ~> 0. This contradicts T h e o r e m 2.1, establishing the claim.
[]
F o r ~, q, T > 0, call a trajectory y(.): R + --* R ~ an (~, r/, T)-perturbation of (5) if there
exist 0 = T O < T l < T 2 < . . . in [0, oc) with Tj+ 1 - Tj/> T, such that for some xJ(')
satisfying (5) for some ~-thick/7 ~ in place of/~, we have
sup
fly(t) - xJ(t)l] oo < rl,j >~O.
te[75,Ti+ 2]
C O R O L L A R Y 2.2
F o r any ~, e. > 0, T > 0 as in Corollary 2.1 and 7 > 0 sufficiently small, any (:q 7, T)perturbation y(-) of (5) converges to G '.
Proof In view of Corollary 2.1, this is a straightforward a d a p t a t i o n of T h e o r e m 1, p.
339 of [5].
[]
3. P r o o f of T h e o r e m I.I
As a first step towards establishing that {X(n)} tracks (5) asymptotically, we analyze the
S-valued process { Y, }. In particular, we shall show that it m a y be viewed as a controlled
M a r k o v chain.
F o r AeS, let D A = {BeS[ (4) exceeds 6, a.s.} and VA = {u~(DA)Iu(B) >>.6VB~DA}.
Let V = F I A VA. Define p:S • S x V~]-0, 1] by
p(A. B,/.t) = #a(B),
where/aa is the Ath c o m p o n e n t of/a. Define V-valued r a n d o m variables {Z"} as follows.
The Ath c o m p o n e n t of Z", denoted by Z~t is given by
Z] (B) = P( Y.+ , = B/(9.)I { V. = A)+ 7?AI { Y. r A},
where h~A are fixed elements of VA, A~S. Then (4) equals p(A,B,Z") and {Y.) m a y be
viewed as an S-valued controlled M a r k o v chain with action space V and transition
probability function p. It should be kept in mind, however, that this is purely a technical
convenience and it is in no way implied that {Z"} is actually a control process. In
particular, this allows us to conceive of a 'stationary control policy n' associated with
a m a p ~: S--* V wherein Z" = ~z(Y.), n ~> 0. The latter part of(A1) implies that {Y.} will
be an ergodic M a r k o v chain under any stationary policy zr with a corresponding
unique stationary distribution v.e~(S).
Let t o = 0, t. = Z~, = 1a(m), n i> 1. Define y('): [0, ~ ) ~ S by
y ( t ) = Y.,t.<~t <t.+1,n>~O.
Define/~E U by/a, (A) = I {y(t) = A }, t >i 0 and/2se U, s ~> 0, by /at" - #~+~, t~>0.
296
Vivek S Borkar
Lemma 3.1. There exists an ot > 0 such that any limit point of {/2'} as s ~ oc is a-thick, a.s.
Proof. Let AeS. Then
M.=
~ a(m)[I{Y==A}-- ~.,p(B,A,Z"-x)I{Y,,_x=B}-I
m=l
B
is a zero mean bounded increment martingale with respect to {(#,}, whose quadratic
variation process converges a.s. in view of (2). By Proposition VII-2-3 (c), pp. 149-150,
[6-1, {Mn} converges a.s. For n/> 0, let h(s) = min{m > nl Z~=n+ ta(j) >/s}, s > 0. Then
lim._.~(M~,) - M.) = 0 a.s. and Z ~ . a ( m ) t> s together imply
~-f'2~na(m)l{Y==A} - ~ ' ~ " a ( m ) ~ ' s P ( B ' A ' Z " - t ) I { Y ~ - I = B }
.0
(9)
.atm)
a.s. Define O,,,e~(S x V) by
r
• J)=
~ _ ~ .a(m) I { r., e C, Z " e,l }
~,,a(m)
for C c S, J c V Borel. Then from (9), it follows that a.s., any limit point O of O,,, in
~(S x V) as n ~ oo must satisfy
O({A} x V)=
f ,A,
)dO, AeS.
Thus 9 must be of the form
O({A} x J) = v(A)(p.4(J), AeS, d ~ V Borel,
where A -~ (pa:S ~ ( V ) defines a stationary randomized policy and v the corresponding stationary distribution (see e.g. [2], pp. 55-56). By Lemma 1.2, p. 56 and Lemma 2.1,
p. 60 of [2-1, it follows that
rainy(A) >/rain v,(A) & ct > 0.
A,x
A
Thus
liminf ~ " a ( m ) I { Y~ = A}
>/~ a.s.
,~ ~
~.,~2,,a(m)
From our definition of {/~t}, it then follows that
liminf 1 I ' dY#ty(A)>1eta.s.VAeS.
t--* oo
SJo
Fix a sample point outside the zero probability set on which this claim fails for any
AeS, s > 0 rational. For this sample point, the claim follows easily in view of our
topology on U.
[]
Now rewrite (3) as "
X(n + 1) = X(n) + a(n)(W(n) - X(n))
for appropriately defined W(n)= [Wl(n) ..... Wd(n)] and set i f ( n ) = E [ W ( n ) / ~ , ] ,
n/> 0, the conditioning being componentwise. Write if(n) = [Wl(n) ..... ia(n)].
Fixed point computation
297
Lemma 3.2. There exist K > 0, N ~> 1 such that for n >I N,
IIF~"(X(n))- if(n)I,~ < Ka(n)'.
Proof. W.l.o.g., let X(0) be deterministic. Then VA c I, zeG,
IIF A(X(n))II ~ ~< IIX(n) - z II~ + [IF(z)II o~ V IPzll
~< 3[Iz[l| + lIF(z)ll~ + ~lX(O) ![~o A M <
~.
F o r each i, 1 ~< i ~ d, and c = 1 - r,
[Fr'(X (n)) -- l~i(n)[ ~< E [ [Fr~'(X(n))- W~(n)[I {xq(n) ~< a(n)-C for all i,j}/.~,,]
+ E [ [ F r ' ( ( X ( n ) ) - W~(n)[I{zij(n ) > a(n) -~ for some i,j}/.~,].
The second term is bounded by 2MCa(n) ~ in view of (A2) and the conditional
Chebyshev inequality. Let fi be the integer part of a(n) -~. Since a(n) -~ is o(n), we may
pick n large enough so that n > ~. Then for m ~< t~, (A3) leads to
IX(n) - X ( n - m)ii ~ ~ 2 M
~
a(k) <. I~a(n) 1-~
k=n-t~
for a s u i t a b l e / ( > 0. Thus the first term is b o u n d e d by K,a(n)'. Since b >>.r/(1 - r), the
claim follows.
[]
Let T > 0 . Define T o = O , T , = m i n { t , , I t m > > . T , _ l + T }, n~> 1. Then T , = t,,<,) for
a strictly increasing sequence {re(n)}. Let I, = [T,, 7".+ 1], n/> 0. Define ~"(t), t e l , , by
Y~"(T,) = X(m(n)) and
x"(tm~.~+k. 1) = ~"(t.,~.~+ ~) + F r ...... (~"(t,.~.)+~))
-
Y~'(t,.~.)+k))(t.,r §
1 - t,.r
with linear interpolation on each interval It,.<.) + k, t,.~.~+ k + ~]" Define x(t), t >1O, by
x(t.) = X(n) with linear interpolation on each interval [t., t. +~].
Lemma 3.3.
lim sup Ilx(t) - 2"(t)[I ~o = 0 a.s.
n ~
oo
tfivJ'~
Proof. Let n t> 1. F o r i >>.re(n), we have
x(t,+ 1) = x(t,) + a(i)(Fr'(x(ti)) - x(ti)) + a(i)(lTV(i) -
F~'(x(t,)))
+ a(i)(W(i) - if(i)).
Let l~l m = Y~=oa(i)(W(i)- l?V(i)) and ~ = M i - h4,~,) for i>~ re(n). Then {~rm, ~-m} is
a zero mean bounded increment vector martingale and the quadratic variation process
of each of its c o m p o n e n t martingales converges a.s. by virtue of (2). Thus by Proposition VII-2-3(c), pp. 149-150 of [6], {J~m} converges a.s. Fix a sample point for which
this convergence holds and let 6 > 0. Then
sup ~[~i!1~ < ~5/2
i >ire(n)
for n sufficiently large.
298
Vivek S Borkar
Let .,2i + 1 = x(t i + ~) - r i >>.m(n) with 2,,~,) = X,,{.)(i.e., ~,,~,~_ 1 _a__0). T h e n for i >~ re(n),
-~i + 1 = 2i + a(i)(Fr'(2i) - 2"i) + a(i)(Fr'(s + ~i- 1) - (s + ~i- l )
- FY'(s
+ .~) + a(i)(17V(i)- FV'(x(ti)).
Also
s
~) = s
+ a(i)(Fr'(s
- s
S u b t r a c t i n g a n d using the p r e c e d i n g l e m m a , we have, for n sufficiently large,
[I2i* x - 2"(ti + x ) l[o~ ~< (1 + 2a(i)) I 2i - ~"(ti) II~ + 2a(i) II~ - ~ li o~ + Ka(i) l +'.
By increasing n if necessary, we m a y s u p p o s e t h a t
a(i) 1 +" < 6/2.
i>~n
T h e n using the inequality 1 + 2a(i) ~< exp(2a(i)) a n d iterating, we have for n sufficiently
large,
sup
l[2i-~(ti):lo~<<.2e2Cr+l)(K + T + 1)6.
m(n)<~i<~m(n- 1 )
Also
sup
I1~ - x(t,)II ~ <~ 6/2,
m(nJ<~i~m(n + 1}
for sufficiently large n. Since ~5 > 0 was a r b i t r a r y , the claim follows on o b s e r v i n g that
x('), s
are linearly i n t e r p o l a t e d from their values at {ti}.
[]
N e x t define :~'(t), t e l n , by g"(tm(n~ ) = x(tM~fl and
.~(t) = FYt~
- s
tell.
L e m m a 3.4.
lim sup I ~"(t) -
:~n(t)11 ~ = 0 .
Proof. This follows easily f r o m the G r o n w a l l l e m m a .
[]
Let ~ > 0 be as in L e m m a 3.1.
L e m m a 3.5. Almost surely, the followin9 holds. There exists an or-thick sequence
fi"e U, n >~ O, such that if s
t e l , , is defined by s
= x(t,,tn)) and
.in(t) = F;"(s
-- 2(0, t e l , ,
for n >1 1, then
lim sup IIs
- 2~(t)l[ o~ = O.
Proof. This is i m m e d i a t e f r o m L e m m a s 2.3 a n d 3.1.
[]
P r o o f of T h e o r e m 1.1. Let e > 0. Let b = :~ a b o v e in C o r o l l a r y 2.1 a n d pick T = T(e, ~)
accordingly. Pick "/> 0 as in C o r o l l a r y 2.2. C o m b i n i n g L e m m a s 3.3-3.5, we h a v e
Fixed point computation
lim sup
. ~ ~
299
lid'(t) - x(t)'[ ~ = 0 a.s.
tEln
Thus x(t, + -) is a (ct, T, 7)-perturbation of(5) for sutficiently large n. By Corollary 2.2, it
follows that x(t) ~ G~. Since e > 0 is arbitrary, the claim follows.
[]
Observe that the foregoing can be easily extended to the following relaxation of
the latter half of (A2). The directed graph formed therein need not be irreducible,
but each communicating class in it must correspond to elements of S which together
cover I. Also, extension to nonexpansive F with respect to the weighted w - n o r m
is straighforward.
4. Examples
This section sketches some important instances of fixed point problems for oononexpansive maps. A general reference for these is [1].
(i) Shortest path problems: Given d + l locations {0,1 ..... d} and the distances
{dg~,0 ~< i,j <~d, iv~j} between them, the problem is to find the shortest path from
location i r 0 to location 0. Letting V(i) = the length of the shortest path from i to 0, one
has the dynamic programming equations
V(i)=min(dlo, min(d,j+
V(j))),
l<.i<~d.
j~i.o
Letting V = [V(l) ..... V(d)] T, this has the form V = F(V) for an ~-nonexpansive F.
(ii) Markov decision processes: Consider a controlled Markov chain {X.} on a
finite state space S, with a compact metric action space A and a continuous transition probability function p:S x S x A - * [ 0 , 1 ] . The aim is to choose an A-valued
sequence {Z.} that does not anticipate future, to minimize a suitable total expected
cost. Thus
P( X. +1 = J/X i, Zi, i <~n) = p( X .,j, Z.) Vn.
Let B be a proper subset of S and ae(O, 1). Consider two cost functionals: For
x A),
keC(S
(1) cost up to a first passage time:
E
[
~ k(x.,z.)
_~'=0
]
.
where ~ = min {n/> OIX.eB},
(2) infinite horizon discounted cost:
E[~=oa"k(X.,Z.) ]"
Letting V(i) denote the minimum cost when X o = i, the dynamic programming
equations in the two cases are, respectively,
V(i)= min
u
(k(i,u) + ~ p(i,j,u) V(j)),
j~B
ieS\B,
300
Vivek S Borkar
and
V(i)=
min(k(i,u)+a~p(i,j,u) V(j)),
u
i~S.
j
Both can be cast as fixed point equations V = F (V) for ~ - n o n e x p a n s i v e V.
(iii) Systems of linear equations: The problem of solving a system of linear equations
A x = b can be cast as finding the fixed point of F ( x ) = x - a ( A x - b ) , a e ( O , 1). If
[11 - AII ~ ~< 1, F is ~ - n o n e x p a n s i v e .
(iv) Strictly convex network flow problems: The following problem arises in network
flow optimization:
minimize
ao(fi~)
~
(i,j)EA
subject to
~, f i j {~(i,j'~A}
~, fj,=S, ViEN, b,j<~fo<~coY(i,j)~A,
{jI(.],i)EA}
where alj(- ) are strictly convex. This problem can be cast as that of finding a fixed point
of a pseudononexpansive map, i.e., a m a p F satisfying liE(x) - y II~ ~< IIx - Yll ~ whenever y is a fixed point of F. (See (1), w7.2 for details). O u r analysis applies here as well.
Acknowledgements
This research was supported by a grant from the I S R O - I I S c Space T e c h n o l o g y Cell.
The a u t h o r would like to thank Prof. Vinod S h a r m a for pointing out an error in the
preliminary version of this paper.
References
[1] Bertsekas D P and Tsitsiklis J N, Parallelanddistributedcomputation(Prentice Hall, Englewood Cliffs)
(1989)
1-2] Borkar V S, Topics in Controlled Markov Chains, Pitman Research Notes in Maths, No. 240, Longman
Scientific and Technical, H arlow ( 1991)
I-3] Borkar V S and Phansalkar V V, Managing interprocessor delays in distributed recursive algorithms,
Sadhana19 (1994) 995-1003
[4] Borkar V S and Soumyanath K, A new analog parallel scheme for fixed point computation, Part I:
Theory, preprint (1993)
[5] Hirsch M W, Convergent activation dynamics in continuous time networks, NeuralNetworks2 (1987)
331-349
[6] Neveu J, DiscreteParameterMartinoales(North Holland: Amsterdam) (1975)
Download