A METHOD FOR THE PARAMETRIC CENTER PROBLEM, WITH A STRICTLY MONOTONE POLYNOMINAL-TIME ALGORITHM FOR LINEAR PROGRAMMING by Robert M. Freund and Kok-Choon Tan WP# 2600 was 2100-89 l March, 1989 Abstract Given a system of linear inequalities and equalities Ax b + dt and Mx = g + ht where the right-hand-sides (RHS) are parametrically deformed over the scalar t , the parametric center problem is to trace the parametric family of approximate solutions (t) to the center problems P(t) , where P(t) is the problem: maximize , ln (bi + dit - Aix) subject to Ax <c b + dt i=l and Mx = g + ht. We present an algorithm for tracing the parametric family of solutions (t) over the given range t E [ t . At each iterate of the algorithm, the value of the parameter t is strictly increased and a Newton step is taken. The sequence of values of t exhibit the following geometric rate of change: If tk and t k+ ' are two successive values of the parameter t generated by the algorithm, then either (tk+l - TMIN) (tk - TMIN)(1 + 128m) or (TMAX - tk+1) < (TMAX - t)(1-12 J where TMIN (TMAX) is a lower (upper) bound on the smallest (largest) value of t for which Ax < b + dt, Mx = g + ht has a solution. Thus the iterates exhibit either linear growth away from TMIN or linear convergence toward TMAX , with a rate of change l ~~1 t of 128m , where m is the number of inequality constraints. When applied to the linear programming problem, the algorithm is an O(mL) iteration algorithm for linear programming, that strictly improves the primal objective value at each iteration, and requires no dual feasible solution (or even dual feasibility) to start. After O(mL) iterations, the algorithm either detects primal unboundedness or produces an interior solution that can be rounded to an optimal solution to the linear program. Key Words: Newton step, center, linear program, interior-point algorithm. Introduction Given a system of Ax b and k m linear inequalities in equations in center of the system Rn of the form (A, b, M g) Rn of the form Mx = g, the (analytic) is the optimal solution to the convex program: P: maximize l, n (bi- Aix) i=l s.t. Ax< b Mx = g where (Ai, bi) respectively. ( x e Rn Ax system denotes ith row of A (See Sonnevend [12, 13] . b, Mx = g ) (A, b, M g) , and ith component of b, Assuming is nonemepty and bounded, the center of the denoted , is uniquely defined. The computation of points near the center and their properties are important for interior point algorithms for linear programming and extensions, see Karmarkar [6] ,Renegar [11], Megiddo [8], Kojima et. al. [7], Vaidya [16], Monteiro and Adler [10], Mehrotra and Sun [9], Jarre [5], Barnes et. al. [1], and Todd and Ye [14], among others. Algorithms for finding the center are presented in Censor and Lent [2] , Vaidya [15] , and [4] This study is concerned with the parametric analysis of the family of centers as the right-hand-side varies parametrically. (RHS) (b, g) of the system (A, b, M, g) We define the parametric center problem (PCP) be the problem of tracing the parametric family of optimal solutions the problems: 1 xt to to III P(t): maximize , In (bi + dit - Ai x) i=l s.t. Ax < b + dt Mx= g + ht (where d e Rm interval and te [t, t], h e Rk and are given), as 1t,t t varies over a given are finite or infinite. algorithm for generating a piecewise-linear function the property that x (t) We present an x (t) is an approximation to the center is an approximate solution to P(t), as the approximate path of solutions. t is varied. [ It -4 x (t), Rn with i.e., x (t) We refer to (t) as (The sense of the approximation and its properties are defined in Section 3.) The algorithm starts with an approximate solution center problem t = tk t P(t) at t =t . At iteration is chosen as t = tk+1 ( tk+1) to solve for . where The path by linear interpolation. guaranteed increase in chosen so that either (TMAX - t l ) < l tk > tk R( t) t (tk) . t is The next value of and a Newton step is performed is then extended for t e [tk, tk+ ] The important feature of the algorithm is the t at each iteration. In particular, tk+1 can be (tk+1 - TMIN) > (tk - TMIN)(1 + 12 ) (TMAX - tk)( 1 ) , where (upper) bound on the smallest (largest) value of Ax < b + dt , to the k , the value of P (tk) is and the approximate center for ( t) Mx = g + ht , has a solution. TMIN (TMAX) t is a lower for which Thus the iterate values of demonstrate either geometric growth away from 2 or TNIIN or geometric contraction toward TMAX, with a rate of change of 1E128m (where m is the number of inequality constraints). This algorithm can be applied to solve the linear programming Suppose we wish to solve the linear problem problem in a new way. LP: maximize cT x s.t. Ax<b Then we can use the algorithm for PCP to solve for the path of centers of the system LP(t). maximize E i=l s.t. In (bi - Aix) Ax < b cx = t as t is increased. This yields a new "central-trajectory-following" algorithm for linear programming that differs from other central-trajectory First, it is a strictly monotone algorithm for linear methods in two ways. programming, i.e., an algorithm that strictly increases the objective function value of the primal at each iteration, unlike other central-trajectory-following algorithms. Second, it requires no prior information or bound on the optimal objective value, and will process a linear program that is unbounded in the primal objective value (i.e., dual infeasible), unlike other centraltrajectory methods. However, the complexity of the algorithm is iterations, as opposed to (where O( iM L) O(mL) for most other central-trajectory methods L is the bit-size of the problem instance) and so has an inferior 3 III complexity bound (by ff ) . Perhaps it is the strict monotonicity of the primal objective value in the algorithm that is responsible for the inferior complexity bound. This paper is organized as follows. In Section 2, we present the main results regarding the parametric center problem and we present the algorithm for tracing the approximate parametric path of centers. The remaining three sections are devoted to proofs of the results of Section 2. notation and preliminary results. Section 3 presents Section 4 contains an analysis of one step of the algorithm and presents the results on the use of Newton's method. Section 5 contains the results regarding bounds on feasible values of are generated by the algorithm. 1i 4 t that 2. The Parametric Center Problem Given a system of Ax b m and equations referred to as linear inequalities in Rn of the form Mx = g , the (analytic) center of the system (A, b, M, g) is the optimal solution to the program m P: maximize ln (bi- Aix) i=l s.t. Ax< b Mx = g (see Sonnevend [12, 13] .) Suppose is the unique solution to P . center as the right-hand-side varies parametrically. (RHS) of the system (A, b, M, g) In particular, we are interested in generating the parametric family of optimal solutions P'(t): Our interest lies in tracing the maximize xt to the problems A In (bi + dit - Aix) i=l s.t Ax < b + dt, (2.1) Mx + g + ht. In this section, we present an algorithm for generating a piecewise-linear path of solutions and as t (t) such that (t) is close to xt in a suitable measure, is varied strictly monotonically over a prespecified range. First note that there is no loss of generality in assuming that the equations Mx = g + ht are not present. without loss of generality that kxk and is nonsingular. M = [B, N] To see this, we can assume is a By suitably partitioning 5 kxn matrix where A = [C, D] and B is Ill x = (y,z), we can eliminate the y variables to obtain the equivalent problem m P"(: maximize + n (i dit - Ai z ) i=l s.t. where Az < b A= D - CB-N, b = b - CBg , straightforward to show that solves solves Zt and P"(t) if and only if Yt = B 1 (g + ht - Nt) P'(t) , where d = d-CB-lh. . Itis t = yt, Z We thus can concentrate on the more convenient problem P(t): ln (bi maximize + dit - Aix) (2.2) i=l s.t. Let X = (x Suppose st = xt Ax < b Rn Ax b) is the centerof b + dt - Axt, andlet Because P (t) + dt. and Xt = (x e Rn Ax < b +dt Xt, i.e., t solves P(t). e is a convex program, xt solves P (t) if and only if xt , namely St = b + dt - At > O0 (2.3a) eTS t 1 A = 0 (2.3b) is the vector of ones of appropriate dimension. We assume that our initial value of of Xto = {x Let St = diag(st). the Karush-Kuhn-Tucker (K-K-T) conditions are satisfied at where . R I Ax < b} t is t = 0 , is inonemnlpty and bounded. 6 and the interior In this case, it is straightforward to show that Xt will be bounded for all values of t . T = (t I int Xt * T will be an open interval, and it is ) . Then Let straightforward to extend the analysis in Megiddo [8] to show that the path of parametric centers r: t - t , t G T, Let TMAX = sup (te T) and is continuous and differentiable. TMIN = inf (te T . Itis also t t straightforward to show that if d = Ar for some of translations of Xt' if and only r e R n , in which case the sets Xt by the translation vector We therefore assume throughout this paper that this case, either and TMIN = -° TMAX = + d TMIN > -~ tr . are just a family does not lie in the column range of , or TMAX < + , A . In or both. The algorithm presented in this section will trace a piecewise-linear path (t) measure. that is "close" to the parametric center path At each iteration the value of t least one of two ways, as follows. in a suitable is strictly increased; thus the algorithm is strictly monotone in the parameter iteration, the magnitude of the increase in xt t Suppose t . Furthermore, at each is bounded (from below) in at tk is the current value of t Then at the next iteration the algorithm will produce either a finite lower bound UB TMIN is produced, then tk+1 - tk i.e., LB 1 128m or a finite upper bound tk+' , (UB - tk), (TMAX - t+1) < (1- the new value of 28m geometrically, i.e., (tk+ -TMIN) )(tk 128m 7 If TMAX, or both. If will satisfy decreases geometrically, LB is produced, then t so that (t - (tk- LB) , (1 + t, (TMAX - t) so that 12m)(TMAX - tk). will satisfy (tk+ l - tk) > 1 UB > TMIN) TMIN) grows L II At least one of the above two bounds must be satisfied. Note that in either of the two cases the geometric rate of change of t relative to a bound is at least 1 where m is the number of inequality constraints. 128m ' Given a linear program in the form: maximize LP: x x Ax < b s.t. we can reformulate LP as the following equivalent problem maximize x, t LP: s.t. t Ax < b + Ot cTx = O+ t Thus an algorithm that traces the parametric path of centers to the above system for strictly increasing values of algorithm for solving LP . t will be a strictly monotone The application of the parametric center problem algorithm to linear programming is presented at the end of this section, and is an O(mL) iteration algorithm for linear programming. 2.1 Properties of the Parametric Center For t e (TMIN, TMAX), St = b + dt - Axt, space of that St A , and let i.e., d = St ut + Art indicator function f(t) let ut t be the center of be t he projection of Xt , and let St d ut = [I - St A (AT St2 A) AT St ] St d . for some as n rt E R . f(t) = eT ut , 8 onto the range Then note We now define the path and note immediately that 'S"1 _S t' f(t) = eTut = eTSt (d-Art) = eTSt d , -1 by (2.3). since eTSt A = O, f (t) = e St d . Therefore an alternative equivalent definition of f (t) is is the optimal objective value of the TMAX follows. It is obvious that is as f(t) The motivation for considering the path indicator function following linear programming problem: TMAX = maximize LP°: t x, t Ax- dt < b s.t. Suppose for a given value of eT St (Ax -b)] 1 i < Then for and St = diag(st) . (x, i) to LP , any feasible solution tS = d xt be the Let f(t) < 0 . that ^st = b + dt - Axt, P (t) , let solution to t 1 [eT;t = +dt) At-st (Ax - = 1 [-m + tf(t)] f (t) whereby t- m TMAX f(t) f(t) > 0 Similarly, if Now suppose that for a given value of Let be the center of the system xt' St= diag(b + dt-Axt*) . (t', t*) whereby particular if both that TMIN TMIN X t , say eTSt A = 0 is the center of the system and and TMAX TMAX t ,that f(t) f(t*)=O Ax < b + dt*, and let and are finite. are finite. If is bounded from below (above) in 9 eTSt d = 0, (A, -d)( x, t) < b , from (2.3). = ((x,t)e Rn+ l I Ax - dt < b) X Thus the set Then TMIN > t- isIbounded, and in We say X° T1IN (T MAX) t . is bounded in t is finite, we say Nc)te that if X° is III bounded in t, then f(t*) = e T S d = 0, t- , the value of to be on the upper path if f (t) < 0 , 0. which yields is guaranteed to exist. Returning to the path indicator f (t) t f(t) defined earlier, we define xt and to be on the lower path if The intuition behind this definition is provided in the next two propositions. Proposition 2.1 The path indicator function decreasing for t (TMIN, TMAX) Proof: The K-K-T conditions of P(t) eTSt A = 0 and Let t and respectively. and A unless require that At + st xt be the vector of derivatives of = b + dt. 'xt at t , differentiating the above expressions yields Then differentiating the above expressions yields t Then St-2 A O t and t + t = d. Furthermore, since then f (t) = eTut is strictly f (t) = eT ut = eT ;St d f'(t) = -t St d = -St (A it + t) = -t §2St < 0, st = 0 , in which case out by the assumption that d d = A xt . But this last possibility is ruled does not lie in the column range of 10 A . w Proposition 2.2 (Upper and Lower Paths) (i) and TMAX te R (ii) f(t ) = such that and , f (t) > 0 for all t E (TMIN, t ), f(t) < 0 for all t is finite and TMAX all (iii) are both finite if and only if there exists TMIN (t, TMAX) ; TMIN = f(t) < 0 if and only if -- for t < TMAX ; is finite and TMIN all if and only if f(t) > 0 for TMAX = t > TMIN. These three cases are illustrated in Figure 1, (a), (b) and (c). center of = ((x, t) IAx- dt < b . X and and so TMAX TMIN f(t') = (i) We have seen earlier that if Proof: , That being the case, X are both finite. Conversely, if TMIN are both finite, then the center of X (x*, t) , is the (t, t) then is bounded TMAX and which is the solution to the problem maximize x,t s.t. exists uniquely. and St = . Ax - dt<b The K-K-T conditions for eT St d = 0 , diag (st') , In (bi + dit - Aix) i=1 i.e., f(t )= O, P° require that where st = b + dt* - Ax- The rest follows from Proposition 2.1. 11 eT St A= 0 and II t T..... 'MAA rI , - r rt)< I f(t) = 0 t Fc(4a\ AI I r - MIN x Figure 1 (a) 12 _ - - V t Figure 1 (b) 13 11 , . . i 1 I t i f(t)< 0 i I lMIN Elpx Figure 1 (c) 14 (ii) Suppose is finite and TMAX We have seen earlier that f (t) > 0 implies f(t) < 0 . f() = 0 then But, from (i),if contradiction. then f (I) < 0 . Thus, is finite. TMAX . TMIN is finite. TMIN Conversely, if Consequently, if Let t < TMAX. TMIN = - Thus, is finite, which is a f(i) < 0 for some f(t) < 0 for all t < TMAX t, then follows from (i). TMIN = - The proof of (iii) is similar to (ii). 2.2 Algorithm PCP Before presenting the algorithm for the parametric center problem, we introduce some more notation and the main improvement theorem. X = {x e Rr- ,Ax < b} Recall that and Xt (X e R n IAx = We assume that X is bounded for all b+dt)}. is bounded, and so A t . Let has full column rank and Xt IIv ll denote the Euclidean norm. If M is a symmetric positive-definite matrix, wedenote let where I Y-ZIIM = e intX (y-z)TM(y - z) . Let be given, let = b-Ax, S = diag() . We say that X-X Q() < 1/21 . be thecenterof X, and let Q(x) = ATS2 A, is close to the center of X if The motivation for this criterion of closeness to the center will be apparent from the following theorem, which also serves as a basis for the PCP (Parametric Center Problem) algorithm. 15 III Theorem 2.1 (Improvement Theorem). Let = b-A3Z, xxQ S = diag () , , where <1 Suppose and e int X = (x IAx <b) Q = ATS-2 A . is the center of the system l Suppose (A,b) . Define (projection of S-1 d) = [I - §S- A A S- A r = (AT2'Ay AT 2d (translation) a= 1 801ll11 (step length) Furthermore, define AT S-1] - d a - Sa = diag (Sa) -= (Newton step) (AT §a2 A)l AT§ e (New approximate center) XNEW = x + q + ar Then II NEW- Xia < 1 21 ., where (A, b + ad), and Q = ATSN2EW In this theorem, X. The vectors so that x A where SNEW = diag(b + rd - A NEW). . r are defined and satisfy The increase in defined next, and is a function of reR n, lltll > 0 , is the center of the system is given and is assumed to be close to the center u and d = Ai+ S xa and so a IIu . t from Because S§'d = 'IA + I, t = 0 to d Ar t = a is for any is well-defined. The Newton step 16 of - is then , defined using the modified slacks x, will be suitably close to Nm According to the (a translation vector). (the Newton step) and theorem, is composed, using zNEW and the center of a, Xa This theorem will be proved in Section 4. The increase in is a function of II uiI t . is given as t = 0 from However, the fact that makes good intuitive sense. 1 /si in the weighted norm with weights A, liesveryclosetotherangespaceof is small. Thenchanging the RHS by by a r and then changing the The quantity 1l u i = 1,...,m . Suppose d = Si+ Ar, i.e., ad a is from the column range of d is the minimum (least-square) distance of A, which l Ideally, we would like this increase to be as large as possible, to speed algorithmic convergence. proportional to 1/llU a = d andso IuI is the same as translating RHS by only aS X . Then because Ilul l is small, we can take a big step. If, however, I iUl is large, then "most of lies outside of the range space of d " RHS of a d is shifting the shape of A , so that a change in the X substantially, not just translating the polyhedron. Thus, the step we can take will be smaller. However, as the next theorem indicates, the value of important bounds on the values of show that if a is small, so is Recall that f (t), TMAX TMAX or and TMIN , II UiII also gives and thus will TMIN the path indicator function, gives important information about the boundedness/unboundedness of the path of centers xt in both directions, according to Proposition 2.2. 17 111 Theorem 22 (Bounds on TMhAX and TMTN ). Under the conditions and definitions of Theorem 2.1, (i) if e/lUIl > 120 , then TMIN (ji) if el/IiUll < -1/20 then TMAX < UB then TMIN > LB - and TMAX < (iii) if letu/ 11 111 < 1/20, 22¥m(m+l) + 1 LB 211 lii ll 22Ym(m+1) + 1 2111 illI 1.6m(m-l) + .6 lull B - 1.6¥m(m-1) + .6 . Iill This theorem will be proved in Section 5. Note that either (i), (ii), or (iii) must be satisfied, so that either a finite lower bound on TMIN or a finite upper bound on course of increasing from t t = 0 to TMAX t = a, is produced in the or both are produced. Case (i) corresponds to being (approximately) on the lower path at i.e., f(0)> 0 . path, i.e,. Case (ii) corresponds to being (approximately) on the upper f(0) < 0 close to the center of Case (iii) corresponds to X is increased from f(O)= 0, is so that (, 0) . Note also that these bounds are t t=0, t = 0 one or both bounds is at least to O(m/Iluii) t = cc = 1/8011 Ill, 1 128m Thus, even though the ratio of a Therefore repeated increases in to t using the methodology of Theorem 2.1 will result in either geometric growth 18 t-TmTIN in the quantity with a growth rate of at least geometric contraction in the quantity k)I (1- atleast Prrnacif4an m Suppose tNEW = ' where a, + I or with a contraction rate of TMAX-t , as the next proposition indicates. (-c.mobrir Chana ~~~U V~VII· ~· · ~~ ~-~· · 2 . (1 in . With the notation a and ~-LRPlaZIh+O T",,t nitly if XOLD - X , v} I T.. - nnfikA nr -- tOLD = 0, are defined as in Theorem 2.1, then XNEW either (i) (tNEW- TMIN) (1 + (ii) (TMAX-tNEW) < 1 - 1 m)( tOLD - TMIN) or 12 Im) (TMAX - tOLD) Proof: Suppose a lower bound, either or LB , is generated through Theorem 2.2. Notice that LB < LB if m 2 so in either case, TmNL - (1-6 Wm(m- i) + 6) /1912. (tNEW - TMIN) = tOLD - TMIN tNEW -6 + -TMIN I I -_ 1_6 Thus +1 = 1+ _ 1 128m · A parallel analysis demonstrates (ii) if an upper bound is generated. m Now let us return to our initial interest - to trace the parametric center path xt for the program ranges in an interval P (t) . Suppose we want to trace the path as t e It, t] , where 19 t > t . t Suppose we are given a III point (Such a point can be found by X. that lies close to the center of x using the algorithm in Vaidya [15] or in [4] .) The following algorithm, denoted for Parametric Center PCP Problem, is an iterative algorithm that invokes Theorems 2.1 and 2.2. At The Step 0, initial lower and upper bounds are set to their extreme values. initial value of and the counter is chosen as' t = , t x is set, as is the RHS . F are computed as defined in Theorem 2.1. which is the increase in close to the center of p(tk + a), (t) TMIN -NEw , , i.e., xNEw t Algorithm In Step 2, the constant is then computed; U and a , TMAX xNEW is (tk) = x , using In Step-4, the current are updated, in accordance with Theorem tk+ l 2 t. If so, it stops. If not, it The output of the algorithm is the piecewise-linear path t, namely t , t, ..., tk , ... PCP (Parametric Center Problem) Input: AE Rmxn, Step 0 (Initialization) UB=+oo, tk . approximately solves [tk,tk + a] = [tk, t] and the incremental values of x (t) In Step 1, the values as endpoints and interpolating. and is In Step 3, a piecewise-linear path In Step 5, the algorithm checks if returns to Step 1. Set point, according to Theorem 2.1. (tk ' l ) = XNEW bounds on 2.2. X is defined in the range and t at the iteration, is defined as in Theorem 2.1. t "dose-to-center" The next (for the The current value of number of iterations) is set equal to zero. The current value k b, d LB=-Co, Rm , k=O, t , x0 t ° =t, 20 -=x , RHS=b+dt Step 1 Set (Projection of d) = RHS-A Compute , S = diag(s). u = [I - S-1 A(A T -2 A A AT 1]S-1 d r = (AT - 2 Ar 1 ATS 2 d. Step 2 (Compute Step Length and Compute New Approximate Center) a Set 80 IJUJIl Sa = RHS+axSi-Ax-, Sa = diag(a); . = - (AT XNEW = Step 3 AY AT § e; x+cXr+q. (Extend Piecewise-Linear Path) Set tk+l = t +a . For tk t tk+l ,define Step 4 2 5t) + L a (XNEW - X) - .- (Update Lower and Upper Bounds) (i) if eT/ltI > 1/20 , then LB = max{LB,tk-(22imrnm+l+)/( 21 2 lU I) ' 11 (ii) if eT/l ull < - 1/20 then UB = min(UB, tk +(22 m (m+) +1)/(21 1 1 1u ) ; I < 1/20o ,thenLB = max(LB,t -(1 .6m(m1)+ 6)/11l11 (iii) if eT /I and UB = min (UB, t k +(1.6m (m-1) + .6)/11Ul }). Step5 If tk+ < , set RHS = RHS+da, k = k+1 , = NEW, and go to Step 1. if t k +l t, STOP. 2.3 Algorithmic Performance According to Theorem 2.1, if '; for each k = 1,2, ..., x(tk) x is close to the center of will be close to the center of break points of the piecewise-linear path i parameterized center. Lemma 2.4 x(t) Xt X k . , then Thus the will be near the Furthermore, we will prove in Section 4: For all values of t generated by algorithm [[x(t)- lZ. k near thecenter Xt, Q = ATS 2 A, and st = b+dt-Ait), in the sense that < PCP , 585, (t) is where St = diag(st) . U We are now ready to discuss the performance of the algorithm. According to Proposition 2.3, we obtain at each iteration either a geometric decrease in the gap the gap t - TMIN . TMAX - t at each iteration, or a geometric increase in We thus can measure algorithmic performance according to the change in TMAX - t , or t - TMIN , on whether we are approximating the upper path path (f (t) > 0) , or both. or both, depending (f (t) < 0) , the lower Suppose that in the course of running algorithm 22 PCP, that a lower bound on TMIN is never generated. Then all iterates will satisfy criterion (ii) or (iii) at Step 4, so that all iterates will generate an upper bound, and all iterates will lie approximately on the upper path. Lemma 2.5 (Algorithm Performance Based Only on TMIN = - or if no iterates of the algorithm bound on TMIN , then the sequence of TMAX-tk < (1 1) In particular, if Proof: t values If generate a lower will satisfy (TMAX) . t < TMAX, 128m In((TMAX K = PCP TMAX) - the algorithm will stop after at most ) / (TMAX - i) ) 1 iterations. Under the hypothesis of the lemma, the algorithm must satisfy either r l criterion (ii) or (iii) at Step 4. Thus, by Proposition 2.3, ( 128m) (TMAX - tk). Lemma. Then If 128m In ((TMAX-)/ (TMAx-t))l let K = Kln(1 l • -K( 1 )+ ln (TMAX -1) + ln (TMAX-t) < -(in ((TMAX - t) /(TMAX - Thus, TMAX- < Thus we obtain the geometric decrease of the t < TMAX, ln(TMAX-t ) TMAX - tk+1 tk < TMAX- t, whereby stop. t ))) + n (TMAX-t) tk > t . Thus the algorithm will U 23 ill Suppose instead that none of the iterates of the algorithm generate an upper bound at Step 4. Analogous to Lemma 2.5 we have: Lemma 2.6 (Algorithm Performance Based Only on TMIN) If TMAX = + ° or if none of the iterates of algorithm TMAX at Step 4, then the sequence of tk - TMIN 1+ . PCP generate an upper bound on t values will satisfy ( TMN) In particular, the algorithm will stop after K = r128m in ((t - TMIN)/ (t - TMIN)) iterations. We next examine the case when the algorithm generates both upper and lower bounds. We first need the following result, which will be proved in Section 5. Lemma 2.7 If criterion (ii) of Step 4 of the algorithm PCP is satisfied at iteration k, then in all subsequent iterations, criteria (ii) or (iii) of Step 4 will be satisfied. The significance of Lemma 2.7 is as follows: if at iteration k an upper bound is generated, then an upper bound is generated at every subsequent iteration. Lemma 2.8 algorithm (Algorithm Performance Based on Lower and Upper Bounds) If the PCP generates both lower and upper bounds, then there is some t 24 for all k > j j, the algorithm generates lower bounds only and the algorithm generates upper bounds, and Ui) for all (ii) for all k > j, Furthermore, if K= k such that for iterate k :5j tk-TMIN TMAX-tk 2 (1128m (TMAX - tj) then the algorithm will stop after at most t < TMAX , F256m ln TMAX 2-TMIN (1 + 12m(- TMIN) -128m In (t-TMIN) -128m n (TMAX - l + 2 iterations. Proof: The existence of j is guaranteed by Lemma 2.7. The geometric convergence rates are then a consequence of Proposition 2.3. suppose t < TMAX , and let K be as defined above. note that .5 (In ( - TMIN) + n (TMAX-t)) lnTMAXI- TMIN 2 from the arithmetic-geometric mean inequality. K Thus, 128m In (t- TMIN)+ 128m In (TMAX - t) - 128m In (TMAX - t) - 128m n (t - TMIN) 25 Let Finally, t = t, and lnt- TMIN = 128m - TMIN + 128m1TMAXln TMAX - According to Lemma 2.5, with 128m In TIN t replaced by , tk t after at most iterations. - TMIN Furthermore, according to Lemma 2.6, with after at most t K t replaced by tj+ l , tk t iterations. .- 2.4 A Strictly Monotone Algorithm for Linear Programming that requires O(mL) iterations. Suppose we wish to solve the problem LP: maximize Ax s. t. where t = x, e R n+ l , A b Rmx(n+1), and eliminating one of the x and b e Rm . (n+1) variables Upon setting of , LP is easily transformed to the form maximize x, t LP: t Ax s. t. where x £Rn, A Rm xn , transformation of the data algorithm PCP b E Rm , and the data (A, b, c) . to trace the path b+dt x(t) Ax < b + dt . 26 We can solve b) LP are a linear by using the of center to the parametric problem x, e R n Suppose (x °, t) = (xc, cTx °) and path satisfies the starting criterion of the algorithm Ix -xtIQ < namely A . Q' =AT(S) x(t) for is a given starting point for which '-1 21 where s = b+dtO-Ax ° > 0 Then we can use algorithm objective value of the linear program k = 0, ... ,, LP . S ° = diag(s ° ) PCP I[t, TMAX) = [cTxO, z ' ) where z' t PCP to generate the is the optimal The sequence of values of tk , will be strictly increasing, according to Theorem 2.1, i.e., the objective value will be strictly increasing at each iteration. total number of bits in a binary encoding of LP . Let L be the In order to evaluate the algorithm's complexity, we consider three cases. Case (i): The linear program is unbounded. never generate a finite upper bound on After k = O(mL) iterations, In this case, the algorithm will TMAX , which equals infinity. tk > (t -TMIN) (1 + 2 L , and we can conclude that LP is unbounded. Case (ii): The linear program is bounded and indicator function at t = to , is negative. f(to) , the value of the path This being the case, the algorithm will always generate upper bounds, and after O (mL) (z*-tk) (-t)(1 l 2 ) will exceed is less than 2L, iterations, whereby (tk) canbe rounded to an optimal solution, see Karmarkar [6]. Case (iii): The linear program is bounded and can show as in case (i) that after otherwise the LP k = O(mL) would be unboundled. 27 f(to) > O . In this case, one iterations, that f(tk) < O , for Furthermore, after an additional k = 0 (mL) iterations, we will obtain via case (ii) that we can round to an optimal solution. Thus, after 0 (mL) iterations, we can round to an optimal solution. Note that in either of the three cases, that algorithm process LP after O(mL) PCP will (by detecting unboundedness or producing an optimal solution) iterations. This algorithm falls into the class of central-trajectory based algorithms, but is inferior in that the bound of 0 (mL) iterations is worse than the bound of O(4fiL) iterations for algorithms such as Renegar [11] or Vaidya [16] that trace the (weighted) center of the system Ax cx as b 8 is increased, or to the bound of O (ff L) iterations for algorithms based on barrier penalty methods that trace the solution to m maximize cT x + e n nsi i=l s.t. Ax+s= b s>0, see Monteiro and Adler [10], among others. All three methods follow the same path in their idealized version. Yet the latter two obtain convergence in superior to our algorithm. O (I¥ L) iterations, which is However, these other algorithms do not guarantee strict improvement in the objective value, (but do guarantee strict improvement in the duality gap). In contrast, our algorithm will guarantee strict improvement in the objective function of 28 t )/128m ('-Cc at each iteration. Perhaps it is the implicit imposition of the strict improvement in objective value that increases the iteration bound by a factor of Furthermore, our algorithm does not assume that Instead, our algorithm will detect unboundedness of LP O ( V) . is bounded. LP directly. As a final note, note that our algorithm can be used to mimic Renegar's algorithm [11], tracing the center of as PCP t is increased. Ax b -cx -t Thus, Renegar's set-up is a special case of the problem we are considering. However, we see no way to cast problem (2.2) as a special case of the set-up used by Renegar [11]; we allow all RHS values to vary simultaneously, which is apparently more general than in his work = [11]. The remainder of the paper is devoted to proofs of the results presented in this section. preliminaries. algorithm In Section 3, we present notation and Section 4 contains an analysis of a single step of the PCP , and contains proofs of Theorem 2.1 and Lemma 2.4. Section 5 contains an analysis of bounds generated by the algorithm, and contains proofs of Theorems 2.2 and Lemma 2.7 r 29 111 3. Notation and Preliminary Results In this section we present notation and some preliminary results that will be used in the proofs of Theorems 2.1 and 2.2 and in subsequent analysis. 3.1 Notation an .d Translations v , II v For a vector matrix M, IM I[ i denotes the Euclidean norm, and for a denotes the usual matrix norm, i. e., iIMI = sup IIMvIIl/iivI Note that if If M -' Ilv ll M is a diagonal matrix, lI M ll = max is a positive definite matrix, the = M-norm mill of v is VTMV. PM: = I- M(MTM 1 MT The matrix denotes the orthogonal projection matrix which projects onto the null space of MT . Let Qt (x; u) the negative of the Hessian of the function ft (x; u): = n (bi + tui - Aix) , where u e Rm is a given vector i=l parameter. (. Let (x): = diag(b-Ax) Then Qt (x; u) Q(x) AT - =AT),\ -2 and At(x; u): = diag(b + tu-Ax) . hen (x) A 1I 30 t = 0 , \e denote denote Let xt(u) denote the center of the system denote the center of the system Suppose observe that d = u + Ar Ax < b + tu , and let Ax < b . for some At (x; d) = At (x - tr; u) u Rm and and hence Thus, the difference between modifying the re R. We Qt (x; d) = Qt (x - tr; u) RHS by d and by simply corresponds to a translation of the inequality system by ({x R n I Ax < b+td = x x u tr , in that R n I Ax < b+tu + tr The following Lemma is therefore obvious. Lemma 3.1: Ci) Suppose t (d) = d = u + Ar for some and re R n . Then (u) + tr , (ii) Ilx-xt(d)llqt(Zt(d);d) = II(x-tr)- (iii) max ur Rm t I Ax < b + td for some t(u)IQ(x(u);u) for any x} = max xE R n , t I Ax b + tu for some In the sequel, we shall be working with appropriate choices of and r Rn instead of d . Rm , u For the appropriate where convenient and the context is clear, welet At(x): = At(X;U)x;, Rm As xwe shall see in Section 5, this in fact is central to the construction of the proof of Theorem 2.2. u and (x = A(x)), 31 Xt = Xt(u) , S = b-Ax x . III 1h S = diag (-s) and St = diag(st). St = b + tu - A diag (^) b - AR;, s We also will abbreviate , Qt(x; u) by Qt(x). The next Lemma presents some basic inequalities. It is essentially Proposition 7.2 of [4] with some simple extensions. Lemma 3.2: Q(x) = xe R n Suppose AT S -2 A. Then for any satisfies s = b-A >O and let I- XIIQ() < <, 1 xE R n such that we have (i) (ii) = b-A > 0, I I A - 1 () A (iii) IA- ()A and for any (iv) II IIQ() (R)fl = lS-lslI < 1-81 ()ll = -S I < 1+8 ! VE R n 1 II IIQ(), 1- 8 where Q () = AT-2 A, and (V) 11 VIIQ(l ) < (1+s)11 v IIQ() . 3.2 . Equivalent Measures of Closeness In this study, we measure how close a point the system Ax < b with the norm II-XIIQ(X) 32 x is to the center x We shall also make of use of a different measure of closeness to x that was introduced in [4] , and we will show a certain equivalence of the two measures. Define Q = m AT-2A, y=y(x) = 1 ATS e (3.1a) - and y = y7() Note that Ax b. = / (m - 1)yTQ-l y denote the center of the system ThenQ= Q() . Let Then y(X) = 0 andso y(x)= 0 In [4], the scalar = y(x) is used to measure the closeness of Lemma 3.3 ([4]). Let h > 0 Suppose (3.lb) 1 yT Q y Let to the center x x . Ax < b. be the center of the system be a given parameter. = (X) < gIngh(1 + h)) C (m'- 1) where g(a) = h 2 (1 + ) Then lx-xlQx Proof: Follows from the proof Lemma 7.2 of [4] with weights (1 - h) 2 w = (llm)e . U We shall say that Corollary 3.1: x is approximately centered if y(x) < .0072 . If y(i) < .0072, then 33 ill 'iQ(-) < 1/21 III Proof: Let h = .03 and (x = h) h h 2 (1 + we have IIX-XI2Q(). < m m-1 Substituting for h and ( Then from Lemma 3.3, ?) ((1-hy)2| , and noting m 2 gives the desired result. Therefore, for appropriate values of h , approximately centered implies is close to the center by our criterion, that is 1X- XIQ(x) Lemma 3.4: the system Then < 1/21 . x is h = .03 ) Next, we show the converse implication. I-xIIQ() Suppose (e.g., < < 1/2, where is the center of Ax < b ' = y(x) < a + 2a 62 where .1 2(1 - 6)(1 - 26) Proof: Let = b-Ax II (^1 = and jjX...XjfQ() = b-Ax < From Lemma 2.1 of [ 4], vwith weilghts m m ,i i=1 n s - , i=1 nsi< M 111- 1 1 1-8 From Lemma 3.2, I X....(XIIF 8 1-6 < 1. N = (1/m ) e , 3 (., \here 2(1-ct) 34 a = 8 1-6 < 1 . . On the other hand, from (2.4) of [4], m m lni 1=1 Therefore, Thus, , lni 2 ( mI )(1 + i=1 a = y 2<a+ , where y = y(x) . (1+ - - VT-+72y . a2 2 (1 - a) 7 1+ 2) , where 2 a = 82 -) 2(1-a) . 2(1 - )(1 - 2) Finally, we present some elementary inequalities. Lemma 3.5: Assume m >2 . Let Q, y and Y be defined as in (3.1) . I (i) For all (ii) For all Proof: Note that £> , < 1, 9y< £ implies (m - 1)yT Q - y ? = (m- 1) TQ y 1 - yT Q-l y Since <m m-1 +2 m- 1 To prove (ii), note that (i) (m - 1) yT Q-1 y < £ £ implies whereby yT Q- y = follows immediately. (mn - 1) T Q-1 y I-EC E implies m 35 2 < £/(1 - ) i m-l +92 4. Analysis of One Iteration of the Algorithm In this section, we analyze one iteration of the algorithm and prove the Improvement Theorem (Theorem 2.1) and Lemma 2.4. if I t is small then the two centers x and xt First, we show that are sufficiently close to each other with respect to some appropriate norm (so that Newton's method, when applied, will converge). Theorem 4.1: and let xt Let denote the center of the inequality system u = A-' () u . Proof: denote the center of the inequality system I tl < 1/(76 u) Suppose sufficiently small. note that = tIQ(x) < 1/12 The proof makes use of Lemma 3.3 of the previous section. We want to show that the quantity y Ax < b + tu . Let I - . Then Ax < b AT y(): = I A- Y = 7(x) for the system Ax < (b + tu) We begin by giving expressions for y and () e = 0 (see (2.3)), and so for the system AT At1 () e 1 AT [At1 ()e is Q . First Ax < (b + tu), 1()ei m1 AT At () A- () (t u) -i Also, Q1..= ATAT Qt(,) ()(t = I ) , by definition of AT At- (x) A 36 uG above. and so , yT Q- y = ml ( ()A_ (A T t2()A ) _ A Ta()(t I) -lt because the eliminated matrix is a projection matrix. IIlIlt 112 YT Q-1 y(m - 1) < 1- Thus, from Lemma 3.3 with Corollary 4.1: (1 - 5,775 and y < .0132 . 2) < ( 1 )2 I hy) 2 Under the conditions of Theorem 4.1, I IX Proof: 1 Hence h = 1/18 2h 2 (1 + -X XtQ, .- < < I -11II-lul2 t 112 T Q-l y - Xt II Q, (z < 1/l Follows from Lemma 3.2(iv). . Furthermore, using Lemma 3.2(ii), the following corollary is immediate. Corollarv 4.2: Let d R 2 Suppose IIx-XIIQ(, < 1/21 be given and define U: = P AS' d 37 11 denote the center of the inequality system Let xt Suppose It < X -XtiiQt(t-) < 1/11 Proof: - Let A-1() tlA - ()SuI Then 1 u = Su It I< u = A() u . and Note then that IIA' (x)sl U < 2llull, Thus, 80 11 I < ! 21. Qt(^Xt < 1 8011u11 20 Hence, from Theorem 4.1, we have -Xt and so that from Lemma 3.2(ii). 1l 1/76 b+tg-d ' IIU- 4.1, Ax < 1 761 I 11 llx-tl a(-X)< 1/12 and by Corollary . 1/11 We next use a theorem of Renegar [11] which gives the region and rate of convergence of Newton's method for our problem. Theorem 4.2 (Renegar [11]): xe intXt = ({x Rn Assume Ax < b + t E: = Ix-XtlQt,(x) <I ,1 ,here and x. is the center of the system Ax < (b + tu) . 38 Let Qt t = b+ut-Ax, ATS-2 Q= ATS Then and St = diag (t) -t = - Q 1 () AT t1 e, where A.t 11t-XtlQ,(t) < (1+ E2 e . We are now ready to prove the Improvement Theorem. For the reader's convenience, we restate the theorem before proving it. Theorem 2.1 Suppose Define (Improvement Theorem): e int X = x l Ax < b)} U: = satisfies II - I Qc) < 1/21 (projection of g' d) ; S-1A Sd I r: = (AT S 2 A)l AT S- 2 d a: = (step length) . s801ull s: Further, define (translation vector of system) ; = b + cSu-A, q: = -(A T Sc: = diag(Sa); A) ' AT S e S (Nexw approximate center) x,Ew: = x + q + c.r Then l XNE -(X- IQ (E)V) - (Newton step) ; 1 39 Proof: First note that (dl Q ((d); d) = II~a xa (u) IIQ((u); u) XNEw -a Hence, by Lemma 3.2 (iv) (with where Xa = x+q and and that II Xa - Xa - Qa.() Let inequality system Ax < b +ut W . let u = S, and denote the center of the xt and let u,r, Qt(x) = Qt(x;u) . For all I I I Q(-X)~ Rn ; 76l~lz( for all v 12I~i7675 1lviIQ(^) for all v E R n ; and · (I10 (ii) IV IIQ, rt') < (iii) 1/22 , where .033 < 1/22 as in Theorem 2.1. NEW which, by Theorem 4.2, implies <.1462 Lemma 4.1: Under the conditions of Theorem 2.1, define te [O,c] < = In Lemma 4.1 (iii) below we Qa (Xa) = Q (Xa (u); u) . : = IIX-XalQ() show that, , x = x (d), IIxcallQ(;a) it suffices to show that 8 = 1/22 ), a = Xa (u) u = Su . Thus by Lemma 3.1, d = u + A , where Xt IIK- x tllQ,(^) < .1462 Proof: From Corollary 4.2, iwe have 76 and IJ|2- I|Q(;) < 1/11 (4.1) Thus, by Lemma 3.2(ii) and (iii) , (,)) A,(?)I 11A- 11 11~I and 11 an _ 10 ' j[A.II-:. {\)A(.K)H S 40 (4.2) IlI A () ~~~t A () I1= max i Also, bi - Ai bi + t Ui- Ai max 1 + bi - Ai < 76 t Al()ull IItA- ()u I because I1- 1 76 76 (4.3) 75 from (4.1). Similarly, ()'At ()II = max bi + tUi- Ai i bi -Aix I, = max 1+t ui i I bi- Ai x I < Hence, I VIIQ(X) 77 76 1+11tA x = IA- (X)A v IA • II)- (R1) This proves (i). (4.4) Qt ( x) (X)ALt(t) I A () from (4.2) and (4.4) The proof of (ii) is similar, using (4.2) and (4.3). show (iii), we have by the triangle inequality, -'4 41 A vi Next, to III 11 3z - Xt > IIQ,(^) 1 -1 IQ(X) + 11 l I _ (124t 11 75' (12 i )(7(2 since IjX-X-IIQ(-) <-20'21 IIx-0 Xt x x xt x 0) + 1- IIQ(x) < .1462 1 < from (ii) and (4.1), I from Lemma 3.2(iv). 20 . We are now ready to prove: Lemma 2.4: For all values of Xt , near the center of x (t)- xt Q,((t)) Proof: Let C generated by algorithm t PCP , x(t) is in the sense that .585 a = 1/(801111) as in Theorem 2.1. Lemma 3.2(ii), it suffices to show that Let te [0, ] . By IIx(t)- tIIQ() < 0.369 We have, from the proof of Theorem 2.1, and Lemma 4.1(iii), IiXa xIQa(a) - I| x Hence, IIX- X IQ(x) IQa(a) + I XC 12I7) (~ 1 (x-- I Qa(x,) < .033 +.1462 < 0.18 I(- < 2ii'~7i(I 71 Xa - R I (Ux 1I-,(. 11 75 10 76 < 0.222 42 (by Lemma 4.1(ii)) (by Lemma 4.1(i)) Next, observe that Ix(t) - IlQ (,) = = t (a-x) -(t)-x Therefore, LI (a - x) I Q,( ,) < 0.222(t/a) Thus, by the triangle inequality and Lemma 4.1(iii), I!5(t)-XtIIQt(,) < Ilx(t)-IllQ()1 ) + I1i -tllQ,(2,) < .222 X X x X~~~~~~~ + .1462 < .369. II 5. Lower and Upper Bounds In this section, we analyze the upper and/or lower bounds generated by algorithm PCP , and we will prove Theorem 2.2 and Lemma 2.7. Recall that TMIN:.= inf T TMIN · T = ( t I intX t , 0 We shall derive upper bounds on TMAX and and lower bounds on We shall, for convenience, adopt the following notation and assumptions throughout this section. The current value of X is the center of the system t is t = x - x Q() = iterate x satisfies s = b-Ax > 0 and where TMAX: = supT Ax < b , , and the current < S (s -s = b-Ax, 1/21 S = diag(s) and S = diag () . We decompose d into d = u + AF, where = PS-'AS d and (AT -2 A)AT- 2 d . F By assumption, u u = • 0 . d (5.1) does not lie in the column space of A We shall prove Theorem 2.2 and Lemma 2.7 in this section. begin with three fundamental Lemmas. Lemma 5.1: Proof: and so eTS leT S 1 u - eTu < I l/20 u - eT Ie(S | - =e( - S < is' < 44 ]ul R '.r) · i '1 U(s-s We The first inequality is the Cauchy-Schwartz inequality and the last follows from Lemma 3.2 and the assumption that f() = eT S u Recall that If e T i > IIui/ is the path indicator function at thenfromLemma 5.1, 20 Therefore, the current center eTS' u x is on the lower path and e T' then TMAX . >0. TMIN is - m/(eT S u). is bounded above by However, we typically cannot deduce the exact value of - m/(eT -1 u) eTS' u < -1uil/20 t= 11/20 eTU11-_ bounded below, as we have seen in Section 2, by the bound Similarly, if . IiX-XjQ(i) < 1/21 . from the current iterate. Therefore, we shall later in this section derive an alternate bound using the next Lemma which is a variant of Lemma 7.1 of [4]. Lemma 5.2: Let be the center of the syst,em xe intX = (x Ax<b) the ellipsoid FOUT FOUT: = Then Proof: Let x int X Suppose IIX-xIIQ) < 8 < . Define by: xe R n intX is given such that Ax < b . I IIx-xIIQ() <( +)Vm(m -1) + 68) (5.2) FOUT . be given and let s = b-Ax . Then from properties of the center, (see for example [4] , Theorem 2.1), ilx - 45 II Q(~) < /im (m-l) From III II x - X I Q(I)< (1 + 8)/m (m-i ) Lemma 3.2(v), we also have IIx--IIQ(T) < Ifx-XiIQ(¢) + IIX-RIIQ()-< (1 + 8)Vm (nm- Thus, + . 8 The next Lemma concerns the well-known classical least-square problem or minimum-norm problem. Lemma 5.3: mxn matrix M Given an and an m-vector d , for all scalar t, IIMx-tdjI where t 2 ItIt IPMd for all x E R n , PM = I - M (MT M'MT. We are now ready to prove Theorem 2.2 and Lemma 2.7. We shall prove Theorem 2.2 in two parts. Proposition 5.1: (i) if eTI (ii) Proof: Under the definitions and conditions of Theorem 2.2, [ < -1/20 if eT-/ll then > 1/20 (i) Suppose TMAX TMIN then eT Wllu 1 <- 1/20 . < UB - (22 Vim+1j+ 1)/21 Iull , > LB - (22 fr-im+1 + 1) / 21 I 11 . We first show that 0 is close to the center of the following extended system with one additional variable and one additional constraint: A x b' (5.3) 46 eu S'U where[ ]e A=[0 where 0 and 0 = I 1- eT S 0 < 0. . Note that from the remark following Lemma 5.1, Therefore, for some x) TMAX = SUP {t I Ax < b +dt t = sup t t I Ax t I- x = sup t for some x) b + ut for some x} -1 For system (5.3), define Then, A(x,O) = S O and - t A(x,t): = diag([ ] I SO A , 0) = -1 Therefore, x = le [eT X =([x] 1 A, -eT = -1 u+ is the center of system (5.3) and 0 lI[] -[ l1 Let and 0 1 0 1 [eT, 1],& (R",0) Q(x,t): = e Q(X O) Rn[+1A =II xt[ ] < -'II Q(x) < 1/21 1 I 47 / and (x,t) TA -- 2tA . III FOUT= [ :] E R n+ l [ X] Then, from Lemma 5.2, int'X Furthermore, because = A T S- Q(x, ) = [ ATs -2 A -ATS R 0 < i5(X, 0) 2m 21 m1 + 1 21 ; FOUT -2 = A T -2 u ] u , we have ATS2- A 0o 0 02 +1lu~12 -uT S -2 A 02 + UT S-2u · Thus [tt 0 2(, 0 1 ; Q(i,O) ) > t 2 11 whereby II2 Et] Also, for any + t2(0 2 + II II2) A(x -)112 =1131i ( o0) 0; t 22 m iim +1T + 1 21 Vmim+1 + 1 t< t< 22 2111 II It then follows from Lemma 3.1 that TMAX = SUP ( t Ax - tu < b for some sup {t Ax- tu < b for some x ,Ot < 1) = = sup t fx E intX for some 48 xI x I (since 0 < 0) • (22 x} E FOUT for some 5 max t I [t (since int X FOUT). m (im+)j+ 1)/211ll The proof of (ii) exactly parallels that of (i). Proposition 5.2: (i) TMAX < UB (ii) TMIN LB -( Proof: S eTd/ IJuJi Suppose I Then < 1/20 ( 1.6 m.(m-1) + .6 )/ 1.6/ m (m UI I ) + .6)/ [-ll We shall show that 00] is approximately centered (in the sense of Section 3.2) for the extended system [x] [A, -u] Let Section 3.2. Q, y Let for system (5.4). and y = y(x) Q,y forsystem and y = y(R,O) Ax b be as defined in be the corresponding parameters Observe that = b-A b -[A, -u] Therefore, since (5.4) u = u = > 0 and AT S-1 =0 , r Q=4 -AT L_uT s-2 [A, -ul = 49 Q 0 0 IIi2/m 1 11 and and y AT -u T m Therefore, I [-eT -d/mj yT Q-1y + e T -1 IIX-gXQ() Recalling that Y -I ee S' / 2 < 8 = 1/21 , wecompute conclude from Lemma 3.4 that, 'y < . a+ a = 760 1 and Next from Lemma 3.5(i), Thus, (m -1))yTQyY I + (760+ Im-)lull) - < .0053 2 t Therefore, by Lemma 3.5(ii), h = .41 [x:] < .0735 . and so Taking in Lemma 3.3, we have ] 2r~ <) i - K where xt] Note that ) m- h2(1 + 2) < .36 < (.6)2 , (since (1 - h) 2 is the center of system (5.4) and [ X] Let q < .0054 X: =([ t] 11~5(XO) = IIs- A(x and [A, -u] 50 Q(x,o) =mQ. - x)- tu . m/(m-1) 2) FOUT: = it] IIS-'A(x-x)-t int X Then by Lemma 5;2, [:] e FOUT + .6 (m-) <_. 1.6 By Lemma 5.3, for all since > Itl JJUJI 1.6 Im (m-1) + .6 > jj'A(x-R)-fu-jj Thus = 0. ATS-1 Four. i[ sup (Itl I t T) =sup (ltl [I < max (Itl [x] e < (1.6 xl for some FOUT m (m-1) + .6)/lU x) for some e int X I U This completes the proof. Finally, we Theorem 2.2 follows immediately from Propositions 5.1 and 5.2. prove Lemma 2.7: iteration k , criterion (ii) of If will be satisfied. Proof: Suppose criterion (i) eT1/ll/jll < -1/20 in is satisfied at algorithm PCP of then in all subsequent iterations, Step 4 that Step 4 criteria (ii) iteration k or f(tk) = eTS u < 0 . 51 of It suffices to show will not be satisfied in all subsequent iterations. Lemma 5.1, we have (iii) By Proposition 2.2, By III I f (t) < f (tk) < 0 criterion (i) for all t > tk . Therefore, in all subsequent iterations, will not be satisfied, as it would imply that t > tk . ,. 52 ____11__11_1____11___-.I___-- _ f (t) 0 for some References [1] Barnes, E. R., S. Chopra, and D. L. Jensen (1988). "A polynominal time version of the affine scaling algorithm." Working paper, Graduate School of Business Administration, New York University, New York, NY. [2] Censor, Y. and A. Lent (1987). "Optimization of 'log x' entropy over linear equality constraints." SIAM Journal of Control and Optimization 25, 921-933. [3] Freund, R. M. (1988). "Projective transformations for interior point methods, Part I: Basic theory and linear programming." Working paper OR 17988, Operations Research Center, M.I.T., Cambridge, MA. [4] Freund, R. M. (1988). "Projective transformations for interior point methods, Part II: Analysis of an algorithm for finding the weighted center of a polyhedral system" Working paper OR 180-88, Operations Research Center, M.I.T., Cambridge, MA. [5] Jarre, F. (1988). "On the convergence of the method of centers when applied to convex quadratic programs." Manuscript, Institut fir Angewandte Mathamatik und Statistik, Universitit Wiirzburg, Am Hubland, 8700 Wiirzburg, West Germany. [6] Karmarkar, N. (1984). "A new polynominal time algorithm for linear programming." Combinatorica 4, 373-395. [7] Kojima, M., S. Mizuno, and A. Yoshise (1987). "A primal-dual interior point method for linear programming." Research report No. B-193, Dept. of Information Sciences, Tokyo Institute of Technology, Japan. [8] Megiddo, N. (1986). "Pathways to the optimal set in linear programming." Research report RJ 5295, IBM Almaden Research Center, San Jose, CA. [9] Mehotra, S. and J. Sun (1988). "A method of analytic centers for quadratically constrained convex quadratic programs." Technical report 88-01, Dept. of Industrial Engineering and Management Sciences, Northwestern University, Evanston, IL. [10] Monteiro, R. C. and I. Adler (1987). "An O(n 3 L) primal-dual interior point algorithm for linear programming." Report ORC 87-4, Dept. of Industrial Engineering and Operations Research, University of California, Berkeley, CA. [11] Renegar, J. (1988). "A polynominal-time algorithm, based on Newton's method, for linear programming." Mathematical Programming 40, 59-94. III [12] Sonnevend, G. (1985). "An 'analytic' center for polyhedrons and new classes of global algorithms for linear (smooth, convex) programming." Preprint, Dept. of Numerical Analysis, Institute of Mathematics, E6tv6s University,; 1088, Budapest, Muzeum K6rut 6-8. [13] Sonnevend, G. (1985). "A new method for solving a set of linear (convex) inequalities and its applications for identification and optimization." Preprint, Dept. of Numerical Analysis, Institute of Mathematics, E6tv6s University, 1088, Budapest, Muzeum Kbrut 6-8. [14] Todd, M. J. and Y. Ye (1987). "A centered projective algorithm for linear programming." Technical report No. 763, School of Operations Research and Industrial Engineering, Cornell University, Ithaca, NY. [15] Vaidya, P. (1987). "A locally well-behaved potential function and a simple Newton-type method for finding the center of a polytope." AT&T Bell Laboratories, Murray Hill, NJ. [16] Vaidya, P. (1987). "An algorithm for linear programming which requires O(((m+n)n 2 + (m+n) 1 .5 n) L) arithmetic operations." Proceeding of the Nineteenth ACM Symposium on the Theory of Computing, 29-38.