!~~eR~~:Ns.R~~n~~AE ~B ~W~:~~F7 ~i~ i:-

advertisement
:
:a-:S~~~c-~~,~~-
~B
~W~:~~F7
·::;::--:'~-
i:-
;
:r:`:-small_~
:-.:::::-:
~i~
!~~eR~~:Ns.R~~n~~AE ---Da e
;·:.·
.-
":
·1-.
.
:
Ij
.·
·;
:·
- ·:.-·.· ·.
;'·J:
·
·
- .·r-t
I)·
·-:
:.·
'i
.!·
i;·
;·
:'·
:·
·::
-,
..
·.'
i.--,,:-··
-·
r--
b.-
i.
i·
i.
:··--
>
MASSACHUSETTSBl~~~~i~
INSTITUT
.OF~0TcHNOLOGY
'
A METHOD FOR THE PARAMETRIC CENTER
PROBLEM, WITH A STRICTLY MONOTONE
POLYNOMIAL-TIME ALGORITHM FOR
LINEAR PROGRAMMING
by Robert M. Freund
and
Kok-Choon Tan
OR 192-89
March, 1989
v
r
Abstract
Given a system of linear inequalities and equalities Ax
b + dt and
Mx = g + ht where the right-hand-sides (RHS) are parametrically deformed
over the scalar t , the parametric center problem is to trace the parametric
family of approximate solutions
(t) to the center problems P(t) , where
m
P(t)
is the problem:
maximize
ln (bi + dit - Aix) subject to
Ax < b + dt
i=l
and
Mx = g + ht . We present an algorithm for tracing the parametric family
of solutions
(t) over the given range t E [ t, t] . At each iterate of the
algorithm, the value of the parameter t is strictly increased and a Newton step
is taken. The sequence of values of t exhibit the following geometric rate of
change: If tk and tk+1 are two successive values of the parameter t
generated by the algorithm, then either (tk+l - TMIN)
(tk - TMIN)(1 + 121m)
(TMAX - tk ' ) < (TMAX - tk)( 1
) where TMIN (TMAX) is a lower
128m)
(upper) bound on the smallest (largest) value of t for which Ax < b + dt,
Mx = g + ht has a solution. Thus the iterates exhibit either linear growth
away from TMIN or linear convergence toward TMAX , with a rate of change
1
of 128m , where m is the number of inequality constraints.
or
When applied to the linear programming problem, the algorithm is an
O(mL) iteration algorithm for linear programming, that strictly improves the
primal objective value at each iteration, and requires no dual feasible solution (or
even dual feasibility) to start. After O(mL) iterations, the algorithm either
detects primal unboundedness or produces an interior solution that can be
rounded to an optimal solution to the linear program.
Key Words:
Newton step, center, linear program, interior-point algorithm.
I
Introduction
Given a system of
Ax < b
and k
m
linear inequalities in
equations in
center of the system
Rn
of the form
(A, b, M g)
Rn
of the form
Mx = g,
the (analytic)
is the optimal solution to the convex
program:
m
P:
A in (bi - Aix)
maximize
i=l
s.t.
Ax < b
Mx= g
where
(Ai,
bi )
respectively.
denotes
ith
(See Sonnevend [12, 13] .
(x e Rn Ax < b, Mx = g )
system
rowof
(A, b, M g) ,
A
and
ith
component of
b,
Assuming
is nonemepty and bounded, the center of the
denoted
,
is uniquely defined.
The
computation of points near the center and their properties are important for
interior point algorithms for linear programming and extensions, see
Karmarkar [6],Renegar [11], Megiddo [8], Kojima et. al. [7], Vaidya [16],
Monteiro and Adler [10], Mehrotra and Sun [9], Jarre [5], Barnes et. al. [1],
and Todd and Ye [14], among others.
Algorithms for finding the center are
presented in Censor and Lent [2] , Vaidya [15] , and [4]
This study is concerned with the parametric analysis of the family of
centers as the right-hand-side
varies parametrically.
(RHS)
(b, g)
of the system
(A, b, M, g)
We define the parametric center problem
(PCP)
be the problem of tracing the paramitric family of optimal solutions
the problems:
1
xt
to
to
P(t):
, in (bi + dit - Aix)
maximize
i=l
s.t.
Ax < b
+
dt
Mx= g + ht
d e Rm
(where
interval
t E [,t]
and
,
t, t
and
varies over a given
t
are given), as
h e Rk
algorithm for generating a piecewise-linear function
the property that
We present an
are finite or infinite.
x(t): []
is an approximation to the center
x (t)
is an approximate solution to
P(t), as
the approximate path of solutions.
t
is varied.
Rn
-
with
(t)
x (t) , i.e.,
We refer to
(t)
as
(The sense of the approximation and its
properties are defined in Section 3.)
The algorithm starts with an approximate solution
P(t)
center problem
t = tk
t
t = t .
at
t = tk+1
is chosen as
to solve for
x ( tk + ).
where
The path
guaranteed increase in
(TMAX -
t1)
is
P(tk)
tk+1 > t k
( t)
t
and a Newton step is performed
is then extended for
at each iteration.
In particular,
tk)(i
where
-128m)
(upper) bound on the smallest (largest) value of
t
is
The next value of
t e [tk, tk+1 ]
or
Mx = g + ht , has a solution.
TMIN (TMAX)
t
__
is a lower
for which
Thus the iterate values of
demonstrate either geometric growth away from
2
can be
tk+1
(tk+1 - TMIN) > (tk - TMIN)(1 + 1)
< (TMAX -
Ax < b + dt ,
(tk) .
t
The important feature of the algorithm is the
by linear interpolation.
chosen so that either
k , the value of
At iteration
and the approximate center for
to the
( t)
TMIIN
or geometric
contraction toward
TMAX , with a rate of change of
1
128m
(where
m
is
the number of inequality constraints).
This algorithm can be applied to solve the linear programming
problem in a new way.
LP:
Suppose we wish to solve the linear problem
maximize
cT x
s.t.
Ax < b
Then we can use the algorithm for
PCP
to solve for the path of centers of
the system
LP(t):
maximize
I In (bi - Aix)
i=l
s.t.
Ax < b
cx= t
as
t
is increased.
This yields a new "central-trajectory-following"
algorithm for linear programming that differs from other central-trajectory
methods in two ways.
First, it is a strictly monotone algorithm for linear
programming, i.e., an algorithm that strictly increases the objective function
value of the primal at each iteration, unlike other central-trajectory-following
algorithms.
Second, it requires no prior information or bound on the
optimal objective value, and will process a linear program that is unbounded
in the primal objective value (i.e., dual infeasible), unlike other centraltrajectory methods.
However, the complexity of the algorithm is
iterations, as opposed to
(where
O(~:w L)
O(mL)
for most other central-trajectory methods
L is the bit-size of the problem instance) and so has an inferior
3
complexity bound (by
v ff) .
Perhaps it is the strict monotonicity of the
primal objective value in the algorithm that is responsible for the inferior
complexity bound.
This paper is organized as follows.
In Section 2, we present the main
results regarding the parametric center problem and we present the algorithm
for tracing the approximate parametric path of centers. The remaining three
sections are devoted to proofs of the results of Section 2.
notation and preliminary results.
Section 3 presents
Section 4 contains an analysis of one step
of the algorithm and presents. the results on the use of Newton's method.
Section 5 contains the results regarding bounds on feasible values of
are generated by the algorithm.
4
------
t
that
2.
The Parametric Center Problem
Given a system of
Ax < b
P:
Rn
of the form
Mx = g , the (analytic) center of the system
and equations
referred to as
linear inequalities in
m
(A, b, M, g)
is the optimal solution to the program
maximize
A In (bi - Aix)
i=l
s.t.
Ax< b
Mx = g
(see Sonnevend [12, 13] .)
Suppose
x
(RHS)
as the right-hand-side
center
P .
is the unique solution to
varies parametrically.
Our interest lies in tracing the
of the system
(A, b, M, g)
In particular, we are interested in generating the
x t to the problems
parametric family of optimal solutions
m
P'(t):
A in (bi + dit - Aix)
maximize
i=l
(2.1)
Ax < b + dt,
s.t
Mx + g + ht
In this section, we present an algorithm for generating a piecewise-linear path
of solutions
and as
t
x(t)
such that
x(t)
is close to
^xt
in a suitable measure,
is varied strictly monotonically over a prespecified range.
First note that there is no loss of generality in assuming that the
equations
Mx = g + ht
are not present.
without loss of generality that
kxk
and is nonsingular.
lM = [B, N]
To see this, we can assume
is a
By suitably partitioning
5
kxn
matrix where
A = [C, D]
and
B is
x = (y, z) , we can eliminate the
y
variables to obtain the equivalent
problem
m
maximize
, In ( i
+
d-it -
i=l
iz )
s.t.
where
b = b - CB 1g ,
A = D - CB1N ,
straightforward to show that
solves
P'(t),
solves
zt
and d = d - CB-'lh
P"(t)
if and only if
Yt = B 1 (g + ht - N2)
where
.
It is
X"t = (,
We thus can concentrate
on the more convenient problem
P(t):
maximize
,
n (bi + dit - Aix)
(2.2)
i=l
s.t.
Let X
=
Suppose
(x
Ax < b
xt
Rn Ax < b )
is the center of
st = b + dt - Axt,
Because
and let
P (t)
+
dt .
and
Xt = (x E Rn Ax < b + dt ) .
Xt , i.e.,
Xt
solves
P (t) .
St = diag(st)
is a convex program,
xt
solves
P (t)
the Karush-Kuhn-Tucker (K-K-T) conditions are satisfied at
where
e
if and only if
x , namely
st = b + dt - Axt > 0
(2.3a)
eT S t A = 0
(2.3b)
is the vector of ones of appropriate dimension.
We assume that our initial value of
of
Let
Xt = x e R n I Ax<b)
t
is
t = 0 ,
is nonemllpty and bounded.
6
and the interior
In this case, it is
)
straightforward to show that
X,
will be bounded for all values of t .
Then
T
will be an open interval, and it is
T = (t I int Xt * 0} .
Let
straightforward to extend the analysis in Megiddo [8] to show that the path of
parametric centers
Let
1: t -
is continuous and differentiable.
t, t e T ,
TMAX = sup (te T
TMIN = inf (te T) .
and
Itis also
tt
TMAX = +
straightforward to show that
if d = Ar
for some
of translations of
Xt
if and only
TMIN = -°
r e R n , in which case the sets
Xt
by the translation vector
We therefore assume
tr .
are just a family
does not lie in the column range of
d
throughout this paper that
TMIN > -oo , or
this case, either
and
TMAX < +
In
or both.
,
-
A .
The algorithm presented in this section will trace a piecewise-linear
x (t)
path
that is "dose" to the parametric center path
At each iteration the value of
measure.
Furthermore, at each
t .
algorithm is strictly monotone in the parameter
least one of two ways, as follows.
in a suitable
is strictly increased; thus the
t
iteration, the magnitude of the increase in
xt
is bounded (from below) in at
t
Suppose
is the current value of
tk
t
Then at the next iteration the algorithm will produce either a finite lower
bound
UB
LB
is produced, then
tk+1 _ tk >
i.e.,
TMIN
1
128m
or a finite upper bound
tk+ ,
(UB - tk),
(TMAX - tk+l) <(1 -128
will satisfy (t k +l - tk) >
geometrically, i.e., (tk+1
the new value of
so that
1
-TMIN)
(tk - LB) ,
>(I +
If
so that
7
If
will satisfy
decreases geometrically,
LB isproduced,then
(t -
)(tk - TMIN)
12 8mn
orboth.
TMAX,
t,
(TMAX - t)
(TNIAX - tk) .
128m
UB
TMIN)
grows
t+'
At least one of the above two bounds must be satisfied. Note that in either of
the two cases the geometric rate of change of t relative to a bound is at least
1,
where m is the number of inequality constraints.
128m
Given a linear program in the form:
LP:
cT x
maximize
x
s.t.
as the following equivalent problem
LP
we can reformulate
LP:
Ax < b
maximize
x, t
s.t.
t
Ax < b + Ot
cTx
= 0 + t
Thus an algorithm that traces the parametric path of centers to the above
system for strictly increasing values of
algorithm for solving
will be a strictly monotone
t
The application of the parametric center
LP .
problem algorithm to linear programming is presented at the end of this
section, and is an
O(mL)
iteration algorithm for linear programming.
2.1 Properties of the Parametric Center
For
St = b + dt - Axt , and let
space of
that
let
t e (TMIN, TMAX),
St A ,
i.e.,
d = St ut + Art
indicator function
ut
ut
L-
for some
f(t) as
xt
be the center of
be the projection of
A (ASt
rt E R n
f(t) = eT u t ,
8
AThen
A) AA
T
Xt , and let
St d
onto the range
St d
We now define the path
and note immediately that
note
eTSt A =
since
d ,
f(t) = eTut = eTSt t) eTSt=e=
by (2.3).
,
f (t) = eT St d
Therefore an alternative equivalent definition of f (t) is
follows. It is obvious that
is as
f(t)
The motivation for considering the path indicator function
is the optimal objective value of the
TMAX
following linear programming problem:
TMAX = maximize
x, t
LP:
t
Ax - dt < b
s.t.
Suppose for a given value of
st = b + dt - At , and
P (t) , let
solution to
any feasible solution
(x, t) to
-
be the
xt
Then for
St = diag(Ot) .
LP °
-
1^-1
f (t)
f (t) < 0 . Let
that
t
1
f (t)
f(t-
_ I [-m + tf(t)3
f(t)
TMAX < t -
whereby
(
f(t)
f(t) > 0
Similarly, if
Now suppose that for a given value of
Let
be the center of the system
t'
-t. = diag~b + dt~ - Aj;;)
St = diag(b + dt*-At*) .
(t, t*)
whereby
if both
that
TMIN
TMIN
X
and
TMAX
TMAX
t* , that
t- m
f(t)
f(t*) = O
Ax < b + dt* , and let
T--I
e Tt
eTS d = 0
e St A = 0 and
is the center of the system
and
>
TMIN
t , say
(A, -d) (x, t) < b , from (2.3).
X = ((x, t)E Rn+ l I Ax - dt < b
Thus the set
particular
Then
,
are finite.
are finite.
If
is bounded from below (above) in
9
We say
TIIN
t .
is bounded, and in
X
(TMAX)
is bounded in
t
is finite, we say
Note that if
X
is
bounded in
t ,
f (t*) = eT S t. d
then
0,
t ,
the value of
to be on the upper path if f (t) < 0 ,
0 .
which yields
is guaranteed to exist.
Returning to the path indicator
f (t)
t
f(t)
defined earlier, we define
xt
and to be on the lower path if
The intuition behind this definition is provided in the next two
propositions.
Proposition 2.1 The path indicator function
decreasing for
t e (TMIN, TMAX)
Proof: The K-K-T conditions of
eTSt
Let
t
and
respectively.
and
A
it
unless
A
= 0
P(t)
and
require that
At +t
st
and
xt at t ,
Then differentiating the above expressions yields
st St A = 0
t + st = d .
^-1
f (t) =e u = eT
eT St d ,
-2t
f'(t) = -stSt d =-t
St (A t +
st = 0 , in which case
out by the assumption that
d
t)
=
d = A it .
-St
-2
<
t < 0 ,
But this last possibility is ruled
does not lie in the column range of
10
I
= b + dt.
be the vector of derivatives of
Furthermore, since
then
f(t) = eTut is strictly
A .
m
2.2 ODP~ter and
Lower
Paths)
ProDosition
P
e
and
-low
rooir
(i)
(ii)
TMAX
and
t ER
such that
are both finite if and only if there exists
and
f(t) = O,
f(t) > 0
for all
t
f(t) < 0
for all
t e (t, TMAX);
is finite and
TMAX
all
(iii)
TMIN
TMIN
f(t) < 0
if and only if
-_00
=
for
t < TMAX ;
is finite and
TMIN
all
(TMIN, t),
TMAX =
if and only if f (t) > 0
0
for
t > TMIN
These three cases are illustrated in Figure 1, (a), (b) and (c).
Proof:
center of
and so
TMIN
f(t) = O,
(i) We have seen earlier that if
= ((x, t) IAx- dt
X
TMAX
and
b)
then
(t',
That being the case,
X
are both finite. Conversely, if
TMIN
are both finite, then the center
(x*, t*) of X
,
is the
t)
is bounded
TMAX
and
which is the
solution to the problem
maximize
x, t
s.t.
exists uniquely.
and
Ax - dt<b
The K-K-T conditions for
eT St d = 0 I i.e.,
St = diag(st) .
In (bi + dit - Aix)
i=1
f(t*) = 0,
P°
where
require that
st' = b + dt* - Ax'
The rest follows from Proposition 2.1.
11
eT St' A =0
and
t
- ..LfVIAA
-C(
r
.0
t
1\L/
t (t)
/LA\
r i kJ)
I
Ir
-1
.1
MIN
x
Figure I (a)
12
= 0
-
n
U
t
TMAX
(^ ,,t)
X
)
= ( x, t) I Ax-dt b)
x
Figure 1 (b)
13
t
1
f(t)<O
I
'I' MIN
x
Figure 1 (c)
14
_
__
I
(ii)
Suppose
is finite and
TMAX
We have seen earlier that
f(t) > 0 implies
f() < 0 .
f(t) = 0
But, from (i),if
contradiction.
then
Thus,
f(i) < 0 .
is finite.
TMAX
then
.
TMIN
is finite.
Thus,
is finite, which is a
TMIN
Conversely, if
Consequently, if
Let t < TMAX
TMIN = -
f () <
f (t) < 0 for all
for some
t ,
t < TMAX
then
follows from (i).
TMIN = -
The proof of (iii) is similar to (ii).
2.2 Algorithm
PCP
Before presenting the algorithm for the parametric center problem, we
introduce some more notation and the main improvement theorem.
Recall that
and
Xt
X =
x
Rr ,Ax < b}
= (x X Rn
We assume that
X
is bounded for all
Ax < b + dt) .
is bounded, and so
t . Let
Iv
A
has full column rank and
denote the Euclidean norm.
If
M
Xt
is a
symmetric positive-definite matrix,
wedenote
I y-zlIM =
let xe intX
where
(Y-Z)TM(y - z) .
be given, let
= b-Ax,
S = diag(S) . We say that
IIX- XlIQ(x)
<
1/21 .
Let
and let
be the center of
X,
Q(R) = ATS2 A,
is close to the center of
X if
The motivation for this criterion of closeness to the
center will be apparent from the following theorem, which also serves as a
basis for the
PCP
(Parametric Center Problem) algorithm.
15
Let
int X
Suppose x
Theorem 2.1 (Improvement Theorem).
=(X I Ax < b)
, S = diag () , and Q = ATS-2 A . Suppose
where x is the center of the system (A, b) . Define
= b-A
IIx - XIIQ < 211,
[I
=
S-1 A (A T -2 A)- A T -l]
-
(projection of s-' d)
d
r =
(A T -2 A) -1 AT -2 d
(translation)
a=
=11a
8o11~11
(step length)
Furthermore, define
Sa = b+Suca-AZ
!
S = diag ()
= -(AT S§2 A)1 ATS
XNEW = X
(New approximate center)
q + ar
< 211 . where
Then IIXNEW-XaI
= AT--EW2
SNEW A
(A, b + ad), and
Q = A
In this theorem,
x
X .
u
The vectors
so that
+
d = A + Su
,
Xa
is the center of the system
where SNEW = diag(b + ad - A XNEW). -
and
.
r
The increase in
and so
a
t
from
'Ar + u ,
S-ld =
are defined and satisfy
t = 0
I il ·. Because d
•
Ar
to
t = o
is
for any
is vwell-defined. The Newton step
16
of
x
is given and is assumed to be close to the center
defined next, and is a function of
reRn, I1ul > 0
(Newton step)
e
4
is then
defined using the modified slacks
(the Newton step) and
x, q
theorem,
,
and xNE
(a translation vector).
F
will be suitably close to
NEw
is composed, using
,
According to the
the center of
Xa .
This
theorem will be proved in Section 4.
The increase in
t
I iUll ·
is a function of
from
t = 0 is given as
1/llI U l
However, the fact that
makes good intuitive sense.
is the minimum (least-square) distance of
lies very close to the range space of
A ,
is small. Then changing the RHS by
by a r
Il l
and then changing the
i.e.,
d = S+
RHS by only
"most of
d "
lies outside of the range space of
RHS
ad
is shifting the shape of
X
I ill
Ar ,
d
and so If Iu
is the same as translating
a Su .
Then because
I ul
I
is large, then
is small, we can take a big step. If, however,
of
The quantity
i = 1,...,m . Suppose
1/ i,
ad
is
c
from the column range of
d
in the weighted norm with weights
A,
which
,
Ideally, we would like this increase to be as large as
possible, to speed algorithmic convergence.
proportional to
a = 80
X
A , so that a change in the
substantially, not just translating
the polyhedron. Thus, the step we can take will be smaller.
However, as the next theorem indicates, the value of
important bounds on the values of
show that if
a
Recall that
is small, so is
f (t) ,
TMAX
TMAX
or
and
TMIN ,
[I
Iu
also gives
and thus will
TMIN
the path indicator function, gives important
information about the boundedness/unboundedness of the path of centers
xt
in both directions, according to Proposition 2.2.
17
Theorem 2.2 (Bounds on
and
T AX
Under the conditions and
TM!N).
definitions of Theorem 2.1,
(i) if et/llIll > 1/20,
then
(ii) if eTu/IIlUll < -1/20, then
(iii) if
e
/ I1uII
- I < 1/20,
TMIN
LB
TMAX < UB
LB -
then
TMIN
and
TMAX
<
UB -
22Ym(m+1) + 1
2111ll
22Ym(m+1) + 1
2111 llII
1.6fm(m-1) + .6
IIll
1.6Vm(m-1) + .6
.
III"
This theorem will be proved in Section 5.
Note that either (i), (ii), or (iii) must be satisfied, so that either a finite lower
bound on
or a finite upper bound on
TMIN
course of increasing
t
t = 0
from
to
TMAX
t = ca,
is produced in the
or both are produced.
Case (i) corresponds to being (approximately) on the lower path at
i.e.,
f(0) > 0
path, i.e,.
Case (ii) corresponds to being (approximately) on the upper
f(0) < 0
close to the center of
Case (iii) corresponds to
is increased from
f(0) = 0,
so that
(, 0) is
X° .
Note also that these bounds are
t
t=0,
t = 0
one or both bounds is at least
to
0(m/
IiTII)
Thus, even though
the ratio of
a
t =
-= 1/8011llII
1
Therefore repeated increases in
128m
to
t
using the methodology of Theorem 2.1 will result in either geometric growth
18
in the quantity
with a growth rate of at least
t - TMIN ,
geometric contraction in the quantity
at least
(Geometric
Change
\___________
__ ___ in__
Prorosition
11
__ _ ________ 2.3
___
m
2 .
where
tNEW = a,
1
)
I
or
with a contraction rate of
as the next proposition indicates.
(1 - 1m) '
Suppose
TMAX - t ,
(1 +
t
Relative to
TuiNA
1_···_
or
TAY).
· 114-1-r
With the notation
XOLD
a
are defined as in Theorem 2.1, then
and
XNEW
- X,
tOLD = 0
either
(i)
(tNEW - TMIN)>
(ii)
(TMAX- tNEW) < (1
(
+ 1128 m)( tLD- TMIN)
12;8m)(TMAX
, or
- tOLD)
Suppose a lower bound, either
or LB , is generated through
2
Theorem 2.2. Notice that LB < LB if m
so in either case,
Proof:
TMIN
>
LB = -(1.6
(tNEW - TMIN) =
tOLD - TM IN
tNEW
m(m- 1) + .6)/ I11l 2>- 1.6m /I11 u
+ 1
-TMIN
>-(8oI1i
- 1.6m
)1
IIu II
+1 =
Thus
1+
1
128m ·
A parallel analysis demonstrates (ii) if an upper bound is generated.
U
Now let us return to our initial interest - to trace the parametric center
path
xt
for the program
ranges in an interval
P (t) .
t E [t, t]
Suppose we want to trace the path as
where
19
t >t
.
t
Suppose we are given a
point
(Such a point can be found by
X.
that lies close to the center of
x
using the algorithm in Vaidya [15] or in [4] .)
The following algorithm, denoted
for Parametric Center
PCP
Problem, is an iterative algorithm that invokes Theorems 2.1 and 2.2. At
Step 0, initial lower and upper bounds are set to their extreme values.
initial value of
is chosen as
t
t =I ,
and the counter
-x is set, as is the
r
RHS .
are computed as defined in Theorem 2.1.
which is the increase in
close to the center of
P(tk + a) ,
X +
(tk+'l) =
and
bounds on
TMIN
and
returns to Step 1.
x(t)
is then computed;
, i.e.,
xNEW
approximately solves
t e [t k , tk +
TMAX
a ,
is
] = [tk,
tk +l]
,
x(tk) =
using
In Step 4, the current
are updated, in accordance with Theorem
t kf l
t . If so, it stops.
If not, it
The output of the algorithm is the piecewise-linear path
and the incremental values of
t,
namely
PCP (Parametric Center Problem)
Input:
A
Rmxn , b, d
Step 0
(Initialization)
UB=+oo,
and
In Step 3, a piecewise-linear path
Algorithm
Set
tk
xNE-
x-Ew
In Step 5, the algorithm checks if
2.2.
In Step 2, the constant
as endpoints and interpolating.
NEW
is
In Step 1, the values
point,
according to Theorem 2.1.
is defined in the range
x(t)
t
at the iteration, is defined as in Theorem 2.1.
t
"close-to-center"
The next
(for the
The current value of
number of iterations) is set equal to zero.
The current value
k
The
LB=-o,
Rm
t° ,
t
..., t k ,
, t , t , x
k=O,
t ° =t,
20
x=x
,
RHS=b+dt.
Step 1
Set
(Projection of d)
s = RHS-Ax,
Compute
S = diag () .
u = [I - s A (AT -2 AI AT
-1]
S d;
r = (AT S-2 Arl AT S-2 d
Step 2
(Compute Step Length and Compute New Approximate Center)
Set
80 lull'
Sa = RHS +
S
- A
4 = - (AT S 2 A) 1 AT
XNEW =
Step 3
e
+ ar + q
(Extend Piecewise-Linear Path)
Set
tk+l =tk + a.
For
tk< t < tk+ l
Step 4
, Sa = diag(s,);
,
define
t) = - +[(t
](XNEW
-
X)
(Update Lower and Uvrer Bounds)
· ' a
.
(i) if eT U/1III > 1/20,
·* L~~~~~~
*
then
,~~~~~~~~~~~~~~~~~
LB = max LB, tk
21
(22 mm-- im+ 1Y+)/(
1+ )/22 1 111ul)
;
II
(ii) if eT/
< - 1/20, then UB = min UB, tk+(221m(m+l)+ 1)/(211iu
ll)
(iii) if jeTU/[IIj
< 1/20 ,thenLB = max(LB, tk-(1.6/mm-l)+.6)/Iull}
and UB = min (UB, tk +(1.6 Vm (m-1) + .6)/1ull }
Step5
tk + <t,
If
set
RHS = RHS+da ,
k = k+l,
x-
= XNEW,
and go to Step 1.
If tk+1
t,
STOP.
2.3 Algorithmic Performance
According to Theorem 2.1, if
for each
k = 1, 2, ...,
x (tk)
x
is close to the center of
will be close to the center of
break points of the piecewise-linear path
parameterized center.
Lemma 2.4
(t)
X
X t , then
Thus the
"
will be near the
Furthermore, we will prove in Section 4:
For all values of
t
generated by algorithm
near the center
Xt,
inthesensethat
Q = ATSt2 A ,
and
st = b +dt-A(t),
II (t) - x
kQ
tl
PCP ,
.585 ,
(t) is
where
St = diag(st).
.
We are now ready to discuss the performance of the algorithm.
According to Proposition 2.3, we obtain at each iteration either a geometric
decrease in the gap
the gap
t - TMIN .
TMAX - t
at each iteration, or a geometric increase in
We thus can measure algorithmic performance
according to the change in
TMAX - t ,
or
t - TMIN ,
on whether we are approximating the upper path
path
(f(t) > 0) ,
or both.
or both, depending
(f (t) < 0) ,
the lower
Suppose that in the course of running algorithm
22
;
PCP ,
that a lower bound on
is never generated.
TMIN
Then all iterates
will satisfy criterion (ii) or (iii) at Step 4, so that all iterates will generate an
upper bound, and all iterates will lie approximately on the upper path.
Lemma 2.5
(Algorithm Performance Based Only on
or if no iterates of the algorithm
TMIN = -
bound on
then the sequence of
TMIN ,
TMAX- tk < (1i
)(T
In particular, if
K =
t values
generate a lower
will satisfy
t < TMAX ,
the algorithm will stop after at most
t) / (TMAX- t)l1
iterations.
Under the hypothesis of the lemma, the algorithm must satisfy either
criterion (ii) or (iii) at Step 4. Thus, by Proposition 2.3,
(1
-
128 ) (TMAX
Lemma.
Then
If
- tk)
.
t < TMAX,
ln(TMAX-t
K
let
TMAX-
tk
<
K = F128mln ((TMAX-t)/(TMAX-t) )l
) < Kln(1
< -(In
TMAX - tk+ <
Thus we obtain the geometric decrease of the
< -K(i
Thus,
If
MAX-t)
l28m ln ((TMAX-
Proof:
PCP
TMAX)
+ In (TMAx-t)
l
)+
lnn (TMAx-t)
((TMAX - t)/(TMAX-t ))) + In (TMAX - t)
TMAX- t,
whereby
stop.
tk
t.
Thus the algorithm will
U
23
Suppose instead that none of the iterates of the algorithm generate an
upper bound at Step 4.
Analogous to Lemma 2.5 we have:
Lemma 2.6 (Algorithm Performance Based Only on TMTN)
or if none of the iterates of algorithm
TMAX
PCP
at Step 4, then the sequence of
tk-TMiN > 1 + 128m(*t(I
-TMiN
If TMAX = + °
generate an upper bound on
t values will satisfy
)
In particular, the algorithm will stop after
K = r128m n ((- T IN)/TMIN
))1
iterations.
We next examine the case when the algorithm generates both upper and
lower bounds. We first need the following result, which will be proved in
Section 5.
Lemma 2.7
If criterion (ii) of Step 4 of the algorithm
PCP
is satisfied at
iteration k, then in all subsequent iterations, criteria (ii) or (iii) of Step 4 will
be satisfied.
The significance of Lemma 2.7 is as follows: if at iteration
k
an upper
bound is generated, then an upper bound is generated at every subsequent
iteration.
Lemma 2.8
algorithm
(Algorithm Performance Based on Lower and Upper Bounds) If the
PCP
generates both lower and upper bounds, then there is some
24
such that for
iterate
j
for all
k >j
j , the algorithm generates lower bounds only and
the algorithm generates upper bounds, and
k < j,
(i)
for all
(ii)
for all k > j,
Furthermore, if
K=
k
tk -TMIN
TMAX-t
t < TMAX ,
2 5 6 mln(TMAX
-
k
(1 +
1) k(I-TMIN
128m)
< (1
(TMAX-t)
then the algorithm will stop after at most
TMIN) - 128m In (t-TMIN) -128m In (TMAX-t) +2
iterations.
Proof: The existence of
j
is guaranteed by Lemma 2.7.
The geometric
convergence rates are then a consequence of Proposition 2.3.
suppose
t < TMAX ,
and let
K
be as defined above.
note that
.5 (n (t - TMIN)
+
In (TMAX
- t))
< In
TMAX - TMIN
2
from the arithmetic-geometric mean inequality.
K 128m
1
Thus,
n( -TMIN)+ 128m In (TMAX- t)
- 128m In (TMAX -
) - 128m In (t - TMIN)
25
Let
Finally,
t = tj ,
and
- TMIN
TM
128m In tkTIN)
- -t
128 m In TMAX
\TMAX-t)
According to Lemma 2.5, with
128m In t - TMIN
t
replaced by
t >i
after at most
iterations.
t replaced by
Furthermore, according to Lemma 2.6, with
K
after at most
t ,
tj+1 ,
tk
t
iterations.
.
2.4 A Strictly Monotone Algorithm for Linear Programming that requires
O (mL)
iterations.
Suppose we wish to solve the problem
maximize
LP:
A
s. t.
where
t =
x,
T
x
e R n+ 1
, A'ERmx(n+l)
and eliminating one of the
< b
,
b E Rm
and
(n+1)
Upon setting
variables
of
,
LP is
easily transformed to the form
maximize
LP:
x, t
Ax < b + dt
S. t.
where
x £ R n, A E R m xn
transformation of the data
algorithm
PCP
b E R'
, and the data
(A, b, ) .
to trace the path
(t)
Ax < b + dt .
26
(A, b)
We can sol,ve
LP
are a linear
by using the
of center to the parametric problem
x, e R n
Suppose
is a given starting point for which
(x ° , t) = (x °, cTxo)
satisfies the starting criterion of the algorithm
namely
Q° < 2x
-xt°l
and
Q = A' T(S
path
x(t)
for
where
21
A .
so = b+dt
- A
xo > O
Then we can use algorithm
t E [to, TAX)
where
= [cTx,z*)
objective value of the linear program
PCP
LP .
z
PCP
S
= diag(s °)
to generate the
is the optimal
The sequence of values of
tk
k = 0,...,, will be strictly increasing, according to Theorem 2.1, i.e., the
objective value will be strictly increasing at each iteration.
total number of bits in a binary encoding of
LP .
Let
L
be the
In order to evaluate the
algorithm's complexity, we consider three cases.
Case (i): The linear program is unbounded.
never generate a finite upper bound on
After
k = O(mL)
iterations,
2 L , and we can conclude that
In this case, the algorithm will
TMAX , which equals infinity.
t > (to - TMIN)(1 + 128m)
LP
is unbounded.
Case (ii): The linear program is bounded and
indicator function at
t =
to
, is negative.
f(to) , the value of the path
This being the case, the algorithm
will always generate upper bounds, and after O(mL)
(z* - tk) < (z* - to)
is less than
(1 -
will exceed
2
,
iterations,
herebv
xt)
canbe
rounded to an optimal solution, see Karmarkar [6].
Case (iii): The linear program is bounded and
can show as in case (i) that after
otherwise the
LP
k = O(mL)
would be unbounded.
27
f (to) > 0 .
In this case, one
iterations, that
f(tk) < 0
,
for
Furthermore, after an additional
k = O (mL)
iterations, we will obtain via case (ii) that we can round to an
optimal solution.
Thus, after
iterations, we can round to an
O(mL)
optimal solution.
Note that in either of the three cases, that algorithm
process
LP
after O (mL)
PCP
will
(by detecting unboundedness or producing an optimal solution)
iterations.
This algorithm falls into the class of central-trajectory based algorithms,
but is inferior in that the bound of O(mL) iterations is worse than the bound
of O(lfi-L) iterations for algorithms such as Renegar [11] or Vaidya [16] that
trace the (weighted) center of the system
Ax < b
cx 2 6
as 8
O (I
is increased, or to the bound of
iterations for algorithms
L)
based on barrier penalty methods that trace the solution to
m
maximize
cT x + £
n si
i=l
s.t.
Ax+s = b
,
s>O
see Monteiro and Adler [10], among others.
All three methods follow the same path in their idealized version.
Yet the latter two obtain convergence in
superior to our algorithm.
O (Vi L)
iterations, which is
However, these other algorithms do not
guarantee strict improvement in the objective value, (but do guarantee strict
improvement in the duality gap).
In contrast, our algorithm will guarantee
strict improvement in the objective function of
28
(z' -cT X)/128m
at each
iteration.
Perhaps it is the implicit imposition of the strict improvement in
objective value that increases the iteration bound by a factor of
Furthermore, our algorithm does not assume that
Instead, our algorithm will detect unboundedness of
LP
O ( ff ) .
is bounded.
LP
directly.
As a final note, note that our algorithm can be used to mimic
Renegar's algorithm [11], tracing the center of
Ax
<b
-cx < -t
as
PCP
t
is increased.
Thus, Renegar's set-up is a special case of the problem
we are considering.
However, we see no way to cast problem (2.2) as a
special case of the set-up used by Renegar [11]; we allow all
RHS
values to
vary simultaneously, which is apparently more general than in his work
[11].
The remainder of the paper is devoted to proofs of the results
presented in this section.
preliminaries.
algorithm
In Section 3, we present notation and
Section 4 contains an analysis of a single step of the
PCP , and contains proofs of Theorem 2.1 and Lemma 2.4.
Section 5 contains an analysis of bounds generated by the algorithm, and
contains proofs of Theorems 2.2 and Lemma 2.7
29
Notation and Preliminary Results
3.
In this section we present notation and some preliminary results that
will be used in the proofs of Theorems 2.1 and 2.2 and in subsequent analysis.
Notation and Translations
3.1
For a vector
matrix
1IMl
v ,
I NI I
M ,
IlviI
denotes the Euclidean norm, and for a
denotes the usual matrix norm, i. e.,
= sup 11Mv1l/IllVI
v•:0
Note that if
If
M
M
{]M1
is a diagonal matrix,
= max
miil
i
is a positive definite matrix, the
M-norm
of
v
is
Iv IIM = vTMv
PM: = I-M (TM)
The matrix
1
denotes the orthogonal projection
MT
matrix which projects onto the null space of
MT
Let
Qt(x; u)
the negative of the Hessian of the function
m
ft(x; u): = X n (bi + tui - Aix),
where
u e Rm
is a given vector
i=1
parameter.
Let
A (x): = diag(b- Ax)
T-
Then
and
Qt (x; u) = A T A t (; tu) \
Q(x) = AT
At (x; u): = diag (b + tu - Ax) .
When
-2 (x) A .
30
t = 0 ,
e denote
denote
Let
t (u)
Ax < b + tu , and let
denote the center of the system
denote the center of the system
Suppose
d = u + Ar
Ax < b .
for some
At (x; d) = At (x - tr; u)
observe that
Rm
u
and
r
Rn
RHS
by
d
and by
u
tr , in that
simply corresponds to a translation of the inequality system by
R n I Ax < b+td} = (x E Rn
We
Qt (x; d) = Qt (x - tr; u)
and hence
Thus, the difference between modifying the
({x
x
b + tu} + tr
Ax
The following Lemma is therefore obvious.
Lemma 3.1:
(i)
Suppose
t (d) =
(ii) xx
- Xt
(iii) max
d = u + Ar
for some
u E Rm
and r E R n . Then
t (u) + tr ,
(d)|
Qt(Zt(d);d) =
t I Ax
b + td
(x -
tr) - Xt (u) Qt(Xt(u);u) for any
for some
x
= max
t
xE R n ,
Ax< b + tu for some
In the sequel, we shall be working with appropriate choices of
and
rER
instead of
d .
,
u E Rm
As we shall see in Section 5, this in fact is
central to the construction of the proof of Theorem 2.2.
u E Rm
For the appropriate
where convenient and the context is clear,
we let At(x): = At(x; u),
A ()
and
= A((),
31
St =
t (U) ,
= b-Ax ,
x
.l
S = diag (-)
and
s
b
diag () , st = b + tu - A ,
A-X
St = diag(st) .
We also will abbreviate
Qt (x; u) by Qt (x).
The next Lemma presents some basic inequalities. It is essentially
Proposition 7.2 of [4] with some simple extensions.
Q(x) = AT
E Rn
Suppose
Lemma 3.2:
-2
satisfies
and let
xe R n such that IIx-IIQ(5)
Then for any
A.
s = b - Ax > O
< 1,
we have
(i)
> 0 ,
= b-A
(ii)
II A- 1 (x) A (z) = IS-
(iii)
lIA-
1 ()A
and for any
(iv)
=
()I
v
I I IQ() <
1sI Sll
1
1-6
<1+6
,
Rn
1
1- 8
lv
(V) 11
VIIQ(X) < (1+a)l
3.2
<
HQ(-)
where
= AT -2 A , and
Q ()
VIQ(x)
.
Eauivalent Measures of Closeness
In this study, we measure how close a point
thesystem
Ax < b
with the norm
I IX -
IIQ()
32
_
x
is to the center
x
We shall also make
of
use of a different measure of closeness to
x
that was introduced in [4 , and
we will show a certain equivalence of the two measures.
Define
Q = 1 ATs-2 A, y=y(R) =
m
m AT S e
(3.1a)
(m - )yT Q y
and y = y(x)=
(3.lb)
1 -yT Q y
Note that
Q
Ax < b.
Then
Let
Q(X)
=
y(R) = 0
denote the center of the system
andso
is used to measure the closeness of
Lemma 3.3
Let
([4]).
h > 0
Proof:
I
-x
x
In [4], the scalar
to the center
7 = Y(x)
x .
be the center of the system
Ax < b .
be a given parameter.
Suppose Y = Y(X)
Then
Let
y())= 0 .
l
<
g
2 Q()
in ( +h))
h
(m - l)
where g(c) = 1- (c +
h 2 (1 +
-(xal)2+
2)
(1 - hy) 2
Follows from the proof Lemma 7.2 of [4] with weights
w = (1/m)e.
.
We shall say that
Corollary 3.1:
If
x
is approximately centered if
y(x) < .0072
)/2.
then
33
il X-
y(x) < .0072
Q(x) < 1/21 .
Proof:
Let
h = .03
and
IIX-XIJ2 Q(X) <
we have
Substituting for
ca =
m
m-1
h and 7y,
In (1 + h)
Then from Lemma 3.3,
h
(1 - hy) 2
and noting
m >2
gives the desired result.
Therefore, for appropriate values of
h ,
approximately centered implies
is close to the center by our criterion,
IIX
thatis
Q()
-
Lemma 3.4:
the system
<
1/21 .
Suppose
x
(e.g.,
Next, we show the converse implication.
X X1IQ(X) < 6 < 1/2
where
x
is the center of
Ax < b.
Then
=
(x) < a + 2f2 ,
Proof:
Let
= b - Ax
From Lemma=
of
and
62
2(1 - ) (1 - 26)
xwhere
s = b-Ax
x-xQ(2.1
From Lemma 2.1 of [ 4], with weights
I
1-6
,,
From Lemma 3.2,
II-x
iln Si
In 'S-
M--~
U
i=l
Z
34
iQ(X) < 61
-
< 1.
1-6
= (1/mn)e ,
m
i=1
X is
h = .03 )
I
where
-
1-6
<
1 .
.
On the other hand, from (2.4) of [4],
m
2 (mml1Ž
in si
i=
7=
(x) .
i=l
Therefore,
Thus,
where
-,J,/
+ 2)
2
a =(a
>
(1 + -
a= 2(1-c) -
Y< a +
where
,
2y).
a =
a2
2(1 - a)
62
2 (1 -
.
)(1 - 26)'
Finally, we present some elementary inequalities.
Lemma 3.5:
(i)
For all
(ii)
For all
Proof:
Since
Assume
Note that
m-1 +9
2
m
2
> 0,
F <
Let
Q, y
< £
,
implies
and Y be defined as in (3.1) .
(m - 1)yT Q - y
yT Q-1 V I F implies
(m -
- 1) yT Q-1 V
_ (m
9
.
whereby
To prove (ii), note that
(i)
£
< £ /( 1-)
yT Q-1 y =
1 - yT Q-1 y
m-1'
<
m-1 +y
follows immediately.
(m - 1) T Q- y < e
implies
£
U
-1)
35
2
.
Analysis of One Iteration of the Algorithm
4.
In this section, we analyze one iteration of the algorithm and prove the
First, we show that
Improvement Theorem (Theorem 2.1) and Lemma 2.4.
if
Itl
is small then the two centers
^x
and
are sufficiently close to
xt
each other with respect to some appropriate norm (so that Newton's method,
when applied, will converge).
Theorem 4.1:
denote the center of the inequality system
and let
xt
u^
() u .
=
-
b
Ax < b + tu .
Let
Itl < 1/(76 11 1) .
Suppose
11-
Then
XtQ
t()
< 1/12
The proof makes use of Lemma 3.3 of the previous section. We want
Proof:
= y(R)
to show that the quantity
note that
T
A
y
ATA
(^) AT
y()A:
-1
() e = 0
Ie
(2)e
-AT
Ax < (b + tu)
y
and
(see (2.3)), and so for the system
=
T A-l
[A
e
'
is
Q .
First
Ax < (b + tu),
- ( x) e
(XAT(x)(t u)
TA-1 1 xA-,
= 1 AT
Also,
for the system
We begin by giving expressions for
sufficiently small.
=
Ax
denote the center of the inequality system
Let
LQQ
( Q= ,)
Q
Alson
At
(x)(t u),
by definition of
AT At 2 (X,)A
n-M
36
u- above.
and so ,
yT Q-1 y =
l (t )T At () A (AT At () A) T A
AZ ()(t
because the eliminated matrix is a projection matrix.
y =
(X)
yT Q-1 y (m - 1)
1
It
-
1~tQ1-Corollary 4.1:
1
5,775
Hence
and
y < .0132 .
2h2 (1 + 2)
.
'12J
(1 - hy)2
Under the conditions of Theorem 4.1,
) < 1/I1
- Xt
Follows from Lemma 3.2(iv).
.
Furthermore, using Lemma 3.2(ii), the following corollary is
immediate.
Corollary 4.2:
Let
d
Rm
2
h = 1/18
IIX-
Proof:
<
1-lu]
1 - yT Q-1 y
Thus, from Lemma 3.3 with
- 112
) < 1 ||t-1
Suppose
I -XI Q)
be given and define
<
1/21
u: = PA
37
Sd .
Let
xt
denote the center of the inequality system
Suppose It
1
Then
8011u11ll
t A- 1 (x)Su
Ax < b + t S u .
< 1/76
and
X XtIIQ,(^,~t < 1/11
Proof:
Let
and
u = Su
u^ = A-1(x)S
,
u = A () u .
Note then that
so that
Iu
from Lemma 3.2(ii).
It[ <
A-1(x)S
l
201
u-11,
Thus,
1
< 21 .
80
from
Theorem
we
4.1, have 20
Hence, from Theorem 4.1, we have
I
1
<_
1
8011u11 - 7611 uII
-X tIlIQt(X) < 1/12 ,
llx-xtllQ,(^X,) < 1/11
4.1,
and by Corollary
.
We next use a theorem of Renegar [11] which gives the region and rate
of convergence of Newton's method for our problem.
Theorem 4.2
(Renegar [11]):
x E int Xt = {(x e R n
£
=
x -
Assume
Ax < b + tul
t IQt (t) < 1 w here
X.
and
is the center of the system Ax < (b + tu).
38
Let
,
st = b+ut-A
Qt = AT
Then
2
and
St = diag (t)
xt = x-Qt
1
where
( x ) AT -St ee ,, where
A.
|xt-xtIQ
(1 + 6)2
t (Xt)
.
-E
We are now ready to prove the Improvement Theorem.
For the
reader's convenience, we restate the theorem before proving it.
Theorem 2.1
(Improvement Theorem):
1.~~~~~~~~~~~~
Suppose xe int X = x I Ax<b}
Define
U: = P-S-1A
satisfies
1
Further, define
8011
=
I
XEW -
;
S, : = diag(s) );
(A T S--2 A) -1 ATS
IQa(Ew)
1/21
(step length) .
e
(Ne-wton step);
(Nelw approximate center) .
x-\w: = x + q + c.r
Then
<
(translation vector of system) ;
so,: = b + cc S u - A
-:
|Q(X)
(projection of S'-1 d)
S-'d
r: = (AT S-2 A)-1 A T S-2 d
a: =
IX -
1'/Il
39
Proof: First note that d = u + Ar,
I
(dl Qa ((d);
NEw -
and
Xa = X (U)
that
Q ()
II xa -
x
(d),
Qa (x)
< 1/22,
.
Lemma 4.1: Under the conditions of Theorem 2.1, define
Let
inequality system
Ax
]
tE [O,
where
which, by Theorem 4.2, implies
< .033 < 1/22
as in Theorem 2.1.
= XNEW
In Lemma 4.1 (iii) below we
(u); u) .
= Qa (
Qa (ca)
X xa
-
E: = I X- XaIIQ(() < .1462,
show that,
(u) I lQ (x(u); u)
it suffices to show that
6 = 1/22 ),
and
a
Hence, by Lemma 3.2 (iv) (with
+
where Xa =
IIxa -
d) =
u = Su . Thus by Lemma 3.1,
where
u = Su,
let
b +ut
and let
and
u,r,
denote the center of the
xt
Qt(x) = Qt(x;u)
For all
I
(i)
1II VQ(X)
- 10 76
(ii)
I vIIQt( )
-
(iii)
I
x
- t
12~
IQ,(~'
lvllQ(xt)
i7 I v I IQ(;)
;
for all
v E R
for all
v E R n ;and
• .1462
Proof: From Corollary 4.2, wve have
lt
A- (x)u
176 andc
< 76
1 - X; 1I()
<
1/11
(4.1)
Thus, by Lemma 3.2(ii) andl (iii)
.... )
III )
IA1 (xt)At ()
12
11
and
A ( ) Al (
40
< 11
10) '
(4.2)
Also,
A-t ()A
bi - Ai x
max
i
bi + t ui- Ai
()=
max
i
I
+t+ t
u)
bi- Ai _
1-l
because
I t A- ()
1
I < 76
75
t A- () u l
< 1
u
from (4.1).
76 '
(4.3)
Similarly,
max bi + tui-Ai
i
bi -Ai X
= max
I +t
i
<
Hence,
II ll IQ()
1
u
bi - Ai
tA-l(X)ull
< 727
(4.4)
76
= l A-1(x) A vI
A 1-)a ()At(
K
A 1 (X)A (t) l
Z77
)(i.)HV uQ(i)
76 101
This proves (i).
I
""
I AI-1
~ (Xt)t
from (4.2) and (4.4) .
The proof of (ii) is similar, using (4.2) and (4.3).
show (iii), we have by the triangle inequality,
41
Avii
Next, to
I1R-Rt1IQ'(^XJ < 11x -
x
Xt +11---tllQ,(^)
x
x
xt
IIQ,(^)
+ l1,
11
11 ) 75)• I-xIIQ(1)
· i111
Ix--x11Q(X)
since
201
775
+
<
211-xIQ(-)
-20
-
<
11
<
from (ii) and (4.1),
.1462
1
20 '
U
from Lemma 3.2(iv).
We are now ready to prove:
Lemma 2.4:
For all values of
near the center of
t
generated by algorithm
PCP ,
x(t) is
in the sense that
Xt ,
I x (t) - t I (,(t))
t
< .585
Proof:
Let
as in Theorem 2.1.
ox = 1/(8011U I)
I x (t) -
Lemma 3.2(ii), it suffices to show that
Xt
te [0, c] .
Let
IIQ ()
<
By
0.369
We have, from the proof of Theorem 2.1, and Lemma 4.1(iii),
| Xa
X I Q (a)
Hence,
IXa-
I
lXIIQ(X)
Z'I| IQ(x) + IIx - RC IIQa(^a) < .033 +.1462 < 0.18
<
1
<
75
(by Lemma 4.1(ii))
xa
72 10 7611
< 0.222
42
(
(by Lemmna 4.1(i))
Next, observe that
x (t)-
(t)-- X Q((a -
= t (xa-x)
O~
x)
l lQ (x,)
<
Therefore,
0.222(t/a)
Thus, by the triangle inequality and Lemma 4.1(iii),
IIx(t)-
t IQt ( t) < Il(t)-xQllQ,(,) + Ix -tlQt(Q,)
X
< .222 + .1462
X~~~~~~~~~
< .369.
5.
Lower and Upper Bounds
In this section, we analyze the upper and/or lower bounds generated by
PCP , and we will prove Theorem 2.2 and Lemma 2.7.
algorithm
Recall that
T = It
I intXt
0}
!
TMAX: = sup T
TMIN: = inf T . We shall derive upper bounds on
TMIN ·
iterate x satisfies
x
The current value of
. We decompose
d = u + A , where S-1 u
0 .
is
t= 0 ,
and the current
Ax < b ,
< 1/21
, S = diag(s) and
= b- A
d into
u =S-l'AS
d
and
A) 1 lAT s-2 d
2
(5.1)
By assumption,
u
t
= b-Ax > 0 and lX-XlIQ(X) = II(^ -)II
is the center of the system
S = diag()
r = (AT
and lower bounds on
We shall, for convenience, adopt the following notation and assumptions
throughout this section.
wvhere
TMAX
and
d
does not lie in the column space of
A
and so
We shall prove Theorem 2.2 and Lemma 2.7 in this section.
begin with three fundamental Lemmas.
Lemma 5.1:
eT S
Proof:
eTS
u - eTu
<
-1 U
111/20
II
IeT(S1 _
u
S)
- ;nS-S) U1
<
<
44
is-s)
(1:' 2)
Sull
i Il
We
The first inequality is the Cauchy-Schwartz inequality and the last follows
I x - x Q()
from Lemma 3.2 and the assumption that
Recall that
If
f(O) = eTS u
Therefore, the current center
x
1/21
is on the lower path and
eT
TMIN
_11 u 11/20
then
TMAX
is
- m/ (eT S-1 u)
is bounded above by
However, we typically cannot deduce the exact value of
- m/(T Su)
eT S u
<
t=0.
Ilull/ 2 0 >0.
eTS u 2 eTu-
bounded below, as we have seen in Section 2, by the bound
Similarly, if
.
is the path indicator function at
/20 then from Lemma 5.1,
eTu > u[
<
from the current iterate.
Therefore, we shall later in this section
derive an alternate bound using the next Lemma which is a variant of
Lemma 7.1 of [4].
Lemma 5.2:
x
Let
be the center of the system
intX = (x I Ax<b)
the ellipsoid
FOUT
FOUT: =
Then
Proof:
Let x E int X
IIX-XIIQ(X)
is given such that
Suppose
< s < 1.
Define
by:
x E Rn
intX
Ax < b .
I
J1X- 1 1 Q(X)
(1
+ 6) Vm (m-1) + }
(5.2)
FOUT .
be given and let s = b-Ax . Then from properties of
the center, (see for example [4] , Theorem 2.1),
45
Ix-Xl Q()
<
m (m-1) . From
Lemma 3.2(v), we also have
IIX-
IIQ()
<
IX-X IIQ(X)
+
(1 + ) m (m-)
I x- XIIQ(X)
<
II -
< (1 +
IIQ(X)
)/m
(m-l)
Thus,
+
.
6
The next Lemma concerns the well-known classical least-square
problem or minimum-norm problem.
Lemma 5.3:
Given an
mxn matrix M
and an
m-vector d ,
for all
scalar t,
IIMx-tdII
where
2
It IIPMdII
for all x
R ,
PM = I-M(MTM MT
We are now ready to prove Theorem 2.2 and Lemma 2.7.
We shall
prove Theorem 2.2 in two parts.
Proposition 5.1:
(i) if eT
(ii)
Proof:
if
(i)
eT
Under the definitions and conditions of Theorem 2.2,
l ull < - 1/20
/
II >
Suppose
TMAX < UB
then
TMIN 2 LB
then
/20
< -1/20
eTull l
.
(22
u
irnm+1Y+ 1)/21 11
-(22imm+1
We first show that
+ 1)/ 21 11 I .
E
is
close to the center of the following extended system with one additional
variable and one additional constraint:
A t
<
(5.3)
l j
46
A
A =
where
A
-u
0
0
I
= eTS' l
and
Note that from the remark following Lemma 5.1,
0<0
TMAX = SUP ( t I Ax < b + dt
for some x)
sup t I Ax < b + ut
for some x}
=
u .
Therefore,
t
I
t s|
<[ b
for some x
1
For system (5.3), define
A(x,t): = diag([l b
1
Then,
A(2, ) = 0 S
[eT, 1
Therefore,
X
{[]
I
A(, 0)=
[ T 1-O
(, 0) A = [eT S-1 A, -
[
and
[
Q(x,t): =
and
0 1
^-1
eT S
+
= 0
is the center of system (5.3) and
2]
x.
O
[o] -KoJ
Let
- xlt ])
E
=
IIx-x
I
Q(x,o)
Rn+l A[
t
<i'
b
I
47
Q(X)
I1
< 1/21
and
AT --
A
(2t)
(x,t)A .
E R n+
FOUT =
Then, from Lemma 5.2,
Furthermore, because
int X
-AT
S A
-U T g`2 A
-
0'
-
IQX,
- , 0)
< 22
21
m (m+1}
+
12
FOUT.
0 = 0ASAT -
T -2
Q(X,O) =
[ :]
l
S-2 u
= A
AT S-
K
2 u
2 + uT -- 2
, we have
ATS -2 A
0
0
02+llull 2
Thus
ttJ
12(Xo
-[
+ t2 (02+ I1UIl2)
= S A(x-)
)
> t211ll1
e
Also, for any
FouT ,
2
t[ -
[o
Q(X,O)
22 m (+
1+
21
22 Vm (m+1Y + 1
whereby
21 I1 II
It then follows from Lemma 3.1 that
TMAX =
UP
(
t
=sup(t
Psut
Ax - tu < b
for some
x
I Ax - tu < b for some x , Ot <1 )
t
int
X
for some
48
x
I
(since
< 0)
1
:5 max
<
(22
ttl [
1
FoUT
x}
for some
(since int X
FOUT)
m (m+1) + 1)/2111U11
.
The proof of (ii) exactly parallels that of (i).
Proposition 5.2:
<
(i)
TMAX
(ii)
>
TMIN LB
Proof:
Ie T u I/I Il
Suppose
UB -
<
/2o
Then
(1.6 /m (m-l) + .6 )/111
I
= -(1.6
We shall show that
m(m-1) + .6)/ 1u l
is approximately centered (in the sense of
0
Section 3.2) for the extended system
[A, -u
Let Q,y
Section 3.2.
Let
for system (5.4).
and
Y = y()
Q,y
and
(5.4)
for system
=
(, 0)
be as defined in
be the corresponding parameters
= b-Ax = s > 0
U_
S'
= -1
S
uU
and
AT S-Iu = 0
r
ja~= I
Ax < b
Observe that
b [A, -u]
Therefore, since
x
Q
0
-2[A, -u]
L
49
I
0
Il]2/m
I
and
and
y
S§- e
Ym
-U/M
-eT
Therefore,
-TQ-l
I x-
Recalling that
yTQ~y + (eT )2/m!lIll 2
x I
(-)
< 6 = 1/21,
2
.
< (a+2
<
++f
1
Therefore, by Lemma 3.5(ii),
h = .41
a =
1
760
and
Next from Lemma 3.5(i),
Thus,
'Y
(m-l)y Q
2a.
Y < a+
conclude from Lemma 3.4 that,
(m-l) yT Q-1 y <
wecompute
(m-l
)+M
IeTu II i 2
1
1
2
< .0054
< .0053
and so
y < .0735.
Taking
in Lemma 3.3, we have
h2 (1 + 2)
xl
*1
(
0
:
where
x]
M 1
0)
.36 < (.6)2 , (since
2
(1 -h)
is the center of system (5.4) and
m/(m-l)
Q (,0) = mQ
Note
that
Note that
Let
[ ]-
X: = [x
t
[o]
[Ax
=
u]
i
It i
S'
A(x - x)- tu I .
b• and
I
50
I
-
<
2)
FOUT: =
tt
I1S-' A (x - )- t
Then by Lemma 5.2,
it]
E FOUT
1.6 /m (m-1) + .6
By Lemma 5.3, for all
FOUT .
int X
I<
m (m-1)+ .6 t jj-S 1A(x-R-t-ujj z! I
, 1.6
since
IJ
Thus
ATS-1 U = 0
sup (ItI ti T/ =
sup
(It I X I E- int X
< max (Iti
x]
for some
x
for some
E FOUT
< ( 1.6 /m (m-1) + .6)/1l
x
II
This completes the proof.
.
Theorem 2.2 follows immediately from Propositions 5.1 and 5.2.
Finally, we
prove
Lemma 2.7:
iteration
k ,
criterion (ii)
If
Step 4
will be satisfied.
Proof:
Suppose
eT u / l
criterion (i)
Lemma 5.1, we have
< -1/20
in
is satisfied at
algorithm PCP
of
then in all subsequent iterations,
Step 4
that
of
criteria (ii)
iteration k .
or
51
of
It suffices to show
will not be satisfied in all subsequent iterations.
f (tk) = eT S-1 u < 0 .
(iii)
By Proposition 2.2,
By
f (t) < f (tk) <
criterion (i)
for all
t
tk .
Therefore, in all subsequent iterations,
will not be satisfied, as it would imply that
f (t)
0
for some
·
t > tk .
52
References
[1]
Barnes, E. R., S. Chopra, and D. L. Jensen (1988). "A polynomial time version
of the affine scaling algorithm." Working paper, Graduate School of
Business Administration, New York University, New York, NY.
[2]
Censor, Y. and A. Lent (1987). "Optimization of 'log x' entropy over linear
equality constraints." SIAM Journal of Control and Optimization 25,
921-933.
[3]
Freund, R. M. (1988). "Projective transformations for interior point methods,
Part I: Basic theory and linear programming." Working paper OR 17988, Operations Research Center, M.I.T., Cambridge, MA.
[4]
Freund, R. M. (1988). "Projective transformations for interior point methods,
Part II: Analysis of an algorithm for finding the weighted center of a
polyhedral system" Working paper OR 180-88, Operations Research
Center, M.I.T., Cambridge, MA.
[5]
Jarre, F. (1988). "On the
convex quadratic
Mathamatik und
Wiirzburg, West
[6]
Karmarkar, N. (1984). "A new polynomial time algorithm for linear
programming." Combinatorica 4, 373-395.
[7]
Kojima, M., S. Mizuno, and A. Yoshise (1987). "A primal-dual interior point
method for linear programming." Research report No. B-193, Dept. of
Information Sciences, Tokyo Institute of Technology, Japan.
[8]
Megiddo, N. (1986). "Pathways to the optimal set in linear programming."
Research report RJ 5295, IBM Almaden Research Center, San Jose, CA.
[9]
Mehotra, S. and J. Sun (1988). "A method of analytic centers for quadratically
constrained convex quadratic programs." Technical report 88-01, Dept.
of Industrial Engineering and Management Sciences, Northwestern
University, Evanston, IL.
convergence of the method of centers when applied to
programs." Manuscript, Institut fiiur Angewandte
Statistik, Universitit Wiirzburg, Am Hubland, 8700
Germany.
[10]
Monteiro, R. C. and I. Adler (1987). "An O(n 3 L) primal-dual interior point
algorithm for linear programming." Report ORC 87-4, Dept. of
Industrial Engineering and Operations Research, University of
California, Berkeley, CA.
[11]
Renegar, J. (1988). "A polynomial-time algorithm, based on Newton's
method, for linear programming." Mathematical Programming
40, 59-94.
[12]
Sonnevend, G. (1985). "An 'analytic' center for polyhedrons and new classes
of global algorithms for linear (smooth, convex) programming."
Preprint, Dept. of Numerical Analysis, Institute of Mathematics, E6tv6s
University, 1088, Budapest, Muzeum K6rut 6-8.
[13]
Sonnevend, G. (1985). "A new method for solving a set of linear (convex)
inequalities and its applications for identification and optimization."
Preprint, Dept. of Numerical Analysis, Institute of Mathematics, Edtv6s
University, 1088, Budapest, Muzeum K6rut 6-8.
[14]
Todd, M. J. and Y. Ye (1987). "A centered projective algorithm for linear
programming." Technical report No. 763, School of Operations
Research and Industrial Engineering, Cornell University, Ithaca, NY.
[15]
Vaidya, P. (1987). "A locally well-behaved potential function and a simple
Newton-type method for finding the center of a polytope." AT&T Bell
Laboratories, Murray Hill, NJ.
[16]
Vaidya, P. (1987). "An algorithm for linear programming which requires
O(((m+n)n 2 + (m+n) 1 .5 n) L) arithmetic operations." Proceeding of the
Nineteenth ACM Symposium on the Theory of Computing, 29-38.
b0
I
Download