.. 0 ff-,0-"0-:'0 : it~~~~~~~~....fff0 -

advertisement
':·
-;
#
;~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~·
ZOM~~~~~~~~~BW~~~~~~~~~~~q
~~--(IF~~~BB~~~NSw~
:.
~P~::~::-:
_.*-;
'
i~~~~~~~~~~~~~~~~~~~-
u: · · ~"'8 ~
'
.
.
0
.-
:''
a~_:::jo%
:
,
.
....
'
.
,..
:
.
.=
....
. - '
.:
.......-: .,.~
...
.; ' ;.:
.
'
:':..::.
0 ff-,0-"0-:'0 :
-
:
: -:-:
....: 0
':
::
. .
!
"
·
..
~.' ...
~
".;
:
soft_-
11:
--.
1::
";
"
-I:-i·~I
"
......-.., .......-
:'
.
',
....
:-;~:~l~_;-r:-:
0- X:
·
.-
'--'L~..'-'" - -~
~:~ ,~--:'
07 0
0
:.-"'77:'Sk:~.g:"-.p'
~:-:
.....
"':
. ,'.,:
-:
.......
~
~
~
'::
....
:- -
..
'
" · ''
...
0
S"~ 0' '"0 '';":jb : '·f0 ;:" ; ;, 0 'i-.;i
~ 0~ ~~~~~~~~~~~~~~~~~
:'!""
.'7',
-
a~;::;~,-war:·ib.
it~~~~~~~~....fff0
- .
:;...
-
'
,
,
.
...
:
MASSACHUS E.TTS.. NSTIT
. . . . . . .
'" '-;;:'"
:~: ..................
'
~~
~goa
"- ' - : - ''''" & - , , :!- '. :
":
T
" '''
'
7
LARGE SCALE GEOMETRIC LOCATION PROBLEMS II:
STOCHASTIC LIMIT PROPERTIES
by
Mordecai Haimovich*
OR 131-85
*
Revised October 1985
University of Chicago, Graduate School of Business
Abstract
Given a set of N points (customers) in the plane, the K-median problem
is concerned with finding a set of K points (centers) so as to minimize the
sum of distances between customers and their closest center. Suppose that the
customers are independent random samples from a distribution with bounded support. This paper shows that as N and K tend to infinity while K/N tends
to a constant a
(possibly zero), the value functional is almost surely
proportional to NK-1 /2 with a proportionality constant a1/2(ac). It also
(i) shows that,as a function of a, (ca) is convex; and (ii) provides an
approximation of 6(a) in the neighborhood of a = 0 and a = 1. As a
approaches 0, the problem coincides asympotically with an underlying
deterministic continuous problem.
outline
1.
Introduction
2.
Asymptotic Continuity (K/N + 0)
3.
Asymptotic Separability (K/N - a > 0)
4.
Conclusions
5.
References
Acknowledgements:
This research was supported by Grants ECS-7926625 and ECS-8316224 from the
Systems Theory and Operations Research Division of the National Science Foundation.
The author is indebted to Professor Thomas L. Magnanti for his encouragement and for many helpful suggestions.
__
-_
_
1.
INTRODUCTION
Let
X = {Xl,X2 ,...xN}
be a set of points in the plane, frequently re-
ferred to as customers or demand points.
The K-median problem is the
problem of finding another set of points
C = {Cl,C 2 ,...,CK},
referred to
as centers (or medians), that minimizes the total sum of distances between
customers and their closest center.
Let
DK(X) = Min ( Z
C
x£X
min
cC
x - c
)
(.)
be the optimal value (or cost) functional of the problem.
two variants of the problem:
We will consider
the restricted version in which centers can
be located only at customer locations (i.e.,
which proposes no such constraint.
CcX) and the free version
Since many of the results and derivations
in this work are not version specific, we will use the same notation for both
versions.
Moreover, our assertions and derivations apply to both, unless
explicitly indicated otherwise.
While simple to state, the K-median problem seems rather hard to solve,
especially for large values of
N
and
K.
In fact, Papadimitriou [Pa]
showed that the problem is NP-complete (in the terminology of contemporary
computational complexity theory [GJI), which while short of proof, provides
strong evidence of the problem's (asymptotic) computational intractibility.
This fact gives rise to research into the design and analysis of computationally
efficient approximation algorithms (heuristics).
While to the best of our
knowledge no heuristics with a guaranteed percentage error (i.e.,
-optimal) are
known for this problem, it is still likely that in practice certain heuristics
This deviates from our notations in [Ha] where D
is defined on measures and
according to which we would have DK( E 6 )rather than D (X), where
x
is
the unit mass (measure) at
x.
X
K
x
_^___^W___nllllllla
-2-
will perform well in most situations, or on the average.
To check such
assertions, one can use empirical tests, or instead use the standard
mathematical tool concerning this type of assertions, i.e., probability
theory.
The prototype of the probabilistic approach is Karp's analysis of
partitioning algorithms for the Travelling Salesman problem [Ka], which
is based on the
asymptotic order of the optimal value for problems
defined on N cities that are chosen independently aid randomly from a fixed
planar region.
This asymptotic growth rate which implies (and is in turn
implied by) the asymptotic optimality of partitioning heuristics, was first
established by Beardwood Halton and Hammersley
BHH].
NK-1
development, Fisher and Hochbaum [FH] established an
order for the K-median problem for
N
In a subsequent
/2
asymptotic
customers that again are independent
random points of a fixed planar region.
Their analysis supports the asymptotic
optimality of a certain aggregation heuristic when K grows no faster than log N.
The present study was motivated by the desire to sharpen the results
in [FH] as well as to extend the range of the analysis to encompass arbitrary (relative) growth rates of K.
We show that the asymptotic be-
havior of the problem manifests two different convergence modes.
The
first one, associated with what we call asymptotic continuity, occurs whenever K/N tends to 0 as N tends to infinity, i.e.,K = o(N).
In this case,
we show that the problem almost surely converges (in terms of value and the
optimality of solutions) to an underlying continuous deterministic problem,
and if K itself tends to infinity, then the problem almost surely follows
the associated asymptotic behavior of the underlying deterministic continuous problem, as analyzed in [Hati
and [HM1].
Accordingly, the optimal
I_-I_
YC
CII-·-
-I-
_
_
I
-3-
2
value almost surely tends2 to
('2/3
3/2
-1/2
(6)(f
d>)
NK/
where
(6)0O.377 is
the expected distance between a random point of a unit area regular
hexagon and its center, and
f
is the density of the absolutely continuous
part of the common probability distribution of the customers.
Moreover,
one can use asymptotic approximations for the solution of the underlying
deterministic problem based on the asymptotic formula for aggregate center
allocation as given in [Ha]
as indicated in [HM1].
and the asymptotic hexagonal partitioning property
Somewhat weaker results of this nature have been inde-
pendently established by Papadimitriou [Pa] whose analysis is for the uniform
distribution and requires the stronger assumption K = o(N/logN).
Zemel [Z]
has extended Papadimitriou's analysis to the K-center problem which is obtained by replacing the operator
Ez X
in (1.1) by
Max x.
The second convergence mode, associated with what we call asymptotic
separability, occurs whenever K/N tends to a constant a > 0 as N tends to
infinity.
In this case we show that for the uniform distribution on the unit
square the optimal value almost surely tends 2 to
function of a alone.
We also establish bounds for
close approximations of it when
(a)V where
B(a),
(a) is a
as well as very
is in the neighborhood of 0 or of 1.
This
convergence result implies (and is implied by) the asymptotic optimality
of partitioning procedures of the type employed by Karp [Ka].
While these
results are similar in nature to those of Beardwood et al. [BHH] for the
TSP and other problems as discussed in Subsection 3.5, the techniques in
[BHH]
are not adequate for the K-median problem.
Independently, Steele [SI] and
Hochbaum and Steele [HS] have obtained results on the free and restricted
2More precisely, the ratio between the optimal value and the given expression tends to 1.
-4versions of the K-median problem.
Our analysis provides a sharper rate of
convergence for the deviation probabilities (that is, for any given
the probability that the ratio between the optimal value ad
from
(ac) by more than
VI
> 0
differs
), which, incidently, answers a question posed
by Hochbaum 3 about a stronger mode of convergence.
We also evaluate the
limiting behavior of the asymptotic constant
as
1.
3(a)
a
approaches 0 or
In addition, our approach has the potential of being extended to non-
uniform distributions.
We discuss and derive asymptotic continuity in Section 2 and asymptotic
separability in Section 3.
In contrast to the expository style in [Ha],
which considered the K-median problem through a set of formal properties
it satisfies,
the treatment here, though generalizable to other situations,
is far more concrete.
The probabilistic apparatus we use is rather elementary:
Tchebychev's
inequality as applied to second or fourth order moments coupled with the
Borel-Cantelli's Lemma, in a manner not dissimilar to the standard textbook
derivation of the strong law of large number for independent identically
distributed random variables with finite fourth moments.
these rudimentary tools can be quite powerful.
3
Private communication.
As we will see,
-52.
Asymptotic Continuity (K/N
o)
Large scale K-median problems where the number of customers Der facility is very high, i.e.,
K = o(N)
can be approximated by aggregation that
aggregates/unifies customers that are close together, or by "continuation,"
i.e., the use of a smooth continuous distribution to represent the essentially discrete demand.
While Fisher and Hochbaum
approach, we will use the second.
FH] take the first
This section is devoted to establishing
the asymptotic accuracy of the continuous deterministic approximation (and
thus also the asymptotic optimality of heuristics based on it)
large-scale discrete problems with
2.1
for random
K = o(N).
Results
Let
square
p
2
S = [0, 11
and let
f
of independent p-distributed random points.
set of the first
TcR
N
points.
X (T) = 1 if
,
with support in the unit
be the density function of its absolutely
continuous part which we assume is not null.
for all
R2
be a Borel probability measure on
Let
Xi £
6X
2
T
Let
Let
X 1, X2,... be a sequence
XN = {X1,XX2 ...
N
be the point unit mass at
6X (T) = 0
and
} be the
X
1'
otherwise).
(i.e.,
Follow-
ing [Ha], we will define the value function for a finite-positive regular
Borel measure
w
with a bounded support as
DK()
where
H mi
ICI is the cardinality of
min
x-c
dw
C; the restricted version restricts
C
to
N
N
Consequently, DK(X )=DK( E 6X ) (which is an abuse of
i
i=l
N
1
notation). Now, PN
N
E
X
is the empirical probability measure
i=l
i
determined by the first N points of the sequence. By linearity of
the support of
w.
integration we have
-6-
DK(PN) =
DK(XN)
PN
As is well-known (see [Ch] for example),
almost surely (a.s.) as
g: Rd
R2
-
R, f gdp n
+
N
o (i.e., for every bounded continous function
-
Weak convergence of Borel measures on
f gdp a.s.).
is in turn known (see [Du]) to imply uniform weak convergence relative
to any uniformly bounded and equicontinuous family of functions
Sup If gdp N - f gdp - 0 a.s.).
g£F
Observing that the family F
DK(XN
1
show that
may change with
)
(i.e.,
N,
and
DK(PN)
o,
N
definitely as
K =
i.e.,
by
K
N).
min IxcEC
bounded on
as
c,
S
ICI = K - 1
Since
DK(p)
+ m
(where
increases in-
is
O(K
) (see [FH] or
KIDK(pN) - DK(p)! + 0 a.s. as
K = o(N) (i.e., when
K/N - 0
F = {g: g(x) =
is neither equicontinuous nor uniformly
mere weak convergence of
and
N
K
Note, however, that the family of functions
ca).
K
+ 0
In the case where
N + oo, which we shall prove for the case
-
I
our notation will suppress,
,
[Hal) a meaningful result here will be
N
DK(P)
however, this result becomes meaningless as both
converge to G.
DK(p)
defined by
= ID(PN) -
DK(P )
though, the indexing of
as
F
F = {g: g(x) = Mlin flx - cll,
cCC
can easily
S,
one
on
bounded
uniformly
and
, CcS} is equicontinuous
C #
K
p
weakly converges to
PN
to
p
is not sufficient
What is needed is convergence of the empirical
for proving this result.
measure in a stronger sense as established by Sublemma 2' in the proof of
Lemma 2 below.
The main results of this section are summarized by
THEOREM 1:
(i)
If
K(N)
N
O
as
N
+
(i.e.,
K = o(N),
then
-1~
I
Iu··
I
I
-7-
DK(X )
N
-
K
=
DK(P)
0
almost surely
N-to
(ii)
e
Let
<
Any algorithm which for some
0.
>
solutions for demand p
b-optimal
can be modified by reallocating an arbitrarily
(though fixed) fraction of the centers4 to yield solutions that are
small
almost surely asymptotically c-optimal for the demand
If in addition
(iii)
K
where
finds
6(6) =
K(N) + O
lim K
D(X
N-N
K
1
1
(- + 4
n 3)-
=
as
(6)(
N -+ o,
then
2/3
D3/2
f2/3 d)3/2
Z
i=l
X
almost surely
23
Remarks:
(1)
The reallocation of a small fraction of the centers, referred to
in the statement of part (ii) above, consists of placing an arbitrarily
small fraction, say 0.01, of the centers according to a uniform grid (e.g.
a square grid) before proceeding to solve the 0.99K-median problem for demand
p.
This device which is essential to our proof can be avoided, however, if
is absolutely continuous with a density
Sup
Sf(x) < m; e.g., if
just the square
(2)
S,
p
f
is uniform on
satisfying,
S.
can serve as the support of
Inf
p
Sf(x) > 0,
(Any polygon, rather than
p).
6(6) as given above in (ii) is the expected distance between a
(uniformly distributed) random point of a regular hexagon of unit area and
its center.
4
For the restricted version there is an additional modification involving the
replacement of solution centers for demand p, by the X.'s that are closest
to them.
J
--
11
11~-
--
I~-
-8-
(3)
(X1 C X2
While the theorem is stated for a sequence of nested demands
C ...) (it is an "incremental model" in Weide's [We] terminology),
the proof below does not make any use of the associated dependency between
N
DK(X
)
s.
Let
= K' DK(XN
DEV
)
- K~ DK(p).
Almost sure convergence in the
subsequent proofs follows always from the Borel-Cantelli Lemma (i.e., uses
N Prob( DEVN
> e) <
for any
X
individual distributions of the
> 0) and thus depends exclusively on the
DK(XN)s.
Consequently the theorem actually
holds for arbitrary (non-nested) sequences of problem instances, each of
which is generated by independent random points.
In particular, it holds for
the case where the instances in the sequence are generated independently (the
"independent model" in Weide's [We] terminology).
This mode of convergence
which by Borel-Cantelli's Lemma (both halves) is equivalent to summability
(over the whole sequence) of probabilities of
deviation from the limit,
is known also as "complete convergence" [HR].
The rest of this section is devoted to proving Theorem 1.
The
technicalities, while not entirely trivial, may be too specifically tailored
to be of general interest, and the reader may wish to proceed directly to
Section 3.
2.2
Derivation
.otational Conventions:(1)
We need a notation for
the value of suboptimal solutions.
,et, ten,
D(X,C)
=
Z
xEX
i
ccC
-
;;lip
D(p,Cq
min
c!
x-cll
1_11__11
___1__*
-9-
(2)
In order to enhance clarity, we will append the argument "(c)"
, can be thought
to random entities when appearing in "pointwise" statements.
of as an element
£ Q where (,
F, P) is some implicit underlying proba-
bility space that we will not otherwise invoke.
(3)
For a random variable Y,
2(y)
(Y) is
Note a (Y) ~ (a2 ()) 2 '
its fourth central moment.
LEMMA 1 (Free version only):
For any sequence {K(N)}N>l'
K2
K
D (XN)
lim sup [N
N-
is its variance while
pK(
)
] < 0 almost surely.
Proof:
Fix
0 < 6 < 1.
If
K < 1/6, let
K = K
and let
centers solving the K-median problem with demand
let
K = K -
r6/-K1 2
in the partition of
and let
S
into
K-median problem with demand
1K D(XN(w))
-
be the set of
p; otherwise, if
K < 1/6,
consists of the centers of the subsquares
CK
[1 2
p.
CK
subsquares and of the centers solving the
Now,
KK(P) = UN(w) + VN(W ) + WK(N)
(L.1)
+ ZKN)
where
UN(w) = K
VN(w)
DK(XN(w))
KN
D(XN(w),C)
K= D(XN(w),CK) - KD(pCK
)
(L1.2)
WK(N) = K(p,CK) - KD_(p)
ZK(N)
K'D(p) - K
K(P)
Z~~K(N
__1111_
_
II__·_^
1_
-10-
UN(w) < 0 for all w, since (by definition) DK(XN(w))
WK(N)
D(XN(w),CK).
0 since the set of centers that (for demand p) yield
~~~~K(N)
Also,
cost D(p)
K
is a subset of CK.
Now VN(w) may be rewritten as
VN(w) =
where
Yj,K(N)(w) = K(min
Y
(
w)
=l YjK(N)
JIXj(w) - c
(L1.3)
- D(p,CK))
cCK
The Y
K(N)' j
1,2,...,N are independent identically distributed zero
=
mean random variables.
Also all the moments of
point in the support of
in
CK,
and thus
p
is at distance
1/2
Y
1
< K
j,K '
-
are finite since every
at most from some center
1
<
2' [,/l
[
Yj k
V
Now emulating the standard proof of the strong law of large numbers
for variables with finite fourth order moments, we first note that
a (VN )
and thus for any
Prob (IVN)
£
N4 [No(
Z
)
+ 6( )c
2
(Y
1)]
= (N2 )
1,K
> 0 we obtain (using Tchebychev's inequality) that
) = O(N
>
(YK
) and thus that
Prob(IVNI > E)
<
N=l
which, using the Borel Cantelli Lemma, implies that lim
N-o
surely.
VN = 0 almost
-YI111111--
P-ll
1I^I
·._._
-11-
It follows, then, from (L1.1) that,
lim sup(W- D(XN)
KDK(p))
Finally, noting that for
K
1/6,
li
sup ZK
K
=
K
K
a.s.
K
K
-iK - 2K
1
1 - 26 '
<
we have
(1- 26 ) K2 D (p) - K2 DK(p)
(N)
|
and recalling (Theorem 1 in [Ha])
if
K >- 1/6
if
K < 1/6
the convergence of KDK(P) when K
A,
we deduce that ZK(N) can be made arbitrarily small by choosing 6 small
enough, which concludes the proof.
LEMMA 2:
If K(N)
- N
0 as N
lim inf[K
Proof:
(i.e.,K = o(N)) then,
DK(XN) - KK(p)]
O0almost surely.
Similarly to the proof of Lemma 1, fix 0 < 6 < 1.
CK,N()
If
K < 1/6,
let
be a set of centers solving the K-median problem for the first
customers,
let
D
CK,N(W)
X1(X), X2 (w),
... XN()
and
let
K = K; otherwise, , if
include also the centers of the partition of
congruent subsquares, and let
K = K + r
S
into
N
K - 1/6,
r/i2
1 2.
Now,
Kn D
K(
(w)
)
FN(w) + GN()
+
(N)
L2.1)
I--·_I_---__III__I___
Il^ll-___C ^___
-12-
where
N
= K- DK(X (w)) - K
EN()
FN()
=
FN(W)
D(N(w),C
)
NK D((w),CKN(w)) - KD(p,CK (w))
K,N
D(
(w),CK,
N
(L2.2)
K D(p,CK N(w)) - KD(p)
GN(w)
- K K(p)
HK(N)= KIDK(P)
By definition,
EN(X)
possible to show that
K(N)
ZK(N))
O
HK(N)
and
GN( )
O0 for all
.
Also it is
(as it was in the proof of Lemma 1 for
can be made arbitrarily close to
0
by choosing a sufficiently
small 6.
The proof will be complete, then, if we show that
lim inf FN
N-*co
>
0
(L2.3)
a.s.
To prove (L2.3) consider first, for M > 0, the partition of R into
MK
MK
MS
I[J4/H12 subsquares, S1 , S , ..., S
th
i
MK
subsquare S
M
let pK
,
tomer will lie in S
let yMK be the center of the
be the probability that any particular cus-
(i.e.,
i
MK
2
i
= f
dp), and let NM N(w) denote the
iS
L
(N )
actual number of customers, out of the first N, that are in S
K½'
N
D((),C
M(w))=
KN N
min
j=l CCK N(W)
KI 1 2r1r1
N
i=l
Now,
IXj(w) - c
I·~icl
U£M
MK
Ml'
mintlX.(w)
- yi
M
+ Yi
-cli
j:Xj ()S M
_-111
-
_
_I
__I
II_
I__Y
I__IX
-
-13-
-il1 2 N'N(w)
> K
2
i=1
I
i=l
-
cl
-2
i
IlyMK - clI
yi
N N min
,N
CCK
,N()
T1
41
2
CCKN(W)
N
N(w)
> K
MK
Iy
(min
1
1
2M
-
and similarly,
KD(p,CNK( )
I
i=l
K
jMK
Si
lix - cl dp
min
CaCN,K(W)
1
k[,,
2
K
P.
I
i=l
MK
i
min
-cfl
M
CCN,K(w)
Thus,
NM.'
I
FN(w) > K
(w)
(
N
N
i=l
Now for 0 <
P.
' )
C£CK,
_4
IlyMX - clI
min
(L2.4)
M
N
< 1,define
IM'&(w) = {1 < i < [f/-F1
2
(1 -
PMK
) - 1) PMK min
Ilyi
1
CKN
cCKN
cli
N
:
·N
N_
(L2.5)
and,thus,
FN(w) > K %
((1 iWIN
- K
iM
(W)
NF
isN
()
pMK min
I yM K - cI
- 49
(L2.6)
M
CK,N
_
_1__
__1_
11
· 1
--
---
I
I
_
·
I
-14-
Noting that
MK
min
M1y
C
i
N
cCK~N
C '
CK
FNM) >-c-&
1
~-
1
1
26
=1
and recalling that
Z P
i
1,
we have
Choosing
Mi
( IY
- &Ni
(
1
-
(L2.7)
)
sufficiently small and M sufficiently large can make the
first and third terms on the right hand side of (L2.7) arbitrarily small.
To complete the proof, then, it is enough to prove:
Sublemma 2':
If
lim K(N) = o
N
lim (
i
N-xo i I
then, for every 0 < £ < 1 and M > 0,
MK )=
0
(L2.8)
a.s.
N
Proof (of sublemma):
Let N be a Poisson distributed random variable with mean N.
Consider
IN
(w) =
1 <i [
N N~~~1
12 : N M(w) < NPMK(1 1
N M is the number of
where N'
customers out of the first
)}
(w) that lie in
i
We first prove that (L2.8) holds if the summation is over IM'(w)
N
rather
than over
over IM'(w)
rather
than
I N (w)
Let
Let
+
Claim:
for some
> 1 + 4- > 1
NMW). be the probability that
i
N,M,
Qi
1
i
TN
N
(w).
NpMK(N)
zN
1
_
for all N and i
.4.1111-1
IIIP----L-IIIII
-II_-------
I
-
---
-15-
t(1-&)X
N,M,
Qi
Proof:
1
n
-
e
MK
= NP.
where
n-
1
n=n=
O
< e-xe/P
pl (l-s)xA
(e e
(1i
pi
=
for any p > 1
)
(Note that
2
eAX/P pl(l-)AJ = [1 + - + (A/p)
2!
p
(1-)A
t
I
x t(1-E)AI
A2
2!
Now let p = 1 +
* I(1-a)A!+
/2 then,
1
1-1/p
-
1+/2
(1 +
p(1-)
e
e
1
< (1 +
4 )
i
M
)
1+a
2
1+/2e > es /4 >
> I +-
-
pMK (N)
N,M,a
Now let Yi
1I(l+a
e
/2(1-a)
-NP.
QNiM,
1
1+£/2
e
(1 + S/Z)
a2
Thus
1lL(1-a)A]
+ ... ]p
+
(1
+
E
4
MK
(QED Claim)
)
if i
-M,if
i
I
(W)
otherwise
___LIIO__IUIUULIYLIILI^
-_
11111
1^._11_1--
-16-
P
Obviously,
i£I
MK
)
YN M
I
'
(w)
and thus for any
i=l
(w)
E(yN,M,
E(
M,& pi
icl
N
MK
I,']
I
i=l
<
i=1
2
-NP.
Q
P.
1
MK
p
-NP M
+
MK
P.1
1
i:i
MK
+
MK> z
i N('-1)
N(O-1)
< N(e-
1)
+
(
I
-NPi
1
PMK
P.))e-
z
i =N(6-1)
fVW)- Z
+ e-z
e -z
as N -
- 1)
N(
NP.
1) implies
(Note that Pi >-
1
e-Z
< e
.
.)
Taking z large enough we get, then
limE(
PMK) = 0 .
N->o:
iI N'
Letting a
denote the
of yM,
1
(i = 1, 2, ..., [~]Z2)
th
(L2.9)
central moment we have, using the independence
z,
-17-
I2
[VR 2 N,M)<[
a4(
MP
K) =
N
4(
YiM
)
3(
i=l
i=l
1N
<3(
3(i=l
E
i=1
2
E[(yN,M,)2])
E
o2(yN,M,e))2 +
i
'
+
E [ (yN
I
u 4 (yNM,')
=1
i=l
) ]4
i=l
NpMK
M
-NP MNp
.
(p M K ) 20
<3(
1
)2
+
i=l
i=l
(pMK) 4o
1
i=1
MK
_
-NP.
=3(
M -5/6 (] pMK
i:P. <N / 6 5
1I
2
+
1
6
3 pMN
i:P
>N-5/
NpMK
+ ( pMK< -5/6 (pMK)4e
1
i:P. <N
1
-NP.
+
1
<3([
Recalling that
fV1
2
]12N
(PMK)
1
5/6
-5/6
PM.
i:1
-N1/6 )2 + ([
3(
-10/61
- 10/
+ 6
= 1 +
MK
1 2 0 /6 +
2N-20/6
+
4
0
1 )
N1 /
is negligible) and that
2/4 > 1 (thus,
= 0(K) = o(N), we conclude that
20
10
4
4(
-ME
M
PMK )
) +
0(N
(N
6)
0(N-4/3)
iNi
By Tchebychev's inequality we have, then, for every
Y4(
E8 p. MK
Prob(
i,1
N
P.I)I >
E(
<
IM e
> 0,
MK
P1 )
= 0(N - 4/3)
04
ieN
Consequently, this probability is summable over
N = 1, 2, ...
thus the Borel-Cantelli lemma and (L2.9) imply that
and
(L2.10)
-18MK
PK converges almost surely to O, which is the limit of its expeciINI
tation.
It remains to extend the result to the case where the number of
customers is N rather than a Poisson random variable with mean N.
PMK
8IN
N
M,
(w)
For every
MX
=
EM/2
i&
N
P
(W)
1
I
E(W)-
isI 'N)
(L2. 11)
P i
M/2
-IN
(w
M, &
-=M)&/2
IN (w) - IN'
(w), we have
i
N (W)
<
(1
-
)PMKN
<
(1
-
/2)PMKN
N.'
(w)
and therefore,
pMKN <
2i
MN(w) - NM,N()
i
1
Substituting, then, in (L2.11) we get
iN
,£N
N
'(w)
1
<
(/2
M
i
MK
M
Pi
(w)
z
- )1/2
isIN
EN
ii
M/2
()-I
,/2(w)
N
-
1
2N(w) -N
N
(L2.12)
We have just proven that the first term on the right hand side of (L2.12)
converges almost surely to
0.
That the second term
converges almost surely to 0 is well-known.
> 6) <
for all 6 > 0.
2
(
N
-)
Prob(NN N
N=1
This, rather than just a.s. convergence, is
(In fact,
needed here to justify our claim in Remark 2 regarding the applicability of the results to problem sequences of arbitrary dependence
-19-
structure.
The proof of this is straight forward.)
is complete, and so is the proof of Lemma 2.
The proof of the sublemma
a
We are ready to proceed now with the proof of the Theorem:
Proof (of Theorem 1):
(i)
(ii)
Consider first the free version.
This result is an immediate corollary of Lemmas 1 and 2.
First assume an algorithm yielding optimal solutions for the problem
with demand
p,
=
i.e.
.
Note that the proof of Lemma 1 is
essentially based on a heuristic in which
[-]2
centers are
= 6K
placed on a square grid while the remaining centers solve the
(K -
[/-i]2) -median problem for demand
p.
It is shown there that
the difference between the cost attained by this heuristic (when
divided by
NK-Z)
and
KiDK(p)
is almost surely asymptotically
upper bounded by a number that tends to
0
as
6 + O.
Since Lemma 2
shows that almost surely we cannot achieve a cost asymptotically
better than
KDK(p),
it follows that for any given
e > 0
we can
find
6 > 0
so that the heuristic is almost surely asymptotically
E-optimal.
Using
for demand
p,
for any given
(iii)
-optimal (
> 0),
rather than optimal solutions,
in the proof of Lemma 1, yields a similar consequence
e
>
.
This result follows from (i) and Corollary 1.2 in [HM1].
We turn now to the restricted version of the K-median problem, i.e., the
centers must be a subset of the
Xj's.
(For demand
p,
version requires that the centers lie in the support of
the restricted
p).
The arguments
above do not apply since Lemma 1 was restricted to the free version.
We show
below (Lemma 1R) that Lemma 1 is valid also in the restricted version under
the additional condition that
K = o(N).
_11 1_1_
-20-
(i) follows from Lemmas 1R and 2;
Analogously to the previous case then:
(ii) follows from a heuristic implicit in the proof of Lemma 1R and (iii)
follows from (i),
LEMMA 1R:
from Corollary C2 of [Ha] and Corollary 1.2 of [HM1].
K(N)/N + 0
If
N +
as
,
then the assertion of Lemma 1 remains
valid for the restricted version of the K-median problem.
Proof:
0 < 6 < 1.
Fix
There are sets
U
and
V
both finite unions of rectangles (with sides parallel
to the coordinate axes) so that
p({x
({x e u:
where
f(x) > 6}) < 6
S - U:
(L1R.1)
(L1R.2)
f(x) < 6}) < 66
ps(S - V) < 6
(L1R.3)
i(V) < 2
(L1R.4)
denotes the singular part of the measure
ps
This follows from the
p.
regularity of finite Borel measures in Euclidean space (for an explicit argument
of [Ha, appendix A]).
see, for example, the proof of the "claim" in Lemma A
Consider now the partitioning of
S
into
[/i12
congruent subsquares (which we
will refer to as "primary" subsquares) and define the following subsets of
U
(i)
U,
the union of subsquares contained
(ii)
U,
the union of subsquares intersecting
U
V,
the union of subsquares intersecting
V
W,
the union of subsquares with more than
(iii)
(iv)
X 1 (w), X2 (w), ...,
S:
in
N/(63 K)
customers (out of
XN(w))
_^___·_1_11_
1_1__1_11__1·1___11_II
-21-
Let
25
be the total number of rectangles in the sets
2 6-13, [then for all
K > L
(U
U
and let
L
- U) + (V
- V) < 66
(L1R.5)
U
and
V
are less
and thus the number of subsquares intersecting their boundaries is less
4%/(1/r[/rK1) + 4
than
V
we have,]
(This follows from the fact that the combined perimeters of
than 4
and
4W(/[K1 + 1)
/i-K1
2
= 4
+ 1)
([/i-K1
and their total area is less than
which is smaller than
66
for
K > L = 25926-13)
and
thus, in particular
p(U - U) < 66
It follows from (L1R.A) and (L1R.5) that for
(62 + 266) [/-K12
63 K
are no more than
V
or
U - U
subsquares in
V
subsquares in
and
W.
(L1R.6)
K > L,
(U - U).
there are less than
It is also clear that there
For each of the subsquares in
we consider an additional partitioning into
r1/612
congruent
(secondary) subsquares.
The total number of subsquares (both primary and
secondary) is less than
[/ -l2
close to
36K,
Now, for
are possible for
K
L
let
= K
6
(Tighter bounds, arbitrarily
small enough).
and let
CK
be a set of
solving the restricted K-median problem for demand
K - 17 [,/i12
and let
CK
primary subsquares.
center in CK
where for
Let
CKN()
by a customer location
K > L
CK,N()
Xi(w)
p
K
points (centers)
p; otherwise, if K > L,
be the union of a set of
R-median problem (either version) for demand
[ 12
+ 63 K) [1/612
+ ((62 + 26 6)f/-1
< (1 + 2(1 + 62) 62 [1/612) r/-K12 < 17 [/-K 12
W,
R
let
points solving the
and the set of centers of the
be the set obtained by replacing each
(1
i
N) that is closest to it,
is augmented by additional customer locations so that
each secondary subsquare that contains a customer location contains also a point
-22-
of
CK,N(w).
By our construction
CKN()
consists of no more than
K
customer
locations and is thus a feasible (suboptimal) solution to the restricted K-median
problem for
N
customers located at
Xl(w), X2 (w),
..., XN(w).
Now, the
expression in the assertion of Lemma 1 can be decomposed as follows:
K[N DK(XN(w)) - DK(p)
]
= EN(W) + FN( ) + GN(W) + HK(
(L1R.7)
where:
EN(w) = - [DK(XN(w)) - D(XN(w), CK N(M))
KN(W)
= - [D(XN(X),
FN()
N
CKN(w) - D(XN(w), CK)]
(L1R.8)
GN(W)
= K½[k D(XN(w), CK) - D(p,
HK(N) = K[D(p, CK) - DK(P)
Note the
DK
and
D
]
CK)
< K[DR(p) - DK(P) ]
in the above expressions correspond to values of the
restricted version of the K-median and
K-median
problems, respectively.
To show that the upper limit of the left hand side of (L1R.7) is almost
surely
0, it is enough to demonstrate it for each of its components in (L1R.8) or
at least to show that their upper limits can be made arbitrarily small by
sufficiently small
By definition
6.
EN(w)
0
proof of the same fact for
the convergence of
VN
KDK(p)
can show (as we did for
for all
lim GN = 0
(a.s.)
K +
(Theorem 1 and Appendix B of [Ha]) we
in the proof of Lemma 1) that
arbitrarily small by choosing sufficiently small
is almost surely
0(6).
HK(N)
can be made
6.
To conclude the proof we will show that the upper limit of
N + a,
following the
(see (L1.2)) in the proof of Lemma 1. Recalling
when
ZK(N)
w.
FN,
as
-23-
K
For
L
we have by the triangle inequality,
K()
FN( ) N < N N.
min
- max
IIx -
cl I
x XN(w)
cECK
< Li oN (w)
(L1R.9)
where
PN(W)
min
max
xEXN(w)
CECUKMC K
For
IIx - c II
K > M, we consider separately the partial customer populations:
xN,
1
= xN n (W u V u (U - U))
XN'2 = X
n (S - U - V)
XN,3 = XN n (U - W)
Clearly,
XN = XN, 1
where for
u XN, 2 U XN,3
and
FN(w)
+ FN,2()
FNi()
(L1R. 10)
+ FN,3(W)
i = 1, 2, 3
K)
=
N [D(XN'i(W), CKN(W)) - D(Xi (),
FN,i(W)
CK)J
Clearly, for all
FN l(W) K0
FNi l() <N
.
1
1
r A-K I
rl1/SI]
(L1R. 11)
and
F,
(W)
'
.
XN'
2 ()(I
1
F/i I
N,2()I
1
N
L2
(L1R.12)
Before we bound
FN 3
note that the inclusion of the primary subsquare
centers in CK
implies that no customer is further than
nearest center
in CK.
1/(iri/1-K)
We can thus assert that every center
customers outside a circle of radius
1/(/r/6K1])
customers in no more than 9 subsquares.
from the
ccCK
serves no
around it, and thus serves
Replacing
ceCK
by the customer closest
to it might (by the triangle inequality) cost the customers served by
additional distance of up to
Let
'..,
Q 1, Q2 '
rN i(W)
lIx - clI,
but no more than
1/(/[
1).
xsX (w)
be the list of the primary subsquares and define
Q[/12
= min
minN
c an
max
lx - II ,
min
1/(/2 [/6K]
} .
(L1R.13)
ceCK Qi xEX (w)
rN,i(w)
is a bound to the increase in distance, due to replacing
CK,N(w),
for any customer served by a center in Qi.
customers in U - W (i.e.,
by
All the centers serving
XN,3 customers) lie in U. The number of
XN,3
customers served by all the centers in any given subsquare
most
9
3-
CK
Qi
is at
From all that we may conclude that:
6K
K
FN,3(W)
<N
F
½
N
9 3- rN
N,i()
-
i:QicU
< 9F
-5/2
1
r
N,i)
i:QicU
< 11
-5/2 i
~r
N
(W2
M
½
(L1R.14)
,
iQiCU
where
t is the number of subsquares
(and since
/K
5 for
Qi
in V, and since
K > L = 25i 2 S- 13 ) we have
9I-
t
9
[/i-K12
< 11.
-25-
From (L1R.9) - (L1R.12) and (L1R.14) we have
FN()
N( W) +6
0
N
N,2w) + 6 5/2
X
rN, i(w)2
(L1R.15)
i:Q.cU
Note now that
any
e > 0
center
UK<LC K
there is a
c
uK LCK
for all such
c
is a finite subset of the support of
8 > 0
such that a circle of radius
has probability (p-measure) greater than
Prob(minN
lix - cll > ) < (1 - )N
p.
For
around any
0
, and thus
from which we conclude
xeX
that
Prob(pN > c) <
and thus summing over
N
L(L - 1)(1 -
)N
and employing the Borel-Cantelli lemma we have
PN + O
a.s.
(L1R.16)
By the strong law of large numbers we know that
1IxN,2
But, letting
1
Pc
singular parts of
+ Pc ({x
=
IXN n (S - U - V)j + p (S - U - V) a.s.
and
p
p,
denote, respectively, the absolutely continuous and
we have
p(S - U - V) = p({x
S - U - V:
f(x) < 6}) + ps({x e S - U - V:
S - U - V:
(L1R.17a)
f(x) < 6})
f(x)
6})
which by
(L1R.1) and (L1R.3) yields
p(S - U - V) < 6 + 6 + 6 = 36 .
Finally, note that each subsquare
around which we have a circle of radius
customers.
Qi c U
rN i(w)
The intersection of this circle with
side no shorter than
r
(L1R.17b)
contains a center from
CK
that is empty of
Qi
contains a square with
(w) (consider the square, with sides parallel to
,i
the coordinate axes, that is inscribed by this circle, then at least one of
-
,/
-26-
its corners and thus one of its quadrant subsquares must be inside
otherwise
rN, i
(L1R.13).
Consequently
must be larger than
U
1/i(2 [/r/1)
Qi'
since
in contradiction to
contains a disjoint union of no more than
empty (of customers) squares with cumulative area
rN
[/i12
()2.
Now,
i:QicU
if, as in Lemma 2, we use a refined partition into
arbitrarily large (but fixed)
M,
[A-il2
subsquares for
then we can by employing sublemma 2',
show
that the probability (i.e. p-measure) of the empty squares converges almost
surely to
0
v({x
f(x) < 6}) < 266 ,
U:
and thus, noting that by (L1R.2) and (L1R.6)
we may conclude that
lim sup
N
r
2 < 266
rN
26
a.s.
a.s.
i:QicU
and thus that
lim sup
65/2
(
rrNi 2
2
a.s.
(L1R.18)
i:QicU
Plugging (L1R.16), (L1R.17a, b) and (L1R.18) in (L1R.15) yields
lim sup FN
and since
6
0(6)
a.s.
(L1R.19)
can be made arbitrarily small the proof is complete.
Note, again that as in the rest of this paper, complete convergence is
O
essentially implicit in all the a.s. convergence statements in the proof
above.
-27-
3.
Asymptotic Separability (K/N + a > 0)
The analysis of the previous section and the resulting asymptotic
continuity apply only when
K = o(N).
In this section we introduce another
regime of asymptotic behavior, one that is associated with
N/K
does not necessarily approach
+o
K -+
(where
as in the previous section).
This
type of behavior which we will call separability, corresponds to the fact
that when
K
is large, then the optimal solution (and the value) may be
approximated by the solutions to (and the sum of the values of) the components in a partition of the problem, according to some geographic lines.
More specifically, suppose we have a collection of customers uniformly
scattered in a square, or for that matter suppose we have a uniform demand
distribution in a square, and consider a partition of the square into two
congruent rectanges.
Then as
K
grows, the optimal solution can be
approximated by the sum of the separate solutions for K/2-median problems
in the rectangles.
Similarly, if we partition the region into
gruent components, then as
K/L
grows (note
L
may change with
L
conK), the
overall solution and value can be approximated by the sum of the solutions of the components.
This of course suggests the applicability of a
partitioning ("divide and conquer") heuristic as the one introduced by
Karp [Ka] for the Travelling Salesman Problem (TSP).
a problem with
K
medians, we solve
L
problems with
Instead of solving
K/L
medians.
While the separability increases (i.e., the error decreases) as
so does the computational effort.
computational effort.
Once a
K/L
K/L
grows,
One must trade off the accuracy and the
was chosen, the relative error of the
heuristic (i.e., the ratio between the error and the optimal value) does
not depend on
K.
All these features of separability/partitioning are
-28analogous to those explored by Karp for the TSP.
Separability can be
established for the deterministic continuous version of the problem, and
is implicit in the proof of Theorem 1 of [Ha]
(specifically consult Lemma 4
and the preliminary discussion preceding the theorem).
Using the deter-
ministic seDarability and the asymptotic continuity results of Section 2,
one can easily establish separability for the case where
K + c
N/K
+ o
and
simultaneously.
In this section we are interested in establishing asymptotic sepa-
rability for the case where N/K
thus N
oXas well).
1/a for some a > 0 and K -,
X
(and
'For this case, we cannot use the deterministic
separability and we prove stochastic limit results that establish almost
sure asymptotic separability (and thus support the viability of the
partitioning approach).
3.1
Results
As in Section 2,
XN
will denote the first
N
points in an infinite
sequence of independent uniformly distributed random points of the unit
square.
The following theorem summarizes the results of this section.
THEOREM 2:
(i)
If
(N)
a >
as N - -,
there is a non-random function
lim N-1/2 DK(XN)
N-+O
where
then
= B(c)
: (0,1] -+ [0,)
so that,
almost surely
is convex, monotonely decreasing and has the following bounds and
asymptotics
lim ~c)= 1
-*+ 0
where
(a)
a-
(1 +
o~~~~~~~
n3)
2
(T2.1
(T2.1)
-29-
lim
a-*l
1*
)_
)
where
1(a)
1
-erf(ii"n(l/(2
-)))
(a
2a-1
2
In the free version, (a) <
In the restricted version
(1/)
n(/(2a-l))
(T3.1)
0 (a) for all
B(c)
>
2 erf (/n(T/7))
-acx(l/n)Zn(l/1) for all a .
(Here erf(x)
(ii)
into
2
For any
e-
dt.)
> 0,there is an
subsquares where
M > 0
such that partitioning the square
=Lv7T] and solving a separate
problem in each is almost surely asymptotically
K/ 2-median
-optimal.
Remarks:
(1)
Note that here as in the previous section, the a.s. convergence
always follows from the application of Borel-Cantelli Lemma to the whole
sequence; that is, from summability of the deviation probabilities (Prob
(IN-1/ 2 DK(XN)-0(a)I>e)) over N=1,2,3,..., and thus is essentially complete convergence.
(2)
It is quite conceivable that via a Lagrangian technique as used
in Subsection 3.6 and 3.7, one can obtain a non-uniform counterpart of
the theorem.
The rest of this section (Subsection 3.2-3.7) is devoted to the proof
of these results.
The techniques are more interesting than those of
Section 2 and the discussion (mostly Subsection 3.5) contains some
methodological and comparative remarks.
-30-
3.2
A Poisson Window Model
Consider a unit density Poisson point process in the plane.
Let
N (i,j) denote the number of points occurring in the square S (i,j) =
[(i-1)7im, im)x[(j-l)ii,
jm) and for m
1/a let Z (i,j) denote the value
>
of the Lamj-median problem on the points in
S (i,j).
sionally suppress the (1,1) in this notation, i.e., N
2
For m,n > l/a, let Z - L[nmJ; thus [O,Zv~m)
C[O,i)2
For i=j=l, we will occa- N (l,l), Z
- Z (1,1).
C[O,(+l)m) 2.
2
Zn ,the value of the Lan]-median problem on [O, n) ,is clearly not higher
than the cost attained by the crude partitioning heuristic that assigns
2
the points in [O,vn)
2
separate
to the combined centers of the (exact) solutions of
jm]-median problems, one on each of the squares S (ij)
i,j=l, ...,Z. That is
Z
<
Z
Z (i,j) + 3m(N-Nn
(3.2.1)
where the right most term corresponds to the fact that each point in
[O, (+l)))
[0,,)
2
.
< 3; of some center in
is within a distance of 2/2/
For the same reason,Z
< 3
n
N
n
and thus (substituting 1/a and
m for m and n respectively):
(3.2.2)
Zm < (3/vra)Nm
The basic convergence result here is
LEMMA 3:
(i)
lim
n-o
There is a constant
E(I
Z
nn
> 0 such that
- BI) =
This can be easily extended to L
(L1 - convergence)5
convergence for all q>l.
-31-
(ii)
Z
Prob( Z >
integer n>l/a
n n-
+ £)< co
for all
£
> 0
Remark:
The constant
To make this dependence
is, of course, dependent on a.
explicit when necessary, we will denote it
3(a).
Proof:
For notational simplicity, let V (i,j)
m
W
-
-
(i),
m m
V Z
n
n n
and
(N -N 2 ), (3.2.1) can be restated as
n n
2
2m
1
V <
Z V(i,j) + W
nn
2
m
n
2
ij=1
By the independence and stationarity of the Poisson process, and by
the translation invariance of the value functional, Dk, it follows that.
the V (i,j)'s are independent identically distributed random variables,
m
which according to (3.2.2) are dominated by Poisson random variables and
thus have finite fourth moments.
Following the standard derivation of
the strong law of large numbers for the case of finite fourth moments 6, we
have for all
£
> 0,Prob
I2
is summable over n=1,2,....
respectively,
-
n
4
Vm(ij)-E V i>)=O( _)=O(n
2
Now the mean and variance of W
(n-k2m) = O(n
1/
) and (_
n
)
3/2
for all
) which again is summable over n=1,2,.,.
are,
>0,Prob (W >)=
n
. We conclude, then, that
>0
Z
Prob(V
n>l/a
> E Vm + s)< o
or, using the Borel-Cantelli Lemma, that
6
) which
(n- 2m)=O(n - 3 / 2 ) which
implies, using Tchebychev's inequality, that for all
O(n
2
There is an example in the proof of Lemma 1.
(L3.1)
-32-
lim sup V
By (3.2.2),
m
< E V
< (
Z )
=E(
E V
n-
im
for all m > 1/c
a.s.
m
(m2 + m) < 9 (1-ka).
m
such that EVn -
by Lemma 4,to follow, there is a finite constant
i.e., part (i) of Lemma 3 is established.
E V +,
n
Consequently,
+O
Since this result implies that
we have, recalling (L3.1), part (ii) as well.
Li
< E V
almost surely for
LEMMA 4: Let suPn E(Vn ) < o and let lim sup V
1
all m > 1.- Then V n- converges in L
I<(EIVn
Since IE Vn<EIVn
Proof:
n-+
to some constant S.
1/2
) , the first part of the hypothesis implies
that supnlE VnI< - and thus that lim inf E Vn is a finite constant, say B.
n
inf E V = 0 and by the second
Then by definition lim
n
n
Now let Vn = Vn-S.
n--_ +
part of the hypothesis lim sup V n ' 0 a.s. Let Vn
Then, almost
= max(0, V ).
n
surely
0 < lim sup V+ = lim sup(max(O,Vn))=max(O, lim sup Vn) < 0
n
n
i.e.,
'
+ 0 almost surely and noting that E[(V+)
n
<E(V ) and that the
n
Finally, note that
latter are uniformly bounded in n,we get E Vn + 07.
IV I = 2V
n
- V , and thus
n
lim nsup E(IVnl)
= 2 lir
n
7
For simplicity, let U
2
Prob (T )E(U ITn)<supEU
2
-+
sup E(V+) - lim inf E V = 20-0 = 0
n
n
n
n
- Vand let T
nM T
M.
n
clearly implies li
sup E U <£.
n
S.
p
T
1/2
/
Thus, E(UnlT )<[E((unlTn)]/2 < [M/Prob(Tn)]
and therefore EU < £ + Prob(T )E(U
--
denote the event Un >
<
2
2
n
IT ) < £ + Prob(T )1/2 M1/2, and Prob (T )--O
n
n
-33-
3.3
A Conversion Based on Asymptotic Continuity in Center Density
The following "continuity" property converts the results from the
convenient "fixed ratio" (i.e.,
K = [an]) and Poisson window model
to the case where the ratio just converges to a constant and where the
demand points are the first n points of an infinite sequence of independent, uniformly distributed points in [0,1) 2 .
Recall our notation, in
which X n is this set of points.
LEMMA 5:
Let K(n) be a sequence of integers satisfying
lim K(n)/n =
>0.
Then,
(i)
lim
En-1/2DK(Xn)_-
= 0
n-
(ii)
where
Proof:
Z
Prob(n-1/2DK(Xn) >
n>l/2c
B+ ) <
for all
>0
B(=3(a)) is the same constant as in Lemma 3.
Note that in general,
K -K >n -n >0 implies D
(X
s1
) < D
nn
(X
<2
)
(L5.1)
n
(This follows from the fact that solving the K2 median problem on X 22
1
and then assigning all the n-n 2 points of Xn 2-X
as centers, yields a
n
cost equal to DK (X ), but which by definition cannot be lower than the
n
2
optimal value of the K-median (K>K2+nl-n2) problem on X .)
Now fix
>0; since K/n-a as n-~,
we may for asymptotic considerations
assume that K>(a-e)n and thus that K-[(a-3)nl>2en.
(1-2)n < N(l-)n
(i.e., if 2n > n-N(
Therefore if,
- n
> 0) then we also have K-[(a-3)n>n-N(
which by (L5.1) implies that
N
(L5.2)
)>0
-34-
As in Subsection 3.2, we assume that {Nt}t>l is a family of Poisson random
variables with
ENt=t.
The probability that (L5.3) will be violated is, then, lower than
the probability of (L5.2) being violated, which is
stant
>1.
0(
-n
) for some con-
(The proof of this last fact is rather straightforward, and
is essentially equivalent to the proof of the "claim" in Sublemma 2' in
the proof of Lemma 2).
Now,
n
/DK(Xn) -(c)
= n
[DK(Xn) - D[(a_3£)nX
N(1-)n
(
-3)
1-n
r(a-3K nI
(+ £) 1/2
+ [(l-)1/2 B(i__
(
)
-
)n-3
3( )]
(
)
D
(L5.4)
The probability that the first term on the right hand side of (L5.4) is
positive is
0(0
-n
), which is summable over
n=1,2, ....
part of the expected value of this term is at most
The positive
O(n /2-
n
) (since
when positive, the expression within the brackets is clearly bounded by
n).
For the second term in the right hand side of (L5.4), note that
(l-E)n/
D( _3)n
(X(1-)n)
1
(-3n
n Zn , where n' = (l-C)n
has a distribution identical to that of
and where the center density is
a'
=
1-
Hence, by Lemma 3(i) the expected value of the expression within the brackets
of this term converges to 0, while the probability that it is at least
for any
1
> 0
is summable over n'=1,2,... .
1
Noting, however, that this
summability followed from an O(n'-3/2) (which is equivalent to 0(n-3/2))
-35-
bound on the probability, we deduce summability over n as well.
tends to 0 so does the third
is continuous by Lemma 6 below, as
since
Finally,
term in the right hand side of (L5.4).
We may conclude, then, that
E(n /2 DK(Xn)-3(c))
lim
O0
(L5.5)
n-o
max (O,a)) and
(where a
00
-1/2
n
DK(X ) >
Z Prob(n
n=l
+ £1) <
()
(L5.6)
n
To show that the expected value of the negative part of 1/DK(Xn
) -(()
converges to 0, we repeat the same analysis, this time comparing DK(X)
N
K
As above, we get that the probability that
to D (a+3 ) 1 (X (l+)n).
N
nI (X (l+£)n)
D [(a+3E)
(+3)n]
will be violated is
(e-n) for some
follows step by step.
K
< DK(Xn
)
> 1, and the rest of the analysis
(The only difference being that we do not get an
inverted inequality counterpart to (L5.6) since there is no such counterpart in Lemma 3 itself.
We will get it however later in Lemma 10 below).
8 is obviously, non-increasing in a.
LEMMA 6:
Proof:
We furthermore have
B is convex (and thus also continuous) in a
(a>0)
First, we refine our notation to include a as an index
(a
superscript), e.g.,Za (rather than Z ) is the value of the [n]-median
problem
in
on
the
points
[
For any positive a and a
problem on the points in [0,n)
.
For any positive a1 and 0.29 we have
D
-36-
1/2(a
4n
2
<
n
(1,1) + Z (1,2) + Z2 (2,1) + Z 2 (2,2)
n
n
n
Dividing by 4n and taking expectations we get in the limit using
Lemma 3(i)
1
2
2
)
1
) <
(P(a)+(ac)+ (a2)+(C
which implies the convexity of
3.4
=
) )2
1
((a
+
)1
(a 2) )
DO
3.
More About the Asymptotic Constants
(a)
We next derive further information regarding bounds and asymptotically
tight approximations of the function
: (0,1] + [0,o).
LEMMA 7:
(i)
(ii)
lim
a-0
()
(a )
o(c
=
where
1
lim
-
(iii)
For the free median version,
(iv)
For the restricted median version,
1
J3J3
1
where 81 (a) = 1 erf (Zn(/(2
>
n3
3 4
(a)
(~
a-*l (X 3I1
()
/2 [(1 +
( a)
erf(/Zn(l/))
(ca)
))
2a-1
o (a)
for all
a.
for all a.
Remarks:
- Another lower bound to
and Hochbaum [FH].
(a) may be obtained from the analysis of Fisher
-37-
-
Our notation does not distinguish between the free and restricted versions of the problem, the latter having a slightly higher value of
Parts (i)-(ii) are valid, though, for both versions.
.
Parts (iii)-(iv)
may be true in a non (version) specific form, but proving this may be
rather involved.
-
Note that the cost of restricted version is less than twice the cost of
the free version ([HM2],
Lemma 6) and thus (iii) and (iv) imply looser
bounds for the version that is not specified in the assertion.
-
Part (ii)
(and the monotonicity of
) clearly implies that
is
a < 1 (both versions)
strictly positive for
Proof:
(i)
Note first that following the derivation of (3.2.2) we have
8
1/2
N
-1/2
K
N
DK(X ) < 3NK
.
Therefore, N
DK(X ) is uniformly bounded and the
a.s. convergence in Theorem 1 (iii) implies
L1
convergence and for uniform
distribution on [0,1) 2, we get
lim
N-
N
oN
E(DK(XN)
K
=
(6)
+ 3x4))3
for all sequences {K(N)}N, 1 such that Ko, but K/N-+
Consider now any positive sequence
let
19'
Oa2
...
(i.e., N/K-o)
converging to 0 and
2' ... be the associated asymptotic constants (i.e.,
By virtue of Lemma 5(i), for each
for all
1,
(L7.1)
j
there is a constant
M
j=3(aoj)).
such that
N >_M
1/2 IN-
3
8partitioning [0,1) 2 into
2 E(D
(XN))-
[L.N1j
Sj
<
/j
(L7.2)
K1/2] 2 subsquares and assigning a center in each
subsquare (containing at least one customer) guarantees, since all customers
K1/2 -1
1/2 -1
are within v/7 [/
j
of a center, a cost of no more than V'2 N LK /
2
T NK-1/2 < 3 NK-1/2
-38Now let
N
= 1, N
1
j
= max (Mj, [Nj
1
/ajl),
and
This construction ensures that both
j=2,3,....
infinity while
K./N.
tends to 0.
= [ajNjN
JJ
K
Nj
and
for
increase to
K
Consequently (T,7.1) implies that
1/2
K.
N.1/
NJ
E(DK
(X
) tends to
(6)
N.
K.
'
3
3
Q(6) as well.
(iii)
and,
p
is uniform
K
1/2
(see Corollary (1.2) of [HM1]).
(iv)
DK(P)
approaches
tends to
(6)
as
K
To avoid needless complications, in parts (iv) and (ii),
to center distance.
N -[an]
n
[HM1].
we sacrifice
It is convenient to consider
is the (L1 ) limit of
B
the Poisson window model.
approaches
It also follows from Theorem 2 of
some rigor by omitting unnecessary details.
the
Bj
This result is an immediate consequence of Lemma 1, Lemma 5, and the
fact that if
+oo
therefore, by (L7.2), a.
-
n
Z , the average customer
n
This average is clearly improved if we assume that all
customers that are not centers themselves have their closest
neighbor as a center, and if we further assume that these customers are those
with the smaller "closest neighbor distances"
(CNDs from now on).
With this
assumption, the average distance is a truncated average of the CNDs where
the highest [anJ or (asymptotically speaking) the upper
a
fraction is
eliminated from the summation.
r
Now for an arbitrary point, the probability of its CND being more than
2
is e
(which is the probability of having no occurrences within a
circle of radius
r).
The probability distribution function of the CND,
2
then, is
Now let
F(r) = l-e7
R
2
and the corresponding density is
v(l/)n(l/).
Then
e-
f(r) = 2rer
= a; loosely speaking, an a
of the Poisson points have a CND larger than
R.
.
fraction
The truncated average is,
R
2
then, (asymptotically) equal to the truncated expectation
f r(2rre- r
)dr
0
which after little work gives the asserted bound.
-39(ii)
Classify all possible service sets (i.e., sets of customers sharing
a common center) according to their size, as singletons, doublets, triplets
etc.
Let
fl,f
,f2
3,...
denote the fraction of customers belonging to each
category, then, clearly
fl +f
+f
+f... = 1
(L7.3)
1
fl + 2 f+
It is easy to verify that
1
3f
=
+ ...
c
fl = 2a-1, f2 = 2(1-a), f3 = f4 = 0
is a possible
distribution.
The value of the best solution with this choice is just the sum of the
shortest
(1-a)N
change argument).
first
intercustomer distances (as can be seen by a simple interThis value can be approximated by half the sum of the
2(1-a)N CNDs, the reason being that if
R
= /(l/T,).n(l/(2o-l)T, then
while the probability that a random customer will have a closest neighbor
-7TR 2
within distance R1 is l-e
= 1-(2a-1) = 2(1-c), and the probability that
-7TR 2
-TR2
it will have two or more neighbors within this distance is l-e
- e
rR
= 1-(2a-l)(
rule).
+
n(1/(2a-l)))
which is
o(l-ca) for a-1 (using L'hospital's
In other words, when we list the 2(1-)N customers with smallest CND,
and draw an edge for each closest neighbor relation, we will have close to
(1-ca)N disjoint edges, the total length of which is no greater than the best
value for
f
2a-1, f2 = 2(1-x).
The non disjoint edges can be separated at
negligible cost using remaining singletons.
As seen by the argument used in (iv), the (asymptotic) average customer
to center distance in such a solution is equal to the truncated expectation
12
for
1
r(2
Tre )dr
which after little work yield the expression asserted
1. (The "1/2" reflects the fact that when summing the CNDs each doublet
is counted twice.)
-40-
To complete the proof,it is enough to show that any solutions with
fj # 0 j > 3 are asymptotically inferior to the one just described and
evaluated.
The cost of a j-"multiplet" (whether in the free or the restricted
version of the problem) is at least the diameter of the multiplet and
thus at least as large as the (j-l) th closest neighbor distance (denoted
CND(j-1)) from two of its members.
2
gj l(r) = 2r(rr
)j-le
.
The probability density of CND(j-1) is
As in (iv), we can obtain a lower bound (on
the asymptotic average customer to center distance) by taking truncated
expectations.
is smaller than
in
a,
Let
Rj_ 1
Rj_1
be such that the probability that
is
CND(j-1)
fj/j
(which is, recall, the part of j-multiplets
Rj(r)dr = f/j. A lower
the overall center fraction) i.e.,f
0
j_
l(r)dr
1
bound on the contribution of these multiplets to the (asymptotic)
average is
Rj-lrg
(r)dr.
(for all j > 2), and thus R
Note that as
1
tends to 1, all f
tends to 0 as well.
tend to 0
We may use a first order
Taylor approximation of the integrands gjl(r) and rgjl(r) for small r.
Evaluating the first integral to solve for Rj_ 1 in terms of f
stituting in the second integral yields
Cjf
1+1/(2j-2) (where
a constant) as the associated truncated expectation.
and subCj
is
An overall lower
bound will be obtained by minimizing jE 2 Cjfj 1/(2j2) subject to (L7.3)
Z (l-l/j)f.=l-a. It is not hard to see that
j>2
as (l-a) tends to 0 the minimizing fraction distribution satisfies f2 Z 1-a
or, equivalently, subject to
Z f. = o(l-a). Therefore, asymptotically we cannot improve over
j'_3 J
a solution based on singletons and doublets alone, since as shown
and
above this type of solution asymptotically achieves the lower bound.
a
-41-
3.5
Methodological Remarks
While we have established L
- convergence, and thus also conver-
gence in probability, in Lemmas 3 and 5, we have not yet obtained a.s. convergence or specific rates of convergence of the deviation probabilities.
More precisely, we have obtained bounds on the upper deviation probabilities 9
which are summable over n,but have not obtained any bounds for the lower
.
deviation probabilities
This asymmetry is due to the fact that the re-
sults were based on the subadditive relationship (3.2.1).
If we had a
reversed "superadditive" inequality of that type, then we would have similar
bounds on the lower deviation probabilities.
Both sub and super additive
relationships arise in many geometric graph optimization problems, including
the Travelling Salesman problem, Steiner's Tree
Spanning Tree problem.
roblem, and the Minimum
And indeed they were used by Beardwood Halton and
Hammersley [BHE] to obtain a.s. convergence and by Karp [Ka]
relative error in partitioning algorithms.
to bound the
Steele [S2] refined the BHH
convergence result by establishing (via completely different methods) the
summability of the (absolute) deviation probabilities (complete convergence)l
9Prob(n- /2DK(Xn) >
+£)
1(Prob(n-1/2D(Xn) < 8-)
11
The convergence rate of the deviation probabilities established in [S2]
is better than what is needed for summability. Such convergence rate can
be also derived given two sided inequalities by the standard "even order moment/Tchebyclhev inquality" technique using fourth order moments of sums of
independent-variables, as used in Lemma 3 above to bound the upper deviation
probabilities.
-42-
Unfortunately, it seems hard to establish a superadditivity inequality
here, noting that unlike these geometric graph optimization problems, here
the overall solution in disjoint regions of the plane interacts not only
through the arcs crossing their boundaries, but also through the allocation
of the centers, and this interaction cannot be bounded by simple local
(along the boundaries) perturbation arguments.
We will overcome this diffi-
culty by the standard trick of optimization theory in such situations.
Namely, we will add Lagrange multipliers to remove the constraint on the
overall number of centers.
We will use, then, the X-cost median problem
which is concerned with the evaluation of the functional
FL(X)
min (DK(X) +
K
XK)
(3.5.1)
For this functional, we will establish both super and sub additive
relationships and then derive summable bounds for the absolute deviation
probabilities.
We will then use the convexity of
correspondence between the limits of
DK(Xn)
and
(ao)to establish the
FX(X n ) (this is-
essentially an asymptotic "strong duality" relationship between them.)
To get the sought after summability of the lower deviation probabilities
in our problem.
Finally, before proceeding with the proof, we want to point out that
to prove a.s. convergence for either the "Poisson window" or the "partial
uniform sequence" models one can still manage using subadditivity alone
using the beautiful subadditive ergodic theory of Kingman [Ki].
And in-
deed, Steele [S1], Hochbaum and Steele [HS] and the present author (in
independent unpublished work), have done so.
These proofs did not
-43-
establish summability of the deviation probabilities for the full sequence,
but rather for a subsequence.
The resulting a.s. convergence is then ex-
tended, using monotonicity or continuity arguments, to the rest of the
sequence.
3.6
The
-Cost Median Problem
Consider the X-cost median problem as defined in (3.5.1).
it first in the Poisson window model.
We analyze
Using a definition similar to
that we introduced in Section 3.2, define
YX
n
Z
as the value of the X-cost
median problem on the occurrences of a unit density Poisson process in
the square [0, v')
Observe that this definition is equivalent to
.
Y
n
= min{Z
n
+
(cn)} .
(3.6.1)
As in the proof of Lemma 6, we have refined the notation of
the index a since the latter is a variable.
(3.6.1) for any
X > 0, an
a>l may be possible when
to include
Note that in the solution to
is always an integer.
N ,
Z
Note also that values of
the number of points in [O,v)
, exceeds n,
its expectation (and X is sufficiently small).
Adapting the "Z (i,j)"
m
denote the value of the
translates of [0,
notation of Section 3.2, we use
Y (ij)
m
to
X-cost median problem in the corresponding
/2) . Recall also that
Ln7- ij.
Like the development
of (3.2.1), we get the following subadditive relationship.
Yn
n
Z
Ym(i,j)
+ X(Nn-Ng2
n 'mm )
m
(3.6.1)
Here, the right most term is obtained by assigning a center to each point
in [O,V)
- [0,Qm)
.
For a similar reason, Y
n
'N
n
.XN
-44-
Unlike the previous case, we now have a suDeradditive relationship
as well
Y (i,j) - 4nm
YA > E
m
n -. .
1,3=1
1 /4
-
(3.6.2)
N m
n
Verification of (3.6.2):
It is enough to verify that the terms subtracted from the right hand
required if we
side represent a bound on the additional cost (beyond Yn)
n
want to prohibit customers from crossing the boundaries of the S (i,j)
squares.
(This is a "partitioning cost").
To establish this result, add
centers, tentatively on the partition boundaries, spaced evenly at intervals
of length m
/4.
We will consider them as double centers, one for each side
of the boundary, and pay according X for each.
There are 2
partitioning
lines each of length /n, thus the total number of such centers is
2-2Qrn/m-
/
1
4nm -
/
.
The additional cost due to centers, then, is
the first term subtracted on the right hand side of (3.6.2).
Now in the
free version of the problem customers that crossed boundaries can switch
1 -1/4
to the closest of these centers with an added cost of less than 1 m
2
(which can be clearly obtained if at the point of crossing the boundary
the customer rather than continuing to the center on the other side turns
and goes to the closest of the centers that we installed on the boundary).
Now for the restricted version keep in mind the same reassignment of
crossover customers, but move each of the boundary centers from the
boundary to that of the customers (re) assigned to it which is closest
to the boundary.
This customer (as are the rest of the customers as-
asigned to this boundary center) is within
2
-m
from the boundary
center in a direction parallel to the boundary and thus within m/4 in that
direction from any of its other "companion" customers.
In the direction
-45-
orthogonal to the boundary, however, it is even closer to the other
customers than was the boundary center.
any crossover customer is at most m
more than N
-1/.4
.
Therefore, the cost of switching-for
Noting that there are certainly no
such customers, we get the second subtraction on the right
hand side of (3.6.2).
With (3.6.1) and (3.6.2) in place,we are ready to state.
LEMMA 8:
(i)
There is a constant y > 0 (dependent on X) such that,
lim
El 1 YX
Y- = 0
n-woo
co1
Z Prob(
n=l
(ii)
n
X
Y
n
y- > £)
<
X
for all
>
0
Proof:
(i)
(ii)
The summability of the upper deviation probabilities,
Prob(
Y
-
n
follows exactly as (i) of Lemma 3.
-
n
>
), follow also exactly (ii) of Lemma 3.
summability of the lower deviation probabilities, Prob (
-
n
For the
Y
n
- y < -
),
we also proceed similarly, the only difference being that the remainder
term (the subtractions in (3.6.2) divided by n) converges
O(m
/4
this time to
) rather than to 0 as the analogous term (W ) did in the proof of
Lemma 3.
Letting m grow indefinitely (as we do anyhow) nullifies this
remainder.
FI
Remark:
Note that by continuity arguments, one can, by analogy to the development of Lemma 5, convert this result to the "uniform independent sequence"
model.
12
Note also that the superadditivity enables us to extend even to
In the appropriate sense of summability of deviation probabilities.
-46-
the nonuniform case, where if
f
is the density of the absolutely
continuous part, we obtain, similarly to results in [BHH] and [S1], con-
vergence to
3.7
f
f1/2 dl rather than to y.
The Last Nail
To transform the convergence result in Lemma 8 (ii) from the
-cost
median problem back into a convergence result for the a-density median
problem, we need to show that for every given a we have a X such that
the asymptotic center density associated with the
will converge to a while
n
Y
n
-cost median problem
- Xa will converge to
. Using the current
vocabulary of constrained optimization theory, this result is classified
as strong durality.
side of an
(That is, for every
iven value of the right hand
nequality/equality constraint in the problem, there exists a
value for a Lagrange multiplier so that the solution of the Lagran-ean
problem coincides with that of the original problem for the secified
value of the right hand side).
The standard proof of such property when it exists is by showing
that the optimal value is a convex function of the right hand side of the
constraint.
Here, indeed, our proof is based on the convexity of
(the asymptotic minimal value) in a (the normalized number of centers
allowed) which was established in Lemma 6.
To make use of the relations
between the asymptotic constants,we first have to insure the commutativity
between the limit operator and the minimization operator.
LEMMA 9:
Proof:
y(X) = min(B(a) +
a)
By definition (3.6.1) we have, dividing by n,
1 YX <
Za +
n n - n
n
a
for all
a
-47Taking expectations and using Lemmas 3(i) and 5(i), we get in the limit
(as n tends to infinity) y(X) < B(a) + Xa for all a and thus
y(X)
min(3(a) + Xa)
.
To get the converse is a bit more involved.
for i = 0,1,2,...,Z.
n
(L9.1)
For
Then for all n,
Y
min ( Z
nn
n
>
min ((ai)
i
+
)
1
--
i
+ Xai)-z
1
Zn
-B
The first inequality follows from the fact that if a
minimizing
nn
Z
+ h,
then
n
j1
1 acj
z
+ XI =
n n
j-1
1
n
cxj.
Z
n
< a
a
(i)
*
-
(L9.2)
is the value of a
for some j 13 and thus by the
*
non-increasing monotonicity of Z
>
1, let a. = i/k
>
n
X
+ Xj
in
Z
n
>
n
, implying-
n
Za
n
+
a
Taking the expectation of both sides in (L9.2) and letting n tend to
infinity, we get in the limit (by Lemmas 8(i) and 3(i) and by virtue of
the finite number of c.i's)
y()
and letting
>
+ Xa )-X
i
i
Q
min ((a.)
i
>
min ((a)
-
+ Xa) -
-*owe get the converse of (L9.1), completing the proof.
At last, we get to
00
LEMMA 10:
Proof:
Z Prob (
n=l
Z
n(a)
- £) <
<
for all
By (3.6.1) we have for all X
1
Z
n
>
1 Y
n n
Xa
13We exclude the very marginal possibility of
We exclude the very marginal possibility of
everyw.There
cx> 1.
> 0
o
-48-
and since by Lemma 8,Z Prob (1 Y < y(X) - ) <
n n
n
Z Prob ( Z < y(X) - X - )
n n
n
<
for all
>O, we have
for all
>O .
(L10.1)
Now by Lemma 9,
y(X) = min(3(9) +
e)
14
of 6(a) at a yields (since
Choosing X as any subgradient
min(B(e) + Xt)
e
is convex)
= 6(a) + Xa
i.e.,
y(X) - Xa
= S(a)
D
which when substituted in (LlO.1) completes the proof.
To complete the proof of part (i) of Theorem 2, note that the proof of
Lemma 5 will convert this last result into the uniform independent sequence
model, used in the assertion of the Theorem.
For the associated Poisson window model, part (ii) of the Theorem is
a rather straightforward consequence of Lemmas 3 and 10.
The result can
then be extended to the "uniform independent sequence" case by again
using the arguments of (the "continuity") Lemma 5.
(a) is most probably differentiable but we have not proved it.
-494.
Conclusion
We have found that in randomly generated large scale K-median problems
we have two types of ergodic (or "large numbers") convergence processes.
These processes referred to as asymptotic continuity and asymptotic
separability enhance the analysis of the problems in two ways.
First they
permit us to predict aggregate features of the solution,- like the optimal
value and the (aggregate) distribution of centers.
Second, they enhance the
computational tractibility of such problems by suppressing the significance
of detailed information about customer locations (in the case of asymptotic
continuity) and by suppressing the significance of interactions between elements of the solution in spatially separated regions (in the ease of asympotic
separability).
So one is tempted to conclude here (as in other instances
of probabilistic analysis) that the benefits of an exact combinatorial
optimization model for very large problems are limited, as the large
numbers act to suppress the problem's combinatorial elements.
___IIIIIII__IIII__
-LI-·------
I·
-50-
5.
References:
[BHH]
Beardwood, J., Halton, J.H. and J.M. Hammersly (1959). "The
Shortest Path through Many Points" Proc. Cambridge Philos. Soc.
55, 299-327.
[Ch]
Chung, K.L. (1974).
Press, New York.
[Du]
Dudley, R.M. (1977). Lecture Notes for Course 18.175, Department
of Mathematics, M.I.T., Cambridge, MA.
[FH]
Fisher, M.L. and D.S. Hochbaum. (1980). "Probabilistic Analysis
of the K-median Problem", Math. of Oper. Res. 5,1 27-34.
[GJ]
Garey, M.R. and D.S. Johnson. (1979).
bility, W.H. Freeman, San Francisco.
[Ha]
Haimovich, M. (1985). "Large Scale Geometric Location Problems I:
Working paper OR-130-85,
Deterministic Limit Properties",
Operations Research Center, M.I.T., January.
[HM1]
"Extremum Properties of
and T. L. Magnanti (1985).
Hexagonal Partitioning and the Uniform Distribution in Euclidean
Location". Working Paper OR-132-85, Operations Research Center,
M.I.T., January.
[HM2]
and T. L. Magnanti (1985). "Location Games and Bounds
for Median Problems". Working Paper OR-133-85, Operations Research
Center, M.I.T., January.
A Course in Probability Theory, Academic
Computers and Intracta-
[HS]
Hochbaum, D. and J.M. Steele. (1982). "Steinhaus' Geometric
Location Problem for Random Samples in the Plane", Adv. Appl.
Prob. 14, 56-67.
[HR]
Hsu, P.L. and H. Robbins. (1947). "Complete Convergence and the
Law of Large Numbers", Proc. Nat. Acad. Sci. U.S.A. 33, 25-31.
[Ka]
Karp, R.M. (1977). "Probabilistic Analysis of Partitioning
Algorithms for the Travelling Salesman Problem in the Plane",
Math. of Oper. Res. 2,209-224.
[Ki]
Kingman, J.F.C. (1976). "Subadditive Processes", in Lecture Notes
in Mathematics 539, Springer-Verlag, New York.
[Pa]
Papadimitriou, C.H. (1980). "Worst Case and Probabilistic
Analysis of Geometric Location Problem", SIAI J. Comput.,
10, 542-557.
[S1]
Steele, J.M. (1981). "Subadditive Euclidean Functionals and Nonlinear Growth in Geometric Probability", Ann. Probability, 9,
365-376.
Ily·lllll---
LIIIII1- ·-·--·11_-_--
-51-
[S2]
(1979). "Complete Convergence of Short Paths and
Karp's Algorithm for the TSP", Mathematics of Oerations Research 6,
374-378.
[We]
Weide, B.W. (1978). Statistical Methods in Algorithm Design and
Analysis. Thesis, Department of Computer Science, Carnegie-Mellon
University.
[Z]
Zemel, E.(1985).
"Probabilistic Analysis of Geometric Location Problems",
SIAM J. on Discrete and Algebraic Methods 6, 189-200.
Download