Document 11045827

advertisement
(liBSASZES)
«A
K ^'*.—-^ J
cv 1%^
DEV.'EY
HD28
.M414
no.
Heavy
Traffic
Analysis
Stochastic
Inventory-Routing
Martin
I.
the
of
Dynamic
Problem
Reiman, Rodrigo Rubio, and
Lawrence M. Wein
#3881-96-MSA
February,
1996
^ASs,
r^t'^-l^S
APR
X 1
^996
Heavy
Dynamic Stochastic
Inventory-Routing Problem
Traffic Analysis of the
Martin
AT&T
I.
Reiman
Bell Laboratories
Murray
NJ 07974
Hill,
Rodrigo Ruhio
Operations Research Center, M.I.T.
Cambridge,
MA
02139
Lawrence M. Wein
Sloan School of Management, M.I.T.
Cambridge,
MA
02139
ABSTRACT
We
analyze three queueing control problems that model a dynamic stochastic
tribution system, where a single capacitated vehicle serves a finite
make-to-stock fashion. The objective
is
to
in
of retailers in a
each of these vehicle routing and inventory problems
minimize the long run average inventory (holding and backordering) and transporta-
tion cost.
In all three problems, the controller dynamically specifies
the warehouse should idle or embark with a
travel along a prespecified
how many
(TSP) tour
units to deliver to each retailer.
The
By assuming
whether a vehicle at
problem, the vehicle must
and the controller dynamically decides
In the second problem, the vehicle delivers
and the controller decides which
third problem allows the additional
shipping options.
In the first
full load.
of all retailers,
entire load to one retailer (direct shipping)
next.
number
dis-
dynamic choice between the
an
retailer to visit
TSP and
direct
that the system operates under heavy traffic conditions,
we
approximate these queueing control problems by diffusion control problems, which are explicitly solved in
case.
the fixed route problems, and numerically solved in the dynamic routing
Simulation experiments confirm that the heavy
rate over a broad range of problem parameters.
about the behavior of
this
Our
traffic
approximations are quite accu-
results lead to
complex system.
February 1996
some new observations
Introduction
1
A
prototypical example of the inventory-routing problem (IRP)
large oil
company
as
it
distributes gasoline to
its
is
the challenge faced by a
various gas stations: several warehouses,
which hold inventory of a particular item (gasoline), serve a
set of retailers (stations) in a
make-to-stock fashion; arriving customers (automobiles) consume the product at these
and a
sites,
fleet of finite
capacity vehicles (tanker trucks)
from the warehouse to the various
The management
are many-fold
warehouses and
and operation of such a system
Traditionally, a hierarchical decomposition of the
used to allow for a solvable model at each of the levels
strategic level, the
used to transport the product
retailers.
decisions involved in the design
and complex.
is
managers of
retail
Simchi-Levi 1992).
is
At the
system must determine the location and number of
this
retailers, as well as
(e.g.,
problem
the assignment of retailers to warehouses. At a tactical
they must decide on the number of vehicles to operate, and possibly on the assignment
level,
of vehicles to service districts.
At the operational
send a particular vehicle out or
let it idle,
which of the
how much
retailers should each vehicle visit,
deliver to each of the retailers on
its
level
the decisions include: whether to
of the capacity of the vehicle to use,
and how much of
load should a vehicle
its
route.
At the tactical and operational
levels,
the essence of the
IRP
is
the tradeoff between
inventory costs and transportation costs: in order to reduce inventory levels at the retail
sites
without affecting the service
level,
more frequent replenishment
thereby increasing the transportation costs. In
to a lesser extent, vehicle travel times)
such cases, a stochastic model
this
is
is
many
applications, customer
field of
operations research
is
sometimes
in
criticized
Ackoff 1987).
(e.g.,
terexample to this perception: while heuristics
some spectacularly
In
required to accurately capture the inventory costs, and in
have lagged behind theoretical progress
led to
demand (and
subject to considerable stochastic variation.
paper we focus on the operational aspects of the IRP
The
deliveries are required,
a dynamic stochastic setting.
because
The IRP
real- world applications
is
an important coun-
for this notoriously difficult
problem have
successful industrial applications at both the operational (e.g., the
1
Edelman Prize winning work
of Bell et
1983, Golden et
al.
1988) levels, a concomitant mathematical theory for the
IRP
al.
in
1984) and tactical (Larson
a dynamic and stochastic en-
vironment has not been forthcoming. Federgruen and Zipkin (1984) analyze a single-period
IRP with stochastic demand, and Dror and
Ball (1987) develop a heuristic technique to
reduce the long run average problem to a single period problem. Recent studies that con-
IRP
sider the operational aspects of the stochcistic
include Trudeau and Dror (1992),
who
develop heuristics for the case of an external supplier, where retailer inventories are only
observable at delivery times; Minkoff (1993),
Markov
decision
itinerary
model that dispatches
who
constructs a decomposition heuristic for a
vehicles on a prespecified set of itineraries,
where each
characterized by an inventory allocation to a subset of customers; and
is
Schwarz and Ward (1995), who develop myopic
static
and dynamic strategies
Kumar,
for allocating
the contents of a vehicle to the various retailers on a predetermined tour. Chan, Federgruen
and Simchi-Levi's (1994) probabilistic analysis of random instances of the deterministic IRP
is
useful for addressing tactical
aspects of the
IRP with
and strategic
stochastic
issues,
but has no bearing on the operational
demand.
Our system model has one warehouse and one capacitated
assume that the higher
level decisions
An ample amount
available at the warehouse, and the cost of holding this inventory
demand and
and the objective
may
is
differ
by
is
to
retailer)
vehicle travel times are
and operating the
send the vehicle out with a
less
than one, then an
Two
routing,
when a
We
is
not included in the model.
random, unsatisfied demand
if
when the
full
load or
One
vehicle.
vehicle
let
is
is
backordered,
of the crucial decisions in our
problem
at the warehouse, the controller can either
the vehicle
sit idle;
because we employ a long run
the vehicle does not idle and the "traffic intensity" of the system
infinite
amount
is
of retailer inventory will build up over the long run.
types of IRP's are analyzed: the
dynamic routing.
is
of inventory
minimize costs due to holding and backordering inventory (cost rates
the vehicle idling policy:
average cost criterion,
we effectively
have been made to assign a single warehouse and a
single vehicle to serve all retailers in a particular region.
Retailer
vehicle; hence,
first
assumes fixed routing and the second allows
consider two variants of the fixed routing IRP: in the
vehicle leaves the warehouse
it
IRP with TSP
uses a pre-optimized tour of the
m
retailers,
which we
TSP
refer to as the
idling policy, the controller
decision
is
(travelling salesman
must decide how many units
based on the current inventory
The second
of units in the vehicle.
problem) tour. In addition to the vehicle
levels at all retailers
variant
time the vehicle leaves the warehouse
it
is
the
IRP with
TSP
and
this
and on the remaining number
direct shipping] in this case, each
delivers all of its contents to a single retailer,
the controller dynamically specifies which retailer to
routing, the controller decides, based
to deliver to each retailer,
In the
visit next.
upon the current inventory
and
IRP with dynamic
levels,
whether to use a
tour or direct shipping.
By
TSP
only allowing
we avoid an
tours or direct shipping,
assault on the combi-
embedded routing problem, and model these problems
natorial aspects of the
as queueing
control problems. Since these control problems appear to be analytically intractable, heavy
traffic analysis is
employed
in order to
nontrivial limiting control problem,
of the time to
is
make
further progress.
we assume that the
To obtain an
server (vehicle)
meet average demand, the vehicle capacity
is
conditions are stated
theorems
in
sition in the
more
precisely later in the paper.
is
and
must be busy most
large, the tour
long and nearly deterministic, and the vehicle operating cost
interesting
completion time
large; these
heavy
Guided by the heavy
traffic
traffic limit
Coffman, Puhalskii and Reiman (1995a,b), we uncover a time scale decompo-
heavy
traffic limit;
however, no weak convergence proofs are provided because
they would be very demanding and would distract us from our primary objectives:
gain insights into the nature of the optimal policies for the IRP, and
(ii)
(i)
to
to develop effective
pohcies for the operation- of these systems. Under the traditional heavy traffic normalization,
where time and inventory are compressed by a factor
going to
infinity),
of
n and
y/n, respectively (and
the (embedded or averaged) total retailer inventory process
proximated by a one-dimensional reflected Brownian motion on the interval
the control parameter
idling policy.
The
w
drift
— 00, tf], where
represents an aggregate base stock level that dictates the vehicle
is
TSP
being used, and in the dynamic routing problem the controller
dynamically switches between two Brownian motions (one
its
is
well ap-
and variance of the Brownian motion depend on whether the
or direct shipping policy
ping), each possessing
(
is
n
own
drift,
for
TSP and
variance and cost structure.
If
one for direct ship-
we take the heavy
traffic
normalization and slow
time
fcister
scale.
A
down time by a
factor of ^/n, then a fluid limit
is
obtained on this
deterministic analysis at this time scale allows us to optimize over the
other operational decisions.
Our
results are quite explicit; the derived controls for the fixed routing
given in closed form or
in
that
of the optimal policy for the diffusion control
IRP with dynamic
routing,
we analyze a
cleiss
problem
of triple threshold policies
conjectured to contain the optimal policy, and numerically compute the optimal
is
policy to the diffusion control problem.
the accuracy of the heavy
A
traffic analysis
computational study
and allows us
importance of the various operational decisions
dynamic
allocation,
In §2
and
§3,
The performance
routing
is
TSP
vs. direct
of these
analyzed in
§5.
two routing schemes
is
dynamic
compared
Our computational study
summary
carried out that confirms
the vehicle idling policy, static vs.
(e.g.,
shipping, fixed vs.
is
to obtain insights into the relative
we analyze the IRP with TSP routing and
concluding remarks, including a
routing).
direct shipping, respectively.
in §4
and the IRP with dynamic
§7 contains
described in §6.
is
some
of our key findings.
The IRP with TSP Routing
2
2.1
Problem Formulation
Consider a system where a single vehicle with capacity
product to
m
at the central
geographically dispersed retailers.
warehouse at no
a make-to-stock fashion, and
When
the vehicle
is
cost.
demand
full
is
inlinitc
used to distribute a standard
supply of the product
is
is
kept
retailer inventories in
that cannot be served immediately
load and then visits
retailers are visited could
V
Customers arc served from the
before returning empty. Alternatively, the vehicle
which
An
operating the following policy
(indexed as station 0) with a
in
either
terms of parameters that are solutions to certain equations. Because
we cannot characterize the form
associated with the
IRPs are
is
backordered.
used: the vehicle leaves the warehouse
all
may
the retailers in a predefined sequence
idle at the depot.
be arbitrary, we assume that
it
is
Though the order
the solution to the
TSP
implied TSP, and refer henceforth to this service scheme as the
we assume
generality,
in the
=
i
that retailers are indexed from
through
1
TSP
tour.
Two
sources of variability are considered: customer
l,...,m, customer demand at retailer
process {Di{t)^t
>
0} with rate
Without
m according to their position
demand and
denoted by D{t)
= ^,
= ^,
and A
Di{t),
coefficient of variation c^- (variance of the
demand
the total
A, is
of this paper the index runs over the set of retailers {1,2,...,
Our
otherwise.)
cesses; see §6 of
i
and j
is
rate. (In all
m}, unless
given by
samples of the random variable
iid
coefficient of variation
demand streams and
The sequence
(1984) for details.
cf^
{i,j
run from
of each other.
to m).
[0, i]
summations
explicitly indicated
of travel times
between
which has mean
Tij,
in
compound renewal
results easily generalize to cases with correlated
Reiman
For
travel times.
interdemand time divided by the square of the mean). The cumulative total demand
is
loss of
occurs according to an independent renewal
i
and squared
A,-
policy.
Oij
pro-
facilities
and squared
These travel times are independent of the
Keeping with the convention
in the literature,
we assume
that pickup and delivery of units occur instantaneously; in practice, load/unload times tend
to
be dwarfed by the travel times. (Although non-zero load/unload times can be incorpo-
rated in a straightforward manner, the analysis becomes
cloud the basic issues). Hence, the
the
TSP
tour are given by 0t
spectively,
=
E.To'
where the subscript "T"
coefficient of variation of the tour
denote the counting process for
continuously active in
is
^.,.+i
is
+
^mo and
mnemonic
for
idle; (ii)
The busy/idle
while the vehicle
control
be interrupted, and so the sequence
ini {t
I
TSP. For
later use,
=
we
s^/^j, and
t
re-
define the squared
let
{^^(i),^
>
assuming the vehicle
two operating control decisions remain:
is
0}
is
STiBxit))
>
k}.
The
r^.,
is
busy,
how much
(i)
whether
of the load to leave
expressed in terms of the cumulative process Bj{t),
which represents the amount of time the vehicle
=
would
4 = ZfJ^' Ol^+Aj+i + ^moC^o,
tour completions up to time
fixed, only
the vehicle should be busy or
Tfc
inclusion
variance of the total time required to complete
completion time as Cj
TSP
its
[0,t].
Because the route
at each retailer.
mean and
more tedious and
k
=
is
1,2,
..
busy
.
in [0,<].
We
do not allow tours to
of tour completion epochs
delivery allocations are
is
given by
modeled by the m-dimensional
amount
control process L,{t), which represents the cumulative
time
nominal delivery
We
let
developments,
In anticipation of future
t.
=
Vi
size for retailer
A,V/A
for all
V among
allocation process
defined by
and a dynamic allocation process
V^,
=
L,{t)
-
V,ST{BT{t)) for
>
t
amount dehvered during the current
deliveries to retailer
up
i
we need only
to time
cycle for retailer
ef (0)
=
{^^)
>
^i
ej{t)
>
ef(rjt_i)
=
eriTk)
"-"
where the superscripts
The number
XI,
Qi(0-
at time
If
t
=
(1)
size over past tours,
Because the tour completion
(.J{t)
to determine the total
we assume
J2i ^Ji^) represents
the total amount
that the vehicle leaves the warehouse
(^~)
i,
only
for
(2)
if
retailer
<G
is
visited at time
is
andall
(r;t_i,rfc)
is
(3)
/,
i,
(4)
and
(5)
0,
(6)
and "+" denote the times just before and after an epoch.
if
this quantity
denoted by Qi{t), and the total inventory
we assume
that Q,{0)
=
(J
(wliich
is
without
is
negative) at
at the retailers
is
Q{t)
loss of generality, since a long
=
run
being used) then the current inventory Q,{t) equals the cumulative
minus the cumulative demand, which by
Q,it)
i
of units in inventory (or backordered
average cost criterion
deliveries
for all
= V
iT{T^)
i
load
load and returns empty, the dynamic load allocation process must satisfy
^i
retailer
The
Notice that deviations from the nominal allocation
t.
delivered during the current cycle. Because
full
i.
specify the value of
cancel out across the retailers and the process er(i)
with a
tf{t).
0,
which represents the cumulative deviations from the nominal delivery
history can be observed,
up to
us express this control in terms of a
the retailers according to their relative demands.
eJit)
plus the
i
so that the nominal delivery size corresponds to allocating
i,
the vehicle capacity
is
denoted by
i,
let
delivered to retailer
= V,STiBT{t))-D,{t) +
eJ{t)
(1)
for
given by
is
/
=
l,...,m,
t
>
0.
(7)
Define the cumulative vehicle idle time process I{t) by
I{t)
=t-
so that the control policy Bj{t),tJ{t)
BT^
Our
must
>
t
0,
satisfy
are nonanticipating with respect to Q,
e,
in
(9)
Bt
is
nondecreasing and continuous with -Sj(O)
/
is
nondecreasing with 7(0) ==0.
The
=
(10)
0,
(11)
cost,
travel cost rate per unit time, which includes vehicle depreciation, fuel
is r.
Note that these costs can be combined because we are ignoring the
load/unload times (only the driver, but not the vehicle,
ing).
(8)
objective function includes transportation costs and inventory holding and back-
ordering costs.
and driver
Brit) for
is
busy while loading and unload-
Inventory costs are assumed to be piecewise-linear, with the holding cost rate (per unit
inventory per unit time) at retailer
i
denoted by
and the backorder cost rate by
h^
Because travel costs are incurred whenever the vehicle
6,-.
busy, the travel cost rate r can be
is
equivalently treated as a reward for exerting idleness. Hence the problem reduces to finding
a control poHcy fB7'(i), ef (i)j to minimize
limsup
—E
T— CO T
subject to (2)
-
(11),
The dynamic
[
where the
Y.ih^{Q^{t)]''
"-|-"
"-"
h,{Q,[t)]-)dt
for the
-
rI{T)
(12)
denote the positive and negative parts.
stochastic IRP, as formulated in (2)
Even under Markovian assumptions
is
and
-f
underlying
-
(12), does not
random
seem
to be tractable.
processes, the control space
enormous and the state space has m-\-2 dimensions: the inventory/backorder
retailer
and the location and
the problem,
2.2
We
we analyze
Heavy
it
Traffic
begin our heavy
traffic
total contents of the vehicle.
when
To gain
level at
each
further understanding of
the system operates in the heavy traffic regime.
Normalizations
development by centering the service completion and demand
processes; define the centered processes «Sr(i)
=
Sxit)
—
Oj^t and T)[t)
=
D[t)
—
Xt.
It is
convenient to define the process
"
'^^^^
we
refer to this quantity as the
(^ " ^J
V^5T(5r(/))
-
netput process, although
it
'
to the netput processes constructed in the
networks
retailers
(e.g.,
Peterson 1991).
Summing
and substituting the relevant
"^
heavy
V(t)-
(13)
does not correspond precisely
traffic analysis of
conventional queueing
the inventory evolution equations (7) over
definitions yields
Q{t}=x{t)-^m + eT{t).
The heavy
(14)
may be found
approximation for the IRP
traffic
as the
of a sequence of systems indexed by the heavy traffic parameter n.
will
are non-traditional,
we index
off
This indexing
(as
will
quantities with
is
n
(in
Even though no weak
an appropriate place) to make the scalings
be confined to this subsection;
the index, with the understanding that
associated value of n.
Hmit (as n -^ oo)
be undertaken here, because some of the scalings that we introduce
convergence proofs
clear.
all
we
analysis are independent of n.
according to standard heavy
of as a large integer (e.g., 100) but
recommendations that emerge from our heavy
The parameter n
traffic
paper we leave
are considering a single system that has an
The parameter n can be thought
typically the case) the policy
for the rest of the
is
traffic
used to normalize the various processes
Bj
undergoes
9^,
y^
(15)
conventions (notice that only the process
a "fluid" scaling):
W}-Ht)
=
9i:^
W^-\t)
for alii,
'
^/^
\/n
processes ly'"' and
V'*"'
Y,w!'^\t) =
y
5n<) =
'
'
(16)
yjn
yjn
mi) = V^,
The
™
=
and
4"'(')=^£^.
(17)
represent the normalized inventory and idleness, respectively; to
reduce the amount of notation, the normalized versions of the remaining processes contain
8
a "hat".
Employing these
and
scalings in (13)
(14) yields expressions for the uormalized
netput process
'T
and the normalized inventory process
y(n)
=
H/(")(0
To obtain a
parameters
- -^Y(-\t) +
x^-\t)
problem
nontrivial control
in
F^"'
(19)
we normalize the system
traffic,
The demand and inventory
in a particular fashion.
and the other parameters are normalized
scaled,
heavy
e?)(0.
cost parameters are not
as follows:
= ^^,
(20)
n{n)
op =
%.
(21)
/T/(n)
(n)
\
^\^J^B
/'r
^*"'
(n)
>
(22)
0'
T
4(n)
=
^f^c\[n),
(23)
r'"'
=
—
(24)
n
We
assume that
all
and positive hmits
the quantities on the
as
specify, in a unified
n
^ oo.
manner
traffic
conditions and they
via the heavy traffic parameter n, the relative magnitudes of
These conditions are more extensive than those enforced
queueing systems, and therefore warrant some discussion. Since the natural
definition of the traffic intensity
traffic condition,
Now we
side of definitions (20)-(24) converge to finite
Equations (20)-(24) are the iieavy
the various system parameters.
in traditional
left
is
pj
=
X9j-/V, condition (22)
which requires that pj be close
to,
but
less
is
the "traditional" heavy
than, unity.
turn to conditions (20)-(21). Because the state space
is
compressed by a factor
of y/n in the heavy traffic normalization, the vehicle capacity in terms of scaled inventory
units
is
V*"'/\/"'-
Hence,
would reduce to a variant
if
V*"'
was 0(1)
it
would vanish
of the multiclass make-to-stock
Although such a model would be tractable, a
9
limit that
in the limit,
queue analyzed
and our system
in
Wein
(1992).
employs infinitesimal vehicle
sizes
to capture the essence of the behavior of the original system.
fails
condition (20), so that
in
Therefore,
we enforce
0[yjn) and the bulkiness of the retailer deliveries
V'*"' is
the limit. However, since the
demand
rate A^"'
lengths according to (21) to ensure that the ratio
is
unsealed,
we need
K^"V^r converges
retained
is
to also scale the tour
to a finite
and positive
limit.
Turning to
note that since the vehicle capacity
(2.3),
is
0(\/n),
if
the Cj(n)
is
not
scaled then a standard calculation shows that the variance term for the normalized netput
process x^"'
is
0{n) and hence approaches
c^{n)6j{n), by (21
)
and (23) we obtain
enforcing condition (23),
in contrast, this
way
we assume
Sj'{n)
is
to
on a two-dimensional
y/n.
Finally,
relative
travel times
=
=
Thus, by
9j'C^{n).
is
0{y/n);
were simply multiplied by y/n. One
travel time of the tour
is
the
and
we need
is
map
sum
with a fine grid, in such a
likely
of y/n
iid finite
way
that the tour
is
problematic
be independent and the necessary data would
not pursued here; see Rubio (1995) for further details.
to normalize the cost parameters to account for distortions in the
magnitudes of the transportation and inventory costs that
traffic scaling.
n times
^/nsj[n), where Sj-{n)
grid points. However, this modeling artifice
(because adjacent travel times would not
collect)
if
assume that the
passes through approximately
be tedious to
Since 5j(n)
traffic limit.
This construction could arise by superimposing the warehouse and
variance travel times.
retailer locations
=
heavy
that the variance of the tour completion time
quantity would be 0{n}
to achieve (23)
infinity in the
The appropriate
scaling
is
result
from the heavy
to allow the travel cost rate r^"' to be approximately
larger than the inventory cost rates, as in condition (24); see
Rubio
for a detailed
explanation.
In
summary, the heavy
traffic
conditions assume that the vehicle must be busy the
great majority of the time to meet average
demand, the
vehicle capacity
must be
large, the
tour completion time must be large and nearly deterministic, and the travel cost rate must
be very large relative to the inventory cost
rates.
The computational study
that our results are rather insensitive to these conditions.
10
in
§6 reveals
System Behavior
2.3
Heavy
in
Traffic
This subsection considers the limiting behavior of equations (18)-(19). Following Harrison
(1988),
we
replace Bjit), which
for this substitution
is
is
the fluid scaled busy time process, by pjt; the justification
that any pohcy that does not utilize the vehicle for a fraction pj of
the time over a sufficiently long time interval will generate extremely large inventory costs.
In addition,
we consider the normalized netput
Without some embedding
because
exist
it
X{t)
With
is
at tour
completion epochs.
would not
by 0(1) on a time of length 0(l/>/n).
The
thus defined as
= ^{-0--^)
+ y^TipTTk-i) -
^^-^
'DiT,_,) for
t
€ [n-uTk)
.
standard tools of weak convergence (the functional central hmit
this definition the
theorem
embedded
or averaging the limit of the normalized netput process
varies (after normalization)
we consider
process
process
for renewal processes, the
random time change theorem and the continuous mapping
theorem; see Billingsley 1968) can be used to show that the normahzed netput process
embedded
drift
at tour completion epochs
pt and variance
Now we
=
cry
A(c^
is
well
time units under the heavy
of size 0(1)
whenever a delivery occurs and
traffic
m— dimensional normalized
However,
if
we
start with the
then a fluid scaling
\/n.
At
this faster
ual inventory levels
is
time
scale, the
move
at delivery epochs.
value,
and equals zero
is
with
at the
end of the
0{\/n) by (21), a tour takes only 0(l/\/n)
heavy
traffic
is
compressed by the factor
hmit. Consequently, neither tj{t)
inventory process converge to a limit in the usual sense.
traffic
normalization and expand time by a factor of y/n,
obtained, where both time and space are compressed by the factor
jumps
This
heavy
is
normahzation (where time
n); hence, tours occur instantaneously in the
nor the
X
Vc^-).
In addition, because the tour length
cycle.
approximated by a Brownian motion
turn our attention to the process ey. This process equals zero at tour com-
and has jumps
pletion epochs
+
x
Brownian motion
X
remains constant, and the individ-
in a deterministic fashion, decreasing at a finite rate
The
process tj traverses through
many
between the
tours before
X
changes
at each tour completion epoch.
similar to the state of affairs in the heavy traffic results of CofFman, Puhalskii
11
and
Reiriiaii.
In their
exhaustive polHng system, the total queue length process behaves as a
one-dimensional diffusion under the slow time scale associated with the heavy
and the individual queues move as a
fluid limit.
(HTAP)
traffic scaling,
under the faster time scale associated with the
fluid
This time scale decomposition gives
rise to
a heavy
traffic
averaging principle
that implies the following: for purposes of calculating performance measures for the
individual queues, one can analyze the deterministic fluid cycle for each fixed value of the
diffusion process.
and
in
in
the polling system
the IRP. First, the fluid trajectories are different. In the polling problem, the fluid
paths look
down
There are four key differences between the IITAP
like
those for the economic production quantity
at a finite rate. In the IRP, the paths look like those
(EOQ) model:
they go
down
at finite rate but
go up
(EPQ) model: they go up and
from the economic order quantity
in
jumps
at delivery epochs.
The
second key difference relates to the issue of "control". In the polling system the exhaustive
discipline guarantees that
whenever the server switches from a queue, that queue
is
empty.
This exerts a type of control that keeps the multidimensional process well behaved. There
is
no such natural mechanism
in the
IRP; we must introduce a dynamic allocation scheme
to keep the multidimensional process well behaved.
emerged
scale decomposition in the polling system
traffic
normalization, whereas in the
This gives
rise to
IRP
it
the fourth key difference
(This
is
done below.) Third, the time
as a consequence of the standard
heavy
from the scaling assumptions (20)-(23).
follows
— the proof of the HTAP. In the polling context
a difficult proof involving a threshold queue was needed.
For the
IRP the proof
follows
from assumptions (20)-(23) and the properties of the dynamic control (we do not, however,
provide the proof).
2.4
The Limiting Control Problem
As described above, the
scales.
On
analysis of our limiting control problem decomposes onto two time
the slow time scale associated with the diffusion limit,
effects of the controlled allocation process er
is
and choose the vehicle
generated by the normalized cumulative idleness
12
)
.
we can average out the
idling policy.
which we as.sume
is
This policy
nondecreasing and
right continuous.
=
Let Z{t)
—
X{t)
g-Y{t); this
is
the process that would be obtained
one were to observe the total inventory only at tour completion epochs.
process as the total
At the
embedded inventory
faster time scale
process, to differentiate
where the
embedded inventory
total
the individual inventories behave as a fluid,
it
we must
from
is
We
if
refer to this
W.
fixed at Z{t)
find the optimal allocation
=
x and
pohcy that
minimizes inventory costs per unit time. The limit cycle associated with an allocation policy
can be viewed as a closed
to the
m— dimensional
problem of optimally placing a deterministic cycle
inventory cost per unit time that
We
routing:
Z{t)
=
X,
R™.
in
placement
find the optimal cycle
(i)
and
its
Z{t)
=
x.
IRP with TSP
level
choose the nondecreasing
(ii)
to minimize
limsup
T—oo
—E
subject
to
cycle placement problem
control problem for
for the
when
embedded inventory
for a given total
corresponding inventory cost rate g{x); and
Y
Let g{x) represent the
achieved by optimally locating a cycle
is
can now state the limiting stochastic control problem
right continuous process
The
path, and the optimal allocation policy reduces
/ 9iZit))dt-rY(T)
(25)
Jo
J
Z{t)
=
X{t)
-
—V y'(0-
(26)
a nonhnear program and problem (25)- (26)
is
Brownian motion; these two problems are solved
a singular
is
in the next
two sub-
sections.
2.5
Optimal Cycle Placement and Dynamic Allocation
To optimally place the
Wein (1995)
limit cycle,
for the stochastic
and denote the individual
we
follow the approach used in Markowitz,
economic
lot
fluid inventory levels
used to denote quantities introduced for the
in
many ways and we
scheduling problem (ELSP). Let us
choose to specify
it
by the vector
fix
Z{t)
=
x,
=
Qi{y/nt)l s/n (a "bar" will be
The
cycle placement can be defined
by Wi{t)
fluid limit).
Reiman and
(xi, X2,
.
,
.
.
x^), where x^ represents
the lowest point during the cycle of Wi[t).
The
choose (xj,
choice of optimal
.
,
.
.
(.Tj,
.
.
,
.
Xm)
is
a constrained optimization problem:
Xm) to minimize the inventory cost rate subject
13
we want
to
to consistency with the total
WAt)
-^^^,
A
Figure
1:
Fluid inventory evolution at retailer
embedded inventory
level.
be described below.
We
To
inventory cost rate will
come from the averaging
TSP
of the inter-site
and
i
mean
we need
to introduce
path between any two
travel times
=
Of-^^
some new
sites i,j
G
0, 1,
is
model by
Ofj^^
time over a cycle so that the vehicle leaves the warehouse at
to its corresponding cycle placement value x,
=
1,
.
.
,
.
.
.
,m by
compressed by a factor of ^yn
define the corresponding travel times for the fluid
z
.
m. Summing these inventory
i
by (see Figure
9j]^^; in
levels over all retailers,
=
Oj^^^ /y/n.
=
0,
If
we measure
then VV,(0)
=
VV',(0)
x,
+
is
ventory cost rate.
solution.
to retailer
size
is
The
Long term
i
V,
=
.
,Xm) satisfying
.
bcisic intuition of
^(dof^
Vi
=
Vi/y/n.
for
we obtain
(27)
(27),
we want
to determine the associated in-
the time scale decomposition appears to provide the
stability requires that, in the long run, the average
per cycle be
given by
.
related
i
t
(xi,
terms
YliZi ^k.k+i
^x. = x-X:A.^T/^
Given a vector
total
in the fluid limit,
=
1)
and the
Denote the mean
notation.
these quantities are defined by Of^^^
6,j,
Since time
0.
principle to
deal with the consistency issue.
first
level x,
travel time aiong' the
>
during a nominal allocation cycle.
establish the relationship between the cycle placement variables i,
embedded inventory
for j
The
i
A,K/A; under the diffusion and
Viewing the
fluid
14
amount deUvered
fluid scalings, this delivery
inventory of retailer
i
in isolation
with a
we
delivery of Vi on each tour cycle,
delivery, decreasing at a constant rate until
Figure
The inventory
1).
considering the normalized inventory process
Although
[xi, Xi-\-Vi].
this
Simply delivering
VF, to
retailer
be uniformly distributed on the interval
Vi to retailer
i
on every
is
simple allocation scheme, the
drift of
VJOf
the inventory "arrival" rate
rate,
visit will result in
an
infinite long
this,
finite.
run average cost
note that under this
Wi{t) does not depend on Wi{t) and equals zero, since
equals the
demand
rate A^
when pj =
with
I]
<
/^j
generated with the effective arrival rate being prVilOj. With a zero
is
grow
central limit theorem arguments indicate that the inventory or backlog will
fact, similar
turns out that
it
needed to keep the long run average cost
because this allocation leads to a null recurrent process. To see
similar result
can then be calculated by
i
approach provides the correct inventory cost
a dynamic (state-dependent) delivery size
immediately after
V^
reached just prior to the next delivery (see
is
a-,
component associated with
cost
+
see a fluid starting out at Xi
arguments can be used to explain some numerical
results in
1
a
drift,
as \/t. In
Federgruen and
Katalan (1994) and Wein, where a state-independent policy performs poorly
a stochastic
in
setting.
A
at the
simple dynamic allocation policy escapes this
warehouse as
follows.
Given a
We determine delivery sizes
difficulty.
fluid inventory level [w\^
.
.
.
,to„)
when the
at the warehouse, the fluid limit of the inventory immediately before delivery
If
possible,
we would
like to deliver
(/,
=
Xi
+
Vi
—
Wi
-f
A,^^"^^ to retailer
the fluid inventory level immediately after dehvery to x,
then this delivery allocation
have
d^
Yl
>
V. This
is
is
feasible. If d^
<
for
some
+
i,
li d,
V,.
i,
>
in
—
is
—TSP
XiOQ^
.
order to bring
for
then, since Yli di
Wi
is
vehicle
=
?'
=
1,
.
.
.
,m
V, we must
a transient state for the fluid limit; within a finite
number
of
{i:d,>0}
cycles
we
will
have
d,
>
for all
i.
This transient interval has no
average inventory cost, and an averaging principle
will
when
between
Xi
and
Xi
+
The average
Z{t)
=
x, W{{t)
on the long run
hold under this dynamic allocation
scheme. In summary, the essence of the averaging principle here
allocation scheme,
effect
can be treated as
is
if it
that,
is
under
this
dynamic
uniformly distributed
Vi.
inventory cost per unit time
divided by the corresponding cycle length.
The
15
is
equal to the cost incurred over a cycle
cost at retailer
i
may be obtained by
simple
geometric arguments
for
any cycle placement
When
i^.
the cycle placement
is
sufficiently
high (low) so that the inventory remains positive (negative) for the duration of the cycle, the
cost
simply the holding (backordering) rate multiplied by the absolute value of
is
which
is
When
the average inventory level over a cycle.
i,
+
K72,
the inventory changes sign during the
cycle the total holding (backordering) cost over a cycle equals the area of one of the triangles
above (below) the time axis multiplied by
when there
length.
is
a sign change we
In the
heavy
sum
traffic limit,
amount demanded per
which
cycle,
/t,
To obtain the time average inventory
(6,).
cost
the areas of these two triangles and divide by the cycle
amount
the
of fluid delivered per cycle, V, equals the
X6t] hence,
is
we
set the cycle length in the fluid
model
equal to V/X, rather than Oj. In summary, we have the following expression for retailer
+ ^)
hi{xi
9i{xi)
=
!^x]
+
2V,
<
Notice that
problem
is
S',(x,) is
a convex function of
to minimize
Let us
make
conventions he
=
h
H,
iixi>0
h,x,
+ f^ \i-V,<x,<0
+ ^)
6.(.r.
With equation
x,.
hi
and
bp
=
b
=
form solution to the cycle placement problem
b,
min,
is
in
h, for all
where
x.
Not
i,
=
£
—
rn
1
1.
The solution
Region
1:
surprisingly, g{x)
x<ar =
j: X^'' -
Oi
+
i
Vi
for
hi
16
I
^p,
is
L ^K,
+
0,
= -
A
closed-
The
solution
cost as a function of the
quadratic with linear edges in the
is
placement problem
i
X,
allowed.
for further details.
and g{x), the inventory
to the cycle
\s
variables; readers are referred to
inventory level x.
Proposition
p
and define the labeling
found by using constraint (27) to turn the
Markowitz, Reiman and Wcin
yields the vector of optimal placements x'
embedded inventory
>
6;,
problem into one of unconstrained optimization over
total
(28) in hand, the cycle placement
^,(x,) subject to (27).
min,
an analogous optimization
(28)
.
<-V;
ifx,
the innocuous assumption that
=
i:
hi
^v
g{x)
= -bx +
ai,
nTSP
Region
,
aT<x<h = Y.
2:
l^{b + h,Y
Iv-.tV
>^^~^l''^
E r^^'
+
'
i
X:
=
«2
=
7,\1^
2
V
+
h^
,
i
h.
0,
/?,
%r
03
bih,Vj
04
bi
Region
x
3:
x]
>
-\-
hi
/Sj,
= -r^nr^
Oj +
Z^''
^^
^'
ft,
g[x)
a^
=
/ix
+
a,5,
= -hTx.'eir
+
Oi
Z^
^ -y.hM
2
^
^
In the region
where the
cycle holds (backorders)
so,
is
much
most of the inventory
'
-E
2 ^
^!'
6,
~
+
f^'
site
(6),
depends upon two
and
its
factors:
nominal delivery
particular retailer represents).
greater (smaller) than zero, the optimal
at retailer i (p),
the difference between
size (or equivalently, the
In the region
where the
cycle placement at each retailer varies linearly with the
It is
worth noting that,
for the
where
its
cheapest to do
is
it
The exact
demand
proportion of
is
when
hi
that the
close to zero, the
embedded inventory
(i.e.,
level for
holding (backordering) cost
total inventory
symmetric cost case
17
^-
/i,
while the cycle for the rest of the retailers remains close to zero.
each
h
total inventory
'
=
in the system.
h and
b,
—
b for
all
=
2
1, ...
,
m), the more natural solution of
{x-ZkhO^r)
also optimal.
is
Optimal Base Stock Level
2.6
Now
that g{x)
known, we can proceed with the solution to the one-dimensional Brownian
is
The
control problem.
Proposition
some
following proposition
The optimal solution
2.
is
proved
Appendix A
in
to (25)-(26) is Y'^t)
=
of Rubio.
suPq^^^^{X{s)
—
z^}'^ for
base stock level Zj.
Hence, the optimal solution
rier 2j,
(
&T<x<^T
if
is
the local time of the Brownian motion at the bar-
and the optimally controlled process
— oo,2j]
Z
a reflected Brownian motion
is
The remainder
(see §2.2 of Harrison 1985 for a definition).
(RBM) on
of this subsection
is
devoted to the derivation of Zj, which can be found by using two well known facts regarding
an
RBM
df^j/V
on (— oo,z].
>
for fij
First, for
which
0,
Y
independent of the base stock
is
portation cost does not affect the selection of
limsup-r^^
^E
[Jq
The steady
if
X
>
2,
where
z,
Hence, the trans-
level z.
and the problem
=
simplifies to
minimizing
g{Z{t))dt] subject to (26).
state density for
{/j
we have \\mt^oot~^Ex[Y{t)]
defined in Proposition 2
—
2p.'r/crj
>
0.
Z
is
given by pz{x)
=
j/je"^'^"^'
\{
x
<
z
and pz{x)
=
Therefore, the optimal base stock level can be found by
minimizing
= n{-bx^a,)vTe^''^'=-'Ux+j\a2X^ +
Ft{z)
r [hx + a5)C'Te^''^'~'^dx
+
a^x^a^)uTe^-'^''"^dx
z
>
/Sj,
{a2X^
+
a:iX
for
and
(29)
Jpr
Ft{z)
= n{-bx +
ai)C'Te^^^'-'Ux
+r
J—oo
for
Or <
z
<
fir-
The constants
in (29)
is
linear
a4)he^''^'~'^dx
and (30) have the same definitions as
that while the optimal base stock level Zj always satisfies zj
fact that g{x)
+
(30)
Jaf
and has a negative slope
18
for
x
<
qj),
> aj
it
(this
is
in §2.5.
easily seen
Note
by the
need not be larger than 0t-
The
following proposition
and then taking the
Proposition
first
The value that minimizes Ft{z)
3.
\b
vj
is
derived by using integration by parts on (29) and (30),
two derivatives of Fxi^) with respect to
h
otherwise, z^
is
+
\
(
yri^T
^3-^f
+
is
a unique
(31) that satisfies zf
2.7
4>/5t;
if
+ as-'^ =
^
2a2ZT
is
Ft{zx)
In this subsection
the fact that Ft{z)
optimum base
>
/?x or
The Proposed
=
We
is
hzj
+
0.
(32)
05 if Zj
is,
a solution Zy to (32) that
we map the
satisfies
IRP with TSP
idle,
routing.
The
zf
is
is
size of the
proposed policy
/?y,
is
is
but not both.
control concerns two decisions:
best delayed until the vehicle arrives at the
how much
site.
full load,
Let
to
is
correspond to
and consider the epoch
i.
The mapping from heavy
At time t~ the
,
.
.
,Qm{l-7))i
traffic solution to
straightforward: the proposed pohcy attempts to track the heavy traffic
solution (in particular, the optimal cycle placement) as closely as possible.
to be addressed
of the load to
given by the inventory levels at the retailers, iQi{t~),.
remaining load, L{t7).
retailers
first.
the point in time just before the vehicle arrives at retailer
state of the system
and the
<
and how to assign the load among the
the epoch at which the vehicle leaves the warehouse with a
which
=
solution of the approximating heavy traffic control problem
address the load allocations
leave at each retailer
to,
^j, and Fr{zj)
either there exists a solution Zj to
Since the system evolves dynamically in time, the decision of
>
>
convex and continuously difFerentiable)
stock level; that
whether the vehicle should be busy or
t~
(31)
Policy
into a policy for the original
during a tour.
+ aT
I
^4 otherwise.
One can show (from
that there
—
the solution to
Furthermore, the predicted optimal cost
+
is
Q^r)
h J ye^TWr-oiT)
2a2^_0rizT-&T)
^2{^t)^
—
z.
that the heavy traffic solution
19
is
The key
issue
expressed in terms of normalized space
and time and
terms of the total embedded inventory process, whereas the proposed policy
in
must be expressed
terms oi {Qi{t~),
in
,Qm{t~),L{t'~)).
...
Recall that equation (27) relates the scaled cycle placement vector x, and the normalized total
each retailer
=
'^'"^
embedded inventory
<
=
q
If
time <~, where
i.
taken when
Because
level.
y/nx and the unsealed cycle placement vector
Summing
over
by the heavy
under a deterministic evolution
this relation
i,
=
<o
+
+
qj
we
all retailers,
<0)
then
over the
arrives at retailer
it
Therefore, the retailer inventories relate to the cycle
^oi'^^-
=
averaging principle,
traffic
for the retailer inventories
the vehicle leaves the warehouse at time
placement parameters by Qj{t~)
j
corresponding embedded inventory
l-''^'
\/nxi. In keeping with the behavior predicted
course of a cycle.
at
is
to reverse the scalings in the solution to the heavy traffic control problem, let us
we develop
i
Since the load allocation decision
x.
establish a relationship between the current total system
first
= ^jQji^T)
define the unsealed
9i
we
reached,
is
inventory Q(ti')
we need
=
embedded inventory Z{t)
^j&Jf^
for j
>
i
and Qj{t~)
=
+
qj
V^
—
^j^Ji^^ for
get
Qit-)
= V.+E<l:^
(33)
J
where
T]i
=
12j<i{Vj
~
^j^Jf^)
+
J2j>i
Making the substitutions qi/y/n =
x,,
'^
^j^If^
q/y/n
=
^ri
epoch locator constant
x and Ojf^ I \/n
=
for retailer
i.
Ojf^ into (27) yields the
unsealed version of constraint (27),
E9. =
9-L^.<—
(34)
t
t
Using equations (33) and (34), we can express the total inventory at time
vehicle
was
^o (i-c,
when the
at the warehouse) as a translation of the inventory vector at time t~:
Q{to)
=
q
= EQAt7) -
V.
+
J2^Mr
f«^
^
=
0,...,m.
(35)
This equation maps the current inventory levels Qj{t~) into the one-dimensional quantity q
that
is
required to interpret the heavy traffic results. In particular, for a given value of q
can find q'
=
we
y/nx', the corresponding optimal (unsealed) cycle placeincnl parameters by
reversing the normalizations
in
Proposition
1.
As
20
before, this
is
done by
letting q,/\/n
=
a:,-,
x, Vif \pn
=
terms of the state
q:
=
ql y/n
and Off^/ \/n
Vi,
Region
1:
Region
Region
g
=
< ar =
Off^ to obtain the following expressioiTs
E -^>C " E rVT:^'
^^^^
«.-
=
-^Kfor.^,,
(37)
€
=
9-E^.C + E^V;;
(38)
2:
3:
oj <
9
<
q,
/^r
0,-
,;
=
+
< fr =
E ^.C " E f^'^'
/ij-
,-EvS- + E^^'..
(44)
n.
can now use these results to determine the delivery
size at retailer
deterministic inventory evolution for the optimal cycle placement, Qi{tf)
level at retailer
i
just after the delivery
is
made)
Q,{tt)
If
we
deliver
Qi{t~)
+
di;
(fj
(39)
(42)
which are independent of the scaling parameter
We
for q* in
units to retailer
i
=
q*
(i.e.
i.
Under the
the inventory
satisfies
+
(45)
K-.
then the actual inventory after the delivery
equating this quantity to (45) yields the desired delivery
is
made
is
size,
d,=q^ + V,-Q,{t-).
(46)
Because we cannot allocate more than the available load and do not want to make negative
deliveries, the
proposed delivery
0?*
=
max[c?„0]
size
+
is
given by
min[0,L(<,") -(/,] for
21
z
=
1,2,
.
.
.
,m -
1.
(47)
Finally, to
guarantee that the vehicle returns to the warehouse empty, we set
=
d'm
We
could
at retailer
under
its
ill
(48)
principle allow negative deliveries, as long as there
and the
i
L{t-J-
total
total capacity.
amount
of load the vehicle carries as
it
is
inventory available
leaves this retailer
is
kept
However, because the vehicle returns empty, the items accrued from
negative deliveries would most likely be shifted to the last few retailers of the tour, which will
not necessarily bring the state of the system closer to the optimal cycle; hence, we disallow
negative deliveries.
To
dynamic delivery allocations are derived by the following
recapitulate, the proposed
procedure:
(i)
observe the current inventory levels {Q\{t~)
unsealed embedded inventory q via (35),
placement parameters
q',
and
(iii)
.
.
.
and compute the
^Q,ri{iT))
use (36)-(44) to derive the optimal unsealed cycle
(ii)
observe the current remaining load L{t~) and compute
the proposed delivery size via (46)-(48).
We now
the vehicle
turn our attention to the busy/idle policy, which has decision epochs
at the warehouse.
is
total inventory level IZjQjit)
otherwise,
it
is
base stock level solely
=
is
in
^
hi
+ aT
1
Ii
j
\e''T(PT-o:T)
qj
=
the
\/n^f)
-
if
qT>0T;
(49)
1
the solution to
Furthermore, the predicted optimal cost
—
level
if
Proposition 3 yields the optimal unsealed
^e-.r(,r-r) + 2a2qT +
^V('7f)
new tour
terms of the original problem parameters:
in
J>+
otherwise, q^
a
in time, the vehicle starts
below the unsealed aggregate base stock
Reversing the normalizations
idles.
qr
At these points
when
<^2(?f)^
+ ^^^r +
'^'1
otherwise.
counterparts of the ones defined
is
03
-
given by Friqj-)
The constants
in
earlier:
Ut
=
—=
2(1
-ptW
xerici
22
+ vc^y
=
0.
h(jj
(50)
+
05
if
q^
>
(3t,
and by
these expressions are the unsealed
«.
=
4E^-i:A,<'H^iE^
Z^
We
oz
t
I
2
^
2
'
'
^
6,
+
and
/li
have completely characterized a dynamic control policy that depends exclusively
on the original system parameters. The policy
two controllable aspects of the
specifies the
the aggregate base stock level defined in (49)-(50) determines the vehicle idhng
system:
pohcy, and the delivery sizes d* defined
in (47)- (48) characterize
the allocation of units to
retailers.
The IRP with Direct Shipping
3
The only
difference between the direct shipping (DS) case
scheme used when the vehicle
is
operating.
We retain
all
TSP
and the
case
is
the routing
notation from §2, occasionally using
the subscript "D" (for direct shipping) in place of the subscript "T" (for TSP). Moreover,
since the procedure
is
very similar in both cases,
we omit
nearly
case, describing only the distinctive aspects of the analysis.
between
DS and TSP
is
that the
consequences, one of which
is
DS
all
of the details for the
The most
DS
significant difference
case does not have a cyclic structure. This has several
that the results are not as theoretically solid as in the
TSP
case.
3.1
In the
Heavy
DS
Traffic Analysis
case, the vehicle always leaves the
and returns empty, so that every time a
V
units.
nominal
As
before,
policy.
it is
retail site
is
full
load, visits a single retailer
visited its inventory level increases
by
convenient to express the dynamic allocation as deviations from a
The nominal
policy
the nominal policy an amount Vi
denote the number of
warehouse with a
DS
is
deliveries
we consider
is
not achievable:
delivered to retailer
made by a
23
i
in
we assume that under
every delivery.
We
continuously active vehicle during
let
Soit)
[0,i].
We
then
be defined by the analog of equation
let cf^{t)
Then
=
c/j(0
is
i.i.d.,
structure
or
amount
DS
TSP
and each tour
results in the delivery of
as a
Markov
same reason that
would be
The DS
case would have a cychc
Neither of these can be used
chain).
fixed delivery sizes could not
infinite over the long run.
A dynamic
be used
A,/A for retailer
i.
The
traffic intensity for
use the same heavy
traffic
demand
the
is
thus pp
normalizations as in §2.2.
<
< m, and
2
the
does not
must be
= 2^^ KOoi/V.
The heavy
conditions
traffic
are given by (20)-(24), with (21) and (23) understood to hold, respectively, for
1
i
at all retailers, this fraction
DS system
in
in
policy, described
used. For this policy the fraction of total shipments that go to retailer
vary over times of order n. To satisfy average
We
units.
V
TSP
the sequence of retailers visited followed a cyclic pattern (such as a polling table),
if
case: inventory costs
is
Z)'s.
once 7"s are replaced by D's. The problem
case lacks the natural cyclic structure of the TSP. Tour times in the
case, for exactly the
in §3.2,
obtained by replacing 7"s by
delivered during the current trip. These processes
(7) still holds as well,
had a regenerative structure (such
the
1
thus nearly identical to (2)-(12).
The DS
are
the total
is
Equation
satisfy (2)-(6).
formulation
ef (0
IC,
(i
% and
Cq,-,
(22) replaced by
^(^-')>»-
'- =
As mentioned above, the DS case does not have the
''"
TSP
cyclic structure of the
case.
Thus we
cannot use functional central limit theorems based on renewal processes to prove convergence
to a
Brownian motion.
5£j, is
given by
were chosen
in
sj)
an
=
A~^
i.i.d.
times that a retailer
In the
is
DS
case the parameter corresponding to
^ his expression would clearly
J^i ^i^oi^oi-
manner.
It
can be shown to hold
for
Sj- in
arise
if
the
TSP
case,
the next retailer
any policy where the fraction of
visited does not vary over times of order
n using the
Random Time
Change Theorem.
The
lack of a cyclic structure
netput process. Indeed,
we
will
if
makes
we embed
at
impossible to consider an
embedded normalized
epochs during which the vehicle
not obtain a meaningful process.
process to obtain a meaningful limit.
it
is
at the warehouse,
VVc must thus average the normalized netput
By arguments
24
similar to those in §2.3, this averaged
normalized netput process
is
well
approximated by a Brownian motion
X
with
drift /i^
and
variance
The averaged
total inventory process
m
Now we
face the
slow
down time by
=
is
defined by
-
X{t)
problem of optimally placing the
facilitate this optimization.
We
=
60, \\
3,
=
A2
2
(52)
.
Again we
limit cycles for the deterministic evolution of the
In contrast to the
V =
Y{t)
a factor of ^/n and turn to the fluid model.
retailer inventories in R"*.
that
^\
TSP
case,
we introduce an approximation
motivate this approximation by use of an example. Suppose
and each
was exactly
retailer
six
time units away from the
warehouse. Then a continually busy vehicle using the polling table (12121) could
average, retailer
1
to
visit,
on
every 20 time units and retailer 2 every 30 time units, thereby satisfying
average demand. Although optimally placing a limit cycle for a small polling table such as
this
one
is
manageable, the optimization problem gets unwieldy very quickly as the
the table grows.
make a
Our approximation assumes the
delivery to retailer
example, we assume that
i
size of
existence of an idealized policy that would
every Vj \i time units in the fluid model; in the context of our
retailer 1 (2) receives a
dehvery exactly every 20 (30) time units,
even though the (12121) polling table cannot achieve such perfect regularity
in
the fluid
model.
Hence, our approach
scale,
and then track
is
to optimally place an idealized fluid cycle at the fast time
this cycle as closely as possible
with our proposed policy.
deliveries are perfectly regular in the idealized cycle, the use of this
Because
approximation causes
us to underestimate the inventory cost incurred over a cycle; however, simulation results
in §6.1
show that the heavy
traffic analysis
incorporating this approximation appears to be
very accurate, at least for the five-retailer cases considered there. Moreover, the use of an
idealized cycle allows us to avoid the task of determining the actual behavior of the individual
inventory levels on the fluid time scale. This task
of a cyclic structure
it
appears to be extremely
25
is
more than
difficult.
just tedious:
due to the lack
Now we
placement by the vector
(xi,X2.,.
.
.
are similar to the
we
paths pictured
deliver a full load of
V /X,
which equals
The next
T,
TSP
and the averaged
in
retailer
Z{t)
=
i
sum
Under the
i.
Figure
units
is
idealized policy, the fluid inventories
the only differences are the delivery size (in
1;
on each
a retailer) and the visit frequency,
visit to
to establish the relationship
=
total inventory level Z{t)
between individual and total inventory
equals the
between the cycle placement variables
x.
The
levels takes the
constraint related to consistency
form that the averaged total inventory
of the average individual inventory levels.
over an idealized cycle
X, the cycle
define the cycle
order to maintain a balanced flow.
in
step
V
still
,Xm), where x, represents the lowest point during the
cycle of the fluid inventory level at retailer
this case
We
turn to the optimal placement of the idealized cycle.
is
x,
+ V /2.
The average
fluid inventory at
Hence, when the averaged total inventory
placement parameter must satisfy
E-.=--'fBecause the
gi{xi)
and
TSP
is
(53),
it
is
V
under the
is
replaced by V.
fluid delivery size equals
given as in (28), except that
V,
clear that the optimal cycle
case, except that
we
The computation
replace
K
by
V
(53)
placement
and
J2i
DS
is
case, the inventory cost function
Comparing constraints
precisely the solution given in the
^i^of^ by mV/2.
of the optimal vehicle idling policy
is
identical to that in §2, except
that the parameter of the exponential stationary density of the
Hence, Proposition 3 characterizes the optimal base stock
replacing
i>t,
and with the substitutions described above
the constants
3.2
a,
and the thresholds a and
RBM
level for the
in the
is
i>d
DS
=
'Ifio/f^o-
case, with i>D
corresponding definitions of
$.
The Proposed Policy
The mapping from heavy
the
(27)
TSP
case.
We
traffic solution to
proposed policy uses the same philosophy as
begin by establishing a relationship between the total inventory
in
in
the
system at the current decision epoch and the cycle placement parameters. Although there
are several possible ways to do this,
we keep
track of the vector process (rj(0,.
26
.
.
,rm(0)i
which
time of the most recent
specifies the
current time by
retailer
The
i.
parameter
qi
=
then
<,
—
t
ri{t)
each retailer. Hence,
visit to
inventory at retailer
at time
i
relates to the unsealed cycle
t
—
ri{t))
<
=
V- Ut - r,{t)).
+
q,
(54)
expect that Qt{t)
in
>
=
thereby contradicting the definition of
level,
+
Qi
Aj^oi,
we modify equation
this modification leads to considerable
Summing
the simulation study.
u{t)
J2i T^SuX
[V
—
it
which would make the cycle placement of the
0,
the current inventory
ri{t)), XiOoi];
placement
\/n Xi via
Since stochastic effects can lead to unusually long intervisit periods,
Xi{t
we denote the
represents the elapsed time since the vehicle last visited
Qi{t)
V—
if
\i{t
—
over
all retailers
Combining
r,(i)), Aj^Oi]
(54) to Qi{t)
improvements
we obtain
YliQi
than
Because one would
qi
in
possible to have
retailer higher
g,.
=
is
+ max[V —
A,(i
—
system performance
=
Q{i)
~~
u{t),
where
equation with the unsealed version
this
of (53) yields
1
= T.Qdt)-u{t) + '^.
which relates the unsealed averaged inventory
total inventory in the
normalizations,
vector q*
=
DS
r,
q (this quantity represents the unsealed average
Reversing the heavy
case) to the current inventory level.
we obtain the
y/nx* given
(55)
following formulas for the unsealed optimal cycle placement
q:
.
Region
1:
< Qp =
g
jrK-^
-V
}^
= -
+ h,
V
6, +
b
+
b
,
+
,
0,
i
q,
hi
,
+
mV
"1^'
2
h,
.
tor
I
^p,
tit
2
Region
^d <
2:
9
b,
,^p
<
/?D
=
+
h,'
+ ^T'
^E mT
k+
2
h,
i
hi
1
Region
3:
mV
2a7V (
.
«7
traffic
=
+
bt
f^
W E
(3d
<
2
\
ysr^h,-hk\
h
bk + ^A;
^
fc
1
q,
27
\-^
OA:
-t-
,
which are independent of the scaling factor
We
and
u{t),
n.
use this cycle placement as an "ideal" m-dimensional inventory state for a given q
and choose the next
retailer so as to bring the current inventory vector as close as
possible to this ideal state. Let
be the time epoch at which the vehicle
^o
is
ready to depart
from the warehouse, and consider the inventory evolution over the next delivery
a deterministic inventory evolution, the vehicle
next) at time
i,
=
^o
+
reach retailer
will
q:
+V
Q'^itf)
(if it
is
made
optimal cycle
to retailer
i
is
given by
and
= max
+
[q*
V- X^ito +
-
0o.
r,{to)), q]
where the maximization makes the adjustment
+
A,(^o,
+
for long intervisit
for ;
^o,)]
under the deterministic evolution the actual inventory vector
to retailer
is
given by Qx[it)
=
Qi[i-o)
+ V—
and Qj{tt)
X^Oq,
^
i,
times as discussed above.
In contrast,
i
Under
chooses to go there
In the deterministic tour corresponding to the
^oi-
placement, the retailer inventory levels right after a delivery
Qntf) =
i
trip.
=
Qj{to)
—
after a delivery
X-jOq,
for j
^
i.
Therefore, the resulting Euclidean distance between the ideal and actual inventory vectors
after a delivery to retailer
the vehicle to retailer
k,
Finally, as in the
i
is
A{i)
=
where k
TSP
= y Hj
argmin,
~
Q'ji^t)]
is
The proposed
control sends
A(?').
case, the busy/idle control
The only added complexity
traffic results.
[Qjiif)
that for the
is
DS
a direct unsealing of the heavy
case the vehicle visits the ware-
house after every delivery and so has many possible idling decision epochs. By equation
our proposed policy
idles
a vehicle at the warehouse whenever
Q{t)
where
tions,
Wq =
y/n
we have
z'^ is
(55),
-
u{t)
+
—
>
the unsealed idling threshold.
w'o,
Making the
suitable scaling substitu-
that
\ ( ^d{0d - cud)
b-V h) \e''D(PD-aD) - 1
f'
=
Wn
'D
In
I'D
28
+ Qo
if
w])
>
i3d;
(56)
otherwise,
w^
is
the solution to
1
-e
-.n(wn-an)
-i'c(tu£)-a£i)
vd
+
.
1
__
r.
*
.
X
and Fj:){w^)
=
=
^^
,
'^
aj{w*p)
.
+ a^Wp +
ug otherwise.
The constants
m
—
—
k + k
h,
2Var (y:
,^
where (ciDi/^D^Oy) are defined
4
in
in these
equations are
+
\
and
2
h,
y,-^
)
'
kh.
2^b, +
h,'
the unsealed cycle placement formulas above.
Comparison of TSP and DS Routing
In this section
we compare the
relative
performance of the two fixed routing schemes (TSP
and DS). The predicted cost functions Ft, Fq derived
tory
.
w},>f3o,
2
mV\
h,
b,
,
if
= '1-Pd)XV
vd
/
(57)
given by
is
v^(hi-h)^
,^y y ^,
+ -^h,--Y:^-^-^
hwh-h^
,
o.
hi
i
Furthermore, the predicted optimal cost
FD{w},)
"^'^
T.V+ ^Er^ =
-^-T:-^
ud
k +
component
of the system cost.
and by C(3?) the
total cost for the
in §2
and §3 represent only the inven-
Denote a generic fixed routing scheme by
system under
this
3J
G {TSP, DS},
scheme. The total system cost
is
ob-
tained by adding the transportation cost (or equivalently, subtract the idleness reward) to
the inventory cost; that
cost
is
is,
we
set C(9?)
= F^{w^) — r{l — p^n).
independent of the stochastic nature of the system.
A
crucial observation
is
that po
<
pr] this fact (see
consequence of the triangle inequality. Although
important implications.
TSP
Notice that the transportation
policy.
This
is
First, the
DS
Rubio
for a proof)
trivial to prove, this inequality
is
a simple
has several
policy achieves lower transportation costs than the
quite interesting since, at
first
glance, one might expect the converse
to hold. However, minimization of the steady state transportation cost in the
29
IRP context
is
amount
equivalent to maximizing the
full
of items delivered per unit time traveled; hence,
load direct shipping provides the highest transportation efficiency of any fixed routing
scheme. More importantly,
until
pt
=
1
and po <
I;
TSP
be stable while the
for
that
any given problem instance, the demand rate can be increased
is,
there exist
some demand
We
policy would not.
<
1
it is
DS
form of stable dynamic allocation
is
The remainder
in
will
used
dominate
TSP
DS
case.
in
the
I
is
a.
necessary
not sufficient. In particular,
is
the absence of adequate dynamic
mind that while the
r
and
b.
/Sj
—
>
as long as
1
performance of the
some
DS and
However, readers should keep
qualitative statements below are true, these results are not exact,
because our calculation of F£){wq)
is
the true heavy traffic cost under the
TSP
routing as
of this section investigates the relative
schemes as a function of the cost parameters
preferred to the
in
it
<
possible to accumulate inventory at one retail site while backorders grow
without bound at another. Hence
TSP
but
keep the total inventory stable but,
will
load allocation,
3J
DS poHcy would
where the
should note that p^
condition for stability of any fixed routing scheme
having p^
levels
policy
if
approximate and represents a
DS
policy.
the transportation cost
may be found
pp < pj implies that DS
inequality
is
underestimate of
is
DS
high enough. In particular, the
> {Ft{wj) — Fd{w}j))/{pt —
policy achieves a lower overall cost for any r
value of this threshold cost
The
slight
po)- While the
numerically for any particular problem instance,
a more precise characterization requires a better understanding of the relationship between
the inventory costs in both systems. Unfortunately,
performance comparisons
levels,
and
it is
hard to make simple inventory cost
for the different routing schemes, primarily
and hence the predicted inventory
costs, are not in closed
because the base stock
form (see equations (50)
(57)).
To study the
relative inventory cost
performance of the
TSP and DS
consider the case where the inventory costs at the retailers are symmetric
hi
=
b for all i)
and
incretising in b/h,
increased above
These
b
becomes
large.
Because the value of
one expects that there
them
critical values
exist
some
ii^j
and
xr'p in
critical values br-ibo
(while leaving h fixed) the optimal base stock
is
schemes,
(i.e.,
(49)
/),
=
and
such that
let
h
us
and
(56)
is
6
is
if
given in closed form.
indeed exist and, for the case of symmetric costs, have the following
30
closed form expressions:
lyxVe
e^TV
>
For b
will
In
^
6
_
and
bo
=
'i/Dmye"^'"^
^I'om V
h
1
— Fd{wq)
ma.x{bx,bD}, the inventory cost difference Ft(wj)
n
As
i/j'V
GO the value of (58)
is
^i/omV
o^T^
—
+
+
1
In
utV
—
2
+
UDmV
l^D
dominated by the term
be the same as the sign of uq
_
—
[uq-^
can be expressed as
[m-\)V^
(58)
i^£,^)ln(l
+
whose sign
b/h)^
vx- Define the critical value
l/rp
e""^^
K
=
exp[.T]
cost
and only
e^.
if
distribution of the
(Tq
> aj
times
both
is
Then
ud
—
fiD
for b
>
>
(.^OmV
-l\
vprnV
^D-'T
'2{yT
J
max{67',6£),6c}, the
where v^
0,
is
=
TSP
DS dominates
more frequent
in
counterintuitive:
deliveries to each retailer,
terms of inventory
cost.
However,
it
5
5.1
in
achieves the lower inventory
Because pn < pT, the condition
we
high backorder case. Moreover,
for large
if
distance travelled must be 0{n~^'^)]
actually have cr^
since the
TSP
policy
=
Uj.
makes smaller and
might be expected to outperform the
backorder penalties, the
efficiency over the long run via its smaller drift,
much time
3J.
in the
mean
then the difference in
somewhat
-h,
i^d)
policy to be preferred. For the case of deterministic travel
aj, and so
finite
is
DS pohcy
-
— \)V
the exponential parameter for the steady state
see equation (62). Hence, in the heavy traffic limit,
This result
i/£)UT{m
associated with routing scheme
required for the
and ht are
/
\
J
i't
RBM
follows that a\)
it
l\ "T-'D
vjV
where
if
—
and causes the
TSP
DS scheme
policy sacrifices
total inventory to
spend too
the expensive backorder regions.
The IRP with Dynamic Routing
Formulation of the Limiting Control Problem
Consider now a situation where, once the vehicle
on either a
full
load
TSP
is
loaded at the warehouse,
it
can embark
tour or a direct shipment to some retailer. All other aspects of the
31
problem
Ccises.
Because the formulation of the dynamic problem
problems described
fixed routing
Rubio
U
earlier,
=
Qi{0)
Qi{t)
is
as in the fixed routing
a natural extension of the two
we only sketch the argument and
much
a detailed treatment; furthermore,
for
same
sources of uncertainty, cost structure) remain the
(e.g.
refer readers to
of the earlier notation will be reused.
then the system state equations are given by
=
V,ST{BT{t))
and the cumulative
idle
+
V,SD{BD{t))
lime process
is
=
I{t)
D,{i)
t
—
+
ef (0
Brit)
—
+
ej{t)
Boit)
for
for
>
t
>
t
sents the cumulative deviation from the nominal allocation over past
plus the
amount
The
first
delivered over the current cycle/trip at retailer
step in the development of a heavy traffic approximation
denote the cumulative fraction of busy time that the
netput for the dynamic routing IRP system
= (^^^ ~
'^^^^^
+
2T^\Af
^^^
-
DS
+
cf (f) repre-
cycles/DS trips
i.
the influence of the routing control on the total netput. Let 6{t)
^^^^
TSP
Notice that,
0.
according to the previous definitions of the delivery allocation controls, cj{t)
(59)
0,
=
is
to characterize
+
Bu{i)l{Bj[t)
service has been used.
Boit))
Then the
is
aJ
i
Notice that this equation reduces to (13) when
+ V [Sr{BT{t)) +
TSP
routing
is
Su[Bo{i))]
always used.
-
V{t).
Summing
(60)
the
equations in (59) over the retailers and substituting the relevant definitions into (60), we
obtain the following expression for the total inventory in terms of the netput and controls:
Q{t)
where
t[t)
=
er(i)
The heavy
problems.
DS
+
=
X{i)
- (^(1 -
+ 2^^I.go/ ^^))
<^(0)
+
(^^^
'^^)'
(.o{t)-
traffic
conditions are a union of the conditions for the two fixed routing
Conditions (22) and (51) require the
policies to
^(^^
approach one
because the condition
/^d
>
in
traffic intensity
the limit; however,
now we do not
ensures that a stable control
is
under both the TSP and
require
i)ossible.
{.ij
to be positive,
Hence,
in
terms of
the problem data, the retailers are required to be located fairly close together relative to
their distance to the warehouse.
^7-2^0.
More
precisely, the average travel times
^^/1\
32
(.^^^.^^1,
,^i^...^,„.
must
satisfy
(62)
One consequence
appearing
are equal
only
in (61)
and the
of the control
of equation (62)
by 0(n~^/^); hence,
differ
V/Ox and AV'/(?^j
that the quantities
is
in the
term
coefficient in front of the idleness
pohcy employed. Nonetheless, we
will
heavy
traffic
in (61) is
Aj^oj)
hmit, these two terms
a constant and independent
maintain equation (61) as
is,
and view
the time-dependent coefficient as a refinement of the heavy traffic Hmit.
Let
3?(i)
G {TSP,DS} denote the routing mode that
control problem.
We want
used at time
is
to consider intervals of time during
in the limiting
t
which one of the two controls
is
used exclusively, and approximate the resulting normalized netput process by a diffusion with
control-dependent
and variance.
drift
definitions of the netput process that are
TSP
netput process in the
of
embedding
in the
TSP
netput process from the
DS
case was
TSP
approximated
we need
in
to reconcile the different
the two cases:
made
for convenience.
DS
an averaged netput process x[t). This translation
=
x{t)
-f
77
constant will be used to translate the normalized total inventory process for the
well,
and can be calculated
Xj
developed
in §2.5.
inventory level we are seeking. Then, since the average of the
sum
of the averages,
with (27) yields x
=
x
- E.
we have x = ^i{xi + VJ2) =
A,^^^"^
+
V
.
The same
TSP
case as
between total embedded
in that context using the relationship
inventory x and cycle placement variables
equal to the
The
case.
Thus we translate the embedded
which needs to be determined: \(<)
?7,
in the
embedded
the
case forced us to use averaging there, while the choice
case, x{t)-> to
consists of adding a constant,
this
and the averaged netput process
case,
lack of a cyclic structure in the
do
In order to
Let x denote the average
sum
of the inventory levels
Ylt
+ V fi- Combining
-^i
is
this
V/2, so that
=
^-E^^^or"
(63)
i
The
role of this translation constant in the
implementation of the proposed policy
is
described
in the next subsection.
By
the arguments used in the
averaged total netput process x
control-dependent
^(^(^^)
drift
= [,r
is
TSP and DS
well
cases,
we can deduce that the normalized
approximated by a diffusion process X{t,^{t)) with
and variance given by
If
m
= TSP
^"^
33
"^^(^^^
=
14
if
m
= TSP
'
respectively.
In other words, the routing control switches the
netput of the system from
one Brownian motion to another. Furthermore, these two Brownian motions have the same
parameters as the diffusion limits of the corresponding fixed routing
(Tq
and
o-j.
By equation
ccises.
only differ by 0{n~^^'^); once again, we retain this refinement of the heavy
(62),
traffic
limit.
Because the normalized idleness process Y{t)
we
zero,
e{t)
=
traffic
are free to choose ^{t) for
e{nt)/y/n,
let 6{t)
=
times
all
t,
will
only be exerted on a set of measure
not just the nonidling times.
6{nt) be the fraction of busy time devoted to
we
let
the heavy
in
system, and define the averaged total inventory level for the system as
then we obtain the same time scale decomposition as before: under the
fixed
DS
If
and the individual
The exact
fluid levels {Wi{t),
evolution of {Wi{t),
.
,
.
.
cycle placement and the routing
dynamic routing IRP
into:
Wm{t))
scheme
.
,
.
is
Wm{t)) move deterministically
at
a
finite rate.
determined by the averaged inventory
in use at
given Z{t)
(i)
.
fluid scaling, Z{t) is
=
time
t.
We may
x and routing
mode
level,
the
therefore
decompose the
^{t) G
{TSP,DS}, use
the results in §2 and §3 to determine the optimal cycle placement and the corresponding
inventory cost function
Y
is
^3e(j)(a;); (ii)
choose the nonanticipating control (y'(f),3?(i)) (where
nondecreasing and right continuous) to minimize
limsup
T— CO
—E
i
rg^(,){Z{t,^{t)))dt-rY{T)
(65)
•/O
subject to (64).
5.2
The
Optimization of a Triple Threshold Policy
diffusion control
problem (64)-(65) appears to be
coefficient in front of the control Y(t) in (64)
the control-dependent cost function .g»(x)
section to niimrrically
compute the
is
difficult to tackle for
depends upon the routing control ^{t), and
very complex.
An
solution to (64)-(65). Here,
the following triple threshold policy that
is
two reasons: the
algorithm
we
is
used
the next
specialize our analysis to
characterized by the parameters ^i
31
in
<
^2
^
^3^
the
vehicle
is
busy whenever the
total averaged inventory Z{t)
while busy, the vehicle uses the
DS mode whenever
Z{t)
<
Zi
TSP
Our
diffusion control
the ELSP, the
is
in
goal in this subsection
ELSP
of the triple threshold
policy corresponds to using large lot sizes
to using small lot sizes: large lot sizes
vehicle here) in a
more
Z{t)
>
Zs;
to find the
is
with setup times
Markowitz,
in
that the most general form of the optimal policy to the
problem (64)-(65)
DS
when
idles
{zi,Z2.,Zs).
Motivated by our analysis of the stochastic
Reiman and Wein, we conjecture
and
23,
routing scheme whenever Z{t) € [^1,22)? and uses the
or Z{t) G [22,23).
optimal values for the parameters
<
form described above. In terms of
and the
TSP scheme corresponds
and the DS policy both use the server (which
efficient fashion.
The most
general form of the
Markowitz, Reiman and Wein corresponds to the
ELSP
is
the
solution derived
Figure 5 of
triple threshold policy (see
that paper), and in some cases (see Figure 6 of that paper) the solution could be described
with fewer thresholds.
We conjecture that
a double threshold policy with 22
the
we
TSP
poficy {zi
=
—00,
22
=
=
-^3,
the
IRP
solution
either a triple threshold policy,
is
or a single threshold policy, which could
23) or the
DS
policy (21
=
22
=
23 or 21
=
=
22
be either
—00). In §7
describe the rationale behind our conjecture.
Our
analysis of the triple threshold policy requires knowledge of the stationary distri-
bution of the controlled reflected diffusion process and the long run expected average idleness
rate.
The
stationary distribution 7r(x) must satisfy the following system of differential equa-
tions (see Karlin
and Taylor 1981):
^^^(a:)-/x(x)—
a'^{x)
d
-j-T^[x)
z
-
7r(.T)
=
x<23,
ii{x)w{x)
=
x
=
ax
(66)
(67)
23,
where the dilFusion parameters are given by
K-) = l'^
fiD
[
if-e[2„22)
it
X < zi OT x e
^^^
[22,23)
a^x) =
\i
[
35
afj
if-^[-i'-^)
\t
x
<
zi
ov
X e [22,23)
The only continuous
density that satisfies (66) and (67)
7r(a;)
=
i
.
is
k^he^''^^-''''
if
x G (21,22]
k-^C'DC^'^^^-''^
if
a;
G
(68)
,
(22,23]
w here
Ux
k,=
^
^
i>x(l
+
e^T{'2-zi)gUD(z3-:2)
-
e'^r{z2-2i)) 4- j)£,(e'^r(z2-zi)
_
and
1)
j>^e''7-(22-Zl)gi>D(23-22)
A;.-,=
i>T{l
this
is
+
e''^(^2-2i)e^D(23-22)
—
-E
lim
(-.00
t
and taking the
first
_
1)'
-
<
—
>
00 we have that
Y{t)
lim -E[Z{t,^{t))].
(69)
term on the RIIS of (69) corresponds to the asymptotic growth rate of the netput
process under the proposed policy.
time that the vehicle uses the
DS
If
we
let S
=
=
=
r
K[x)dx
J — CO
(1
-
6)fiT
+
SuD.
is
(70)
t
the definition of the triple threshold policy,
6
//m(_oo6(i) denote the long run fraction of
routing policy, then the long run growth rate
lim -E[X{t,^{t))]
i—'OO
left
limit as
!^^-^<+^(^^TO
t
lim -E[X{t,^{t))]
The
i>p(e''r(22-2i)
turn to the expected idleness rate. Taking expectations on both sides of (64),
rearranging terms, dividing through by
By
_(-
the stationary density for the total inventory process.
Now we
The
e''T(22-2i))
+
/''
K{x)dx
it
follows that
=
k^
+
^-,(1
-
e''^^''^-''')).
J Z2
side of (69) can be expressed as
)'-<''^S£)^'^^
36
(71)
Finally, according to the steady state distribution for Z{t) in (68), z
=
limt_oo E[Z{t)] exists
Hence, the second term on the right side of (69) vanishes in the
and
is
this
term and substituting (70) and (71) into
finite.
y
=
We may now
Canceling
(69) gives
V^[i-
;un-^Elnt)] =
limit.
,i_„2j;_,,,„,^,,,^
write an expression for the steady state cost of the
)
(72)
IRP under
threshold dynamic routing policy as a function of the control parameters.
the triple
The problem
is
hence reduced to finding {zl,Z2,zl) that minimize
/•'2
/'I
F{zi,Z2,Z3)=
gD{x)Tr{x)dx
J — (X>
+
/"ZS
gT{x)-K{x)dx
+
J Zl
Unfortunately, F(zi, 22,23)
is
gD{x)Tr{x)dx
I
-
ry.
(73)
J Z1
rather complicated and a closed form solution for the
optimal control parameters does not seem possible. Furthermore, an explicit expression for
F(2i, 22,23)
not available in general because
is
between [aT^f^r) and
(cidiI^d),
its
exact form depends on the relationship
which are the parameters that define the three characteristic
regions for the cycle placement and inventory cost solutions.
parameters, one
5.3
The
=
get either 13d
The Proposed
first
step in
dynamic IRP
Wi
may
y/n
Zi
is
> ^t
or f^o
<
By
considering different problem
Pt-
Policy
mapping the
solution of the diffusion control problem into a policy for the
to obtain unsealed threshold levels.
If
we
and make the parameter scaling substitutions
define the unsealed thresholds as
as in the fixed routing cases, then
the scaling factor n cancels out of the expression for
F{wi,W2,uJ3)
and what remains
is
= \/nF(^=., --=,—=),
\/n Wn Jn
(74)
a function only of the original system parameters. In §6.2, the thresholds
minimizing (74) are determined numerically, and we
refer to these
cost-minimizing thresholds
as {wl,wl,wX).
We now
describe
mode, the DS mode or
how
these thresholds dictate whether the vehicle employs the
idles;
this decision
is
37
made
at the
epochs when the vehicle
TSP
is
at
the warehouse. Because of the translation of the embedded total inventory process into the
averaged total inventory process
shipping option.
If
DS was most
inventory q in (55) and compare
the TSP/DS/idle policy.
and
in (63),
it
TSP
If
we must keep
recently used then
track of the most recently employed
we
calculate the unsealed averaged
to the unsealed threshold levels {wl,W2,w'^) to determine
was most recently used then we define q by combining (35)
(63):
9
= E^.(^r)-'/. + EvJ''' +
(75)
'7,
w here
rj
is
= ^f,= ^-~'£X,el'''
(76)
.
the unsealed translation constant. In this case the TSP/DS/idling decision
by comparing q
in (75) to
the unsealed threshold levels {w'[,W2.,w^).
allocation decisions (which retailer to visit next in the
to each retailer in the
we now use the value
TSP
DS
case and
TSP
determined
Finally, the detailed
how many
case) are determined exactly as in Sections 2
of q in (75) in the
is
and
units to deliver
3,
except that
case.
Computational Results
6
This section contains a series of computational experiments aimed at assessing the accuracy
of the
heavy
traffic analysis
and determining what aspects of the control policy are most
important for good system performance. The computational study
the fixed routing
6.1
have
IRP
simulation experiments performed
five retailers
r equal to zero
We
and Poisson demand processes.
and concentrate on the inventory
in this
cost.
fixed so that Aj
=
A/5, A2
=
A/10, A3
=
A/10,A4
38
=
subsection consider systems that
also set the transportation cost rate
The
obtain different utilization rates; however, the fraction of
is
described in two parts:
IRP and the dynamic routing IRP.
Fixed Routing
The Monte Carlo
is
total arrival rate A
demand
A/5 and A5
is
varied to
represented by retailer
=
2A/5.
The
i
travel time
random
109t
The mean
variables T,j are iid second order Erlang.
= V
travel times are adjusted so that
always holds; this allows us to consider several vehicle sizes while maintaining the
traffic intensity at
We
=
pT
O.IA.
perform four simulation experiments aimed at various aspects of system perfor-
mance.
Experiment
The
1:
obtained under the
TSP
first
set of simulation runs quantifies the cost
policy by recalculating the cycle placement at each retailer, as
opposed to determining these values only once per cycle
We
warehouse).
times ^01
=
^so
let bi
=
=
b
=
^r/lO, ^12
5, hi
=
=
h
=
^34
^23
with a pentagon structure, where the
and the warehouse
is
located
=
l
=
for
^45
midway between
of three vehicle sizes (100,10,5)
i
=
=
1, ...
(i.e.,
,5
first
more with
is
at the
and consider the mean
stations
1
and
travel
5.
which are generated by the different combinations
and three
traffic intensities (0.5,0.7,0.9);
For
all
notice that
cases with px
three replications of 36,000 time units (starting with an
discarding the
vehicle
placed at the vertices of a pentagon,
of these scenarios grossly violate the heavy traffic conditions.
three
when the
^t/5- These travel times are consistent
five retailers are
We consider nine different scenarios,
we simulated
improvement
=
<
0.7,
empty system and
2000 time units) with cycle placement recalculation at the
calculation only at the warehouse; for the px
some
retailers
and
0.9 instances, the length of
each replication was increased to 240,000 time units (discarding the
first
20,000 time units).
This simulation design was used throughout our study and allowed us to keep the standard
deviation of the cost estirhate under
Table
1
summarizes the
1%
of
its
mean.
results of the experiment.
when
the increase in the average inventory cost
The
entries in the table represent
delivery sizes are calculated only at the
warehouse, and not adjusted over the course of the tour. As predicted by heavy
the advantage obtained by recalculation at the retailers becomes quite small
intensity
is
high (about
1% when pr =
with small vehicle sizes at lower
0.9).
when the
traffic
However, the recalculation advantage increases
traffic intensities.
Kumar, Schwarz and Ward, who
traffic theory,
These observations complement those
in
focus on this issue (calculating delivery allocations once
per cycle or at each retailer) using a
much
different model. All subsequent
39
TSP
simulations
,'Pd=0.63
/ yPD=0.45
1/
rpD=0-8I
10%
Figure
2: Sensitivity of
inventory cost to base stock level.
44
A
=
9.0
to (64)-(65). In three of the 36 cases the
tion algoritlim
poHcy generated by the Markov chain approxima-
of the triple threshold form; in the remaining 33 cases, the solution
is
is
a
degenerate form of the triple threshold policy that can be described by one or two thresholds.
Table 5 specifies the proposed triple threshold policies
for the three cases,
scheme
is
used
terms of the unsealed inventory)
(in
DS
along with the proportion of time that
is
used.
As expected, the
TSP
an interval containing zero.
in
The Analytical Double Threshold
Because the numerical results
Policy.
in
33 of the
36 cases can be described by one or two thresholds, and because the Markov chain procedure
only offers an approximate solution to problem (64)-(65)
independent of the heavy
traffic
parameter
[^i, 23)
and
idles
when
equal to Z3 in the results
Z{t)
>
we turn
n),
The parameters
23.
in §5.2; in particular,
<
Z{t)
Zj
the solution
is
not
an analytical derivation of the
to
DS when
double threshold policy, whore the vehicle uses
^(0 ^
(in particular,
Zi,
and
the function
F
uses
TSP
routing
by setting
Zj are derived
in (74)
when
22
must be minimized.
For the symmetric cost, wedge topology cases, this function can take eight different possible
functional forms, and a steepest descent
this function; see
Appendix B
riie results for
policy
is
optimal
of 6'
is
degenerate
in 6 cases.
in
of
Rubio
method was used
for
complete
the 36 cases are presented
22 of the 36 cases, with the
In the remaining 14 cases
minimum
of
details.
Table
in
to find the global
TSP
6.
The
derived double threshold
and DS
policy optimal in 16 cases
where two thresholds are required, the value
often very close to zero, suggesting that the
TSP
policy
is
close to optimal.
\\ liilc
results are not reported here, similar findings continued to hold for different values of the
backordering-to-holding cost ratio b/h, as well as for wider or narrower wedges
(i.e.,
for other
values of 601 /0 12)-
For the 33 cases where the Markov chain procedure generated a single or double
threshold solution, these solutions matched the analytical solutions
hence, the
Markov chain
solutions are not included hrrr.
Markov chain procedure generated a
the
Markov chain
triple threshold policy,
in
Table 6 very
closely;
For the three cases where the
we used simulation
to
compare
policy in Table 5 to the analytically derived policy in Table 6. Table 7 shows
that the double threshold solution out pci forms the policy generated
46
i)y
the
Markov chain
—
V=
\ =
r
to deliver full loads to either a single retailer (direct shipping or
(TSP)
tour,
we avoid the combinatorial complexities
DS) or along a prespecified
inherent in the problem and maintain a
sharp focus on the crucial tradeoff between inventory costs and transportation costs that
at the heart of the IK P.
problem
offers
Our modeling
dynamic stochastic IRP
of the
lies
as a queueing control
a new perspective on the problem: rather than view the IRP as a variant of
the vehicle routing problem,
we
see
it
as a variant of a production/inventory control
problem
(where the capacitated vehicle plays the role of the production system); a^ such, this paper
is
a natural descendant of Wein and Markowitz, Rciman and Wein, which consider more
conventional production/inventory control problems.
By assuming that
the system
we approximate the queueing
TSP
is
operating in the (suitably defined) heavy
traffic
control problem by a diffusion control problem.
regime,
When
only
tours are allowed, this modeling approach, together with the application of a heavy
time scale decomposition, allows us to
traffic
fully characterize the solution to the diffusion
By assuming
control problem, thereby generating an operating policy for the original system.
the existence of a fixed sequence of retailer visits than can achieve constant inter- deli very
times to each retailer
in
the fluid limit, we perform a similar analysis for the
control policy in both cases
is
which
which
specifies
how many
retailer to visit next in the
sit idle
or set out with a
full
load,
is
diffusion control
analyzed.
The
problem
is
and a dynamic allocation
units to leave off at each retailer under a
DS
TSP
scheme, and
scheme.
We also consider the case where dynamic route selection
The
case.
characterized by a vehicle idling policy, which dictates whether
a vehicle at the warehouse should
policy,
DS
(either
TSP or DS)
is
allowed.
solved numerically and a class of triple threshold policies
Finally, a series of simulation studies
is
performed to complement the heavy
traffic analysis.
Our key
•
findings can be
The inventory component
summarized
as follows:
of the total long run average cost
characteristics of the system, while the transportation
scheme
is
determined solely from
first
moment
50
depends on the stochastic
component
information.
for
a fixed routing
•
The
vehicle idhng
idles at the
poHcy
is
characterized by an aggregate base stock level: the vehicle
warehouse whenever the
total retailer inventory exceeds a certain threshold
The value of the optimal base stock
level.
level
is
independent of the transportation cost
Although the existing IRP literature does not typically address the vehicle
rate.
issue,
our simulation results show that system performance
value of the base stock level, deteriorating rapidly
is
idling
quite sensitive to the
when the base stock
level differs
from
the optimal value by more than the vehicle capacity. Moreover, simulation results also
confirm that the system cost under our derived base stock levels are typically within
several percent of the cost achieved by the best (found by exhaustive search using
simulation) base stock level, unless the heavy
equals 0.5 and vehicle capacity
(e.g., traffic intensity
•
The
traffic
allocation of load
among
the retailers
is
conditions are grossly violated
is less
than or equal to 10
units).
dictated by the desire to concentrate most
of the total inventory (backorders) at the site with the smallest holding (backorder)
cost rate.
•
Dynamic
(i.e.,
state-dependent or closed-loop) delivery allocations greatly outperform
their static (state-independent or open-loop) counterparts in a stochastic environment.
In fact, central limit theorem arguments indicate that static delivery allocations cause
the absolute value of the inventory levels to grow as the square root of time, thereby
leading to unbounded costs over the long run.
•
The
TSP
relative
tour, as
advantage of recalculating the load allocation at each
opposed to setting
utilization increases,
•
The
once at the beginning of each tour, decreases as
in the
heavy
traffic limit.
policy that achieves the lowest transportation cost
largest
amount per
fore, direct
fact
and vanishes
it
is
shipping
is
the one that delivers the
unit time travelled (subject to meeting average
is
retailer within a
demand). There-
the most transportation-efficient routing scheme; although this
a trivial consequence of the triangle inequality,
it is
perhaps our most important
observation. This fact helps highlight the basic cost tradeoff in the IRP:
51
DS
leads to
TSP
smaller transportation costs, hut
may
deliveries,
routing, by
demand
for
is,
where the
rates to a level
TSP
routing scheme has a traffic
DS scheme
Moreover, our analysis highlights the danger
in
minimizes current inventory costs
(for
has an intensity
less
in-
than one.
employing a myopic policy that always
example, have each outgoing vehicle satisfy
backorders as possible, regardless of location); such a policy, which
in spirit
has
any given problem instance, one can
tensity greater than or equal to one and the
many
DS
lead to smaller inventory costs. This result also implies that
a larger stability region than TSP; that
increase the
making smaller and more frequent
is
«is
similar
(and consequence) to a production lot-sizing policy that frequently breaks a
setup in order to satisfy backordered demand, can easily become unstable.
•
Heavy
traffic analysis
backorder costs,
•
DS
shows that
will
costs
policies.
cost-symmetric systems with sufficiently high
be preferred to
Simulation results show that there
DS and TSP
for
is
Although the
TSP
routing.
often a large difference in performance between the
traffic intensity,
backorder costs and transportation
play a significant role, the topology probably plays the largest role in the
all
relative attractiveness of each policy. For systems with relatively high loads,
that
TSP
could only be a desirable alternative
when the tour
is
it
appears
wedge-shaped, as
is
often the case in practice.
•
The heavy
traffic analysis
TSP
or fixed
schemes.
accurately predicts the relative cost of using the fixed
DS
Hence, our procedure can be used as an aid in higher level
decisions, as discussed at the end of this paper.
• If
one can dynamically choose between the
the most general form of the solution
Wi
it
<
is
W2
in
<
Wj-.
if
the interval
where W3
is
is
DS and TSP
then
TSP
the idling threshold, then
is
is
is
than wi then
preferred.
the absolute value of the total retailer inventory
52
less
preferred and
DS
we conjecture
that
a triple threshold policy characterized by
the total retailer inventory
[101,102)
options,
is
if it is
Our
in
DS
is
preferred,
the interval
rationale
is
large then the routing
if
[102,^^3)1
as follows:
if
scheme may
have
little effect
DS may
on the rate at which inventory costs are incurred.
be preferable because
DS
-efficiency of
incurs smaller transportation costs. In addition, the
it
has a tendency to increase the total inventory, level relative to
the diffusion control problem the
and
DS
so
zero, as
in
is
it
will help to
TSP
option has a larger drift than the
decrease future backorders. However,
is
much
when the
TSP
(in
option),
less
than
total inventory
the interval [iui,W2), which should contain zero in the nondegenerate case, the
TSP
lead to less backorders and smaller inventory costs,
making
the more attractive alternative. Finally, because the effective penalty for using the
inefficient
TSP
when the
policy decreases
most cases the optimal solution
pohcy, where W2
ELSP
We
=
This state of
W3.
correspond to
DS
is
large,
we
believe that in
be no more complex than a double threshold
affairs
is
somewhat analogous
to the stochastic
(TSP).
computed the numerical
to the
will
total inventory
with setup times analyzed in Markowitz, Reiman and Wein, where large (small)
lot sizes
•
DS
be even more attractive when the total inventory
will
frequent deliveries of
it
In tlicse cases,
dynamic IRP
for a
solution to the diffusion control problem corresponding
number
of instances,
and the
results
were consistent with
our conjectures: the most general optimal policy was of the triple threshold form, and
in
most cases a degenerate form of the pohcy was optimal: either the
{wi
=
i«2
= —00
or
lOi
double threshold policy
we
=
W2
(102
=
=
did not find a numerically
W3), the fixed
TSP
case
{ivi
=
fixed
=
—00, W2
DS
case
w^) or the
^3)- Moreover, in our limited simulation experiments,
computed
triple threshold policy that
outperformed
the analytically derived double threshold policy (although we did not search beyond
the computed triple threshold values).
simulation,
of the
we
By performing many exhaustive
also found that the best double threshold policy differed
two fixed routing
this range, the best
policies in only a
TSP and DS
searches using
from the better
narrow range of system parameter space;
policies achieve fairly similar
in
performance. Hence,
coupling this observation with a previous one suggests that finding the best fixed
route policy
is
very important while allowing for dynamic routing provides a
53
much
less substantial benefit; this
is
particularly true in light of the increased complexity of
implementing a dynamic routing scheme.
In
summary, the important operational
level,
levers for the
IRP iiulude the aggregate base stock
the dynamic allocation policy and the choice of fixed routing scheme, but not the dy-
namic routing
policy.
Moreover, these key decisions are interrelated and a unified stochastic
control model, such as the one considered here,
required for achieving reliable system
is
performance.
Two
topics for future research naturally
dynamic routing scheme
so as to allow
K
come
TSP
Although
tour through xetailers
in
2
1,
and
The
different types of routes
to consider cyclic routes that use a combination of
of a
to mind.
3,
DS and TSP
first
(where
(e.g.,
is
to extend the
K
>
2)
and/or
a cycle could consist
followed by a direct shipment to retailer 2).
theory these extensions could be incorporated and system improvements could
be achieved, the analysis would be tedious and
is
it
doubtful that any additional insights
would be found.
Perhaps the most
fruitful area for future research
would be to develop the necessary
steps for a hierarchical approach to the general (multi-vehicle, multi-warehouse) IRP; such
an approach would be similar
Levi, but
the vehicle routing analysis performed by Simchi-
in spirit to
would also incorporate the inventory cost component. Our
any system given a particular assignment
policies provide estimates for the operating cost for
of retailers to vehicles
and vehicles
results for fixed route
to warehouses.
Motivated by our observation that the
best fixed route policy performs nearly as well as the best dynamic policy over a broad
range of parameters, the
optimization algorithm
first level
(e.g.
up
in
the hierarchy could implrmont
a k-opt algorithm as used
in
some interexchange
the detenninistic vehicle routing
literature) to find the best such route. Higher levels in the hierarchy could then be used to
select the best possible
assignment of vehicles and
to have in the system.
At an even higher
number and
level,
location of warehouses.
54
retailers,
and the
total
number
of vehicles
these results could be used to decide on the
Acknowledgment
This research was supported by a grant from the Leaders for Manufacturing Program at
MIT, NSF grant DDM-9057297 and EPSRC grant GR/J71786. The
Statistical
Laboratory at the University of Cambridge
work was carried
out.
55
last
author thanks the
for its hospitality
while some of this
References
Ackoff, R. L. 1987.
OR, A Post Mortem. Operations Research
M. Dalberto, M.
35, 471-474.
Bell,
W.,
P.
Prutzman. Improving the Distribution of Industrial Gases with an On-Line Comput-
.J.
L.
erized Routing
and Scheduling Optimizer. Interfaces 13,
L.
M.
A., A. Federgrucn
Algorithms
tical
for
Macj and
4-23.
Convergence of Probability Measures.
Billingsley, P. 1968.
Chan,
Fisher, A. Greenfield, R. Jaikumar, P. Kedia, R. G.
.Joliii
Wiley and Sons,
New
York.
and D. Simchi-Levi. 1994. Probabilistic Analyses and PracGraduate School of Business, Columbia
Inventory-Routing Models.
University.
Coffman, E. G.
Jr.,
Switchover Times:
A. A. Puhalskii and
M.
Reiman. 1995a. Polling Systems with Zero
I.
A Heavy- Traffic Averaging
Principle.
Annals of Applied Probability
5,
681-719.
Coffman, E.G., A. A. Puhalskii and M.
Times:
Dror,
A
I.
Reiman. 1995b. Polling Systems with Switchover
Heavy-Traffic Averaging Principle.
M. and M.
Ball.
AT&T
Bell Laboratories,
Murray
Hill,
NJ.
1987. Inventory Routing: Reduction from an .Annual to Short Period
Problem. Naval. Res. Log. Quarter/y 34, 891-905.
Federgruen, A. and Z. Katalan.
1994.
The
Stochastic Economic Lot Scheduling Problem:
Cyclic Base-Stock Policies with Idle Times. Graduate School of Business, Columbia University,
New
York,
NY. To appear
in
Management
Federgruen, A. and D. Simchi-Levi.
1992.
Inventory-Routing Problems. Handbooks
in
Science.
Analytical Analysis of Vehicle Routing and
Operations Research an Management Science,
56
M.
the volume on Networks and Distribution.
Nemhauser,
Ball,
T. Magnanti, C.
Monma
and G.
eds.
Federgruen, A. and
P. Zipkin.
A Combined
1984.
Vehicle Routing and Inventory Allocation
Problem. Operations Research. 32, 1019-1037.
Golden, B.L. and A. A. Assad (Editors).
1988.
Vehicle Routing:
Methods and
Studies.
North-Holland Publishers.
Harrison, J.M. 1988. Brownian Models of Queueing Networks with Heterogeneous Customer
In Stochastic Differential Systems, Stochastic Control
Populations.
tions,
IMA Volume
10,
W. Fleming and
P.L. Lions (eds.).
Theory and AppHca-
Springer- Verlag,
New
York,
147-186.
Harrison, J.M. 1985. Brownian Motion and Stochastic
New
York.
Iglehart, D. L.
in
Flow Systems. John Wiley and Sons,
and W. Whitt. 1970. Multiple Channel Queues
Applied Probability.
Karlin
S.
Press, Inc.
Kumar,
in
Heavy
Traffic
I.
Advances
2, 150-177.
and H. M. Taylor.
A
1981.
Second Course
in Stochastic Processes.
Academic
San Diego, CA.
Schwarz and
Ward.
1995.
Route Using a Dynamic Inventory-allocation
Policy.
A., L. B.
Kushner, H.
J.
1977. Probability
Elliptic Equations.
Academic
J.
E.
Methods
Press,
New
for
Risk-Pooling Along a Fixed Delivery
Management Science 41, 344-362.
Approximations
York,
in Stochastic
Control and for
NY.
Kushner, H.J. and P.G. Dupuis. 1992. Numerical Methods for Stochastic Control Problems
in
Continuous Time. Springer- Verlag.
New
York, NY.
57
Larson, R. C. 1988. Transporting Sludge lu the 106-Mile Site:
for Fleet Sizing
An Inventory/Routing
iModel
and Logistics System Design. Transportation Science 22, 186-198.
Markowitz, D. M., M. L Reiman and
Heavy
Scheduling Problem:
L.
M. Wein.
Traffic Analysis of
199-5.
Dynamic
The
Stochastic Economic Lot
Cyclic Policies.
Sloan School of
Management, MIT, Cambridge, MA.
A Markov
MinkofF, A. S. 1993.
Decision Model and Decomposition Heuristic for
Dynamic
Vehicle Dispatching. Operations Research 41, 77-90.
Peterson,
W.
P. 1991.
A Heavy
Traffic Limit
Theorem
for
Networks of Queues with Multiple
Customer Types. Mathematics of Operations Research 16, 90-118.
Reiman, M. L 1984. Open Queueing Networks
in
Heavy
Traffic.
Mathematics of Operations
Research 9, 441-458.
Rubio, R. 1995. Dynamic-Stochastic Vehicle Routing and Inventory Problem. Ph.D. Thesis,
Operations Research Center, MIT, Cambridge,
Hierarchical Planning for ProbabiHstic Distribution Systems in Eu-
Simchi-Levi, D. 1992.
clidean Spaces.
Trudeau,
P.
and Route
MA.
Management Science 38,
198-211.
and M. Dror. 1992. Stochastic Inventory Routing: Route Design with Stockouts
Failures.
Wein, L.M. 1992.
Transportation Science 26, 171-184.
Dynamic Scheduling
of a Multiclass Make-to-Stock Queue.
Research. 40, 724-735.
5 7
03
10
1
58
Operations
Date
B/^EMEHJ
^^LL-
Lib-26-67
=
MIT LIBRARIES
3 9080 00987 9013
Download