Using a mathematical programming model to examine the marginal price

Int. J. Production Economics 131 (2011) 383–391
Contents lists available at ScienceDirect
Int. J. Production Economics
journal homepage: www.elsevier.com/locate/ijpe
Using a mathematical programming model to examine the marginal price
of capacitated resources
Ali Kefeli, Reha Uzsoy , Yahya Fathi, Michael Kay
Edward P. Fitts Department of Industrial and Systems Engineering, North Carolina State University, Campus Box 7906, Raleigh, NC 27695-7906, USA
a r t i c l e in fo
abstract
Article history:
Received 13 August 2008
Accepted 29 March 2010
Available online 14 April 2010
Accurate information on dual prices of capacitated resources is of interest in a number of applications,
such as cost allocation and pricing. To gain insight we focus on the dual prices of capacity and demand
in a single-stage single-product production-inventory system, and discuss their interpretation. In
particular, we examine the behavior of two different production planning models: a conventional
linear programming model and a nonlinear model that captures queuing behavior at resources in an
aggregate manner using nonlinear clearing functions. The classical linear programming formulation
consistently underestimates the dual price of capacity due to its failure to capture the effects of
queuing. The clearing function formulation, in contrast, produces positive dual prices even when
utilization is below one and exhibits more realistic behavior, such as holding finished inventory at
utilization levels below one.
& 2010 Elsevier B.V. All rights reserved.
Keywords:
Production planning
Linear programming
Clearing functions
Dual prices
Marginal prices
1. Introduction
While linear programming (LP) models have been used
extensively for production planning problems since the 1950s
(e.g., 1974; Missbauer and Uzsoy, in press), their limitations have
been increasingly discussed in recent years. Since these
approaches do not explicitly model work-in-process (WIP)
inventories, as opposed to finished intermediate inventories
between stations, they assume that resources can operate at full
utilization while maintaining fixed cycle times, which contradicts
both empirical observations and predictions of queuing models.
Another anomaly is that a resource can have a nonzero dual price
only when the associated capacity constraint is tight or, in
queuing terminology, utilization of that resource reaches one.
However, it is well known from industrial experience and queuing
models that manufacturing performance measures such as the
expected WIP and cycle time increase nonlinearly with utilization.
This suggests that with an appropriate objective function,
additional capacity can improve system performance even when
utilization is less than one, implying a nonzero dual price for the
resource at lower utilization levels. Since most production
systems are governed to a large degree by queuing behavior, this
inconsistency between the LP models used for capacity allocation
among products and the queuing models used to describe
Corresponding author. Tel.: + 1 919 513 1681; fax: + 1 919 515 5281.
E-mail addresses: akefeli@ncsu.edu (A. Kefeli), ruzsoy@ncsu.edu (R. Uzsoy),
fathi@ncsu.edu (Y. Fathi), kay@ncsu.edu (M. Kay).
0925-5273/$ - see front matter & 2010 Elsevier B.V. All rights reserved.
doi:10.1016/j.ijpe.2010.03.018
steady-state behavior poses an interesting research direction that
has not been explored extensively.
In this paper we examine the behavior of an alternative
mathematical programming model based on nonlinear clearing
functions (CFs) suggested by Karmarkar (1989) in a simple singlestage single-product production-inventory system. While models
of this type have been successfully applied to multi-product
multi-stage systems (e.g., Asmundsson et al., 2006, 2009), we
focus on a single-stage single-product system in order to maintain
the paper’s focus on insights into the different dual prices
obtained by the two models.
Our results show that the CF model explicitly captures the
nonlinear relationships between performance measures and
system workload, providing a more accurate representation of
actual capabilities of the production system. We derive
closed-form expressions for the dual prices of capacity and
demand and illustrate qualitative differences in both production
plans and dual prices between those from the CF model and the
conventional LP model with numerical examples. We conclude
the paper with a summary and some directions for future work.
2. Previous related work
The use of mathematical programming models for production
planning has a long history, dating back to the seminal work of
Modigliani and Hohn (1955). The majority of these are LP
formulations, where the planning horizon is divided into discrete
time periods and two principal sets of constraints are used to
384
A. Kefeli et al. / Int. J. Production Economics 131 (2011) 383–391
represent the operation of the production system. One set of
constraints represents conservation of material flow into and out
of finished goods inventories (FGIs) at the end of each planning
period, assuming that production and demand rates are uniform
within each period. A second family of constraints limits the total
amount of work a resource, such as a machine, can perform in a
period. A wide range of such models has been presented in the
literature over the years (e.g., Hackman and Leachman, 1989;
Johnson and Montgomery, 1974; Voss and Woodruff, 2003).
However, these LP models have a number of drawbacks. Since
they assume all production rates are constant in a planning
period, they do not model WIP at all, and do not assign it a cost in
the objective function. Hence they allow resources to change their
production rates with complete flexibility as long as utilization is
less than one. For the same reason, when backlogging is not
allowed, these models will hold FGI only when demand exceeds
resource capacity in some period. Although queuing models and
industrial experience suggest that when capacity is fully utilized,
very high cycle times and WIP levels will result, the LP models
assume that resources can maintain a fixed cycle time independent of utilization. Finally, they will only permit nonzero dual
prices only for capacity constraints when that constraint is tight
at optimality, implying a utilization level of one for that resource
during that period.
The marginal or dual price of a production resource with limited
capacity, which represents the potential change in objective function
as additional capacity becomes available, is of interest in several
situations. One example is in short-term scheduling and dispatching,
where a number of authors have proposed dispatching rules based
on cost–benefit ratios, where the value of processing a job at a given
time is contrasted with the marginal price of the resource consumed
by scheduling it at that time (e.g., Morton et al., 1988). Yet another
related area is in the estimation of setup costs. It can be argued that
many of the costs related to setup changes, such as manpower, are
fixed relative to the decision whether or not to set up at a particular
point in time. In many cases, the principal cost of the setup is the lost
revenue due to output not being produced during the setup. If
utilization is low and demand can still be met when the setup is
performed, the lost revenue, and thus cost of the setup, will be zero.
However, if utilization is high the opportunity cost of a long setup
can be quite substantial. Another potential use of the marginal price
of capacity has been suggested in the accounting literature
(e.g., Kaplan and Thompson, 1971), where overhead costs are
allocated among products based on their consumption of capacity at
key resources.
The elegant duality theory associated with LPs (e.g., Bazaraa
et al., 2004) directly yields the marginal price of a capacitated
resource in a planning period as the optimal value of the dual
variable associated with that capacity constraint. However, by the
LP optimality conditions, a dual variable will only take a nonzero
dual value when the associated capacity constraint is tight at
optimality, i.e. when the resource utilization is equal to one. This
suggests that there is no benefit to the objective function from
adding capacity to resources that are not fully utilized. However,
this contradicts observations from industrial practice, simulation models, and queuing models of manufacturing systems
(e.g., Buzacott and Shanthikumar, 1993; Hopp and Spearman,
2001). All these suggest that performance measures of a
production system, especially WIP levels and cycle times, increase
nonlinearly with resource utilization, and that significant
increases in these quantities can be observed even at utilization levels substantially below one. Since many manufacturing
resources such as machines exhibit queuing behavior, this inconsistency between the production planning models used for
capacity allocation over time and the queuing models representing the behavior of resources is troubling.
The queuing results would appear to suggest that even at
utilization levels below one, adding capacity to the system ought
to result in improved performance due to reduced WIP levels and
cycle times under at least some conditions. This, in turn, implies
that we ought to observe nonzero dual prices on capacity
constraints at utilization levels less than one. A number of
authors (Graves, 1986; Karmarkar, 1989; Srinivasan et al., 1988)
have proposed mathematical programming models for resource
allocation that exhibit such behavior.
Several alternative approaches to estimating the dual prices of
production resources have been proposed in the literature.
Morton et al. (1988) propose a dispatching rule for job shop
scheduling based on a cost–benefit ratio, where the cost of a unit
of machine time at a particular point in time is estimated as the
increase in tardiness penalties caused by a differential delay in
machine availability. The latter estimates are obtained by
simulating the shop to the end of the current busy period, the
time at which the machine will be idle. Roundy et al. (1991)
propose an alternative approach, where an aggregate planning
model is used to estimate costs of performing a particular
operation at a particular point in time; the dispatching module
then dispatches jobs based on these costs. The costs are estimated
as values of the Lagrange multipliers associated with machine
capacity constraints in each time interval. Hoitomt et al. (1993)
present a similar formulation for job shop scheduling, where time
intervals are reduced in length, giving a dual price for each
individual time unit. These approaches focus on estimating the
dual price of capacity, noting that the dual price of a resource will
depend on the state of the system at that point in time.
Another approach to estimating the dual price of capacity has
been through queuing models, yielding an average price over a
long period in which the system is in steady state. Banker et al.
(1986) use an M/G/1 queuing model of a production system with
lot sizing to examine the effects of adding new products to an
existing facility, focusing on the costs of holding WIP. They study
the marginal benefits of adding capacity and show that it is
beneficial to add capacity at all levels of utilization, but that the
benefit decreases as utilization decreases. This, in our context,
corresponds to a dual price of capacity that increases with
utilization. Morton and Singh (1988) approach the same problem
using a variation of the busy period methodology from Morton
et al. (1988), and show that under steady state assumptions they
obtain the same dual prices for capacity as Banker et al. (1986).
It is important to bear in mind that the nature of a dual price will
depend on the objective function or performance measure of the
model from which it is derived. The short-term dispatching and
scheduling approaches outlined above, as well as the steady-state
queuing models, do not consider the effects of FGI that is held due to
insufficient capacity to meet demand in a specific time period. The
LP models of production planning, on the other hand, explicitly
consider the possibility of meeting future demand by building FGI in
earlier periods, but do not consider the effects of resource utilization
on queue lengths within the production system. Clearly both types
of costs, those due to WIP accumulating in the system as well as FGI
that is held to address temporary capacity shortfalls, are relevant,
and hence a model that integrates both would appear to be
desirable. In this paper we explore this issue for a single-product
single-stage production-inventory system, which has not been done
before to the best of our knowledge.
3. Classical LP formulation
In this section we present the classical LP production planning
model we use as a benchmark. For simplicity of exposition and
without loss of generality, we shall assume that all production
A. Kefeli et al. / Int. J. Production Economics 131 (2011) 383–391
initiated in a planning period can be completed in that period; for
a treatment of the case where cycle times may exceed one period
see Hackman and Leachman (1989). We shall adopt the simple
objective function of minimizing the total production and
inventory holding cost over a planning horizon of T periods. We
shall denote the demand in period t by Dt, and the available
number of machine hours in period t by Ct. Since we have only a
single product, we assume without loss of generality that each
unit of the product requires 1 h of machine time. If we define Xt to
be the amount of production in period t and It to be the amount of
FGI on hand at the end of period t, we obtain the following
well-known formulation (e.g., Johnson and Montgomery, 1974):
T
X
minimize
ðpt Xt þht It Þ
ð1Þ
t¼1
subject to
Xt r Ct
8t
ð3Þ
8t
ð4Þ
where pt and ht denote the unit cost coefficients of production and
FGI holding. We can rewrite this formulation to eliminate the
inventory variables by noting that
It ¼ I0 þ
t
X
t
X
Xt t¼1
Dt
t¼1
where I0 is the ending inventory at time 0. Making this
substitution for It in the objective function and discarding
constants that are independent of the decision variables yields
the following aggregate formulation:
"
!#
T
T
X
X
Xt pt þ
ht
minimize
ð5Þ
t¼t
t¼1
subject to
t
X
Xt Z
t
X
Dt I0
t¼1
t¼1
Xt r Ct
8t
Xt Z 0
8t
CFs, proposed by Graves (1986), Karmarkar (1989), and
Srinivasan et al. (1988), express the expected throughput of a
capacitated resource over a given period of time as a function of
workload of the resource over that period, which, in turn, is
defined by the amount of work available for the resource to
process in that period. For now we shall use the term ‘‘WIP’’ and
the generic variable W to denote any reasonable measure of the
WIP inventory level, whether carried over from a previous period
or released during the current one.
To motivate the use of a nonlinear CF, it is helpful to begin with
a single resource that can be modelled as a G/G/1 queuing system
in steady state. The expected number in system, or, equivalently,
the expected WIP, for a single server is given by Medhi (1991) as
ðst Þ
ðca2 þcs2 Þ r2
þr
2
ð1rÞ
ð9Þ
ð2Þ
8t
Xt ,It Z0
4. Clearing function formulation
W¼
It ¼ It1 þ Xt Dt
385
8t
ðgt Þ
ð6Þ
ð7Þ
ð8Þ
The absolute value of the dual price of constraint (7)
represents the rate of reduction in the objective function with
added capacity in period t.
Some observations on this model are in order. The model does
not represent WIP in any form; material is released to production
and emerges into FGI in the same period. In addition, and of more
interest to this paper, by the complementary slackness conditions
for LP optimality, capacity will only have a nonzero dual price in a
given period if constraint (7) is satisfied at equality. As Banker
et al. (1986), Srinivasan et al. (1988), and others have pointed out,
this is not an accurate representation of the actual situation for
resources governed by queuing behavior, and arises from the LP
model’s failure to represent WIP correctly. As pointed out by
Missbauer and Uzsoy (in press), this problem remains even when
lead times in excess of one period are modelled; in period t,
only the portion of the WIP that was released in period t–L is
available for processing, where L is an exogenous parameter. The
formulation in the following section addresses these issues.
where ca and cs denote the coefficients of variation of interarrival
and service times, respectively, and r is the utilization of the
server. Setting c ¼ ðca2 þ cs2 Þ/2 and rearranging (9) we obtain a
quadratic in W, whose positive root yields the desired r value.
Solving for r with c 41, we obtain
qffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffi
ðW þ1Þ2 þ 4Wðc2 1ÞðW þ 1Þ
ð10Þ
r¼
2ðc2 1Þ
which has the desired concave form. When 0rco1, the other root
of the quadratic will always give positive values for r. When c¼1,
(9) simplifies to yield r ¼W/(1+W) again of the desired concave
form. If we use utilization as a surrogate for output, we see that for a
fixed c value, utilization, and hence throughput, increases with WIP
but at a declining rate. Utilization, and hence output, decreases with
c due to variability in service and arrival rates.
Several authors discuss the relationship between throughput and
WIP levels in the context of queuing analysis, where the quantities
studied are long-run steady-state expected throughput and WIP
levels. Agnew (1976) studies this type of behavior in the context of
optimal control policies. Spearman (1991) presents an analytic
congestion model for closed production systems with increasing
failure rate processing time distributions that describes the relationship between throughput and WIP. Hopp and Spearman (2001)
provide a number of illustrations of CFs for a variety of systems.
Srinivasan et al. (1988) derive the CF for a closed queuing network
with a product form solution. In addition to the queuing approaches
outlined above, clearing functions can also be estimated using
industrial data, or data obtained from a simulation model of the
system under study, as suggested by Karmarkar (1989), Asmundsson et al. (2006), and Srinivasan et al. (1988). These methods involve
collecting observations of average resource workload and throughput and then fitting a clearing function to these data.
Different methods of generating CFs for optimization models
have different advantages and disadvantages. A CF estimated by a
steady-state queuing model assumes at least approximate steady
state behavior of the queue being modelled in a planning period,
which may be problematic in systems with long cycle times at
individual resources. On the other hand, while CF estimation
through simulation and industrial data do not require steady state
assumptions, they are only as valid as the simulation model or data
they are derived from and will require updating as products or
processes change. The development of methods for deriving and
validating non-steady state clearing functions remains an active area
of research (Asmundsson et al., 2009; Selc- uk et al., 2007; Riaño,
2003). An extensive discussion of the issues related to clearing
functions and their use in production planning models is given by
Pahl et al. (2005) and Missbauer and Uzsoy (in press). In this paper
386
A. Kefeli et al. / Int. J. Production Economics 131 (2011) 383–391
we shall assume that an appropriate clearing function can be
obtained using techniques suggested in these references, and focus
on the behavior of the resulting planning models.
Our starting point is the single-product single-stage model of
Karmarkar (1989). Let us define the following decision variables:
Xt ¼ number of units of the product produced in period t,
Rt ¼ number of units of the product released into the stage at
the beginning of period t,
Wt ¼number of units of the product in work in process (WIP)
inventory at the end of period t,
It ¼ number of units of the product in finished goods inventory
(FGI) at the end of period t.
Let f(.) denote the clearing function and Dt the demand for the
product (in units) in period t. Then Karmarkar’s formulation is as
follows:
minimize
T
X
Zt ¼ 1
8t
ð17Þ
Although these additional variables and constraints may seem
redundant, they help us to better characterize the dual prices of
capacity in the CF formulation. Following Asmundsson et al.
(2006) and Missbauer (2002), we approximate the CF f(.) using
outer linearization and incorporate (16) and (17) into our
formulation, yielding the following LP:
T
X
minimize
ðrt Rt þ wt Wt þ pt Xt þ ht It Þ
ð18Þ
t¼1
subject to
ðrt Rt þwt Wt þ pt Xt þht It Þ
ð11Þ
t¼1
subject to
Wt ¼ Wt1 þ Rt Xt
It ¼ It1 þXt Dt
Wt ¼ Wt1 þRt Xt
8t
ð12Þ
It ¼ It1 þ Xt Dt
8t
ð13Þ
Xt rf ðWt1 þRt Þ
8t
ð14Þ
Rt ,Wt ,Xt ,It Z 0
In a multi-product formulation, these variables are defined for
each product and period to allocate the available output of a
production resource among different products that use it. By
definition, the summation of these variables over all products for
a given resource should be equal to 1. Since we have a single
product, we add one additional constraint to our formulation:
ð15Þ
where rt, wt, pt, and ht denote the unit cost coefficients of raw material
releases, WIP holding, production, and FGI holding, respectively, and T
is the final period in the planning horizon. We follow Karmarkar in
writing our CF as a function of the sum Wt 1 +Rt, the resource load for
period t, or, in other words, the total amount of work that becomes
available for processing during the period. Since the formulation
distinguishes between WIP and FGI, flow conservation constraints
(12) and (13) are required for both quantities, which serve distinct
purposes. In periods when demand cannot be satisfied due to limited
capacity, it can be met with FGI carried over from previous periods.
WIP, on the other hand, accumulates in front of the resources as jobs
in queue and in process. This approach differs from traditional LP
models since it links the expected throughput of the resource in
period t to the resource load (incoming WIP and new releases) in that
period.
The solution to the CF formulation provides the decision maker
with a time-phased release plan, i.e. how many new jobs to
release to the production resource in each period, specified by the
decision variables Rt. We assume that job releases are under
complete control of the decision maker and that raw material
availability is not a constraint. This allows us to study the dual
prices of capacity in isolation from the effects of external factors,
allowing more intuitive results to be obtained. Another contrast to
the LP formulations such as those of Hackman and Leachman
(1989) is that lead times do not appear in the formulation;
releases and lead times are jointly optimized, allowing lead times
to vary over the planning horizon.
At this point, in order to obtain a more complete characterization
of the dual prices, we introduce an additional set of decision
variables to our formulation, motivated by the multi-product CF
model of Asmundsson et al. (2006). We define
Zt ¼fraction of the maximum output in period t, as defined by
the clearing function, allocated to the product,
and modify (14) to include these decision variables as follows:
Xt rZt f ðWt1 þ Rt Þ
8t
ð16Þ
ð19Þ
8t
ð20Þ
Xt r ac ðWt1 þ Rt Þ þ Zt bc 8c; t
ð21Þ
Zt ¼ 1
ð22Þ
8t
Rt ,Wt ,Xt ,It ,Zt Z 0
8t
8t
8t
ð23Þ
where ac is the slope and bc the intercept of segment c of the
piecewise linearized CF. The first linear segment (c¼1) will have a
slope of one (a1 ¼1) since at low utilization all work in the system
can be completed within a planning period, and an intercept of
zero (b1 ¼0), since production cannot take place without some
WIP being present. The last linear segment (c¼n) will have a slope
of zero (an ¼ 0) since increasing the load of the resource will
not increase output. To capture the concavity of the clearing
function, successive segments will have strictly increasing intercepts ðb1 o b2 o b3 o o bn Þ and strictly decreasing slopes
ða1 4 a2 4 a3 4 4 an Þ as seen in Fig. 1.
Eliminating the Wt and It variables in a similar fashion to that
used for the LP model in the previous section, making appropriate
substitutions in the objective function and capacity constraints,
and defining boundary conditions, we obtain the following
formulation that will be the basis of our analysis:
! #
"
!
T
T
T
X
X
X
minimize
rt þ
wt Rt þ pt þ
ðht wt Þ Xt
ð24Þ
t¼t
t¼1
t¼t
subject to
t
X
t¼1
t
X
Xt Z
Dt I0
8t
ðgt Þ
ð25Þ
t¼1
t
X
Xt þ
t¼1
t
X
Rt Z W0
8t
ðGt Þ
ð26Þ
t¼1
Output
(Xt)
2
n
1
n
f(x)
2
1
Resource Load
(Wt-1 + Rt)
Fig. 1. A typical CF and outer linearized segments.
A. Kefeli et al. / Int. J. Production Economics 131 (2011) 383–391
t1
X
ac
Xt Xt þ ac
t¼1
t
X
Rt Z ac W0 Zt bc
ðsct Þ
ð27Þ
t¼1
X1 þ ac R1 Z ac W0 Z1 bc
Zt ¼ 1
8c,t a1
8t
Rt ,Xt ,Zt Z 0
ðsc1 Þ
8c
ð28Þ
ðlt Þ
ð29Þ
8t
ð30Þ
where the dual prices associated with each constraint set are
given in parentheses. We now proceed to analyze the dual of this
formulation.
by K ¼ fs, s þ1, s þ2, . . . ,s0 g such that Ws 1 ¼0, Wt 40 for all tAK,
and Ws0 + 1 ¼0.
We first note, by the complementary slackness condition of LP
optimality, that Wt 40 implies that Gt ¼0 for all tAK in the
optimal dual solution. Assume that for all tAK we have Rt 40,
which in turn implies that Xt 40, i.e. if we release any work in
period t, then there must be some production in that period, since
otherwise we can release the work in a later period and save the
WIP holding cost. Under these assumptions, constraints (32) and
(34) (and constraint (33), if TAK) are satisfied as equalities for all
tAK, i.e.
T
X
T
X
gt t¼t
¼ pt þ
ð31Þ
gt t¼t
T
X
c
ðht wt Þ
t ¼ tþ1 c
8t aT
ðXt Þ
ð32Þ
X
scT r pT þ hT wT ðXT Þ
ð33Þ
c
T
X
Gt þ
t¼t
lt þ
X
T
X
t¼t c
t¼t
ac sct rrt þ
c
b sct r0 8t ðZt Þ
wt
8t
ðRt Þ
ð34Þ
lt free 8t
ð38Þ
T X
X
T
X
t¼t c
t¼t
wt
8t A K
ð39Þ
ð40Þ
We now use these relationships to derive expressions for the
optimal values of sct, gt, and lt in a congested interval. For brevity
P
let Ft ¼ cacsct and rewrite (39) as
T
X
T
X
Gt þ
Ft ¼ rt þ
t¼t
T
X
wt
8t A K
ð41Þ
t¼t
Writing (41) for successive periods from s0 to s and solving
recursively yields
8t A K\fs0 g
Ft ¼ ðrt rt þ 1 Þ þ wt
ð35Þ
Under time stationary
pt ¼ p 8tÞ this simplifies to
ð36Þ
Ft ¼ w 8t A K\fs0 g
ð37Þ
To derive the values of gt, we return to (38) with the additional
assumption that Rs0 + 1 40. We can then write (39) for period t+ 1
and substitute that in (38) to obtain
!
T
T
T
T
X
X
X
X
X
gt Gt sct rt þ 1 þ
wt Gt
c
gt , Gt , sct Z0 8t
8t A K
ac sct ¼ rt þ
Gt þ
t ¼ s0 þ 1
T X
X
ðht wt Þ
ac sct
t ¼ tþ1 c
c
t¼t
gT GT c
If TAK the corresponding expression for t ¼T is
X
gT scT r pT þhT wT
T
X
X
X
Gt sct ac sct r pt
T
X
t¼t
þ
T
X
t ¼ s0 þ 1
subject to
T
X
T
X
T
X
X
sct t¼t
c
t¼1
t¼1
X
Gt t ¼ s0 þ 1
5. Analysis of the CF formulation
The dual of this formulation (24)–(30) is given by
"
!
#
T
t
X
X
X
minimize
Dt I0 gt W0 Gt ac W0 sct þ lt
387
Note that any production plan will consist of congested periods
where Wt 40, non-congested periods where Wt ¼0 and Xt 40, and
idle periods where Wt ¼0 and Xt ¼0. Since there is no work release,
production, or WIP in idle periods, they are not of great interest. In
non-congested periods, production takes place but no WIP is
carried from one period to the other. This is possible if the system
operates at low utilization such that all work released into the
system is processed in the same period, implying that the first
segment (c ¼1) of the CF is tight (see Fig. 1). In congested periods,
on the other hand, all work available to the resource cannot be
completed in one period due to congestion, and some is carried
over to the next period as WIP. This means the system is operating
at higher utilization and a segment of the CF with index c 41 is
tight. (In certain extreme cases, where demand falls very sharply
in a short period of time, it is possible to have a congested period t
with Xt ¼0 due to the presence of high WIP levels that was needed
to meet high demand in earlier periods but for which demand no
longer exists. In the remainder of this paper we shall assume
demand is sufficiently regular to avoid this extreme behavior.)
We will assume W0 ¼0 throughout our analysis since feasibility can be ensured by an appropriate I0. As stated earlier, our
objective is to explore the behavior of dual prices in the context
of this LP model. To this end, we first examine the behavior of
the system in a congested interval, defined as a collection of
consecutive congested periods, starting with a period s and
ending with a period s0 4s. We denote such a congested interval
t¼t
¼ pt þ
ðrt ¼ r, wt ¼ w, ht ¼ h, and
ð42Þ
c
t ¼ s0 þ 1
T
X
costs
ðht wt Þ
t ¼ tþ1
t ¼ s0 þ 1
8t A K
t¼t
T
X
t¼t
gt ¼
X
T
X
c
t¼t
sct þ pt þ rt þ 1 wt þ
ht
8t A K
ð43Þ
Following the same recursive approach for (43) as we used for
(41) we obtain
X
X
gt ¼ ht þ
sct sc,ðt þ 1Þ þ ðpt pt þ 1 Þðwt wt þ 1 Þ
c
c
þðrt þ 1 rt þ 2 Þ
8t A K\fs0 g
which under time stationary costs simplifies to
X
X
gt ¼ h þ
sct sc,ðt þ 1Þ 8t A K\fs0 g
c
ð44Þ
c
We now concentrate on (42) and (44). To obtain more general
forms of these relationships, we note that in an optimal solution,
for a given period t, no more than two segments of the CF can be
tight. For the time being, without loss of generality, let two
segments c*(t)oc*(t)+ 1 of the CF be tight in the optimal solution
for a given period t. By complementary slackness, the dual prices
388
A. Kefeli et al. / Int. J. Production Economics 131 (2011) 383–391
of segments c that are not tight in an optimal solution in period t
must be equal to zero, i.e. sct ¼ 0 8c ac ðtÞ and c ac ðtÞ þ 1.
Hence
X
Ft ¼
ac sct ¼ ac ðtÞ sc ðtÞ,t þ ac ðtÞ þ 1 sc ðtÞ þ 1,t
ð45Þ
c
X
sct ¼ sc ðtÞ,t þ sc ðtÞ þ 1,t
ð46Þ
c
If we also assume that there are two tight segments of the CF
in period t + 1 we obtain the following general forms of (42) and
(44):
Ft ¼ w ¼ ac ðtÞ sc ðtÞ,t þ ac ðtÞ þ 1 sc ðtÞ þ 1,t
ð47Þ
gt ¼ h þ½ðsc ðtÞ,t þ sc ðtÞ þ 1,t Þðsc ðt þ 1Þ,t þ 1 þ sc ðt þ 1Þ þ 1,t þ 1 Þ
ð48Þ
Recall that (47) and (48) both hold for a period t such that
t A K\fs0 g, i.e. with Wt 40 by definition and with the assumption
Rt 40, which implies Xt 40. Expression (48) also requires
Rs0 + 1 40.
If we further assume positive FGI carried over from period t to
period t+ 1, i.e. It 40, by complementary slackness applied to (25),
we will have gt ¼ 0. Thus (48) will reduce to
h ¼ ðsc ðt þ 1Þ,t þ 1 þ sc ðt þ 1Þ þ 1,t þ 1 Þðsc ðtÞ,t þ sc ðtÞ þ 1,t Þ
t A K\fs0 g
ð49Þ
This implies that as long as work is released and FGI is carried
in consecutive periods in a congested interval, the sum of the
nonzero dual prices of capacity increases by h for each
consecutive time period the system holds FGI.
It is clear that sct is related to the dual price of capacity.
However, the fact that two segments of the CF may be tight in a
given period renders an intuitive expression hard to obtain. We
now argue that the dual variables lt are a more informative
estimate of the dual price of capacity in our CF formulation. An
optimal solution to our CF formulation will have at most two
segments of the CF tight, yielding two different sct values for a
given period. While these values are governed by (42), they do not
provide a single number that can be used as the dual price of
capacity. However, the extra decision variables Zt in Eq. (16)
multiply the output corresponding to the resource load; therefore
an arbitrarily small increase in the right hand side of (29) will
increase our capacity as well for all linear segments related to that
period. Therefore, the dual variable associated with (29), lt,
provides one unique number, which represents the dual price of
capacity in a given period for the CF model. To derive values for lt,
note that in any given optimal solution we need to have Zt ¼1.
Therefore, by complementary slackness applied to (35) we have
X c
ð50Þ
b sct ¼ lt
c
This expression defines one dual price of capacity for each
period in a clearly structured fashion that incorporates all positive
sct values. An added advantage of (50) is that it does not require
any assumptions and is valid for any given optimum solution.
To sum up our findings so far, the dual price of capacity in a
congested interval is jointly defined by expressions (47), (49), and
(50). On reflection, this is intuitive—the dual prices depend on the
shape of the clearing function embodied in the slopes ac and
intercepts bc of the linear segments, the holding cost w of WIP,
and the holding cost h of FGI. Suppose we have a demand spike in
some period that exceeds the capacity of the system. Since
backlogging is not allowed, both the CF and the LP models will
produce earlier and carry inventory in the periods prior to the
spike. If the system operates at high enough utilization to create
a congested interval, our assumptions for (49) hold and the
marginal benefit of an additional unit of capacity in a period
increases. Additional capacity becomes more and more desirable,
i.e. has a higher dual price, as we get closer to the demand spike;
an additional unit of capacity later in the congested period will
save us from having to hold inventory in the first period of the
congested interval and carry it until the current period.
In contrast to the LP model, however, the decision to hold
FGI in the CF model is linked to the holding cost of WIP. The LP
model will not hold FGI until capacity in a period is fully utilized
since WIP costs are not modelled. In the CF model, however, the
amount of WIP required to maintain high levels of utilization will
eventually require very high levels of additional WIP for very little
increase in output. In this situation, eventually the cost of
additional WIP will exceed the cost of carrying FGI for several
periods. Thus the CF model may hold FGI at utilization levels less
than one.
We now examine the behavior of the system in non-congested
periods.
6. Non-congested periods
So far, we have studied congested periods, where we have
Wt 4 0 and Gt ¼ 0. For completeness, we will now consider noncongested periods (see Table 1), where the system works just in
time—utilization is low, so work is released and processed into
output in the same period without WIP being carried between
periods.
Using (21) for simplicity of exposition, we consider a noncongested period with demand dt. Since backlogging is not
allowed, Xt ¼dt. By the definition of a non-congested period, only
the first segment of the CF is tight, yielding
Xt rWt1 þ Rt
8t=
2K
ð51Þ
and an optimal solution will give Wt ¼Wt 1 ¼ It ¼0 and
Rt ¼Rt +Wt 1 ¼Xt ¼dt. Hence this is a highly degenerate solution,
where all four constraints (25), (26), (27), and (29) are tight. In
this situation the critical binding constraint is (25), the demand
constraint, since until demand increases to the point that the
second segment of the CF becomes tight, there is no benefit to
additional capacity or releases. This fact is clearly reflected in
(50)—since in a non-congested period only first segment of the
CF is tight we have sct ¼ 0; 8c a1 and b1 ¼ 0. This clearly
implies lt ¼0. Therefore, the non-congested periods do not yield
any interesting insight on the dual price of capacity, following the
behavior of the LP model when demand is below capacity and no
inventory is required in the future.
7. Numerical examples
In order to illustrate the differences in the dual information
obtained from the LP and the CF models under different
conditions, we examine a numerical example with one product,
one resource, and a planning horizon of T¼50 periods. We
consider the demand scenario shown in Fig. 2, where demand
remains below the theoretical capacity in the first 30 periods, and
then increases to exceed capacity several times in the latter part
Table 1
Different cases and corresponding variable relationships.
Idle periods
Non-congested periods
Congested periods
Utilization
Wt
Xt
Rt
It
0
Low
High
¼0
¼0
40
¼0
40
40
¼0
40
Z0
¼0
40
Z0
A. Kefeli et al. / Int. J. Production Economics 131 (2011) 383–391
Fig. 2. Demand data for numerical examples.
389
Fig. 4. Dual prices from Scenarios 1 and 2.
Table 2
Parameters of linear functions approximating CFs.
CF segment c
1
2
3
4
5
6
7
8
Scenario 2
Scenario 3
Scenario 4
ac
bc
ac
bc
ac
bc
1
0.5
0.1
0
0
10
12
15
1
0.44
0.11
0
0
1.67
6.67
15
1
0.74
0.51
0.33
0.18
0.08
0.02
0
0
0.31
1.23
2.76
4.9
7.65
11.02
15
Fig. 3. CFs used in numerical examples.
of the horizon. In all cases we assume costs and the CF are
constant over time.
We solve four different scenarios and compare the dual price
of capacity obtained from each. All scenarios assume the same
demand pattern and theoretical capacity of ct ¼15 units per
period, but differ in the way capacity constraints are implemented. In Scenario 1 we make use of the classical LP formulation with
a fixed capacity of 15 units as seen in Fig. 2. In the remaining three
scenarios we use the CF formulation with different settings.
Scenario 2 uses a CF with 4 segments, whereas Scenario 3 utilizes
the same number of segments to represent a more congested
resource (a CF that is more gradual). Scenario 4 uses the same CF
as Scenario 3, but uses 8 linear segments, making our outer
linearization a more exact representation of the underlying
nonlinear concave function. Table 2 lists the CF parameters and
Fig. 3 illustrates the three different CFs used.
We solved each scenario using the ILOG OPL (2007) environment. The dual prices obtained are plotted in Figs. 4 and 5. For the
classical LP model, we report st and for CF models we report lt.
Since in the first half of the horizon the demand never reaches
the maximum possible output in a period, the conventional LP
model in Scenario 1 yields dual prices of zero for capacity in all
Fig. 5. Dual prices from Scenarios 3 and 4.
periods in this portion of the horizon. In contrast, the dual prices
yielded by all CF model Scenarios 1–3 are positive whenever
production takes place, and increase or decrease based on
resource utilization even though utilization is well below one.
This suggests that at utilization levels below one, the conventional
LP representation of capacity provides limited dual information.
In Fig. 4 we observe that during periods 27–39 and 40–47,
when utilization is high, CF dual prices are consistently higher
than those from the LP model when FGI is present. In these
periods the capacity constraint of the LP model is tight, and in
Scenario 2, we use the dual prices corresponding to the segment
c¼2 being tight. It is interesting, however, that the dual prices
from the two scenarios coincide quite closely. At first sight, this
appears anomalous; at high utilization we would expect long
queues to form, and given the ability of CF models to capture this
effect, the dual price of capacity to be considerably higher in CF
models. In fact, a careful investigation of Fig. 5 reveals that as we
add more segments to the piecewise linearization (Scenario 3
versus Scenario 4) the CF model becomes more sensitive to the
queuing behavior and dual prices start to climb. These observations suggest that Scenario 2 underestimates the dual price of
capacity in regions of high utilization due to the piecewise linear
approximation and limited number of segments.
Examining the output from the first and second scenarios in
Fig. 6 we see that the LP model runs longer at full capacity, while
the CF model elects to hold FGI and run below full capacity in
periods 28–33 and 40–48. This latter behavior is frequently
observed in real systems, where full utilization is never achieved
but inventories are frequently present.
These observations suggest that in order to obtain more
accurate dual prices, the piecewise linearization of the CF must be
more detailed in the region corresponding to higher utilization
levels. For intuition, we can consider expression (47) for the dual
prices when a single linear segment is tight at optimality. Since
the slope ac ðtÞ of the clearing function is in the denominator,
small changes in this parameter, especially as it approaches zero
with higher values of Wt 1 +Rt, will lead to large changes in the
390
A. Kefeli et al. / Int. J. Production Economics 131 (2011) 383–391
Fig. 6. Production (Xt) comparison of Scenarios 1 and 2.
dual price. Hence the use of a limited number of linear segments
will lead to underestimation of the dual price of capacity at high
utilization levels, despite the fact that previous studies have not
shown this to be a major issue when CF models are used purely
for production planning.
8. Managerial insights
Our results suggest that in congested periods, the dual price of
capacity in the CF model depends on the shape of the CF and the
holding costs of WIP and FGI in a tightly coupled manner.
The shape of the CF in our model embodies the characteristics of
the production resource under consideration, effectively determining how much WIP is required for a given output level.
Clearly, the dual price of capacity increases as the resource
utilization increases, since both b and 1/a increase with utilization. Moreover, the dual price of capacity increases linearly when
FGI is held in congested periods. In non-congested periods, where
resource utilization is low, the solution is degenerate, and the CF
model does not yield any interesting insights on dual behavior.
Relative to the body of work summarized earlier that utilizes
dual prices of capacity in decision-making, the main feature of our
paper is to point out the deficiencies of traditional LP models in
estimating the value of capacity. We further show that CF
formulations provide richer dual information that is consistent
with queuing models and industrial experience. Therefore,
decision makers should be aware of the problems associated
with certain production planning models that do not capture
queuing effects. This is of particular interest in applications where
dual prices of resources may be informative, such as estimation of
setup costs, dispatching, and overhead allocation mentioned
earlier.
9. Conclusions
We have presented a comparison of the dual prices for
capacity yielded by a conventional LP model and a model based
on clearing functions under some simplifying assumptions. The
dual prices generated by the CF model have the desirable property
that positive dual prices are obtained at utilization levels less than
one, while the conventional model yields zero prices. On the other
hand, the use of piecewise linear approximation to the clearing
function can result in underestimation of true prices at high
utilization levels. Our results also show that the CF model exhibits
more realistic behavior than the classical LP model, holding FGI at
utilization levels of less than one.
The results in this paper show that the CF model effectively
addresses several of the conceptual difficulties associated
with classical LP models of production facilities. Extensive
experimental studies in multi-stage, multi-product environments,
i.e. Asmundsson et al. (2006) and Missbauer (2002), have shown
that when the CF is fitted appropriately, the CF models yield
significantly more realistic results than classical LP models. In
more complex production-inventory systems with multiple
stages and multiple products, multiple interactions between
WIP, demand, CF shape, and finished goods inventories between
production stages render concise analytical expressions for dual
prices hard to obtain, although numerical values of dual prices are
straightforward. A systematic study of the relationships between
prices obtained from our CF model and those obtained by other
methods, such as the queuing-based approaches of Banker et al.
(1986) and Morton and Singh (1988), is also an interesting
direction for future work.
Acknowledgements
The research of Reha Uzsoy was partially supported by the
National Science Foundation under Grant number DMI-0559136
and a research grant from the Intel Research Council. The research
of Yahya Fathi was partially supported by the National Science
Foundation under Grant number DMI-0321635.
References
Agnew, C., 1976. Dynamic modeling and control of some congestion prone
systems. Operations Research 24 (3), 400–419.
Asmundsson, J.M., Rardin, R.L., Turkseven, C.H., Uzsoy, R., 2009. Production
planning models with resources subject to congestion. Naval Research
Logistics 56, 142–157.
Asmundsson, J.M., Rardin, R.L., Uzsoy, R., 2006. Tractable nonlinear production
planning models for semiconductor wafer fabrication facilities. IEEE Transactions on Semiconductor Manufacturing 19, 95–111.
Banker, R.D., Datar, S., Kekre, S., 1986. Relevant costs, congestion and stochasticity
in production environments. Journal of Accounting and Economics 10,
171–197.
Bazaraa, M.S., Jarvis, J., Sherali, H.D., 2004. Linear Programming and Network
Flows. John Wiley, New York.
Buzacott, J.A., Shanthikumar, J.G., 1993. Stochastic Models of Manufacturing
Systems. Prentice-Hall, Englewood Cliffs, NJ.
Graves, S.C., 1986. A tactical planning model for a job shop. Operations Research
34, 533–552.
Hackman, S.T., Leachman, R.C., 1989. A general framework for modeling
production. Management Science 35, 478–495.
Hoitomt, D., Luh, P.B., Pattipati, K.R., 1993. A practical approach to job shop
scheduling problems. IEEE Transactions on Robotics and Automation 9 (1),
1–13.
Hopp, W.J., Spearman, M.L., 2001. Factory Physics: Foundations of Manufacturing
Management. Irwin/McGraw-Hill, Boston.
ILOG, 2007. ILOG OPL-CPLEX Development System. Retrieved January 12, 2008
from /http://www.ilog.com/products/oplstudio/S.
Johnson, L.A., Montgomery, D.C., 1974. Operations Research in Production
Planning, Scheduling and Inventory Control. John Wiley, New York.
Kaplan, R.S., Thompson, G.L., 1971. Overhead allocation via mathematical
programming models. The Accounting Review 46 (2), 352–364.
Karmarkar, U.S., 1989. Capacity loading and release planning with Work-inProgress (WIP) and lead-times. Journal of Manufacturing and Operations
Management 2, 105–123.
Medhi, J., 1991. Stochastic Models in Queuing Theory. Academic Press.
Missbauer, H., 2002. Aggregate order release planning for time-varying demand.
International Journal of Production Research 40, 688–718.
Missbauer, H., Uzsoy, R. Optimization models for production planning. In: Kempf,
K.G., Keskinocak, P.,Uzsoy, R. (Eds.), Planning Production and Inventories in the
Extended Enterprise: A State of the Art Handbook. Springer, Norwell, MA, in
press.
Modigliani, F., Hohn, F.E., 1955. Production planning over time and the nature of
the expectation and planning horizon. Econometrica 23 (1), 46–66.
Morton, T.E., Lawrence, S.R., Rajagopalan, S., Kekre, S., 1988. SCHED-STAR: a pricebased shop scheduling module. Journal of Manufacturing and Operations
Management 1, 131–181.
Morton, T.E., Singh, M.R., 1988. Implicit costs and prices for resources with busy
periods. Journal of Manufacturing and Operations Management 1,
305–332.
Pahl, J., Voss, S., Woodruff, D.L., 2005. Production planning with load dependent
lead times. 4OR: A Quarterly Journal of Operations Research 3, 257–302.
A. Kefeli et al. / Int. J. Production Economics 131 (2011) 383–391
Riaño, G., 2003. Transient Behavior of Stochastic Networks: Application to
Production Planning with Load-Dependent Lead Times. School of Industrial
and Systems Engineering, Georgia Institute of Technoogy, Atlanta, GA.
Roundy, R.O., Maxwell, W.L., Herer, Y.T., Tayur, S.R., Getzler, A.W., 1991. A pricedirected approach to real-time scheduling of production operations. IIE
Transactions 23 (2), 149–160.
Selc- uk, B., Fransoo, J.C., de Kok, A.G., 2007. Work in process clearing in supply
chain operations planning. IIE Transactions 40, 206–220.
391
Spearman, M.L., 1991. An analytic congestion model for closed production systems
with IFR processing times. Management Science 37 (8), 1015–1029.
Srinivasan, A., Carey, M., Morton, T.E., 1988. Resource Pricing and Aggregate
Scheduling in Manufacturing Systems. Graduate School of Industrial Administration, Carnegie-Mellon University, Pittsburgh, PA.
Voss, S., Woodruff, D.L., 2003. Introduction to Computational Optimization
Models for Production Planning in a Supply Chain. Springer, Berlin,
New York.