Evaluating Plans through Restrictiveness and Resource Strength Federico Pecora , Amedeo Cesta

Evaluating Plans through Restrictiveness and Resource Strength
Federico Pecora∗ , Amedeo Cesta
Institute for Cognitive Science and Technology
Italian National Research Council
name.surname@istc.cnr.it
Abstract
The planning and scheduling framework which is the object
of this paper consists in a loosely-coupled integration which
is achieved by cascading a classical planner and a general
purpose scheduler. The output of the planning phase yields a
partially ordered plan which is then integrated with time and
resource constraints to produce a scheduling problem. The
research described in this paper is aimed at understanding the
structural trademarks of the causal knowledge that a scheduling tool can inherit from STRIPS-based reasoners. Specifically, we analyze quality of the plans in terms of two wellknown properties of scheduling problems, namely the Restrictiveness and the Resource Strength. To this end, we describe a set of experiments carried out on a series of state-ofthe-art planners aimed at assessing the bias of different planning strategies in the context of the loosely-coupled framework.
Introduction
In recent years, increasing attention has been dedicated to
integrating Planning and Scheduling (P&S), with the aim
of extending the reach of automated solving to combinatorial optimization problems which require both causal and
time/resource reasoning. Integrated P&S frameworks are
typically obtained following two approaches. On one hand,
integrated P&S architectures can be designed using a “contextual” approach, so called because time/resources and the
causal sub-problem are dealt with contextually. This can be
achieved, as for instance in (Gerevini, Saetti, & Serina 2003;
Smith & Weld 1999; Currie & Tate 1991; Ghallab & Laruelle 1994), by expanding a strictly causal solving technique
with algorithms and data structures capable of dealing with
time and/or resources. Other contextual solvers adopt the
opposite strategy, namely starting from a scheduling-centric
approach and extending it with limited causal reasoning capabilities (Cesta, Fratini, & Oddi 2004; Jonsson et al. 2000;
Muscettola et al. 1992). The main drawback of contextual P&S integration strategies is that typically the “enhanced” system does not aquire a full set of capabilities.
∗
Also affiliated with the the Department of Systems and Informatics (DIS) of the University of Rome “La Sapienza”, Rome, Italy
— pecora@dis.uniroma1.it.
c 2005, American Association for Artificial IntelliCopyright °
gence (www.aaai.org). All rights reserved.
This is the case, for instance, of planners which support
PDDL2.1/2.2 (Fox & Long 2003; Edelkamp & Hoffmann
2004), an extension of the standard Planning Domain Definition Language (PDDL (Ghallab et al. 1998)) for modeling
actions with durations, a characteristic which does not alone
afford the versatility of common scheduling problem definition languages.
On the other hand, so-called “coupled” P&S consists
in linking separate implementations of the two solving
paradigms in order to achieve a bi-modular solver, in which
each component deals with the respective sub-problem. This
approach clearly presents some major difficulties since it
requires the definition of forms of information sharing between the two subsystems. Moreover, this type of integration can be achieved by a strong coupling of the two solving
components (i.e., in which every decision taken by one component is “propagated” by the other component), or, on the
other end of the spectrum, by loosely-coupling the two solving sub-systems (i.e., the planner produces a plan and the
scheduler then enforces the time and resource related constraints upon this plan). While a strongly-coupled architecture may seem to be more realistic, there is increasing evidence that often a loose-coupling of solving components is
more applicable to solve certain classes of real-world problems1 .
The P&S framework which is the object of our work consists in a loosely-coupled integration which is achieved by
cascading a classical planner and a general purpose scheduler. The output of the planning phase yields a partially ordered plan which is then integrated with time and resource
constraints. The resulting scheduling problem is then given
to the scheduler for resolution. The research described in
this paper stems from the necessity for an in-depth comprehension of some fundamental aspects related to this form
of integration. In particular, we address the issue of understanding the structural trademarks of the causal knowledge
1
This is the case, for instance, in military planning, where highranking officers take high-level planning decisions, disregarding
issues such as resource allocation or specific synchronization constraints, while the more scheduling-related decisions which must
be taken in order to make a plan operational occur afterwards.
An interesting discussion on this topic was brought up during the
ICAPS Workshop on Integrating Planning into Scheduling, June
2004.
that a scheduling tool can inherit from STRIPS-based reasoners. Thus, we set out to measure the structural properties
of the scheduling problems yielded from the planning phase
in the light of the subsequent scheduling. The present analysis extends the results obtained in (Pecora, Rasconi, & Cesta
2004; Cesta, Pecora, & Rasconi 2004), in which we focused on the bias produced by different planning approaches
(heuristic search and planning graph based) in the light of
makespan-optimizing scheduling. In this paper, our aim is
to assess the bias of the planning phase on two well-known
properties of scheduling problems, namely the Restrictiveness and the Resource Strength. These two structural properties of scheduling problems are commonly used2 as control parameters for the random generation of Resource Constrained Project Scheduling Problems with minimum and
maximum time lags (RCPSP/max) (Brucker et al. 1998).
To this end, we describe a set of experiments carried out
on a series of state-of-the-art planners aimed at generating
scheduling problems which exhibit varying degrees of these
two parameters.
This paper is organized as follows. We first present the
general experimental setup within which our analysis is carried out. To this end, we first describe more precisely the
loosely-coupled P&S framework (an architecture similar to
R EAL P LAN -MS (Srivastava 2000)) which our work focuses
on, as well as the benchmark set on which our analysis is
performed. We then describe the details of the structural
properties we wish to measure, while the section that follows
describes an extention to one of the planners analyzed in the
experiments aimed at increasing the detail of our observations. The main results of this paper consist in analyzing
and comparing the performance in terms of the measures introduced earlier of four state-of-the-art planners. The results
are then put together and further analyzed, and we conclude
with a summary of our findings as well as an outlook for
future work.
Experimental Setup
In an integrated P&S context, time and resource constraints
as well as causal dependencies are contemplated in the initial
problem definition. Every operator is inherently associated
to a time duration and requires a certain quantity of consumable multi-capacity resources that ensure its executability.
The general schema we use in this investigation is as follows (see figure 1). A causal model of the environment is
given as input to a planner, i.e., the domain representation
and the problem definition, both expressed in a STRIPS-like
formalism. This model does not contemplate time and resource related constraints, which are accommodated after
the planning procedure has taken place. In order to produce a problem specification which can be reasoned upon by
the scheduling procedure, the Partial Order Plan (POP) produced by the planner is integrated with time and resource
related information by means of a plan adaptation procedure. This procedure produces a minimal-constrained deordering (Bäckström 1998) of the POP and integrates it with
2
See for instance (Schwindt 1998).
POP
Planner
Info integration
& Deordering
completePOP π
Scheduler
schedule
Planning
problem
p
Time/Resource info
Durations and
Resource Usage
time
Causal model
PDDL Domain
Representation
PDDL Problem
Definition
Figure 1: The loosely-coupled reference framework, in which a
planning problem p is solved by the planner, ultimately yielding a
scheduling problem π.
time and resource related information to produce what we
shall call a completePOP. In more formal terms:
Definition 1. A completePOP P is a tuple
hT , P, R, C, U, Di where
• T = {T1 , . . . , Tn } ∪ {T0 , Tn+1 } is the set of tasks which
correspond to the n activities in the POP produced by the
planner; T0 and Tn+1 are called source and sink activities;
• P is a set of precedence constraints between the tasks,
where Ti ≺ Tj means that task Ti must be completed
before the execution of task Tj can begin;
• R = {R1 , . . . , Rm } is a set of renewable resources;
• C = {C1 , . . . , Cm } are the capacities of the resources;
• U : T × R → N ∪ {0} is the function which determines
the resource usage attributes of a task, that is, U(T, R) =
u if task T uses u units of resource R (u = 0 indicates
that T does not use resource R);
• D : T → N ∪ {0} is the function which determines the
durations of the tasks, that is, D(T ) = d if the duration of
task T is d time units.
According to the definition above, a completePOP coincides with a Resource Constrained Project Scheduling Problem (RCPSP) (Brucker et al. 1998) and can be visualized in
the form of a precedence graph.
A General Multi-Agent Problem Template
Our aim in this paper is to analyze the performance in terms
of plan quality of different planners within the above mentioned loosely-coupled framework. More specifically, we
are interested in assessing the influence of different planning
strategies on the structural characteristics of the scheduling
problems they lead to. This section briefly illustrates the
main features of the benchmark problems employed in the
experimental evaluation.
We focus on a multi-agent inspired planning domain
which produces completePOPs with high levels of concurrency by encapsulating the notion of executing agent. Concurrency is obtained by inducing the presence of multiple
agents in the POP. Having augmented the STRIPS subset
of the PDDL language with some simple extensions to allow the specification of durations and resource usage for the
operators (these directives are ignored by the planner and integrated into the POP to form the completePOP), the general
structure of a problem which leads to completePOPs which
exhibit high degrees of concurrency is shown in figure 2,
where :capacity 0 denotes an object which is not a resource. Given this structure, the number of agents in the
completePOP is determined by the number of objects with
non-zero capacity, i.e., |{A1, . . . , An}|, and the resource usage of each task is given by the :uses clause in the abstract
operator specification.
(define (domain . . . ) . . .
(:action op
:parameters (?a - agent . . . )
:precondition ( . . . )
:effect ( . . . )
:uses (?a USAGE)
:duration DUR) . . . )
(a)
(define (problem . . . ) (:domain . . . )
(:objects
A1, . . ., An - agent :capacity CAP
B1, . . ., Bm - type :capacity 0 . . . )
(:init . . . )
(:goal . . . ))
(b)
Figure 2: General structure of augmented PDDL domain (a) and
problem specifications (b) which lead to completePOPs with multiple agents. Bold typeface indicates directives which are ignored
by the planner.
A very simple example “instantiation” of this somewhat
general template could be the following: a team of robotic
agents can move from one room of an environment to another (so long as there is a door connecting the two adjacent
rooms), and they can perform a simple operation on certain types of objects (iff they are in the same room as
the object). In order to model the fact that agents do not
perform an operation on two objects at the same time,
each action :uses one unit of the resource agent. This
equates to modeling the agents as binary resources for the
scheduler. The idea is that, once the planning has taken place
(disregarding this resource usage information), the scheduler
will enforce the resource usage constraints, adding as a consequence a precedence constraint during the solving phase
which serializes two operations on the same object which
otherwise were independent.
Notice also that it is possible to model this binary re-
source constraint causally, by adding not (busy ?a)
to the preconditions and busy ?a to the effects of action operation. The busy predicate can be seen as a
“causal excuse” for the resource capacity constraint modeled with the :uses clause, and can be reset by a dummy
finish action (whose effect is not (busy ?a)). Notice that in this manner we would be producing scheduling
problems without resource contention peaks, thus transferring the burden of resolving these contentions to the planning phase. It has been shown in (Pecora & Cesta 2002;
R-Moreno et al. 2002) on a very similar domain that adopting this modeling strategy heavily affects the efficiency of
the planning procedure, making it very difficult to solve even
problems of small size.
Restrictiveness and Resource Constraints
There is a consolidated body of work in the field of scheduling which deals with classifying scheduling problems with
respect to various quantifiable properties. Our aim in this paper is to interpret these structural characteristics in the light
of different implementations of the planning phase which
constitutes the first component of the loosely-coupled framework. In this section we present the two well-known structural factors of scheduling problems which we will focus
on. In the remainder of this paper we will use these factors
to measure the quality of the scheduling problems obtained
with a number of state-of-the-art planners.
Order Strength. The first parameter we focus on quantifies the effect of the precedence constraints contained in the
plan (i.e., the causal structure of the scheduling problem) on
the possible execution sequences of the tasks. Our aim is to
measure to some extent the magnitude of the search space
which is explored by the scheduling phase. To this end, we
borrow the notion of restrictiveness from Project Scheduling.
Given a completePOP hT , P, R, C, Di, let l(P) be the
number of linear extensions of the causal structure determined by the precedence constraints in P. The restrictiveness of the completePOP3 can be defined as follows:
l(P)
|T |!
where |T |! represents the maximum number of linear extentions of a partial order in the set of tasks T . The restrictiveness measures the degree to which the precedences
between activities restrict the number of feasible execution
sequences of the tasks. For completely parallel causal structures, the restrictiveness is 0, while it is 1 for serial task networks. In general, the higher σ, the less different alternatives
exist to resolve resource conflicts between tasks (by defining additional precedence constraints). As a consequence,
the restrictiveness is directly linked to the hardness of many
RCPSP/max problems (both with respect to feasibility and
optimization) (Schwindt 1998).
σ = 1 − log
3
The restrictiveness of a directed graph is used as a control
parameter for the random generation of RCPSP/max problems,
e.g. (Schwindt 1998).
Unfortunately, calculating σ is #P-complete, since it involves counting linear extensions (Brightwell & Winkler
1991). As shown in (Mastor 1970), a good approximation
of the restrictiveness is the order strength. Given a completePOP π, this measure can be determined very efficiently
and is defined as follows:
¯ ¯
¯P ¯
OSπ =
(1)
|T | (|T | − 1) /2
i.e., the number of precedence relations, including the transitive ones (P denotes the set of precedence relations in the
transitive closure of the precedence graph), divided by the
theoretical maximum number of constraints. It is shown
in (De Reyck & Herroelen 1996) that OS outperforms other
significant network measures regarding the correlation with
the hardness of RCPSP/max instances.
The OS parameter thus expresses the properties of the
causal structure determined by the planning phase, and has
no connection with the resource constraints contained in the
problem. As we will see, different planning strategies yield
plans which differ in causal structure, thus the planning algorithms themselves are also connected to the hardness of
the resulting scheduling problems instances.
Resource Strength. One of the main features of schedulers consists in the capability to deal with resources in addition to complex time-constraints. In order to understand
the bias produces by different planning strategies on the
resource-related characteristics of the resulting scheduling
problems, we employ another well-known measure, namely
the resource strength4 . Given the k-th resource in the comk
pletePOP, we denote with rmin
the maximum usage of this
resource by a single task in the completePOP, that is
k
rmin
= max U(T, Rk )
T ∈T
is resource-feasible. We call a problem in which Rk adheres to the first requirement k-parsimonious. Notice also
k
is a lower bound for Ck , since it represents the
that rmin
maximum usage of resource Rk by a single task in the completePOP. Therefore, in a k-parsimonious completePOP we
k
k
have that Ck is bounded in the interval [rmin
, rmax
], since
these bounds represent, respectively, the smallest and largest
capacity which can be given to Rk in order for the scheduling problem to be resource feasible with respect to Rk . If
this is the case, then clearly RSk is also bounded in the interval [0, 1], while RSk may be greater than 1 of the completePOP is not k-parsimonious.
In conclusion, the RS parameter takes into account the
resource constraints which are given by the requirements of
the tasks on one hand, and the limited resource capacities
on the other, thus it measures the scarcity of the resource
availability with respect to the requirements. In the following sections, our analysis focuses on the average RS over
the resources which are employed in the completePOP, that
is
P
RSk
RS π = k:Rk ∈Ru
(2)
|Ru |
where Ru ⊆ R is the set of used resources, i.e., for which
U(T, R) > 0 for some T ∈ T . Notice that a completePOP
may not prescribe the use of all resources modeled in the
original problem, since the particular action instantiations
which achieve the goal (the POP produced by the planner)
are obtained by means of a purely causal deduction, and
as such does not explicitly take into account any resourcerelated metric. Indeed, some among the planners we analyze in the following sections over-commit more than others
with respect to resource utilization. This reflects strongly on
the average resource strength: the higher RS π , the less constraining are the capacities of the resources with respect to
the task network.
k
rmax
Also, let
denote the peak demand of resource Rk in the
precedence-preserving earliest start schedule (i.e., the infinite capacity solution). The resource strength of resource
Rk is thus defined as
RSk =
k
Ck − rmin
k
k
− rmin
rmax
where Ck ∈ C is the capacity of resource Rk . The resource strength expresses the relationship between the resource demand and the resource availability in the given
completePOP.
Notice that a scheduling problem in which the capacity
k
Ck of the k-th resource is at most rmax
(i.e., the maximum
potential claim of resource k in the precedence-preserving
earliest start schedule) is such that resource k is “welldimensioned”. Conversely, if Ck exceeds the peak demand
of Rk , then this resource’s capacity is unnecessarily large,
k
since rmax
is an upper bound on Ck such that the problem
4
This parameter was first defined in (Cooper 1976; AlvarezValdez & Tamarit 1989). The definition we use here is taken
from (Kolisch, Sprecher, & Drexl 1995).
While the scheduling literature points to numerous other parameters which could be useful in the context of this investigation, some of which we will be analyzing in future work,
in the following section we observe how the planning phase
affects the two properties defined above.
As a final note, it is interesting to notice that taking into
account “classical” properties of project scheduling problems rather than the strong-coupling constraint density we
have defined in our previous work, allows us to relax the restricting assumptions we have made with respect to resource
capacities and usage. In particular, parameters such as the
two we analyze in this work are defined for any causal structure in the scheduling problem, as well as for multi-capacity
resources and multiple-resource usage constraints.
Enhancing B LACK B OX for Obtaining Multiple
Causal Solutions
Before describing in detail the structural characteristics of
the scheduling problems which we want to observe, we describe an extention to the loosely-coupled framework aimed
at enhancing the detail of our observations. Among the planners used in our experiments, we employ a planning subsystem which is capable of producing multiple plans, thus
yielding, after the information integration and de-ordering
procedure has taken place, multiple completePOPs. Having
more solutions to the causal part of the P&S problem provides us with a richer set of scheduling problems on which
to perform the structural analysis by means of the measures
defined previously. This extends the results obtained in (Pecora, Rasconi, & Cesta 2004; Cesta, Pecora, & Rasconi 2004)
because it enables us to observe how not only different planning strategies but also different methods for extracting solutions within a certain planning strategy can influence the
structure of the underlying scheduling problem. This is particularly true in the case of planning graph based planning,
in which a planning graph contains many valid plans. Moreover, these plans are partitioned into sets according to the
level of expansion at which they are obtained. In other
words, given the minimum level l at which there exists a
solution to the planning problem, instead of obtaining one
scheduling problem π l , we obtain the set of scheduling problems {π1l , π2l , . . .} which derive from the solution subgraphs
of the planning graph5 .
In order to obtain multiple completePOPs in a single
level of planning graph expansion, we have implemented
a modified version of the B LACK B OX planning system,
which we call M B LACK B OX (for multiple-B LACK B OX).
This enhancement is obtained by replacing the Z C HAFF
SAT-solver (Moskewicz et al. 2001) originally integrated
within the B LACK B OX system with M C HAFF, an enumerative variation of Z C HAFF (Moskewicz et al. 2001;
Moskewicz 2004), i.e., which is capable of finding multiple models of a given well-formed formula (see figure 3).
The experiments described in the following sections thus fo{POPl1 , POPl2 , . . .}
G RAPHPLAN
graph Gl
subgraphs {Gl1 , Gl2 , . . .}
Graph2WFF
WFF
models {φ1 , φ2 , . . .}
Info integration
& Deordering
M C HAFF
{π1l , π2l , . . .}
Figure 3: Enhancing B LACK B OX with enumerative capabilities
by replacing Z C HAFF with M C HAFF. Gl denotes the planning
graph expanded up to level l.
5
Moreover, by increasing the level of expansion of the planning graph we can produce the sets of scheduling problems
{π1l , π2l , . . .}, {π1l+1 , π2l+1 , . . .}, . . . , {π1m , π2m , . . .}, where m is
the index of planning graph level-off. The analysis of OS and RS
on further levels of expansion of the planning graph is outside the
scope of this paper, and will be addressed in future work.
cus also on the variation of OS and RS within the sets of
solutions produced by M B LACK B OX.
Experiments
The two parameters shown above represent attributes of
scheduling problems which are widely accepted as meaningful. In the context of this investigation, OS characterizes the causal structure of planner-derived scheduling problems, measuring the restrictiveness of the task network determined by the planning phase, thus indicating the “degree
of freedom” available to the scheduler in resolving conflicts.
RS on the other hand quantifies the constrainedness of completePOPs with respect to the resources allocation determined by the planner. Both OS and RS provide a means
to interpret the effect of the decisions taken by the planner
with respect to the causal and resource allocation problem
represented by the input planning problem.
Benchmark Problems
For the purpose of the experiments described in this section, the general multi-agent domain definition described
earlier was instantiated according to the dependency graph
shown in figure 4. The domain consists in a total of six
operators, each requiring an object as well as a certain
type of agent (type one or two). Moreover, each operator
:uses a particular amount of the executing agent, and has
a given :duration. Our benchmark consists of 100 randomly
this domain
¤ ®in the form
£
­ generated
£ problems ¤within
p = A1 , A2 , C1min , C1max , C2min , C2max , O , where
• A1 (A2 ) is the number of agents of type one (two);
¤
¤ £
£
• C1min , C1max ( C2min , C2max ) is an interval within
which lies the :capacity of every agent of type one
(two);
• O is the number of objects (i.e., with :capacity 0).
Each problem was generated by randomly choosing A1 , A2 ,
O, as well as the capacity of each agent of type one and two
within the corresponding interval. Un-resolvable problems
were avoided by ensuring that the C1min ≥ 4 and C2min ≥ 3.
Notice that the absence of negative effects in this domain make the corresponding planning problems polynomial. This choice is determined by our interest in aspects
related to the solution quality rather than the performance
of the planners we consider. To this end, the 100 generated
benchmark problems are intended to provide an easy set of
problem instances from the planning point of view. Conversely, the additional resource and time related information poses a more elaborate scheduling sub-problem, with
respect to which the tested planners exhibit a varying degree
of commitment.
As we will see in the experimental analysis, the regularity
of this domain introduces a strong dependency of the results
obtained by the various planners on the size of the problem
instance. In addition, notice that the domain is such that we
expect to obtain little variation in the causal structure of the
plans obtained with different planners6 . Nonetheless, as we
6
This domain can be interpreted as a workflow-management application, in which there is no real difficulty in determining the se-
start
2
a2
2
a1
op12
o
op11
o
P&S problem.
The same set of 100 benchmark problems were solved
by FF (Hoffmann & Nebel 2001), LPG (Gerevini, Saetti,
& Serina 2003) and CPT (Vidal & Geffner 2004), and OSπi
and RS πi were also computed for the 100 solutions obtained
from these planners.
Resources
eff1(o)
a2
3
a2
We begin by analyzing the resource related aspects of the
obtained completePOPs. Figure 5 shows the value of RS
for the various planners considered. In particular, only the
first plan produced by M B LACK B OX is considered (i.e., the
original non-enumerative version of B LACK B OX). Recall
2
op3
o
op2
o
1.4
M B LACK B OX
eff3(o)
eff2(o)
4
i
LPG RS πi
1
a1
RS π0
FF RS πi
1.2
CPT RS πi
0.8
op4
0.6
o
0.4
eff4(o)
0.2
a1
2
0
op5
o
20
30
40
50
60
70
80
90
100
(M B LACK B OX in non-enumerative mode).
Figure 4: The dependency graph of the multi-agent domain used
for the generation of the benchmark set. Ovals represent predicates
and rectangles represent operators.
will see the relative regularity of the benchmark set does not
affect the generality of our observations.
From POPs to Scheduling Problems
To begin with, the benchmark problems described above
were solved using M B LACK B OX. As mentioned, this planner employs M C HAFF in order to produce multiple solutions
to a single planning problem. On the 100 benchmark problems, M B LACK B OX yields a total of 7046 plans (the enumeration performed by M C HAFF yields between 10 and 200
unique plans for each problem). Each solution represents a
completePOP πij , where i ∈ [1, 100] and j refers to the j-th
unique solution to problem i. We then calculate OSπj and
i
RS πj . Moreover, we extract from these measures OSπmax
,
i
max
10
Figure 5: Average resource strength for the various planners
eff5(o)
i
0
min
OSπmin
, RS πi and RS πi , which represent the maximum
i
and minimum values of OS and RS obtained for the i-th
quence of operators which achieves a goal, and in which the planner’s task is essentially that of deciding an allocation of agents to
operations.
that high values of RS indicate that the corresponding plan
is under-constrained with respect to resource capacity constraints, thus that the resources are assigned in such a way
that their capacity limitations are less likely to entail resource conflicts. It is immediate to notice the gap in performance between FF and CPT on one hand, and M B LACK B OX and LPG on the other. In most problems the former
planners employ only one or two agents out of the overall pool of available agents. The strongly sub-optimal allocation of resources (agents) to tasks clearly has no impact on the quality of the solution to the causal sub-problem
(the POP), while it heavily affects the quality of the completePOP as a whole. The scheduling problems produced
by FF and CPT are thus very constrained from the point of
view of resources. Overall, this may have some disadvantages with respect to the scheduling phase, such as longer
makespans, less robust solutions, and so on.
The poorer quality of the plans obtained with FF and CPT
demonstrates the fact that the over-committing nature of the
performance-oriented heuristics employed by these planners
appears to be counter productive in the light of the subsequent scheduling phase. Conversely, B LACK B OX and LPG
have the nice property of committing less to resource peak
leveling decisions, thus maintaining a lower level of invasiveness in the decision space of the scheduler. This phenomenon is even more interesting in the light of the fact that
both B LACK B OX and LPG are (though in different ways)
planning graph based planners. These results thus point to
an interesting load balancing quality of the planning graph
paradigm.
Let us now observe the results obtained with the enumerative planner M B LACK B OX. As shown in figure 5, the performance of LPG is in line with that of the classical singleplan B LACK B OX, reaching for many problems higher peaks.
Conversely, plotting the results obtained with the enumerative variant M B LACK B OX yields the results shown in figure 6. These results show that extracting more solutions
0.3
M B LACK B OX
OSπ0
FF OSπi
LPG OSπi
CPT OSπi
0.25
i
0.2
0.15
0.1
1.6
M B LACK B OX
1.4
max
RS π
LPG RS πi
0.05
i
1.2
0
10
20
30
40
50
60
70
80
90
100
Figure 7: Order strength for the the various planners ( M B LACK B OX in non-enumerative mode).
1
0.8
eration of the benchmark problems. In fact, the domain in
question yields planning problems whose solutions have a
structure which depends strongly on the size of the problem
(see figure 8). In addition to the plateau effect, the results
0.6
0.4
0.2
0
24
0
10
20
30
40
50
60
70
80
90
100
Figure 6: Average resource strength for LPG and the best performance of M B LACK B OX.
20
to a given planning problem affords us a choice between
“equivalent” completePOPs with potentially very different
characteristics from the point of view of resource allocation.
In conclusion, notice that the enumeration of solutions performed by M C HAFF also ignores the resource related information in the problem instance. Indeed, these results suggest
the possibility to equip the SAT solver with a more informed
heuristic for solution space exploration that takes into account resource related metrics.
16
Causal Structure
We now turn our attention to the causal characteristics of
the POPs produced by the planners considered above. As
described previously, the output if the planner consists in a
sequence of tasks which are then de-ordered to produce the
precedence graph hT , Pi. The structural characteristics of
this graph affect the scheduling phase in a number of ways,
and, as noted earlier, a good predictor for the hardness of
a scheduling problem is represented by the restrictiveness,
which can be in turn approximated by OS.
Figure 7 shows the values of OS obtained with the four
planners considered in the previous section. Again, we start
by analyzing M B LACK B OX in non-enumerative mode. We
can immediately notice three characteristics of the results:
the presence of plateaus, the fact that OSπi for all planners
except M B LACK B OX coincide on all problems, and that the
values of OS appear to be scaled. These phenomena are due
to the particular structure of the domain used to produce the
benchmark problems.
The first two aspects can be easily explained in terms of
the particular structure of the domain employed for the gen-
M B LACK B OX OSπ 0 × 100
i
|O|
22
18
14
12
10
8
6
4
0
10
20
30
40
50
60
70
80
90
100
Figure 8: Dependency of the order strength on problem size in the
benchmark set.
show that LPG, CPT and FF all tend to produce very similar solutions, so similar that from the strict point of view
of the causal structure (i.e., not considering how resources
are allocated) the plans are identical. Again, this is a direct
consequence of the “simplicity” (from the planning point
of view) of the domain. Indeed, further experiments show
that this does not occur on more complex planning problems (such as the planning competition benchmarks), where
the three planners yield solutions with values of OS that are
not as easily comparable. This observation indicates that the
benchmark problems considered in this paper yield a search
space whose topology points all three heuristics to solutions
with the same structural characteristics. Indeed, the simplicity of the domain is also the reason for the apparent scaling
between the values obtained with B LACK B OX and those obtained with the other three planners. In fact, experiments on
more complex domains do not yield scaled values of OS.
However, given that the benchmark set is so regular, it
is somewhat unexpected that B LACK B OX obtains solutions
with a different causal structure. These results seem to emphasize the difference between the SAT-solver based plan
extraction mechanism and the search procedures employed
by the other planners. The fact that the value of OS is lower
for B LACK B OX than for the other planners confirms the results obtained in (Pecora, Rasconi, & Cesta 2004), in which
we have shown, by means of a different experimental analysis, that planning graph based planners tend to produce
scheduling problems which are easier than those produced
by planners based on heuristic search (recall that OS is an
indicator of the hardness of a scheduling problem).
A natural question at this point is the following: how does
the enumeration of solutions performed by M B LACK B OX
affect OS? Figure 9 shows the highest (worst) and lowest (best) order strength of the multiple solutions obtained
by M B LACK B OX. An interesting aspect of these results is
0.3
min
OSπ
i
max
OSπ
i
FF, LPG, CPT OS
M B LACK B OX
M B LACK B OX
0.25
0.2
0.15
0.1
0.05
0
10
20
30
40
50
60
70
80
90
100
Figure 9: Order strength for the the various planners, with best
(OSπmin
) and worst (OSπmax
) performance of M B LACK B OX.
i
i
that the best values of OS obtained by M B LACK B OX are
also the first, i.e., OSπmin
= OSπi0 , ∀i. Recall that with
i
respect to the quality of resource allocation (RS), we find
that exploring different solutions to the planning problem is
advantageous (see figures 5 and 6 in the previous section).
Conversely, exploring further solutions does not yield an improvement with respect to the order strength, which is also
increasing (i.e., there exist less different alternatives to resolve resource conflicts). This is to be expected: increasing
the resource strength makes the scheduling problems more
robust with respect to resource failures, but decreases the
ease with which the scheduler can resolve resource conflicts.
Further Analysis of the Results
The experimental results described in this paper allow us to
draw some interesting conclusions on how different planning strategies can benefit loosely-coupled P&S.
Planners as Resource Allocators. First, we have observed the characteristics of the various planning strategies
on resource allocation by analyzing the average resource
strength of the scheduling problems which result from the
planning procedure. In this context we have seen how two of
the most representative heuristic search based planners perform poorly with respect to resource allocation. Conversely,
B LACK B OX and LPG, which base the search space representation on planning graphs, achieve much better results.
The results point to the fact that planning graph based representations retain the intrinsic quality of favoring the extraction of plans with a higher degree of load balancing. Dually,
the search strategies employed by FF7 and CPT favor the
exploration of states achieved by means of a minimal use
of resources for execution. Notice that while it is true that
resource capacities are ignored by the planning phase, the
presence of resources for execution is not. There is no real
“reason” for a planner to prefer the use of many agents rather
than the strict amount necessary for causal satisfiability. Our
results show that nonetheless, the planning graph data structure, which is a compact representation of the entire search
space save the states pruned by mutex propagation, allows
planners such as B LACK B OX and LPG to “inadvertently”
provide solutions which make use of a greater number of resources for execution, thus effectively doing a better job at
load balancing.
It is interesting to notice that a further uninformed exploration of the solution space achieved with the enumerative
SAT-solver M C HAFF improves these results further. This
makes a case for employing an enumerative planner for the
purposes of loosely-coupled P&S, in which it is better to
have more “equivalent” scheduling problems in the presence
of invalidating run-time conditions.
The Bias of Planning on Restrictiveness. Our interest in
the restrictiveness of the scheduling problems produced by
the planners analyzed in this paper lies in the fact that it represents a way to put a number on the causal structure of the
precedence graph which constitutes the scheduling problem.
Also, measuring the restrictiveness is interesting because
this measure quantifies the degree to which the planner restrains the search space of the resulting scheduling problem.
If a planner obtains higher degrees of restrictiveness compared to another planner on the same problems, then this is
an indication that the first planner invades the decision space
of the scheduler more than the latter. Our analysis has shown
that B LACK B OX retains the capability of guiding its search
procedure towards a solution with lower degrees of restrictiveness, thus maintaining a lower commitment with respect
to the subsequent scheduling phase. These result confirm
a previous analysis (Pecora, Rasconi, & Cesta 2004) based
on different structural properties of the scheduling problem,
but which relied on the assumption that all resources were
binary (Ci = 1, ∀i). The use of OS and RS has allowed us
to relax this assumption.
The Tradeoff Between Robustness and Resource Efficiency. The results obtained by means of the enumera7
Notice that FF employs relaxed planning graph propagation to
compute the heuristic estimator. Nonetheless, the heuristic value
so obtained represents only an estimate of goal distance, and does
not incorporate any other information derivable from the planning
graph representation.
tive planner M B LACK B OX show that exploring the neighborhood of the fist satisfying assignment yields plans with
higher degrees of restrictiveness (higher values of OS), but
also improved performance with respect to resource capacity limitations (higher values of RS). Indeed, this does not
come as a surprise. In fact, it is reasonable to expect a tradeoff between these two characteristics of the completePOPs.
On one hand, if the precedences between the tasks in the
scheduling problem allow a high number of valid execution
sequences (i.e., low OS), then it is reasonable to expect that
the only way to safeguard against solutions which are fragile with respect to resource failures is to obtain a constraint
network with higher restrictiveness in order to exclude these
fragile solutions. Dually, it is straightforward to see that in
order to increase the number of solutions to a P&S problem
it is necessary to revert to a completePOP which potentially
forebodes more resource contention peaks (lower RS), but
at the same time allows the scheduler more degrees of freedom for computing a solution which makes efficient use of
the resources. In the light of these considerations, we can
see high restrictiveness (and resource strength) as a symptom of robustness, while low restrictiveness (which entails
low resource strength) implies fragility.
It is important to keep in mind that in the context of
loosely-coupled P&S, OS and RS represent characteristics
of alternative causal solutions to a given P&S problem (i.e.,
alternative scheduling problems). We have shown that it
is possible to obtain a multitude of “equivalent” scheduling
problems (using different planners or an enumerative planner). If we can choose, within the context of one P&S problem, which scheduling problem to schedule for, the considerations made above become quite important. Depending,
for instance, on the projected execution scenario, it may by
useful to immediately select the scheduling problem which
exhibits the appropriate tradeoff between (low) restrictiveness and (high) resource strength.
Conclusions and Future Work
The work described in this paper stems from the general
question of how to extend a strictly causal solver with time
and resource reasoning capabilities. In this context, we focus on a loosely-coupled P&S integration, that is, a framework in which the output of a classical planner is piped into
a scheduler. This approach to P&S integration is suited for
applicative contexts which require scheduling decisions to
occur without the possibility of intervening on the planning
decisions, which have already taken place and are irrevocable. Notice also that this component-driven approach is
attractive because of its ease of implementation using offthe-shelf general purpose tools. In this context, the present
investigation, which extends the results obtained in (Pecora, Rasconi, & Cesta 2004), is aimed at assessing the pros
and cons of different planning strategies within the looselycoupled framework.
The results described in this paper provide an assessment
of how suited different planning strategies are with respect
to two important characteristics of the resulting scheduling
problems. We have shown how restrictiveness and resource
strength, two well-known attributes of scheduling problems,
provide novel and interesting quality metrics for plan quality. The results of these measurements have exposed new
strengths and weaknesses of classical planners, which complement commonly accepted criteria for plan quality such as
those employed in the planning competition, and may point
towards new application scenarios for current AI planning
technology.
We believe that the issues put forth in this paper forebode
interesting new directions of research for planning, such as
new planning strategies which explicitly take into account
the measures described in this paper, as well as investigating
other applicable measures for plan quality.
Acknowledgments. This research is partially supported
by MIUR (Italian Ministry of Education, University and Research) under project ROBO C ARE: “A Multi-Agent System
with Intelligent Fixed and Mobile Robotic Components”,
L. 449/97 (http://robocare.istc.cnr.it). The
Authors wish to thank Riccardo Rasconi and Simone Fratini
for having fostered interesting common research.
References
Alvarez-Valdez, A., and Tamarit, J. 1989. Heuristic Algorithms for Resource-Constrained Project Scheduling: a
Review and an Empirical Analysis. In Slowinski, R. and
Weglarz, J., ed., Intelligent Scheduling Systems. Elsevier.
113–134.
Bäckström, C. 1998. Computational Aspects of Reordering
Plans. Journal of Artificial Intelligence Research 9:99–137.
Brightwell, G., and Winkler, P. 1991. Counting Linear
Extensions. Order 8:225–242.
Brucker, P.; Drexl, A.; Mohring, R.; Neumann, K.; and
Pesch, E. 1998. Resource-Constrained Project Scheduling:
Notation, Classification, Models, and Methods. European
Journal of Operations Research.
Cesta, A.; Fratini, S.; and Oddi, A. 2004. Planning
with Concurrency, Time and Resources. A CSP-Based Approach. In Vlahavas, I., and Vrakas, D., eds., Intelligent
Techniques for Planning. Idea Group Publishing (to appear).
Cesta, A.; Pecora, F.; and Rasconi, R. 2004. Biasing the
Structure of Scheduling Problems Through Classical Planners. In Proceedings of the Workshop on Integrating Planning into Scheduling (WIPIS) at ICAPS, Whistler, Canada.
Cooper, D. 1976. Heuristics for Scheduling ResourceConstrained Scheduling Projects: an Experimental Investigation. Management Science 22:1186–1194.
Currie, K., and Tate, A. 1991. O-Plan: The Open Planning
Architecture. Artificial Intelligence 52(1):49–86.
De Reyck, B., and Herroelen, W. 1996. On the use of
the complexity index as a measure of complexity in activity networks. European Journal of Operational Research
91:347–366.
Edelkamp, S., and Hoffmann, J. 2004. PDDL2.2: The Language for the Classical Part of the 4th International Plan-
ning Competition. Technical report, Technical Report 195
Computer Science Department, University of Freiburg.
Fox, M., and Long, D. 2003. PDDL2.1: An Extension to
PDDL for Expressing Temporal Planning Domains. Journal of Artificial Intelligence Research 20:61–124.
Gerevini, A.; Saetti, A.; and Serina, I. 2003. Planning through Stochastic Local Search and Temporal Action
Graphs. Journal of Artificial Intelligence Research (JAIR).
Ghallab, M., and Laruelle, H. 1994. Representation and
Control in IxTeT, a Temporal Planner. In Proceedings of
the Second International Conference on AI Planning Systems (AIPS-94).
Ghallab, M.; Howe, A.; Knoblock, C.; McDermott, D.;
Ram, A.; Veloso, M.; Weld, D.; and Wilkins, D. 1998.
PDDL — The Planning Domain Definition Language,
AIPS 98 Planning Competition Committee.
Hoffmann, J., and Nebel, B. 2001. The FF Planning System: Fast Plan Generation Through Heuristic Search. Journal of Artificial Intelligence Research 14:253–302.
Jonsson, A.; Morris, P.; Muscettola, N.; Rajan, K.; and
Smith, B. 2000. Planning in Interplanetary Space: Theory and Practice. In Proceedings of the Fifth Int. Conf. on
Artificial Intelligence Planning and Scheduling (AIPS-00).
Kolisch, R.; Sprecher, A.; and Drexl, A. 1995. Characterization and generation of a general class of resourceconstrained project scheduling problems. Management Science 41(10):1693–1703.
Mastor, A. 1970. An Experimental and Comparative Evaluation of Production Line Balancing Tehniques. Management Science 16:728–746.
Moskewicz, M.; Madigan, C.; Zhao, Y.; Zhang, L.; and
Malik, S. 2001. Chaff: Engineering an Efficient SAT
Solver. In Proceedings of the 38th Design Automation
Conference (DAC’01).
Moskewicz, M. 2004. mChaff Website at Princeton:
http://www.princeton.edu/˜chaff/mchaff.
html.
Muscettola, N.; Smith, S.; Cesta, A.; and D’Aloisi, D.
1992. Coordinating Space Telescope Operations in an Integrated Planning and Scheduling Architecture. In IEEE
Control Systems, Vol.12, N.1, 28–37.
Pecora, F., and Cesta, A. 2002. Planning and Scheduling
Ingredients for a Multi-Agent System. In Proceedings of
UK PLANSIG02 Workshop, Delft, The Netherlands.
Pecora, F.; Rasconi, R.; and Cesta, A. 2004. Assessing the Bias of Classical Planning Strategies on MakespanOptimizing Scheduling. In Proceedings of the 16th European Conference on Artificial Intelligence (ECAI-04), August 22nd - 27th, Valencia, Spain.
R-Moreno, M.; Oddi, A.; Borrajo, D.; Cesta, A.; and
Meziat, D. 2002. Integrating Hybrid Reasoners for Planning & Scheduling. In Proceedings of UK PLANSIG02
Workshop, Delft, The Netherlands.
Schwindt, C. 1998. Generation of Resource-Constrained
Project Scheduling Problems Subject to Temporal Con-
sraints. Technical report, Universität Karlsruhe. Report
WIOR-543.
Smith, D. E., and Weld, D. S. 1999. Temporal planning
with mutual exclusion reasoning. In Proceedings of the
Sixteenth International Joint Conference on Artificial Intelligence, 326–337. Morgan Kaufmann Publishers Inc.
Srivastava, B. 2000. RealPlan: Decoupling Causal and
Resource Reasoning in Planning. In AAAI/IAAI, 812–818.
Vidal, V., and Geffner, H. 2004. Branching and Pruning:
An Optimal Temporal POCL Planner based on Constraint
Programming. In Proceedings of the 19th National Conference on Artificial Intelligence (AAAI-04), San Jose, CA.