From: AAAI Technical Report SS-93-03. Compilation copyright © 1993, AAAI (www.aaai.org). All rights reserved. Classical Honeywell Nonlinear Planning in Complex Mark Boddy boddy@src.honeywell.com Systems & Research Center, :]660 Technology Drive Minneapolis, MN 55418 Abstract Classical nonlinear planning has been studied in the abstract since the early 1970’s. More recently, interest has grown in applying these techniques to real problems, or at the least to simplified versions of real problems. In this paper, we explore someof the implications and results of those applications, with an eye to addressing the question of where the main problems lie. What successes have been achieved? What remains to be done? In particular, what fundamental representational or algorithmic issues, if any, need to be addressed before classical planning becomesa useful tool for large, complex, real-world problems? Introduction This paper is primarily a discussion of classical nonlinear planning in complex domains. Our concern is with how the planning problem is changed when we attempt to model more of the characteristics encountered in these domains. Abstraction has both benefits (refining away details to reveal the essential nature of a problem) and disadvantages (some of those details have a profound impact on the nature of the problem). For example, the Blocksworld has proved to be a useful abstraction in which to isolate and examine certain issues in planning. However, as the limitations of the Blocksworld have became apparent, more recent abstract domains (e.g., Tileworld [Philips and Bresina, 1991]) added such complexities as duration and deadlines, events not under the planner’s control, and uncertain action outcomes. Section provides somedefinitions and historical perspective. Wethen investigate the problems caused for planners by the introduction of partially-ordered plans, action durations and deadlines, simultaneous actions, and events not under the agents control. This is not a list of arguments for abandoning classical planning. Quite the contrary: for every difficulty raised here, there is at least a partial solution to the problem. We close with a summary and some recommendations for future work. Domains MN65-2100 Definitions Webegin by defining some terms and providing a limited historical background. A plan is an internal representation of actions an agent may perform to bring about changes in the environment. Representing a plan requires operators: abstract representations of the actions available. Predicting the effects of executing a plan (i.e., performing the corresponding actions) requires some representation of the environment’s state and how that state is modified by the performance of the actions corresponding to the various operators. This prediction is commonly called projection. The effects of actions are frequently modelled as operator poatconditioa8, sometimes specified as add and delete lists of propositions modified by performance of the action. Operator preco~clitions encode the states in which a given action maybe taken and howthe effects of that action depend on the state in which the action is performed. 1 The plannin9 problem starts with an initial state description, and a 9o~1 to be achieved. The goal consists of a set of conditions to be achieved at the end of the plan’s execution. 2 Workon replanning or generative planning adds a pc~rtigl plar~ to be modified so that the goal is achieved. For this paper, I take "classical planning" to be the study of generative methods for solving the planning problem. Included in this definition are specializations such as hierarchical or constraintposting planning. The planner cannot make perfect predictions about the effects of a given plan. This true is independent of any consideration of dynamic domains (e.g., malicious infants knocking over towers of blocks). The issue is purely one of abstraction: any practical model of a physical process omits some details. In order to plan in the classical sense of the term, the planner needs to manipulate models of both the environment and the 1For the moment,we finesse the frame problem and related issues by assuminga STKIPS-rulesemantics on a linear sequenceof completelyspecified operators. aThis definition of a god can be extendedin a straightforward wayto include descriptions of states to be avoided or maintained throughout the plan’s execution. agent’sactions. The moredetails omittedfrom these no possible intervening defeater, thegoalis declared to be satisfied. Thisis notprojection, because wearemakmodels, themoreerrors willoccurin theplanner’s preing onlya localdetermination. No attempthas been dictions. As planners areapplied to morecomplex dowhetherthe preconditions forthe mains,thedomainandactionmodelsrequired to main- madeto determine operator satisfying the goalaretrue.Thisapproach taina reasonable degreeof fidelity becomemorecomsuffices foran environment in whichtheplannerhas plexaswell. complete control overwhathappens: goalscanbe satIn STRIPS[Fikesand Nilsson,1971],statesare represented as listsof predicates andactions aredeisfiedby addinga newoperator or movingexisting operators around. If thereareevents notundertheagent’s scribed by operators including preconditions, addand control, it maynotbe possible to addoperators where deletelists.Theseoperators makeno provision for desired or to forcethenecessary orderings. Thisputs context dependent effects, simultaneous actions, or acbackin theposition of needing projection tion durations.More recently,STRIPSoperators ourplanner in themorecomplete sense. have beenextendedto modelcontext-dependent effects[Pednault, 1988]andadditional inference mechThisresultalsoimplies thatnonlinear planners doanismshavebeenaddedto derivecontext-dependent ingprojection (i.e., thosenotusingsomelocalcriterion effects, e.g.[Wilkins, 1984,AllenandKoomen, 1983]. liketheMTC)areeither solving a hardproblem in some cases,or computing incorrect results in somecases.If Someworkhas beendoneto add duration [Vere,1983, computationally expensive problem instances aresuffiDeand al., 1989, Tats et al., 1992]. The advent of nonlinear planning [Sacerdoti, 1977, ciently rare,thismaybc acceptable. Thepossibility of Tats, 1977] introduced new complications to the probincorrect results is moretroubling. lem of determining the effects of a given plan. WefolIn previouswork[Deanand Boddy,1988],we have low [Minton et al., 1991] in defining nonlinear classical demonstrated thatit is possible to compute an approxplanning to be generative planning that manipulates imation to projection fornonlinear plansin polynomial partially-ordered sets of operators, whether or not the time.Thealgorithm is sound,meaning thatif it says final plan produced is required to be completely orthata factisnecessarily trueata point, thatdetermidered. nationis correct. Theapproximation involved is that thealgorithm is notcomplete: itmayfailto detect the factthatsomefactis necessarily true.s SincethealProjection and Partial Orders gorithm handles negated propositions, it is possible to The original intuition behind nonlinear planning was compute projections involving a fact and its negation the happy thought that we could plan by a process of so as to determine whether a proposition is necessarily adding orderings only as required to ensure that our true at a point, necessarily false at that point, or indeplan has the right effects. This "least commitment" terminately one or the other. Thus, even for planning approach was entended to other forms of constraints problems involving events beyondtheplanner’s control, as the notion of constraint-posting planning was genprojection canbe regarded as a solvedproblem, though eralized to include other aspects of a developing plan onethatrequires a little carein implementation. 4 (e.g., [Stefik, 1981]. Constraint-posting planning can be viewedas an incremental process of eliminating possible Duration and Deadlines plans from a set defined by the current set of constraints by adding additional constraints. For this process to be The fact that actions take time was abstracted out in effective, it must be the case that the planner can dethe earliest domain models. Planners using these modtermine whether or notthereis a planin thecurrent els will be of limited use in domains where synchronizasetofpossibilities thatwillachieve thegoal,or that tion with other events or processes is important. This canbe augmented so thatit willachieve thegoal.If may include such domains as manufacturing planning theplanner cannotconstruct a nontrivial description and scheduling, spacecraft operations, robot planning of theremaining possible plansandhowthatsetwould in any but the most simplified domains, and scheduling be changed by additional constraints, it is essentially distributed problem-solving or other processing. adding constraints arbitrarily soas toobtain a plansufAs mentioned in Section , several planners include ficiently constrained thatit canbe analyzed. Making representations for metric timeandactiondurations. sucharbitrary choices is exactly thekindof behavior In addition, variouspeoplehaveimplemented "temthatthe leastcommitment approachwas intendedto poralreasoning systems" in an attempt to isolate and circumvent. optimize reasoning abouttherelations amongactions, Generating a correct andnon-trivial description of or betweenactionsandotherprocesses [Allen, 1983, theeffects ofa nonlinear planis computationally expensive[Chapman, 1987,DeanandBoddy,1987].One way SThecircumstances underwhich thisalgorithm is incomaround thisdifficulty is to treatplanning as a purely plete aredescribed in[Schrag etal., 1992]. constructive activity. For example, Chapman’s Modal 4Assuming nonlinear plansconsisting of STRIPS operTruthCriterion focuses onlyontheeffects of operators: ators. Itiscertainly possible tocomplicate projection by ifthereis a preceding operator establishing a goal,and adding other features totherepresentation. chical Interval Constraints (HIC). Each HICtype has .................. 1.2................. function defined for calculating bounds on its duration. 0.70 0.70 FigureI: A simpleproblemwithduration andpartial orders For the example in Figure ??, the two activities would be contained in an HIC whose duration was calculated by summingthe duration of the included operators. The problem with this approach is the requirement of a rigid hierarchy. If actions must be ordered for reasons that are not locally determinable (e.g. because of resource conflicts, not because they are sequential steps in some task reduction), this representation will break down. It may be possible to augment Williamson and Hanks’ representation to cope with a limited number of special structures representing such nonlocal information. Simultaneous Actions The general problem of reasoning about the effects of actions, when those effects may depend on what other Dean and McDermott,1987, Arthurand Stillman, actionsoccurred at thesametime,is a computational 1992].Oplan-2contains a separate reasoning module nightmare. Potentially, youneedto construct a descrip(thetimepointnetwork manager) withsimilar function- tionofsimultaneous effects forthepowersetoftheset ality.Thiskindof reasoning tendsto be computation- of operators available to theplanner. Therearea couallyexpensive. Forbin, Deviser, andSipeallsuffer from pleofpossible approaches to representing andreasoning performance problems limiting thesizeof theproblems aboutsimultaneous effects moreefficiently: to whichtheycan be applied. Oplan-2appearsto be ¯ Replaceprojection witha moregeneral representaableto handlelargerproblems thantheotherplanners tionof causality and theevolution of a domainin mentioned here. termsof statevariables [Muscettola, 1990]. Implementing an efficient temporal reasoning system ¯ Restrict the kinds of operator interaction modelled, is not thesolehurdle,however. Addingduration to considering onlyresource usage[Wilkins, nonlinear plansincreases thedifficulty of determining forexample whether or notthecurrent partial plancanbe refined 1988]. intoa planthatwillhavethedesired effects. In fact, Reasoning about resource usage is simpler than the genit becomes difficult to determine simplywhetherthe eral problem of simultaneous effects because the interactions described in thecurrent partial plancaneven actions compose. In other words, it doesn’t matter why be executed. the present current demandis 5 amps. If your task reConsider thesimpleplanfragment in Figure1. There quires 3 amps, the resulting demandis 8 amps. Wecan arctwounordered tasks,eachannotated withan esticertainly construct resource modelsfor which this propmatedduration. If actionscan onlybe takenin seerty does not hold, but from an applications viewpoint, quence, thetwotasksdepicted musteventually be orthe aim is to expand the set of domain characteristics dered.Whentheplannertriesto orderthem,it will we can model, not to prove that the general problem is discover thatneither ordering willwork,because there hard. simply isn’troomforthemto be performed in sequence. In general, determining whether there is an ordering for Summary a set of actions constrained in this way is a hard probIn this paper, we briefly explore some of the modelling lem. and computational problems introduced by augmenting There are two proposed techniques for addressing this problem. The first is 8imulatiom the planner mainour domain models to include additional characteristics of complex domains. The particular characteristics we tains a partial order, and after every modification expends some effort exploring the corresponding set of consider include unordered and simultaneous actions, and duration and deadlines. Webriefly survey some total orders to ensure that there is some feasible toproposed methods for dealing with these issues. It is tal order [Miller, 1985, Muscettola, 1990]. As generally employed, this is a heuristic method: the planner gives not accidental that in doing so we stray in the direcup before exploring the complete set of consistent total tion of scheduling terminology and techniques. It has been clear for some time (certainly the earliest papers orders. Another possible approach is proposed on Forbin in 1985 madethis point) that effective "planning" for real-world domains will require techniques by Williamson and Hanks in [1988]. In this approach, drawn from both the planning and scheduling commuthe partially ordered set of operators is organized into a nities. More people than we have room to cite here hierarchy of abstract operator types, knownas Hierar-