Quantitative Operator Selection for Planning Under...

From: AIPS 1994 Proceedings. Copyright © 1994, AAAI (www.aaai.org). All rights reserved. Quantitative Operator Selection for Planning Under Uncertaintyt Todd Michael ~ Manseli Departmentof ComputerScience The University of Melbonrne Parkville, Victoria 3052,Australi& mansell@modmel.mrl.dsto.gov.au. Abstract Thispaperdescribesthe best fLrst searchstrategyused by U-Plan(Mausell1993a), a planning systemthat constnt~ quantitatively ranked plans given an incompletedescription of an unce~environment.UPlanacceptsuncertainandincomplete information about the environment,characterisesit using a DempsterShaferinterval, andgeneratesa set of multiplepossible world sta~s. Plan cctmtruction takes place in an abstractionhierarchywherestrategic decisimm are made beforetactical decisions.Searchthroughthis abstraction hierarchyis guidedby a quantitativemeasure (expected fxdfilment) based on decision theory. This search strategyis best first withthe provisionto Ul~_~_teexpectedfulfdmentsandreviewpreviousdecisionsin the fight of planningdevelopments. U-Plangenerates multipleplans for multiplepossibleworlds,andwill attemptto use existingplansfor newwm’ld situatictm.A super-planis thenconstructed,basedonmerging the set of plansandappropriatelytimedknowledge acquisition operators, whichare used to decide betweenplan alternativesduringplanexecution. 1 Introduction to U-Plan Traditional planning systems describe the planning problem as the composingof a course of action that transforms the world from a given initial state to a desired goal state. This description makes two assumptions about the planning domain: complete and accurate informationabout the ~rld is available; and the environmentis static. Theseassumptionsensure that a constructedplan can be successfully executedin such an ideal world. However,such planners rarely produceplans that workin the real world, that by its nature is dynamic and its description is imprecise. To devise a useful plan given uncertain and/or incomplete information about a dynamic environment requires the removal of these assumptions. A major problemwhenplanning given incomplete and uncertain informationabout the environment is that it is not possible to construct oneinitial state that precisely tThis work was done under the CRCfor Intelligent Decision Systems, while the author was employedby DSTOAustralia. *Present address Materials Research Laboratory, PO Box 50, Ascot Vale, 3032, Melbourne, Australia. and nnambig~3~qly represents the world. U-Planuses a possibleworldsrepresentation,wherethe availableinitial informationis used to construct every possible initial state of theworld.Associatedwith each possible worldis a numericalmeasureof belief specifying the degree to whichthe evidencesupports each possible world as the onethat representsthe true state of the world. A hierarchical approach to planning is used as it significantly reduces thesearchspacebyfirst planning at abstract levels, and then expandingthese abstract plans into moredetailed plans. At the highestabstraction level strategic decisionsare made,whileat the lowestlevels of abstraction, ta~cal decisions about how best to implement the strategy, are made.U-Planutilises a set of (predefined) goal reduction operators that encodehow planninggoal is reducedby the operatoxasapplication. Whatresults is a planninghierarchytree wherethe goals are broken up into subgoals by the goal reduction operators.This allows us to first makethe strategic decisions, whichthen guidesall other decisions downto the tactical implementation of the subgoals. Ameasureof expectedfulfilment (section 5.1) is used whenselecting whichoperator to apply next. In support of hierarchical planning, eachpossible worldis describedat a numberof predefinedabstraction levels, resulting in decisions being madebased on a descriptionof the worldat a suitablelevel of detail. U-Planconstructs a plan for one possible world at a time, the first plan being constructed for the possible worldwith the greatest likelihoodof representingthe true world. Beforea plan is constructedfor the next possible world, the suitability of reapplyingan existing plan to this worldis assessed. Associatedwith each plan is the set of possible worldsit worksfor. If a plan partially worksfor anotherpossible world(e.g. the strategy works but someof the detail is different), then that part of the plan is used for this possible world, and planning continuesfromwherethe plan failed. Whena plan exists for every possible world, the operator order of all the plans is combinedto obtain a single planning tree that brancheswhenthe operator executionorder differs. At this point the ability to acquire additional knowledge is used. At each branch, a knowledgeacquisition operator can be inserted to gather current informationabout the state of the world and so determinewhichaction in the planningtree to carry out next. MANSELL 311 From: AIPS 1994 Proceedings. Copyright © 1994, AAAI (www.aaai.org). All rights reserved. Central to planningusing U-Planis: the set of states are representedat several abstractionlevels; the selection of reduction operators is not based on the modaltruth criterion, but dependenton a calculation of expected fulfilment; the system will plan to acquire additional knowledgewhenit is advantageous to do so, and an attempt is madeto apply an existing plan to morethan oneinitial state. 1.1 The Air Combat Domain U-Plan has been applied to a simplified air combat domain.The air combatdomainis dynamicand requires agents to act given uncertainand incompleteinformation. Generatinga plan for aircraft operating in such a domain requires the consideration of a large numberof plan strategies. Therole of an AI plannerin sucha domainis to constructa planthat best fulfils the stated goalsusing only available information, with the option to plan to acquireadditional informationif required. A simplified air combatdomainis introduced in this paper that considers two agents: the defender (whose objective is to defendhimselfand a designatedairspace) and the aggressor (who is invading the airspace controlled by the defender). Whilethe various actions availableto the agents in this domainare diverse, certain patterns in the moreabstract strategies can be identified. For examplesomeof the actions available to the defender are target monitoring, strategy selection, strategy implementation,and evaluation of attack strategy. To operate in such an environmentrequires a sophisticated planning and reasoningsystem. Thestrategies available to U-Planare centred around the selection of either a Beyond-Visual-Range Attack or a Visual-RangeAttack. Thedifferent wayssucha strategies can be carried out are numerous,each one involvesa uniquecourseof action. It is intendedthat U-Plangeneratea suitable courseof action for the defenderaircraft givenonly the information available to the aircraft at plan time. This plan should intend to acquire necessary information when appropriate,havea high likelihoodto success,be suitable to as manypossible worldsas feasible, and workfor the worldsthat are mostlikely to be true. Theplan produced by U-Planis intendedfor use in post missionanalysis of the defenderaircraft, not as a real time planningaid (as the systemis not runningin real time). $ State Representation Whenan incomplete model of the world is all that is available,a set of initial states canbe usedto describethe alternative environments.U-Planemploysa set of initial possiblestates (P-states) to describewhatmightbe true the world. AP-state, ps(a), is a completedescription one possible world using propositional statements. Each P-state is described hierarchically with n levels of abstraction, (ps(a)={tl(a) ... in(a)}) wheren is domain 312 POSTERS dependentand selected during knowledgeengineering. Thelevel ti(a ) is a completedescriptionof a worldat the ith level. Thehighestlevel of abstraction gives a coarse description of the state of the world. Thelowest level gives a detailed view of the world. Intermediate levels provide the description required to makea smooth transition betweenboth extremes. Associated with each P-state is a two valued quantitative measure(an evidential interval (Shafer 1976)) that characterises the weight of evidence that supportsthe P-state accuratelydescribesthe true state of the world. This information is used by U-Plan in determiningthe order in whichP-states are plannedfor, and the final executionorder of operators. Adetailed descriptionof howP-states are generatedandused can be found in t~ansel11993a). 3 Reduction Operator Planningoperators represent actions that the systemmay performin the given domain.Therole of an action is to changethe state of the world,the aimof an operatoris to represent howapplying that action will change the system’s view of the state of the world. U-Plan uses reduction operators to give alternative methodsfor achievingthe goal at a lowerlevel of abstraction, or at the tactical level it describesthe direct effects of an action on the P-state. Theseare SIPE-like operators (Wilkins 1988) where the closed world assumption is implemented,and hierarchical planningused. Each operator contains information about howand whenit should be applied (Mansell 1993a). Included this frameworkis the operator’s name,the abstraction level it operatesin, the necessarypreconditionsthat must be true of the P-state, andthe satisfiable preconditionsit will attempt to maketrue. The plot of the operator providesstep-by-step instructions on howto performthe action represented by the operator. This includes a description of the goal reduction operators that are appliedat the next level of abstraction, or at the lowest level of abstraction,howthe operatorchangesthe P-state. A function for calculating the probability of the reductionoperator succeedinggiventhe current P-state is also included. The availability of such a function is domainspecific and non-trivial to formulate. In the air combatexamplediscussed here the function is obtained empirically(basedon historical data). Theprobability succcssdoesnot providesufficient informationto select a reduction operator as it does not take into account the goals of the system.It is for this reasonthat associated with eachreductionoperatorlisted in the plot of a parent reduction operator is a measure of fulfilment, representing the degree to whichthe reduction operator achievesthe goalof the parent. U-Planuses a deductivecausal theozy (Wilkins1988), to deducethe context dependenteffects of applying a c’urca~ ~ [Boo] From: AIPS 1994 Proceedings. Copyright © 1994, AAAI (www.aaai.org). the All rights almsreserved. of the system,andthe [900,SO0] SVt_ATrA~ [,~Ol [2OOl [SOOl ~n~ZN_c~Br~L~01~ f200l [~o,~oJ~ol ARD [1000] / ATTA~ [ [700]--~_~9~.4D ]L~P Figure 1: The strategic portion of the absttaction hierarchy for the ~plified mr combatex~nple. reduction operator to a P-state. The effects that are deducedare considered to be side effects, wherethose that are introduceddirectly by the reductionoperator are the direct effects. Theuse of deducedeffects simplifies the description of the operators by removingthe needfor extensiveacid and delete lists. After the application of each reduction operator a set of triggers arc used to determineif the world has been changedin such a way that the deductive rules need he applied. If so, the deductivecausal theoryis usedto changethe P-state to be consistent with all the effects of an action. Theside effects of applyinganyreductionoperator are recordedin the planninghierarchytree. expansionof each layer in the tree describesa moredetailed level of abstraction in the plan wpresentedby the nodesin the tree. 5 Operator Selection Manyclassical planningsystemsnsc a state-based search strategy to solve planningproblems.To find a solution one applies operators to a state description until an expressiondescribingthe goal state is found. U-Planuses a quantitative measure,called expectedfulfilment, in an abstractionhierarchyto guidethe selectionof operators. A plan constitutes the successive application of reduction operator from~ highest level of abstraction (i.e. the goal function)downto the mosttactical detail. the air combatexample(fig. l) the goal of defending specifiedassets is accomplished by first choosingwhether to attack or turn awaythe aggressor, through specific strategies and manoeuvres available, downto the detailed implementation of a specific manoeuvre. The following sections outline howthe reduction operators are selected and implemented,and the process that is continuallyreviewingthese decisions. 5.1 Calculating Expected Fulfilment Goals in manydomainsdealing with uncertainty (for instance the air combat domain) are not precise requirements. Manygeneral goals can be fulfilled to various degrees by acl,.ieving alternative subgoals. However,not all subgoals are equally likely to be 4 The Abstraction Hierarchy achieved. We adopt an approach to planning by determininga course, of action that is likely to maximise U-plandoes not construct a state-based search tree, but the expected fulfilment of our goal. Consequently,our constructs a strategy hierarchy whichis a decision tree plans are not exhaustive. Theydo not elaborate all the like structure,, wherethe nodesin the hierarchyrepresent alternative actionsrequiredin all possibleworlds.Rather, a continuoustransition of actions fromthe strategic (at they specify alternative actions that are likely to the root node) to the tactical (at the leaf nodes). maximisothe expected fulfilment of our goal in the strategy hierarchy can be represented as an AND/OR possible worlds that are consistent with our partial search tree, the root noderepresentingthe strategic goal descriptionof the environment. of the system,andthe leaf nodesrepresentingthe tactical Expectedfulfilment is a quantitative measureused to details of howthe goal is to be achieved. Eachnode in rank the reduction operators whichachieve the goals of the tree is a subgoalnoderepresenting the current goal the active operator (i.e., the next reduction operator and P-state, and certain pairs of nodes are cormectedby chosen to be expanded) for selection purposes. For arcs representingthe application of a reductionoperator example,if Attack is the active operator who’sgoals we that producesthis subgoalnode. wish to achieve, the expectedfulfilment for BVRAttack For example, figure 1 shows part of the strategy and VRAttack(the operatorsthat achievethis subgoal)is hierarchyfor a simplified air combatdomain.Thegoal of calculatedandusedas a basis for the selection. defendingdesignatedassets can he achievedby either an Probability theory provides an effective methodfor Attack or Turn_Away.Thestrategy hierarchy showsthat choosing actions capable of producing consistently there are two reduction operators that can be applied to accurate choices. Information such as intelligence achieve anAttack, the BVR_Attack(or BeyondVisual reports, preferences, and raw data can be encodedand Range attack) and the VR_Attack (or Visual Range manipulated by a probabilistic inference engine to attack). Thenext level presents the manoeuvresapplied produce useful recommendations.Whereasprobabilities to achieve these specific attacks, whichare achievedby are usedto representthe likelihoodof events, fulfilments the implementationof tactical actions (not shown).The are used as a local measureof the degree to whichthe applicationof these operatorsrepresentsa clarification of consequentof the action achieves the intendedgoal and MANSELL 313 From: AIPS 1994 Proceedings. Copyright © 1994, AAAI (www.aaai.org). All rights reserved. the desirability of achievingthe goal using that action. Thetermfulfilment is usedto captureboth the essenceof utility scaled accordingto the desire to use a particular approach.Theterm utility is not used in this description to avoidconfusion,as there are subtle differencesin how they are usedand whatthey represent. The expected fulfilment is used as a measure of an action’s likelihood to produce the consequent that achievesthe agent’s goals. If weuse the measureF(c) represent the degree of fulfilment of consequentc, then the overall expectedfulfilmentassociatedwith action a is givenby: EF(a) = F(c)P(cla,e), (1) where, P(cla, e) is the probability of achieving consequence¢, conditioneduponselecting action a and observing evidence e. For example, to calculate the expectedfulfilmentof the Attackoperator in figure I, the probability of successfully executing an attack in the given P-state is multiplied by the degree of fulfilment obtainedby executingthe action (depictedin figure 1 as the first number in squarebracketsabovethe operator). Theexpectedfulfilment of action a, EF(a),is regarded as a gauge of the merit of action a. The expected fulfilment is used as a procedure for choosing among alternative (or competing) actions. Whengiven the choice betweentwo action (eg, Attack and Turn-Away) the selectionis basedon the actionthat yields the highest expected fulfilment (ie, EF(Attack)or EF(Turn-Away)). This result will dependon the description of the P-state whenthe selection is made.This process can be thought of as establishing a rank order in which one should attemptedto apply them. The semantics and justification of fulfilments are identicalto thoseoutlinedfor utility theory.Utility theory is not simplya convenientmathematicalformula, but is basedon studies of the psychologicalattitude towardrisk, choice, preference,and likelihood. Theessenceof utility theory, and therefore fulfilment, is captured by the axioms of utility Theory (Von Neumann & Morgenstern 1947). 5.2 Applying an Operator Oncethe reductionoperatorsthat achievethe goals of the parent operator have been ranked using the expected fulfilment calculation, they can be tested to determine their suitability to the P-state. U-Planwill attempt to applythe reductionoperatorsto the P-state in the order in which they were ranked (by their EF), until appropriateaction is found.If the necessarypreconditions of a reductionoperatorare true in the active P-state, then the reductionoperator is provisionallyselected, else the planfail for that operatoris applied(this usuallyinvolves backtracking). Whena reduction operator that satisfies the necessary preconditions has been found, the satisfiable preconditions are tested. If anyof these are not 314 POSTERS true, U-Plancan attempt to satisfy themusing reduction operators of equal or lower abstraction. If these preconditionsare not satisfied, the operator is rejected, andit’s planfail procedureis implemented. Onceboth sets of preconditionsof a reductionoperator can be shownto be true in the active P-state, the operator is accepted and its plot can be applied. The plot represents the effects the reduction operator has on the state of the world, andthe subgoalsthat maybe used to achieve this subgoal. Whenapplying the plot, the next level of the strategy hierarchyis exposed,andagain the subgoalwith the highestexpectedfulfilmentis selectedto be expanded next. The plot of operators ~presents actions at the lowestlevel of abstraction specifyhowthe P-state is physicallychangedbytheir application. This process of applyingoperators continuesuntil the next layer of the strategy hierarchyhas beenexposed.At this point, the earlier selection of specific actions are reviewedas describedbelow. 5.3 Reviewing Selected Operators When constructinga strategyhierarchyit is possiblethat, as a plan’s detail is filled out, it becomes less likely to succeed. This is partially becausethe initial strategic decisionsare basedon informationat a morecoarse level of abstraction. Asthe plan is expandedandmoretactical decisions aboutthe implementation of specific strategies are made, the expected fulfilment of specific plan branchesmaydecreases. Also, the EFcalculation for a higher level operator mayinclude a range of possible useful actions. However,as the plan becomesmore detailed, certain alternatives will be discarded,possibly resulting less that ideal actions beingselected, reducing the true e, xp~edfulfilmentof the plan branch. This makesit important to review earlier decisions while planning. After the application of a group of reduction operators U-Plan compares the e~ected fulfilment of the current subgoals,with those of previous subgoals, to determinesif the current subgoal remains favourable. U-Planuses an offset added to the current subgoal to take into account expected lower EF as increaseddetail is addedto the plan. Includingan offset is an iterative deepeningstrategy includedto avoid the problemof the system jumpingaround from branch to branchin the strategy hierarchy. Theoffset value will dependon the difference in abstraction level of the subgoals. Theiterative deepeningstrategy will not be discussionin the remainderof the paper, whichwill focus on the moregeneris process. To review these selections one must have a wayof updatingthe EFvalues calculated for the nodes in the abstraction hierarchy basedon the detail addedto that nodesplanningbranch.A set of updaterules are used to re-evaluate the fulfilment and probability of previously expanded operators given the most recent planning From: AIPS 1994 Proceedings. Copyright © 1994, AAAI (www.aaai.org). All rights reserved. developments.The update rules used dependon whether the nodeproducesan AND or ORbranch in the tree. In the case whenan operator is expandedproducingan ORbranchin the abstraction hierarchy, the updaterules used to determine the fulfilment and probability of a parentnodegivena set of possiblechildren are givenby: I MAX F(parent) = {F(child)]childreEF(child)} , (2) ]~I~o.o.,, [85o,81o,7oo1 /VR ATFACK - ( / SET_~.ARn~O / {iooo,o.9; [8001 VISUAL LOCK [sso.slo.6~ts] ACQUmE j ~ [~LOSE_D4 ~r_TARGET ~ {800~ "0/ / ’ ’ " AR LOCK pooo, o.80~\ gaoo.s.o} 0ooo.~71 {8o~o.81) \ I MAX e(~nO= {e(ch~ld)[childrenF~(child)L VntE ~ARM_WEAPON READY,m~{IO00,1.O} {zooo, o.9} ~mRE~N {ZO00,O.9} {zooo,o.o} Wherechildren is the set of children operatorsthat have been expandedor considered for expansionand have not beenrejoct~ Simplystated, rules 2 and3 tell us that the fulfilment andprobability values for the parent of an OR nodeis equalto the fulfilmentandprobabilityof the child nodewith the greatest expectedfulfilment that has not beenruled inapplicableto the P-state (i.e. dueto failure of preconditionsduring expansion).If the operator with greatest EFis not fromthe current subgoal, a changein the planningdirection occursand is recorded. The updating of the parent reduction operator at an ANDnode involves updating the probability and fulfilmentas follows: I MIN F(parent) = {F(child)lchildreF(Child)} , (4) P(parent) = I-I P(child). ffoo] (5) children In the AND case the rules and their justifications are less obvious(Mansell1994). The fulfilment of a parent operator is replaced by that of the child fromthe set of children with the lowest fulfilment. Normally, when confronted by an AND node, the children are given the samefulfilment as that of the parent. Thejustification being, that as the parent can ouly be achievedby a single sequence of actions, then this reduction is simply a refinement of the parent operator 0.e., these actions should wholly achieve the desired goal). Howeverthe situation mayarise when, at a lower level in the abstraction hierarchy, the fulfilment valuefor oneof the child operators mayitself be updated(by its subsequent descendants) to a new, lower valuer. Whensuch circumstancearise, the child operator is not fulfilled completely by its descendant, and consequently, the parent operator is no longer completelyfulfilled by its children. In the subset of the air combatexamplegiven in figure 2, a simple scenario for a Close_Inmanoeuvre operator is evaluated. Theexpectedfulfilment for this operator is calculated (the first numberin the squarebrackets above the operator, i.e., [850]) based on the fulfilment and probability values (shown in the braces below the Figure 2: Anexampleof the abstraction hieraroby for a Close In manoeuvre scenario that demon~ates the updating of Palfilments and probabilities. Progressive (fulfilment, probabilUy)and[EF]values are givenfor each operator. operator, i.e., 1000and0.85 respectively) obtainedfrom the operator. Onexpandingthe Close In operator, the next level of the plan is uncovered.This showsthat the Set_Bearing, Acquire_Targetand Fire_Readyoperator are to be applied. Thefulfilments and probabilities for these are calculated and shownin braces below the operators. As this is an AND operation, update rides 4 and5 are usedto updatethe fiflfilments andprobabilities for the parent, Close_In, operator (shownin the second set of braces belowthe operator, (1000,0.81)). These updatedvalues are used to calculate the updatedEFvalue for the Close_Inoperator(i.e., [810]). Figure 2 also includes an examplewhererules 2 and 3 are usedto updatethe fulfilmentandprobabilities for the parent of an ORnode. In this case the Acquire_Target can be achieved by either a I~sual_Lock or a Radar_Lock.The Visual_Lockis chosen as it has the higher EF of the two. However,the V~sual Lock has a m lower fulfilment than its parent and this is propagated back through the branch updating the Close_In EF to [648]. Asa result of propagatingthese valuesbackup the branch, the Side operator becomesfavourable over the expandedbranch Close_In. 5.4 Sensitivity of Expected Fulfilments The ~ fulfilment value is calculated by multiplyingthe probability of success of an operator by the degreeto whichthe operatorfulfils its intendedgoal. It is therefore useful to knowthe relative sensitivity (Karnavas,Sanchez,and Bahill 1993) of probability and fulfilment on the expected fulfilment function. The relative-sensitivity of a function F to the parameterct over the normaloperatingpoints is givenby: SF= %changein F °~’//F ~ ~nor = o~ar//a = ~[NOP FO’ MANSELL 315 From: AIPS 1994 Proceedings. Copyright © 1994, AAAI (www.aaai.org). All rights reserved. whereNOPand the subscripts 0 meansall functions and their parameters are evaluated over their normal operatingpoints. Given,the expectedfulfilment function in equation 1, the relative-sensitivities fimctions are computedas follows: NapEFoFo, $~ =--~ F0 P CO (8) Therelative-sensitivities of probability andfulfilment on the expected fulfilment are independent. This informationis used whendeterminingthe certainty with whichprobabilities and fulfiknents mustbe knowngiven a set of expected fulfilments. For example,given the choice betweentwo actions with close EFvalues, one can calculate the degreeof confidencein the fulfilmentvalues required of the actions, given the percentagedifference betweenthe actions probabilities. 6 Plan Reapplication U-Plan applies plan reapplication in an attempt to determineif a plan generatedfor one initial P-state can be adoptedfor anotherinitial P-state. Thedesiredresult being fewer plans than the numberof initial P-states. This is implementedby attempting to reapply plans generatedfor oneinitial P-stateto other initial P-state. Aplan is reapplicableif all the reductionoperatorsin the plan (that are not redundant)havetheir preconditions metunderthe newinitial P-state, andwhenapplied result in the goal state being achieved. If a plan, during re, application,fails dueto the unsuccessful applicationof an operator, that plan is not entirely discarded. U-Plan will attemptto use the part of the plan that wassuccessful and planning continues from the point where the plan failed. Thedesire is to construct plans with the sameor similar strategies by reusing, at least part of, the plan at the high level of abstraction. Whenmorethan one plan partially worksfor a newinitial P-state the best plan (Mansell1993a)is used. 7 Super-Plans Onceplans exists for all the P-states, with supportand plausibility abovesomethreshold, a single super-planis constructed. This is achievedby mergingthe set of plans constructedfor the set of initial P-states, that is applying identical operator sequencesand branchingat the point whereplans differ. At each branch in the super-plan a knowledgeacquisition operator is added, attaining the informationrequired to select whichaction in the superplan to apply next. Thecase mayarise whenthe required informationto differentiate betweenalternative branches is not available.In this case, the selectionis basedonthe 316 POSTERS degree of evidence supporting for each branch of the super-plan (see (Mansell1993a, Mansell1994) for more detail). This final step of producinga super-planis an importantpart of presentinguseful coarseof action that couldbe applied by the system. In most cases, U-Planproducesa super-plan in less time than it wouldtake a traditional planner to produce one plan for every possible world (see (Man~ll1993a, Mansell1993b)for moredetails on these results). 9 Conclusion U-Plan is a hierarchical planner that deals with information represented at a level of abstraction equivalentto the action being investigated. Outlinedin this paper is the quantitative best-first search method employed by U-Plan for operator selection in an abstraction hierarchy. As this process is a forward propagatingpartial decision tree, a methodfor reviewing previousdecisionsin the light moredetailed information is included. The update rides are presented in some detail, and an exampleof their operation presented. UPlan has provedto be a effective planningsystemin the air combatdomain(Mansell 1993a), and the expected fulfilment calculation a reliable formulafor operator selection. Acknowledgments I would like to thank Dr. GrahameSmith and Dr. Elizabeth Sonenbergfor their manyinsightful comments. References Karnavas,W.J., Sanchez,P. J., andBahill, A. T. 1993. Sensitivity Analysis of Continuousand Discrete Systems in the Timeand FrequencyDomainsIn IEEETrans. on Systems, Man,and Cybernetics, 23(2): 488-501. Mansell, T. M. 1993a. A Methodfor Planning Given Uncertainand IncompleteInformation. In Proceedingsof the Ninth Conference on Uncertainty in Artificial Intelligence, 350-358.WashingtonDC. Mansell, T. M. 1993b. Air CombatPlanning Using UPlan for Post MissionAnalysis. In Proceedingsof the First Australian and NewZealand Conference on Intelligent InformationSystems,644-648.Australia. Van Nenmann,J. and Morgenstern, O. 1947. Theory of Gamesand EconomicBehawour.2nd ed. Princeton, New Jersey: PrincetonUniversityPress. Mansell, T. M. 1994. Planning Given Uncertain and IncompleteInformation. Forthcoming. Shafer, G. A. 1976. A MathematicalTheoryof Evidence. Princeton,NewJersey: PrincetonUniversityPress. Wilkins, D. E. 1988. Practical Planning:Extendingthe Classical AI PlanningParadigm.Los Altos, California: MorganKaufmann.

Quantitative Operator Selection for Planning Under...

Related documents

Products

Support

Quantitative Operator Selection for Planning Under...

Related documents

Add this document to collection(s)

Add this document to saved

Suggest us how to improve StudyLib