Search in the Context of Complex Robot-World Interaction Sanjiv Singh The Robotics Institute Carnegie Mellon University Pittsburgh, PA15213 ssingh@cmu.edu From: AAAI Technical Report WS-97-10. Compilation copyright © 1997, AAAI (www.aaai.org). All rights reserved. Abstract In contrast to typical planningproblems,automatedearthmoving operationssuchas digginga trench or leveling a mound of soil posesomeadditionalchallenges.First, soil is diffuseand thereforea uniquedescriptionof the worldrequiresa verylarge numberof variables. Second,the interaction betweenthe robot andthe worldis verycomplex to the pointthat it is notpossible to accuratelypredict the effect of a candidateaction, in this paper, 1 pose planningfor earthmoving operationsas a problem of constraintoptimization.I discusswhyon-line searchis necessary and presentexperimentalresults froma testbed wherea robotis taskedto diga trench. Introduction In our laboratory, we develop automation for machines that operate in industries such as mining and construction. Three characteristics distinguish such machines. They are capable, e~cient and safe. Capability has to do with scale--forceful interaction with the world requires machinesthat can generate large forces. Efficiency boils downto performinga task as fast as possible without compromisingoutput. Safety is a matter of ensuring collision free operation and in some cases avoiding tipover. other state, through the use of a reasonably competentagent. To be considered successful, the agent must independently cause the transition for a Sufficiently large set of initial and goal states. There are two maindifficulties with operating in this particular domain. First, the dynamics of the interaction between a robot earthmover and the world are very complex. This meansthat it is difficult to produceaccurate forward models of actions that the robot might choose. Second,since soil is diffuse, the numberof variables required to uniquely specify the state of the world is extremely large. In other words, the configurationspacethat the robot operates in is essentially infinite. I have formulated the task of planning earthmovingoperations as a problemof constraint optimization. The nature of the models in this domaindictates the use of search. Further, since the large configuration space affects the choice of the control parameters, there is no hope of doingthe search off-line. Several researchers have adopted a similar view to planning for diverse tasks such as robot juggling and robot pool playing (Jordan 90, Moore 92, Schaai 94). Feiten and Bauer have devised a planner for a mobile robot operating in a cluttered environmentsthat plans in the space of actions rather than in Used judiciously, modelscan greatly help in increasing effithe configuration space of the vehicle (Feiten 94). Kelly ciency and safety. By "models" I mean physical, numerical describes a methodologyfor navigation of a cross country models of mechanismsand their interaction with the world. robot based on like principles (Kelly 95). Myapproach is also (Modelsof planning and sensing are also useful, but I do not similar in motivationto workin classical optimal control (Kirk address themhere.) That is, givefi a candidate input, these mod- 70) in that it seeks to determinethe control signals (plans) that els predict the corresponding output of a (sub) system. will both satisfy someconstraints as well as optimizesomepercourse we would like invertible models--models that compute formancecriterion. the input necessary for a desired effect--but these are rarely BelowI discuss five principles developed from myexperience available. Good forward models are often the best we can with a planner for earthmoving. achieve. They provide the proaction that is necessary for saccessfui goal directed activity, at least in the environmentsin ¯ Develop a tractable representation. Since the configurawhich we must put our autonomousmachines. Wheninvertible tion space for this domainis intractable, another frameworkis necessary. Search is conductedin the space of control paramemodelsare not possible, search is necessary. ters (action space) that abstracts the prototypical task that the In this paper I will concentrate on one task---earthmoving-robot must perform. It is also necessary to develop the notion that highlights the approach espoused above. I have a robot manipulatorthat is equippedwith a scoop ("bucket") and a sen- of an atomic action and a forward modelthat predicts the effect of an action on the world. Finally, we need an evaluation funcsor that allows it to measurethe shapeof the terrain aroundit. I wouldlike the robot to dig a trench in a sandboxas per spec- tion to measuresthe utility of an action. ification, or perhapslevel a moundof soil. Onthe surface, such ¯ Reduce the search space. On-line search dictates compacmessof the search space. In somecases an analysis of the a problemis familiar to researchers in artificial intelligence. The world is in someinitial state and must be modified to some mechanicswill often showthat not all the parametersare inde- 97 pendent.In other cases, if it is foundthat the utility function is monotonicin relation to one of the parameters, then it needn’t be searchedfor. ¯ Use force as well as geometry to constrain planning. Geometryof the world and mechanismprovides useful constraints on the action space. However,force constraints that arise from the mechanicsof robot-world interaction can effectively represent the boundson whatthe robot can actually do in a world wherefriction and torques are significant. Noa priori modelsmaybe available in which case it is necessary to learn these models from experience. ¯ Use multi-resolution search and heuristics. Abstraction of the search into levels of varying granularity makesthe search more efficient. In somedomainsthe addition of commonsense heuristics, can make the progression of the plans muchmore natural. ¯ Take plans generated with a grain of salt. If meaningful models of uncertainty can not be incorporated into the planning, the plans generated through the kind of process described above should be thought of a means of being proactive. The plans are best handedto lower levels controls that are tightly coupled to the world with feedback. Developing A Tractable Representation parameters used to define an action that a robot is capable of executing. Eachpoint in this space represents an atomicaction. The space can be separated into two sets-- the set of all feasible actions and the set of actions knownto fail. Anaction might fail becauseit is impossibleto achieve or it results in an undesirable effect¯ The task of an action space planner, then, is a familiar problemin optimal control: Maximize~Minimize h(lk) subject to g(lli) where hO is a utility function and gO are constraints that delimit the set of feasible actions and lli spansthe set of actions that a robot can execute. Note that the constraints change wheneverthe state of the world changes. Since we cannot predict the state of the world in responseto robot actions and there is a large branchingfactor (manyactions maybe chosenat any step), it is not practically feasible to consider sequences of digging actions. Hence the searching will involve identification of only the action that is optimal over a single planning step. An atomic action Since an action space encodes actions, not mechanisms,our task representation will not include any details about the configuration of the mechanism.Instead, we will encodethe trajectory followed by the excavator bucket (Fig. 2) To develop a planner for a task such as digging a trench, we need three things: a search space, a forward modeland an ~aluation function. The Search Space Instead of abstracting robot and world state, we could abstract the task that the robot is to performsuch that atomicactions are described by a compactset of parameters. The space spanned by these parametersis called an action space. At every step, the robot selects fromthe set of feasible actions, one that optimizes somecost criterion. The chosen plan is guaranteed to satisfy constraints (e.g. avoidcollisions) as well as optimizea cost criterion (e.g. minimizejoint torques). Fig. I provides a schematic view of this optimization. Fig. 2 Manipulatorarm equippedv,~ a bucket, creating a trench. There are an infinite numberof digging trajectories that could be used so it is necessaryto consider a smaller set--those that are of a particular form. For example,we could base the notion of an atomic action on the patterns used by humanoperators. There are typically three phases to an operationm penetrate, drag and curl. Trajectories of this formcan be parameterizedby a smallerset (Fig. 3). ~ataP~neet all constraints andare"optimal" Fig. l A ~hematlc viewof constraintoptimization.I More formally, an action space is spanned by the range of 98 This representation requires six variables and a functional to represent uniquely: k is the distance from a fixed reference frame to the point wherethe bucket enters in the soil, a is the angle at whichthe bucket enters the soil. It travels along this angle for a distance d!, and then follows a horizontal path for a distance d2. Finally the bucket rotates throughthe soil along an Using Force & Geometry to Constrain ,w r Planning Recall that each point in an action space represents a unique action. Wecan poseconstraintsin the regionsof the spacethat correspondto plans that the plannershouldstay awayfrom. / ,,=k , Geometric Constraints .... -d; -- Fill. 3 A typical dig duringa trenchingoperation.A diggingtrajectory of this formcanberepresented by (k, oh1, d2, d3, r, PO). Aplan maybe geometricallyinfeasible becauseit requires a robot to exceedits rangeof motionsor becauseit violates some geometriccriterion associatedwith successfulexecutionof the task. Theteachability constraint. Fig. 4 showsa graphical depiction of the set of diggingactions that are withinthe workspace of the robotfor the veryfirst dig. Thesurfacethat is formedby this three-dimensional set representscritical pointsthat are on The Forward Model the edgeof being reachable. Since the actions are parameterOneinteresting aspect of this problemis that no computation- ized witha larger set of variablesthancanbe visualizedin three ally tractable forwardmodelsare available. That is, given a dimensions,someof the critically reachableplans are in the shapeof the terrain, anda candidateaction, there is nohopeof interiorof the set. predictingthe resultant terrain. Finiteelementmodelsare availjoint limit able to performsuch simulationsbut since a single run might take tens of minutesto simulate,it is not possibleto use such modelsfor the purposesof planning. Wemightsatisfy ourselves with something less. For reasonableplans it is possible to approximatethe resulting trajectories. Underthis assumption, it is possibleto calculatethe forcesexperienced at the cutring edge (using a pseudo-static analysis) as well approximatethe-amountsoil that might be swept into the bucket. arcof lengthd3 andradiusr. Wewill alsoneedto determine the valueof P0 (the pitch angle of the bucket,relative to a fixed coordinateframe)throughoutthe dig. The Evaluation Function Typically,for earthmoving operationsit is essential to optimize the volumeof soil excavated.Sincethere is often morethan one limit actionthat will meetthis requirement, it is possibleto addother criteria such as minimizing the time taken to completethe dig, or minimizing the torqueson the joints of robot. Notethat the evaluationis goodfor only single actions and mustbe recomputedacross the rangeof the action spacewhenever the state of Fig. 4 Twoviewsof set of feasible plans that meetthe teachability the worldchanges.I assumethat the state of the world(topol- constraint.Thisset is specificto the joint limits of the robotusedandto the terrain before the action Is commenced. Thetmjectddesimplied by ogyof the terrain) can be measuredusinga sensor. threepoints are shownexplicitly. Theshadedregion showsthe amount of soil sweptby eachtrajectory. Reducing the Search Space Takennaively, the search spaceas describedaboveis too large to be searchedon-line. However,someanalysis of the space can makethe search moretractable. For example,if weassume that the amountof soil excavatedincreases monotonically with d2, its valuecanbe foundeasily if the othervariablesare instantiated. Valuesof P0, r, d3 can be foundfroma mechanical analysis of contactbetweenthe tool andthe terrain (Singh95a). Theshapingconstraint. TheshapingconsWaJnt for trenching keepsthe trajectory of the bucketfromgoingpast the shapeof the desiredtrench(dashedline in Fig. 5). Thatis, all trajectories that extrudepast these boundariesare excluded. Thevolumeconstraint. Of the geometricallyfeasible actions, somewill yield a partially filled bucket,somewill result in a full bucket andyet others will sweepthroughthe terrain to excavatemoresoil that can possiblybe held in the bucket. A Theresult is that our action space can be spannedby a compact simple calculation of swept volumeis used to predict the three-tuple (k, ft., d!). amountof soil excavatedby a diggingaction. All actions that excavatea volumelarger than a preset percentageof the bucket 99 Force Constraints If a robot earthmoveris infinitely strong, that is, it can muster any torque required, then it is sufficient to consider only geometric constraints. Morerealistically, for robots with torque limits, it is necessary to consider the required forces. If we could estimate the forces required for candidate digs, we would havea goodcriterion by whichto further restrict the set of digs that are geometrically feasible. Unfortunately, the interaction betweenan excavating tool and terrain is complexenoughthat no simple physics-based models are available to predict the resistive forces for arbitrary actions and terrain shapes. In earlier workI have showna methodthat learns to predict resistive forces basedon observationof the resistive forces, tool trajecFill. S Twoviewsof set of feasible plansthat meetthe shaping tories and the shape of the terrain from previously conducted constraint.Notethat all of thedigsherestaywithinthebounds of the digging experiments (Singh 95b). Fig. 8 comparesthe geomettrench that hasbeen specified (dashed line) ric and force constraints for the very first digging actionmfor somevalues of the parameters, the robot is not limited geometcapacity, are excluded(Fig. 6). rically but the corresponding actions can not be achieved becausethe required forces are too great. ~ Co~lmlmJ For~ Cmm~m (a) I ml Fig. 8 Comparison of geometric constraintsvs. forceconstraintsfor onevalueof k. Fill. 6 Twoviewsof set of feasibleplansthat meetthe volume constraint. All trajectoriessweep onlysweep upto asmuch soil that the Todetermineif a candidate dig passes the force constraint, the bucketcanhold force prediction function is called. Thepredicted resistive force and the robot’s trajectory are used to calculate the effective The set of actions that meet all the geometric constraints is joint torques required to overcomethe resistive forces. A canshownin Fig. 7. didate dig is force-feasible if the joint torques due to resistive forces do not exceedthe torque capability of the robot. UsingMulti-resolutionSearchandHeuristics So far I have examinedonly a planar problem, that of digging a trench as wide as the excavator bucket. For more complex problemssuch as the excavation of a foundation footing (Fig. 9), wecould abstract the problemsuch that at a coarser level the ultimate goal is broken downinto a sequence of trenches. The alternative is to makethe planner global by searching for additional variables that dictate base position. This has two problems. First, the search space becomesvery large and second, it is difficult to developan efficient sequenceof actions. ~1~7 Twoviewsof set of feasibleplans that meetthe shaping conslralnt. Sometimes, there are commonsense procedures that can be used at a coarser level of planning. For example,whendigging into a cavity wemight alwaysstart near the top, before digging further downinto the hole for the same reason as one starts 100 ~Cla~ "g.~,~:~, ~ ea~ni rangefinae~ ~./bucket ~_ Fill. I0 Testbed Fig. 9 Planviewof an excavatorbackhoe digginga footing. 1"he machine startsat A anddigsonesideof thefootingto a desireddepth. Second,the machine movesto B fromwhicha secondside can be excavated. A third setup(C) is achievedby rotation of the base. Excavation of thecenteris achieved by a fan-likepatternof digsfrom (D). sweepingstairs from the top and not the bottom. If relevant, such procedures can be incorporated into the coarse planning alleviating the need for search to generate such sequencing. Taking Plans Generated With a Grain of Salt For real worldapplications, it is never possible to modelall the effects of sensors and actuators. At best we might hope to capture first order or secondorder effects. It maybe possible to fold uncertainty modelsinto the planning directly and limit the feasible set by the marginof cumulative uncertainty. However, this is not alwayspossible and after all the modeling,plans are best thought of as a meansof proaction. Final execution of the plan is best left to a lowerlevel control schemethat has a tight coupling with the world. Control schemestypically are applicable in a narrowrange of operating conditions and in this light planning can be thought of as a meansto specify actions that fall within the operating conditions that where properties of convergence hold. Wewould like as muchfeedback as is possible--- state feedback before planning so as to plan properly, during execution so that unmodeledeffects can be compensatedfor, and after executionso as to set the stage for the next plan. Implementation duces an imageof the terrain such that the value of each pixei in the imagecorrespondsto the distance fromthe scanner to the world. This image is used to build a topographical mapof the terrain in the sandbox. The excavation cycle consists of three phases. First, a terrain mapis producedfrom range images taken by the laser scanner. Next, the planner chooses a digging action to execute that satisfies geometricand force constraints and optimizesutility criteria. Last, the action is executed by the robot and the cycle repeats until the terrain is sufficiently close to the specified goal. Whenthe search space is extremely large and answers are needed quickly, at the expense of optimality, a numerical methodsuch as simulated annealing can be used. In fact I have used an augmentedversion of simulated annealing to do the optimization in the past (Singh 91). However,the stochastic nature of the search means that the solutions found are not repeatable and it is very hard to performcontrolled tests where one of the constraints is modified. Further, exhaustive search (test and generate) with intelligent ordering of constraints works well enough for our purposes. While in the worst case this meansthat each constraint has to be evaluated for each candidate point in the action space, in reality testing early for conditions such as reachability prunes the search space.=ffectively. It is possibleto further select fromthis set basedon other criteria. such as time or joint torques required. For example,Fig. I l showstwo different trajectories selected for the sameinitial state of the terrain, given different evaluation functions. Note that both plans sweepthe sameamountof soil. Experimental Wehave developed a testbed to conduct experiments in subsurface sensing and excavation. The testbed consists of a sandbox (2.5m x 2.5m x lm), a hydraulic robot outfitted with an excavator bucket and force sensor, and a laser range finder. The setup of our testbed is shownin Fig. 10. Weuse a large industrial manipulator with an end effector payload of approximately 125 Ibs. A small excavator bucket with a volume of 0.01m3 serves as an end effector. The laser range scanner pro- Results Wehave conducted over 2000 digging experiments with our testbed. Fig. 12 showsa progression of one sequence in which the goal is to producea trench. In this experiment’the digging actions chosen sweepa volumeof soil very close to the bucket capacity. However,the actual yield in the bucket is a function of howcohesive the soil is--digging in loose, granular soil results is someof the soil spilling out of the bucket. Experimen- I01 Conclusions 116 . 1 1.6 : ...... Eanhmoving is an exampleof an application where the interaction betweena robot and the world is significant. To perform such a task efficiently, we will want to use every trick in the book. This includes the use of good forward models, mechanical analysis and multi-resolution search. .......... O.S 0.6 ...... (a) (b) FIg. 11Twodifferent trajectoriesselectedfor the same state of the terrain (a) evaluationlunctlon minimizes the largest torques(b) evaluation functionminimizes thetotal joint torques. 1.5 I have shownhowthe task of digging a trench can be stated as a problem of constraint optimization. Since the optimization happens one action at a time (necessary due to the uncertain forward models and large branching factor at every planning step), the planner is greedy and thus requires a higher level planner that ensures that proper subgoalsare specified. ., ! References (Feiten 94) Feiten, W. and Bauer, R., "Robust Obstacle Avoidance in Unknown& CrampedEnvironments;’ In Proc. IEEE international. Conference on Robotics and Automation, May1994. San Diego. ).s °n’= I~gel m,,~, I 1A vo~um:O.0137 9 9A q 1.6 ).s..........i........:.-.-.:.~.-...... ~........ ~a I.S oloDa ,,~, you,v,: o.m~ ~ ,.s 2 2.s G ......... :....... ".:.::.:.i:......i....... 3 1 I .s 2 2.s i i........ ,.s.......... ::.-.:i.: ......i........ ~;7 .~. Dq;DSm volume: o.o130 (Moore 92) Moore, A. and Atkeson, C., "An Investigation of Memory-baseFunction Approximatorsfor learning Control;’ Technical Report, MITAI Lab, 1992. ~..o.m~ 1 1.5 :) 2.S ......... i ......... ; ............................ i O~,~8 mveptvt~u~,:0.012S .8 1 1 .S 2 (Jordan 90) Jordan, M. I, and Jacobs, R. A, "Learning to Control an Unstable System with ForwardModeling;’ In Advances in Neural Information Sciences 2, MorganKaufmann, 1990. Kelly, A. J., "An Intelligent Predictive (Kzily 95) Control Approach to the High-Speed, Cross Country AutonomousNavigation Problem", Ph.D. thesis, Robotics Institute, CarnegieMellonUniversity, June 1995. Kirk, D. E., Optimal Control Theory, (Kirk70) Prentice Hall, 1970. 2.S Fill. 12Thefirst eightstepsin thecreationof a trench.Thedesired trenchIs shown by thedashed line. Theshaded regionis thevolume of soll Onm3) estimated to beswept by the excavator buCkeL tel results are encouraging. The planner examines approximately 2000plans in a few seconds and almost alwaysfind one that fills the bucket. Theplanner becomesinefficient as the profile of the terrain approachesthe desired goal. This is because the form of the prototypical plans does not produce efficient trajectories. In this case, it is possibleto executeopenlooptrajectories that follow the outline to the desired trench to scoop out the remainingsoil. 102 (Schnai 94) Schaal, S., Atkeson C, "Robot Juggling: Implementation of MemoryBased Learning;’ II~.~R Control Systems,Vol 14 (1), February,1994. (Singh 91) Singh, S. "An Operation Space Approach to Robotic Excavation;’ In Proc. IEEESymposiumon Intelligent Control. Alexandria, August, 1991. (Singh 95a) Singh, S., "Synthesis of Tactical Plans for Robotic Excavation," Ph.D Thesis, Robotics Institute, Carnegie Mellon Univ, January 1995. (Singh 95b) Singh, S., "Learning to Predict Resistive Forces During Robotic Excavation" In Proc. International Conference on Robotics and Automation, Nagoya, May 1995.