Addressing Preemption Costs in Multi-Agent Resource Allocation for Medical Applications Robin Cohen

Expanding the Boundaries of Health Informatics Using Artificial Intelligence: Papers from the AAAI 2013 Workshop Addressing Preemption Costs in Multi-Agent Resource Allocation for Medical Applications John A. Doucette Robin Cohen David R. Cheriton School of Computer Science University of Waterloo 200 University Avenue West Waterloo, Ontario, N2L 3G1 David R. Cheriton School of Computer Science University of Waterloo 200 University Avenue West Waterloo, Ontario, N2L 3G1 lize values computed when generating ‘transfer of control’ (TOC) strategies (Scerri, Pynadath, and Tambe 2002) to provide a basis for accurately estimating costs. Abstract In this paper we offer an approach for reasoning about resource allocation and scheduling in multiagent systems that takes into consideration the costs of preempting an agent from its current task. We apply our methodology to the motivating medical application of allocating doctors to patients in hospitals during mass casualty incidents and demonstrate noticeable improvements in performance (generating far fewer problem patients) over competing approaches that do not model the costs of preemption in sufficient detail. In particular, our approach offers a method for addressing the challenges of cyclical dependencies in the estimation of preemption costs by localized agents through a combination of planning techniques. Background Medical Scheduling and Resource Allocation Recently, partially and fully distributed agent based approaches have begun to appear in hospital scheduling. One of the principal reasons for adopting these approaches is the inherently distributed nature of the hospital scheduling problem: each department within a hospital may set its own schedule independent of the schedules used by other units, which can create inharmonious global schedules. Decker and Li applied their Generalized Partial Global Planning algorithm to hospital scheduling to improve throughput and decrease the duration of patient stays (Decker and Li 1998), but acknowledged that their approach did not generate optimal schedules (though optimal plans for dynamic environments offer limited improvements). Vermeulen et al. proposed a different approach through the utilization of automated Pareto efficient appointment time swaps between different patient agents (Vermeulen et al. 2007). This system had the advantages of being incentive compatible for patients, and of being fully distributed, but limited its scheduling changes to voluntary exchanges. This can result in suboptimal scheduling when the incentives of individual patients do not align with the goals of the hospital itself. Paulussen et al. (Paulussen et al. 2004) address this with a market based coordination mechanism called MedPAge for the problem of patient scheduling, in which the agents representing patients attempt to obtain appointments that maximize improvement in their patients’ ‘years of well being’. This was accomplished through negotiated timeslot swaps, managed by agents on behalf of their respective patients. An agent holding a slot would relinquish it if they expected the adverse effects on their patient to be less than the positive effects of allowing the swap. The mechanism is market based in the sense that the preempting agent specifies an amount it would be willing to pay in order to facilitate the swap, and the present holder of the timeslot will relinquish it only if the amount exceeds its expected losses in terms of years of well being. However, the MedPAge system computes only a Introduction Recent work applying multiagent systems in the medical domain to assign doctors’ limited time to patients with different degrees of need (Paulussen et al. 2003) has produced promising results using a market-based coordination mechanism. The MedPAge approach adopted by Paulussen et al. however falls short in scenarios where preemption of resources is an issue, because of a worst case estimate of resource preemption costs. We envision the scenario of a Mass Casualty Incident (MCI), wherein a few doctors must be dynamically assigned to many patients, as an application area where this shortcomming would become especially important. In this work, we provide a more detailed description of the costs associated with preemption in a MedPAge-like system. We show through simulations that the new model can better balance the requirements of patients during situations where very large numbers of patients are present at once, as in a MCI, and so can reduce the frequency of ‘problem patients’ (i.e. those with critical status). This is accomplished utilizing techniques from the multi-agent subfield of adjustable autonomy (Hexmoor and Castelfranchi 2003), which deals with coordination and planning for agents that act as codecision makers with human entities. In particular we utic 2013, Association for the Advancement of Artificial Copyright Intelligence (www.aaai.org). All rights reserved. 14 first-order approximation of the true costs of these preemptive swaps. In this approximation, the present holder of a resource considers the costs of preempting other timeslots itself when computing its expected losses. Its losses are simply the value of its current appointment time, less the value of the next best time it could reasonably expect to obtain. To determine which timeslots this first agent may reasonably expect to acquire, this agent must discover the costs of preempting the appointment times of other agents. However, each of these agents has the same problem: to determine the costs of losing their current timeslots, they need to know the costs associated with taking the slots of other agents. The presence of cycles in these dependencies can make the discovery of true costs computationally expensive. To avoid this issue, when one agent (the preemptor) asks another (the preempted) to compute the costs of a preemption, the preempted MedPAge agent assumes all other agents in the system lack capacity for planning, and that any third agent (the victim) that it attempts to take the timeslot of will be left with the worst timeslot of all: the back of the queue. This worst case estimation of the costs of these victims can cause overestimation of the preempted agent’s costs. As a consequence, the preempted agent will require a larger payment from the preemptor than is strictly necessary, which can prevent needed preemptions from taking place. The model we present in this paper attempts to address this issue using a combination planning and user modeling techniques. Using these techniques, our agents can more accurately estimate the costs of preempted agents, and thus allow more preemptions to take place. tems like the Electronic Elves project (Scerri, Pynadath, and Tambe 2002). A TOC strategy is a plan specifying who to engage when an agent needs to obtain a decision. For example, an agent might be asked to schedule a meeting time between many different humans. The agent may have some idea about the optimal time, but knows that there is some uncertainty in its guess. To minimize the probability of a mistake, the agent can transfer control of decision making to an alternative entity (i.e. a human or an agent) which it believes could make a more informed decision. If the entity declines to make the decision, or if circumstances change before the entity decides, the agent can transfer control to other entities or make the decision itself (Scerri, Pynadath, and Tambe 2002). A sample TOC strategy might look like {e1 t1 e2 t2 A}, representing a transfer of control to entity 1 initially, then to entity 2 at time 1 if e1 has declined to respond, and then having the agent (A) make the decision itself if necessary at time 2. The agent can plan out the optimal sequence of TOC operations by estimating the quality of decisions each entity could make, and by incorporating information about their availability for decision making (e.g. a human’s calendar and sleep habits; an agent’s available computational resources). (Cheng and Cohen 2005) extended TOC strategies to incorporate partial transfers of control (PTOCs) which first ask entities questions, before deciding to make full transfer of control. Vitally, the process of generating transfer of control strategies is one of optimization, typically conducted using branch-and-bound search heuristics, and the value used in the optimization procedure is equal to the expected utility of executing the strategy. That is, optimization proceeds by maximizing the integral of the expected quality of the decision made by the entity currently in control of decision making multiplied by the probability that the decision is actually made at the current time: ∞ PT (t) ∗ EUedcurrent (t) dt EU = Mass Casualty Incidents A mass casualty incident (MCI) occurs when a large number of patients simultaneously and unexpectedly suffer (potentially) critical injuries in the same time and place (Mistovich 2000). They are distinct from large scale emergencies such as floods insofar as such events are typically anticipated by communities in which they occur, and plans for mitigating their effects already exist. In regions where MCI are common, first responders rapidly reach the site of an attack and transport patients to a nearby hospital (Hirshberg 2004). Crucially, patients are overwhelmingly likely to be transported to the nearest hospital, rather than the hospital best equipped to care for a large number of serious patients (the associated odds ratio has been measured at nearly 250) (Hirshberg 2004). Consequently, MCI represent scenarios where a large number of critical patients may unexpectedly arrive at an ill-equipped hospital simultaneously, and so may pose problems for existing scheduling systems. 0 For example, if the expected utility of a decision made by entity 1 in the TOC strategy above is 5, and the probability of entity 1 making the decision before time 1 is 0.4, then the expected utility of starting a TOC strategy with e1 t1 is 0.4 ∗ 5 = 2. PTOCs are handled by applying a weight to the expected value of the TOC operations that would be taken given each possible response, equal to the probability estimate for that response being produced. The value of EU for a TOC strategy represents the expected value of executing the strategy. Later, we use this fact to provide an estimate of the costs of preemption. Transfer of Control Strategies Cyclic Dependencies in Preemption Cost Estimation Although many planning techniques could be used with our proposed system, we elect to use Transfer of Control (TOC) strategies. In the health domain, resources may often take the form of doctors or other humans. TOC strategies are a natural representation of this domain, as they were developed to model systems involving human interaction. TOC strategies were created as a planning technique for use in mixed-initiative adjustable autonomy multi-agent sys- The principal contribution of this work is a novel method for addressing the appearance of cyclic dependencies in preemption cost estimation. Cyclic dependences arise when the cost of an attempt by one agent (A) to preempt the resource or timeslot of another agent (B) is coupled to the cost of B preempting the timeslot of a third agent (C). If the cost of B preempting C is also dependent on the cost of C preempting A, then computation of the costs cannot be accomplished 15 directly. In previous work on multi-agent medical scheduling (e.g. (Paulussen et al. 2003)), these issues were avoided through the implicit use of worst-case bounds as estimations of the costs of the second preemption (i.e. B preempting C), but this approach results in an overly cautious use of preemption. In contrast, the system we propose in this paper has agents relinquish their timeslots only if, given their local information, they believe a reasonable appointment time could be acquired. The following example illustrates issues with the existing approach which can prevent valid, important preemption from taking place. Suppose for example, three agents (Alice, Bob, and Charlie) all need a medical procedure, and the present schedule for the procedure is {Bob, Charlie, Alice}. Yet suppose: Alice must be treated first, or she will suffer life long debilitating illness; Bob needs to be treated either first or second to avoid the same fate, but the order does not make a difference; Charlie can be treated at any of the first three time steps, but suffers a high chance of unpleasant, but transient, illness if he is treated in the third step. The optimal schedule is thus {Alice, Bob, Charlie}. Given the present schedule, Bob and Charlie are both content with their arrangement, and do not have any incentive to change it. 1: Alice needs to be treated first, so her agent opens negotiations to preempt Bob’s appointment. 2: A swap of Alice and Bob’s slots will only result in a positive net utility if Bob can preempt Charlie’s slot, so Bob’s agent opens negotiations with Charlie’s agent. 3: A swap of Bob and Charlie’s slots will only produce a positive net utility if Charlie can preempt Alice’s slot. 4: If we do not allow cyclic dependencies, and use a worst case heuristic (following (Paulussen et al. 2003)), Charlie’s agent reports the cost for placing Charlie in position 4 in the schedule. This would result in no net gain, so the preemption is rejected. Using a worst case heuristic for estimating the costs of preemption prevents convergence to the optimal schedule. Consequently, a more precise estimate is required to obtain optimal performance. disengage from the system. Similarly, if the professional is involved in a sensitive task (e.g. surgery), they may not acknowledge interruptions by the system. Following (Cohen, Cheng, and Fleming 2005), for each user, we adopt a user-specific exponentially decaying model of bother, where requests for direct preemption contribute to a human resource’s bother, and the contribution is proportionate to the importance and relevance of the request. Relevance is estimated using a U ser U nwillingness F actor (U U F ) and an Attentional State F actor (ASF ), problem specific values which are based respectively on the user’s ability or willingness to do the task in question and the inverse of the user’s attention level. For example, a doctor who is presently doing routine work in a clinic would probably have very low U U F if asked to save an emergency patient’s life instead. Once in surgery, the doctor would have a very high ASF from the system’s prospective because she is unlikely to respond to requests while engaged in that task. The base cost of initiating a new TOC operation in this model is given by Init = U U F × ASF × T OC Base Bother Cost, where the last term refers to the inate difficulty of the task. Typically this is low when querying a user for their preferences and high when asking a user to make a complex decision, but the precise values are problem dependent. The ultimate bother cost of a TOC operation is given by BC = Init + BC Inc F n(BSF, U U F ). The Bother Increase Function BC Inc F n describes the rate at which bother increases as a function of the user’s present perception of how often the system is bothering them, and how frivilious the task appears to the user. Typically this is a function of the form BSF U U F , where BSF is an exponentially weighted moving average of the bother from previous interactions. To support a fully distributed system, we utilize the type IV multiagent coordination algorithm from (Cohen, Cheng, and Fleming 2005). Agents build an estimate of the bother costs associated with a request by querying a resource’s proxy agent to obtain the values of U U F and ASF at the current time. When the time comes to make the TOC request, the querying agent verifies with the resource’s proxy agent that the estimate is within some small range of the true value. If it is not, the agent adjusts it TOC strategy as necessary, and may try again. Finally, patients within the system are modeled by an initial health state H(0), and an initial condition C = dH dt . At time t from the patient’s appearance at the hospital, the patient will have health state t Cs ds H(t) = H(0) + Model In our scenarios,the resources we have in mind are primarily medical professionals (e.g. doctors). Medical resources are assigned proxy agents, who can estimate the time taken by a resource to perform tasks (e.g. a doctor treating a patient). User Modeling In the case of human medical resources, we also incorporate techniques from user modeling to estimate the probability of response to a direct preemption. In a direct preemption, an agent requests a resource from the person who is currently using it, rather than acquiring an appointment for some future work. Direct preemptions facilitate the acquisition of medical resources presently in routine use for treatment of an emergency patient. For example, a doctor making the rounds might be called from his current patient into surgery. If a system makes many direct preemption requests, especially requests which do not account for the severity of the situation currently being addressed by the medical professional, the professional may 0 where Cs is the condition of the patient at time s (since conditions can change over time). Note that C = dH dt may contain non-deterministic components if desired, though we do not make use of this possibility here. A patient whose health state reaches 0 is deemed to be cured, while a patient whose state reaches some problem-specific maximum (Hmax ) requires immediate treatment to avoid long term health repercussions, and is referred to as a ‘problem patient’. Problem patients consume a disproportionately large amount of hos- 16 pital resources, and should consequently be avoided. TOC strategy with respect to the associated resource. When another agent attempts to preempt an agent’s presently held appointment time, or inquires about the present value of the appointment to the agent for use in construction of a plan of its own, the agent reports the expected utility of keeping its current timeslot, less the expected gain from executing its plan. The former term is the negation of the expected change in the patient’s health state H(t) if they keep their current resource, perhaps in years of life lost, over a fixed time scale. The latter is simply the expected utility of the agent’s optimal TOC strategy: the maximization of the integral in the Transfer of Control Strategies section above, with respect to the associated patient’s utility, again in terms of H(t). The plan may also be periodically regenerated to avoid having it become out of date. Our TOC strategies support three operations. A full transfer of control (FTOC) is used to initiate negotiation with another agent over a timeslot. The expected value of this FTOC is based on the probability that the preemption will succeed, computed using G(told , tnew ), the preemption cost estimate reported by the other agent at the time of strategy generation, and a gradual decay in value based on the length of time into the future that the FTOC would take place. A partial transfer of control, or PTOC, can be used to ask other agents or hospital staff for information regarding the status of other patients, and take actions dependent on their reply. Finally, a strategy generation node, or SG Node, is used to initiate generation of a new TOC strategy to replace the existing one. This might take place if new information comes to light which could dramatically improve the agent’s planning, or if the agent makes a TOC request which the bother model determines to be using out of date information. TOC strategy generation can be accomplished using branch-and-bound search, or using a dynamic programming approach. The latter produces solutions which are certain to be optimal, while the former has superior run times for many problems(Scerri, Pynadath, and Tambe 2002). In this work, we use the dynamic programming approach to remove this possible source of error from our preemption cost estimates. Using TOC strategies constitutes pre-computing contingency plans and allows agents to respond to requests for preemption without querying other agents at the time of the request. This resolves the cyclic dependency problem by providing a reasoned estimate of the true preemption cost without allowing cycles to form. Negotiation The core of our new framework lies in its negotiation protocol. We use a protocol similar to MedPAge (Paulussen et al. 2003), but with several notable differences. First, our protocol places no restrictions on the number of agents who can act at once. That is, there is no concurrency requirement in our protocol. Second, our protocol allows agents to interact with human actors directly, as well as with other agents. This is useful insofar as the system can be proactive in gathering information, or attempting direct preemptions (which might be required in MCI). The underlying bother model mentioned above helps ensure that the system makes only reasonable requests of human users. Third, our system allows for probabilistic response times as a natural consequence of supporting interaction with humans. Instead of receiving an immediate response, agents in our system expect some delay (following a known distribution) which might be caused by network latency, damage to the hospital, or prolonged reasoning (by humans or machines). Finally, our system assumes fully cooperative agents, rather than the partially cooperative ones used in earlier work (Paulussen et al. 2003). The basic negotiation process is presented below as part of the behavior of a patient agent throughout: 1. Let A be an agent representing a patient with health state HA , and condition C. 2. Let P be a plan of action for A which specifies an optimal (or near optimal) order for A to request medical appointments (see Preemption Cost Estimation below). 3. Let p be the appointment time suggested for acquisition at the current time by P , and told be the present appointment time for A with the specified resource. 4. If p is unowned, acquire it. 5. Otherwise, if p is owned by agent B, compare the gain made by A preempting (G(told , p)) (as calculated below) with the TOC-based preemption cost estimate produced by B. 6. If the gain exceeds the cost, A takes p, and B accepts the closest open appointment time as a temporary location, and now searches for a new appointment time based on its own plan of action. 7. If p is the last step in P , generate a new plan. 8. Otherwise, unless A’s patient has recovered, remove p from P , and GOTO step 3. Experiment To validate the new model, we implemented a version of it for a MCI scenario, and compared it with two ablated models using worst-case estimates of preemption costs, following (Paulussen et al. 2004). The first ablated model additionally prohibits direct preemption (as in MedPAge) while the second relaxes this constraint and constructs a MedPAge variation that is more fairly compared to our proposed algorithm. The comparison systems were otherwise identical to our proposed framework, including the same bother cost model, and the same modified negotiation procedure. Our simplified scenario featured four abstracted conditions, Routine, Minor, Major and Emergency. These con- Preemption Cost Estimation Our framework ties the costs of preemption to the generation of the plans of action mentioned above. In essence, a plan of action is a set of TOC strategies, each of which specifies the sequence of TOC operations which is expected to lead to the optimal decision for the appointment time of the agent’s patient. At each time step, an agent processes any responses received since the last time step, and then executes any TOC operations required by its plan. Whenever an agent acquires an appointment time, it computes a new 17 80 ● ● 20 40 %PP / %PS 60 action. We assume that the hospital had sufficient capacity to hold all patients who arrived there. Three parameters were varied in our simulations. First, the number of medical resources was varied between 5, 10 and 15. Again, small numbers were used to facilitate efficient computation on the limited computational resources. Second, the number of patients produced by the MCI was varied between 25, 50, 100, 200 and 400. Finally, the distribution of initial patient health states was varied between U (0, 50), U (0, 100), and U (50, 100), where U (x, y) is a uniform distribution over (x, y). We set Hmax to 200 for consistency with the bother model. In the first experiment, we utilized a single parameterization with 400 patients, 15 doctors and low initial patient health states. In the second experiment, we conducted runs on every combination of values for the three parameters. In each case, we ran both the new model and the ablated comparison model on the same 100 random seeds, resulting 100 unique initial conditions. A patient who reached a health state of Hmax was ruled a problem patient, and removed from the simulation in much the same way as a cured patient to insure the eventual end of the simulation. A single parameterization took approximately 30 minutes to simulate from start to finish on a single 1.8GHz CPU core. ● Full Ablated Paired Figure 1: Violin plots (Hintze and Nelson 1998) depicting the distribution of problem patients (%PP) for the new system (left) and an ablated version incapable of preemption (center). Also shown is the distribution of the pairwise improvement produced by using the new system on each scenario, in terms of patient’s saved (%PS) by using the new system (right). Violin plots depict a box plot with a rotated kernel density plot (smoothed histogram) on either side. Results Our results are summarized in figures 1 and 2. The former depicts the mean difference between the performance of our method and the very simple ablated model in terms of the percentage of patients who became ‘problem patients’ (%PP) during the course of the simulation, or the percentage of patients saved (from becoming problem patients) by using the new system (%PS). The latter shows the same measurements for the more complex ablated model. Tests of significance used parametric or non-parametric measures depending on whether the data were consistent with a normal distribution, and Bonferroni corrections were used where applicable. R produced the results (Team 2009). The results of comparing our model with an ablated version incapable of direct preemption are shown in figure 1. The first two plots show the overall distributions of %PP for each method over 100 runs, while the third shows the paired difference for each scenario in terms of %PS. The paired difference is approximately normally distributed with a mean %PS of 20.79% in favour of the new method and 95% confidence interval (19.55, 22.05). The results of comparing our model with an ablated version capable of direct preemption appear in figure 2. Confidence intervals were omitted for clarity, but were never larger than ±1%. We a used four factor ANOVA to compare the %PP of the two models, with the three simulation parameters and the preemption model as predictors. All predictors were highly significant (p < 1.0e − 16), with a mean difference in favour of the new model of roughly 4%PS averaged across all parameter settings. Nearly all interaction terms were significant as well. In particular, we found that larger numbers of medical resources and patients tended to produce larger gains for the new system, up to some saturation point, while low initial patient health states produced gains as well. ditions produced a value for dH dt of 0.04H, 0.08H, 0.12H and 0.16H respectively. Thus, for example, a patient with a Routine condition and a criticality of 50 would have a criticality of 50 ∗ 0.04 + 50 = 52 if left untreated for a single time step. Medical resources were generated with uniformly random efficiencies for treating each condition. Efficiencies were drawn uniformly from the range [0 − 20], and functioned as a negative modifier to dH dt during treatment. For example, a doctor efficiency for 20 for the Emergency condition, treating a corresponding patient produces a change in health state of dH dt = 0.16 ∗ H − 20 We limited TOC strategies to a fixed depth of 5 time steps ahead. Since patients had only one condition, a plan of action consisted of just a single TOC strategy. This was done to limit the computational cost and number of experiment parameters. We utilized the default values for the bother cost model suggested by (Cohen, Cheng, and Fleming 2005) for simplicity. Attentional State F actor and U ser U nwillingness F actor were generated as a function of health state of a medical resource’s current patient (if any) and the expected utility gain of the preemption (G). Our MCI scenario starts with a small number of patients at the hospital (drawn with uniform probability from the range (0 − 15), all with routine conditions. The MCI is simulated by generating a large number of patients (the exact number is a simulation parameter, see below). Each patient is assigned one of the four conditions with uniform probability, and an initial health state drawn from a simulation specific distribution. Patients are then assigned an arrival time at the hospital, drawn with uniform probability from the range of (0 − 20) time steps. At the time of arrival, each patient is assigned a new patient agent, which generated a new plan of 18 40 ● ● ● 5 Resources 10 Resources 15 Resources 20 40 ● 5 Resources 10 Resources 15 Resources 20 ● 20 40 ● Very High Initial Health State High Initial Health State Medium Initial Health State 5 Resources 10 Resources 15 Resources ● ● ● 200 300 400 ● ● ● −20 −40 −40 −40 100 ● 0 ● %PS 0 %PS ● −20 0 ● −20 %PS ● ● 100 # Patients 200 # Patients 300 400 100 200 300 400 # Patients Figure 2: Mean improvements in %PS over the three parameters. Points above 0 show the advantage of using the new algorithm. Discussion namically. We also hope to apply the system directly to realworld MCI data. We have proposed a method for multi-agent resource allocation using TOC strategies to provide more detailed estimates of preemption costs, illustrated for the scenario of medical scheduling. Our new system relies on pre-computed contingency plans in the form of TOC strategies to provide estimates of the utility costs of a preemption. Using TOC strategies also provides us with the ability to interact directly with human entities (inside a hospital, nurses and doctors), to gather additional information or to make ‘direct’ preemption requests (moving a doctor from a routine case to an emergency mid-treatment). A detailed bother cost model prevents the system from harassing human users with frivolous requests. We verified our proposed framework in simulated mass casualty incidents, and showed that it performed better than using worst-case bounds on preemption cost, when: (i) There were many resources; (ii) There were many users; (iii) Most users could be serviced given the time allowed and the resources available. Conversely, although the ablated system most similar to existing work did not perform well, a variant which supported direct preemption performed well when: (i) There were very few resources; (ii) There were very few users; (iii) Most users could not be serviced, even with an optimal schedule. We suspect that these highly congested domains cause the agents in our system to overplan, which leads to incorrect estimates of preemption costs. Though this issue will need to be addressed in future versions of the system, we conclude that the new system could be well suited for use in hospitals during MCI, and in other similar problem domains which require direct preemption and have both many users and many resources, possibly since these scenarios are more likely to contain cyclic dependencies. In future, we plan to repeat our experiments with the incorporation of learning algorithms. We expect to address the fact that agent plans may be out of date by integrating the learning of both congestion (ratio of supply and demand for resources) and churn (rate of change of assignments in the system), with past experiences providing the starting point. This should provide more robust performance across all scenarios by adapting the plan length and utility estimates dy- References Cheng, M. Y. K., and Cohen, R. 2005. A hybrid transfer of control model for adjustable autonomy multiagent systems. In Proceedings of the fourth international joint conference on Autonomous agents and multiagent systems, AAMAS ’05, 1149–1150. New York, NY, USA: ACM. Cohen, R.; Cheng, M. Y. K.; and Fleming, M. W. 2005. Why bother about bother: Is it worth it to ask the user? In AAAI05 Symposium on Mixed-Initiative Problem-Solving Assistants. Decker, K., and Li, J. 1998. Coordinated hospital patient scheduling. In Proceedings of ICMAS’98, 104 –111. Hexmoor, H., and Castelfranchi, C. 2003. Agent Autonomy. Kluwer Academic Publishers Group. Hintze, J. L., and Nelson, R. D. 1998. Violin plots: a box plot-density trace synergism. American Statistician 52. Hirshberg, A. 2004. Multiple casualty incidents lessons from the front line. Annals of Surgery 239(3):322–324. Mistovich, J. J. e. a. 2000. Brady Prehospital Emergency Care Sixth Edition. Prentice Hall Health. Paulussen, T. O.; Jennings, N. R.; Decker, K. S.; and Heinzl, A. 2003. Distributed patient scheduling in hospitals. In Proceedings of IJCAI’03, 1224–1232. Paulussen, T. O.; Zller, A.; Heinzl, A.; Pokahr, A.; Braubach, L.; and Lamersdorf, W. 2004. Dynamic patient scheduling in hospitals. Agent Technology in Business Applications. Scerri, P.; Pynadath, D. V.; and Tambe, M. 2002. Why the elf acted autonomously: Towards a theory of adjustable autonomy. In Proceedings of AAMAS’02. Team, R. D. C. 2009. R: A Language and Environment for Statistical Computing. R Foundation for Statistical Comp. Vermeulen, I.; Bohte, S.; Somefun, K.; and Poutr, H. 2007. Multi-agent pareto appointment exchanging in hospital patient scheduling. Service Oriented Computing and Applications 1:185–196. 19

Addressing Preemption Costs in Multi-Agent Resource Allocation for Medical Applications Robin Cohen

Related documents

Products

Support

Addressing Preemption Costs in Multi-Agent Resource Allocation for Medical Applications Robin Cohen

Related documents

Add this document to collection(s)

Add this document to saved

Suggest us how to improve StudyLib