Robust Distributed Task Allocation for Autonomous Multi-Agent Teams May 31, 2016 Ph.D. Candidate:

Robust Distributed Task Allocation for Autonomous Multi-Agent Teams Ph.D. Candidate: Sameera Ponda Thesis Committee: Prof. Jonathan P. How, Prof. Mary L. Cummings, Prof. Devavrat Shah May 31, 2016 Motivation  Modern missions involve networked heterogeneous multi-agent teams cooperating to perform tasks  Unmanned aerial vehicles (UAVs) – target tracking, surveillance  Human operators – classify targets, monitor status  Ground vehicles – rescue operations  Key Research Questions:  How to coordinate team behavior to improve mission performance?  How to hedge against uncertainty in dynamic environments?  How to handle varying communication constraints? 2016/5/31 2 Problem Statement  Objective: Automate task allocation to improve mission performance  Spatial and temporal coordination of team  Computational efficiency for real-time implementation 25 40 10 Problem Statement:  Maximize mission score  Satisfy constraints 25 25  Decision variables: 30  Team assignments, Service times  Key Technical Challenges:      Combinatorial decision problem (NP-hard) – computationally intractable Complex agent modeling (stochastic, nonlinear, time-varying) Constraints due to limited resources (fuel, payload, bandwidth, etc) Dynamic networks and communication requirements Robustness to uncertain and dynamic environments 2016/5/31 3 Planning Approaches  Optimal solution methods are computationally intractable for large problems  Typically use efficient approximation methods [Bertsimas ’05] 25 40 10  Most involve centralized planning [Bertsimas ’05]  Base station plans & distributes tasks to all agents  Requires full situational awareness  High bandwidth, slow reaction to local changes 25 25  Motivates distributed planning [Sariel ‘05, Lemaire ‘04]  Agents make plans individually & coordinate with each other through consensus algorithms [Olfati-Saber ‘07]  Faster reaction to local information  Increased agent autonomy 30  Key questions for distributed planning:  What quantities should the agents agree upon?  Information / tasks & plans / objectives / constraints  How to ensure that planning is robust to inaccurate information and models? 2016/5/31 4 Distributed Planning Centralized Problem:  Maximize mission score  Satisfy constraints  Decision variables:  Team assignments, Service times Distributed Problem:  Maximize mission score individually  Satisfy constraints  Decision variables:  Agent assignments, Service times  Main issues: Coupling & Communication  Agent score functions depend on other agents’ decisions  Joint constraints between multiple agents  Agent optimization is based on local information  Key challenge: How to design appropriate consensus protocols? [Johnson ‘10]     Specify what information to communicate Create rules to process received information and modify plans Performance guarantees – is distributed problem good representation of centralized? Convergence guarantees – will algorithm converge to a feasible assignment? 2016/5/31 5 Distributed Planning – CBBA  Consensus-Based Bundle Algorithm (CBBA) [Choi, Brunet, How ‘09]  Iterations between 2 phases: Bidding & Consensus 1 Phase 1: Build Bundle & Bid on Tasks (individual agents) 2 3 Phase 2: Consensus (all agents) All agents consistent? Yes N No  Core features of CBBA:  Sequential greedy task selection – Polynomial-time, provably good approximate solutions  Guaranteed real-time convergence even with inconsistent environment knowledge Key Contributions – extensions to CBBA framework: 1) Time-varying score functions (e.g. time-windows of validity for tasks) 2) Guaranteeing connectivity in limited communication environments 3) Robust planning for uncertain environments 2016/5/31 6 CBBA with Time-Windows e.g. monitor status, security shifts Arrival Time Time-critical e.g. rescue ops, target tracking Arrival Time Score Time-window Score Score  In realistic continuous-time missions, have time-varying task scores Peak-time e.g. rendezvous, special ops Arrival Time  Extended CBBA to continuous-time domains [ACC 2010]  Task optimization involves decisions on task assignments and task service times  Preserves convergence properties  Embedded the algorithm into dynamic planning architecture  Real-time simulation framework for dynamic missions  Experimental flight tests for UAV/UGV teams  Demonstrates real-time feasibility 2016/5/31 7 Cooperative Distributed Planning  Often have fleet-wide hard constraints on assignments  Agent assignments coupled through joint team constraints  Example: Maintaining network connectivity in dynamic environments  Often have limited communication radius, line-of-sight requirements  As agents move around environment – dynamic networks, potential disconnects 25 10 40 25 25  Several issues: Disconnected Network 30  Some tasks rely on continuous connectivity (e.g. streaming live video)  Cannot perform consensus, cannot deconflict plans  How to include network connectivity constraints into distributed planner? 2016/5/31 8 Example: Baseline Scenario  Motivating example – Surveillance Mission around base station  UAVs travel to tasks and stream live video back to base station  Successful task execution relies on continuous connectivity  Limited comm radius (RCOMM) 30 0 10 10 No connectivity! 15 0 No connectivity! 2016/5/31 9 Example: Network Prediction  Conservative solution – predict network connectivity violations  Drop tasks if disconnects will occur  Only execute tasks in local vicinity – conservative 10 10 30 15 2016/5/31 10 Example: Planning with Relays  Can use some agents as communication relays!  Coordinated team behavior leads to higher mission performance  Goal: Develop cooperative planning algorithms to coordinate team 30 30 10 10 Relay Relay 15 2016/5/31 11 CBBA with Relays  CBBA with Relays [JSAC 2012, Globecom 2011, Infotech 2011, Globecom 2010]  Generate CBBA assignments  Predict network over mission duration  Repair connectivity by creating relay tasks  Key features:  Explicit consideration of dependency constraints  Predict network topology only at select missioncritical times – avoids discretizing time  Leverages information available in CBBA consensus phase  Preserves polynomial-time and convergence guarantees  CBBA with Relays improves performance  Agents accomplish higher value tasks  Guaranteed network connectivity  Demonstrated real-time applicability 2016/5/31 Real-time experiment Field experiment iRobot Create Pelican quad Distributed Planning Under Uncertainty  Uncertainty in planning process  Inaccurate models (simplified dynamics, parameter errors)  Fundamentally non-deterministic processes (e.g. sensor readings, stochastic dynamics)  Dynamic local information changes  Can hedge against uncertainty to improve planning Agent Schedule Target Identification Mission Late! involves several challenges  Robust planning Tasks  Optimal solutions computationally intractable – increased dimensionality of planning problem  Non-trivial coupling of distributions – analytically intractable Time  Current approaches involve many limiting assumptions Distribution for Operator Target Identification Figure from [D. Southern, Masters Thesis, 2010]  Key questions:  How to propagate uncertainty through planner to generate agent assignments?  How to distribute planning given additional complexity due to uncertainty?  How to ensure real-time performance and computational tractability? 2016/5/31 13 Distributed Planning Under Uncertainty  Chance-Constrained CBBA – Extended CBBA to incorporate risk into planning process [ACC 2012]  Model coupling using numerical approx (sampling)  Preserves polynomial-time  Probabilistic performance guarantees for given risk  Key features:  Improved CBBA to handle non-submodular score functions (e.g. stochastic scores) [CDC 2012]  Approximate distributed agent risk given mission risk using Central Limit Theorem assumption  Improved performance under uncertainty  Higher scores within allowable risk  Distributed approximation on par with centralized  Current work is exploring dynamic aspects  Dynamic risk allocation  Model learning using Nonparametric Bayesian techniques [GNC 2012] 2016/5/31 14 Conclusion  Distributed task allocation strategies for autonomous multi-agent teams  Extended CBBA algorithm to include time-varying score functions  Addressed cooperative planning in comm-limited environments using relay tasks  Presented robust risk-aware distributed extensions to deterministic planning  Acknowledgments:      Prof. Jonathan How for his invaluable advice and support My committee members Prof. Cummings and Prof. Shah My collaborators and colleagues at ACL, esp. Luke Johnson and Andrew Kopeikin Aero/Astro faculty and staff Graduate Aero/Astro friends! 2016/5/31 15

Robust Distributed Task Allocation for Autonomous Multi-Agent Teams May 31, 2016 Ph.D. Candidate:

Products

Support

Robust Distributed Task Allocation for Autonomous Multi-Agent Teams May 31, 2016 Ph.D. Candidate:

Add this document to collection(s)

Add this document to saved

Suggest us how to improve StudyLib