Decentralized Mission Planning for Heterogeneous Human-Robot Teams Sameera Ponda Prof. Jonathan How

advertisement
Decentralized Mission Planning for
Heterogeneous Human-Robot Teams
Sameera Ponda
Prof. Jonathan How
Department of Aeronautics and Astronautics
Massachusetts Institute of Technology
May 31, 2016
Motivation
• Modern day complex missions involve networked teams of heterogeneous
agents executing several tasks simultaneously:
– Unmanned aerial vehicles (UAVs) – target tracking, surveillance
– Human operators – classify targets, monitor status
– Ground convoys – rescue operations
• Key Research Questions:
– How can we coordinate team behavior to improve mission performance?
– How should planning strategies evolve as we acquire more information?
2
Problem Statement
• Goal: Automate task allocation to improve mission performance
– Spatial and temporal synchronization
– Reduce costs and improve efficiency
• Key Technical Challenges:
–
–
–
–
–
Combinatorial decision problem – computationally intractable (NP-hard)
Complex agent modeling & constraints (stochastic, non-linear, time-varying)
Limited resources (bandwidth, fuel, etc)
Task3
Task1
Dynamic networks and communication constraints
Task2
Unknown and dynamic environments
Task4
Agent5
Agent1
Agent4
Agent6
Agent2
Task6
Task9
Task5
Agent3
Task8
Task7
3
Planning Approaches
• Optimal solution methods are computationally intractable for large problems
– Typically use approximation methods
Agent2
Agent1
• Centralized Planning approach
– Mission Control Center (MCC) plans & distributes tasks for all agents
– High bandwidth, slow reaction, resource intensive
MCC
Agent3
Agent4
• Recent research in Decentralized Planning
– Individual agents make their own plans and coordinate with each other
– Faster reaction to local information changes
Agent1
– Trade-off between communication and computation
Agent2
• Key Questions:
– What quantities should the agents agree upon?
Agent3
• Information / tasks & plans / objectives / constraints
– How do we ensure that the planning is robust to inaccurate information and models?
Agent4
4
Consensus-Based Bundle Algorithm
• Decentralized task allocation approach called Consensus-Based Bundle
Algorithm (CBBA) [Choi, Brunet, How 2009]
– CBBA iterates between 2 phases: Bidding & Consensus
1
Phase 1:
Build Bundle &
Bid on Tasks
(individual
agents)
2
3
Phase 2:
Consensus
(all agents)
All agents
consistent?
Yes
N
No
• Core features of CBBA:
– Polynomial-time decentralized algorithm with provably good approximate solutions
– Consensus on task assignments, not information – guaranteed real-time convergence
even with inconsistent information
Key extensions to CBBA:
1) Temporal constraints – Time-windows of validity for tasks
2) Connectivity issues and constraints
3) Planning for teams with Humans-in-the-loop
5
CBBA with Time-Windows
e.g. monitor status,
security shifts
Arrival Time
Time-critical
e.g. rescue ops,
target tracking
Score
Flat
Score
Score
• In realistic missions, task scores often depend on arrival times and have
associated time-windows of validity:
Peak-time
Arrival Time
e.g. rendezvous,
special ops
Arrival Time
• Issue: Planning algorithms usually involve time discretization
– Extra planning dimension – computationally intractable!
• CBBA extended to include time-windows
– Solution does not discretize time!
– Preserves convergence properties
– Planner decides arrival times, producing
task schedules for agents
• Embedded CBBA with Time-Windows
into a real-time system architecture
6
CBBA with Time-Windows
• CBBA successfully used in real-time fight test environments
– Cooperative search, acquisition, and track (CSAT)
– Coordination of agents under dynamic network topologies
• Further information available online at: http://acl.mit.edu/projects/cbba.html
7
Connectivity: Network Challenges
• As agents move around in the environment, expect varying network topologies
– Limited communication radius between agents
– Potential broken comm links and/or disconnected networks
Task1
Task3
Task2
Task4
Agent1
Agent2
Agent5
Disconnected
Network
Agent4
Task9
Agent6
Agent3 Task
7
Task6
Task5
Task8
• Main issue: Planner cannot converge with a disconnected network, leading to
conflicting assignments
• Developed two solution approaches:
– CBBA with Relays – Creates relay tasks to ensure connectivity
– CBBA with Network Handling Protocols – Protocols to adjust task lists prior to planning
8
Connectivity: CBBA with Relays
• Extended CBBA to include relay tasks –
(Published in GlobeComm 2010)
– Employs underutilized agents as relays
– Key feature: Agents use bid info to predict
network structure at select times
– Guarantees connectivity
– Computationally efficient - converges in real-time
Relay Task
• CBBA with Relays improves team
performance and network connectivity
9
Connectivity: Network Protocols
• If preventing disconnects is too conservative: Network Handling Protocols to
adjust task lists for agents prior to planning – (Published in ACC 2010)
Baseline
Central
(no adjustment) Adjustment
Local
Adjustment
All tasks available
to all agents
MCC distributes
tasks to networks
at each replan
MCC distributes
new tasks to
closest agents
(once per task)
Low
Bandwidth
High
Bandwidth
Low
Bandwidth
Low
Computation
High
Computation
Low
Computation
• Local Adjustment improves
mission
performance
Conflicting
Guaranteed
Assignments
Deconfliction
––
lower
mission
missionscores
scores
and
and
with low bandwidth and higher
computation
requirements
wastedfuel
lower
fuel
consumption
10
Planning for Human-Robot Teams
• Most modern missions involve human-robot teams
– Human operators perform several tasks
(e.g. supervisory, target classification, monitoring)
– Need to coordinate robotic agents and operators
Predator UAV Operations – Associated Press
• Main Issue: Operator performance is stochastic
– Heterogeneous operator capabilities (“slow” vs. “fast”)
– Robustness to uncertainty in team performance
• Recent research has explored modeling operators
using probabilistic distributions – [Cummings et al ‘10]
Log-Normal Distribution for Operator
Target Identification
Figure from [D. Southern, Masters Thesis, 2010]
• Key Challenge: Incorporate uncertainty into planner to increase robustness
11
Planning for Human-Robot Teams
• Consider a time-critical mission with operators performing target classification
• As expected vs. actual service times differ, planner performance degrades
– Adding a margin of conservatism can mitigate this problem
– Tradeoff between late penalties and number of tasks assigned
Agent Schedule
Mission Performance
• Simulation Observations: Late!
– Performance is best when expected & actual
are close (ridge line)
– Steeper drop for overestimating (optimistic)
Optimistic
Conservative
vs. underestimating (conservative)
Planning
Planning
Time
– Conservative Planning performs better
than Optimistic Planning
Tasks
550
500
Mission Score
450
400
350
300
250
200
150
100
• Developing a Robust Planning Framework
– Explicitly embed PDFs of plan parameters
– Adapt as estimates improve
50
0
0
0.2
0.2
0.4
0.4
0.6
Expected Operator
Percentile
0.6
0.8
0.8
1
1
Actual Operator
Percentile
12
Planning for Human-Robot Teams
• Currently performing Human-in-the-loop
experiments at Cornell
– CBBA used to allocate targets to agents (MIT)
– Image processing and sensor fusion used to
update target PDFs (Cornell)
– Human-in-the-loop for target classification and
PDF updates through HRI (Cornell)
13
Conclusions
• Explored strategies to coordinate team behavior to improve mission performance
• Extended the Consensus-Based Bundle Algorithm (CBBA) to address the
demands of more realistic multi-agent mission planning
– Included task time-windows of validity
– Addressed connectivity issues and communication constraints
– Explored planning for heterogeneous human-robot teams
• Current research and expected thesis contributions:
1) Robust decentralized planning framework
• Embed distributions of parameters into planner
• Preserve computational tractability and scalability
(e.g. avoid discretization, explore efficient sampling techniques)
2) Flexible planner structure that adapts to dynamic uncertainty representations
• Modular uncertainty representations (Nonparametric Bayesian models, etc)
• Modify planning strategy without recomputing all scenarios
3) Efficient strategies for information consensus to improve planner performance
• Decide what information and when to share (e.g. hyperparameter consensus)
• Cooperative decentralized strategies to update global distributions
14
Download