PowerPoint

advertisement
Distributed Control of
Multiagent Systems:
From Engineering to Economics
Prof. William Dunbar
Autonomous Systems Group
Computer Engineering
What are Systems? … ANYTHING
in Engineering, usually with
Dynamics.
Some familiar examples:
QuickTime™ and a
TIFF (LZW) decompressor
are needed to see this picture.
How do we describe (or predict) systems?
… with math!
(Images courtesy of http://www.cds.caltech.edu/~murray/cdspanel/ unless stated otherwise)
2
Math: Describing Diverse Engineering
Systems in a Common Way
Internet backbone
CA power grid
San Fran ATC
In these examples:
3
Control Systems are Hidden
Engineering Systems
“A Control System is
a device in which a
sensed quantity is
used to modify the
behavior of a
system through
computation and
actuation.”
4
Abstraction: Multiagent Systems
• The Internet
• Air traffic control
• Supply Chain
Control Problems with:
• Subsystem dynamics
• Shared resources
(constraints)
• Communications
topology
QuickTime™ and a
TIFF (Uncompressed) decompressor
are needed to see this picture.
• Shared objectives
5
(SC image courtest of www.vipgroup.us)
Multiagent Systems: Distributed and
(presumed) Cooperative
Multiagent System:
• autonomous agents
• communication network
Agent
output
action
sensor
input
Environment
Distributed: local
decisions based on local
information.
Cooperative: agents agree
on roles & dynamically
coordinate.
6
A Relevant Decision Method: Model Predictive
Control (MPC)
MPC uses optimization to find feasible/optimal plans for near future.
objective
Minimize (distance to pump & fuel)
s.t. Car model (dynamics)
Without hitting wall (constraint)
To mitigate uncertainty, plan is revised after a short time.
actual
X
computed
7
Mathematics of MPC is Finite Horizon
Optimal Control
objective
Minimize (distance to
pump& fuel)
s.t. Car dynamics
Without hitting wall
(constraint)
8
Convergence of MPC Requires
Appropriate Planning Horizon
Theoretical conditions sufficient & in absence of explicit uncertainty.
*[Mayne et al., 2000]
9
MPC Compared to Other Techniques
• Gives planning &
feedback with builtin contingency
plans.
z(t0)
state
• Only technique that
handles state and
control constraints
explicitly.
• Tradeoff:
computationally
intensive.
t0
t0+d
z*(t;t0)
time
T
zk(t)
10
MPC Successful in Applications:
Process to Flight Control
Caltech flight control
experiment: Tracking
ramp input of 16 meters in
horizontal, step input of
1m in altitude.
MPC updates at 10 Hz,
trajectories generated by
NTG software package.
Movie
11
MPC Admits Cooperation
ok
follow
3
ok
2
follow
1
Get 1 to pump, 2 follow
1 & 3 follow 2.
Decoupled dynamics
Avoid collision
12
MPC of Multiagent Systems: What’s
Missing?
Enables autonomy of single
agent.
Amenable to cooperation for
multiple agents.
Missing?…Distributed Implementation*
Why not Centralized?…Local decision require Global information
Parallelization**?…If you can, but sometimes not applicable.
*[Krogh et al, 2000, 2001]
**[Bertsekas & Tsitsiklis, 1997]
13
My Contribution: A Distributed
Implementation of MPC
Distributed: local decisions based on local information.
Decoupled subsystem dynamics/constraints, Coupled cost L
Decomposition
14
Solution of Sub-problems requires
Assumed Plan for Neighbors
Agent 3 
z3(t0)
What 3 does
state
What 2 assumes
t0
t0+d
z3*(t;t0)
z3k(t)
15
time
Compatibility of Actual and Assumed
Plans via Constraint
Compatibility
constraint
Assumed plan
Bounds discrepancy
z3(tk)
state
tk
tk+d
time
16
Distributed Implementation Requires
Synchrony & Common Horizon T
17
Conditions for Theory are General
18
Convergence Conditions: Centralized
plus Bound on Update Period
*[Dunbar & Murray, Accepted to Automatica, June, 2004]
19
Venue: Multi-Vehicle Fingertip
Formation
4
2
1
d31
qref
3
20
Simulation Parameters
4
2
1
3
: Reference signal
: Actual COM of {1,2,3}
21
Centralized MPC:
Benchmark for Comparison
QuickTime™ and a
Microsoft Video 1 decompressor
are needed to see this picture.
22
Centralized MPC Simulation
23
Distributed MPC is Comparable to
Centralized MPC
QuickTime™ and a
Microsoft Video 1 decompressor
are needed to see this picture.
24
Distributed MPC Simulation
25
Naive Approach Produces Less
Desirable Performance
QuickTime™ and a
Microsoft Video 1 decompressor
are needed to see this picture.
26
Naïve Approach: Bad Overshoot
27
Summary of Contribution
Distributed implementation of MPC is provable
convergent, performs well, and is applicable to a
class of Multiagent Systems:
•
•
Distributed & cooperative structure:

Local decisions based on local information

Decomposition and incorporation of compatibility constraint

Coordination via sharing feasible plans
Applicable for:

Heterogeneous nonlinear dynamics

Generic objective function (need not be quadratic)

Coupling constraints and coupled dynamics
28
Supply Chain Management (SCM) is an
Attractive Venue for DMPC
•
Dynamics (Linear/Nonlinear) s.t. constraints and moving
set points.
•
Forecasts of measurable inputs often available, which MPC
can easily incorporate.
•
Dynamic time scales and inter-stage communication BW
are not limiting factors.
•
Active research area. Why? Companies don’t compete their supply chains do. Thus, SCM will make or break
companies. Examples: Dell, Walmart.
Challenge: distributed (asynchronous) coordination in the
presence of time delays.
29
Overview
1. Define three stage SCM problem from supply chain
literature
2. Distributed Problem ==> Distributed MPC Implementation
3. Nominal decentralized feedback policy from supply chain
literature
4. Numerical Experiments for Comparison
5. Conclusions and Extensions
30
SCM: Information Flows Upstream (orders) and
Material Flows Downstream (goods)
Three Stages: Supplier S, Manufacturer M, Retailer R
shipment rate
delay
acquisition rate
UP
stream
DOWN
stream
order rate
For each stage
Control: order rate
Measurable Exogenous Inputs:
demand rate
31
Bi-drectional Coupling in the Dynamics
For each stage
Dynamics:
Constraints:
Coupling: x depends on downstream order rate & upstream backlog
Objective:
Keep stock and unfulfilled order at desired levels
32
DMPC: Parallel Updates Assuming Remainder of
Previous Response for Neighbors
Q-cost with move suppression
DMPC Controller for each stage
1.
A.
B.
C.
2.
Move suppression ~ DMPC theory bounds
33
Experiments Show Comparable Performance
with Nominal Policy: Single Stage Case
Nominal:
Devised to
match observed
responses
Response to
initial stock
offset
Standard MPC
(not DMPC)
34
Step in Demand Rate: Comparable
Performance + Advantage of Anticipation
Add
anticipation
35
Three Stage with Pulse in Customer Demand:
Comparable then Better with Anticipation
Nominal
DMPC
36
Conclusions and Extensions
•
•
•
Realistic SCM problem (classic MIT “Beer Game”)
DMPC comparable to validated nominal feedback policy.
Clear advantage when customer demand can be reliably
forecasted (anticipation).
•
A detailed relative degree, controllability and stabilizability
analysis to come. Unfulfilled order in stages M and R exhibited
nonzero steady-state error.
Next leap: multi-echelon chains - at least two (and possibly
many) players operate within each stage, e.g., the S stage in
Dell's ``build-to-order" supply chain management strategy
might contain several chip suppliers such as Samsung, Intel and
Micron.
Extend theory asynchronous time conditions.
•
•
37
Download