Mutual Information

Mobile Sensor Networks for Informative Forecasting Han-Lim Choi Postdoctoral Associate Dept. Aeronautics & Astronautics Massachusetts Institute of Technology Nov. 10, 2009 Research Theme: Networked Information Systems • Planning of Sensor Networks – Allocate sensing resources to extract information – Information quantification in large-scale systems – Balance between information and energy [ChoiPhD] H.-L. Choi, “Adaptive sampling and forecasting with mobile sensor networks,” PhD Thesis, MIT. • Multi-Agent Task Planning – Distributed task allocation over a network of autonomous agents – Realities in dynamic, uncertain environment – Interaction with humans [ChoiTRO09] H.-L. Choi, L. Brunet, and J. P. How, “Consensus-based decentralized auctions for robust task allocation,” IEEE Transactions of Robotics, 25(4), 2009. Introduction • Forecasting of Environmental Systems – Increasing potential damages due to adverse environmental conditions (e.g., snow storms, hurricanes) – Need accurate forecasting of future environmental conditions • Intelligent Measurement Systems – Under limited sensing and computation resources – Allocate fixed or redeployable sensors to gather information directed by particular interests Han-Lim Choi (MIT), Nov. 10 3 Introduction • This research – Theoretical framework for intelligent measurement systems for improved forecasting of large-scale systems • Applications – Weather forecasting, Plume source tracking, Wildfire tracking, Geoscientific investigation, Smart building © MIT EAPS © UC Berkeley Han-Lim Choi (MIT), Nov. 10 4 Overall Architecture • Dynamic Data-Driven Loop Closure – Use model to make measurement plans ) Plan execution ) Use collected data to update model/plans – Integration of physical environment, (mobile) sensors, numerical model of environment, planning & control algorithms Sensors Models - Environment - Sensor - Uncertainty Performance Evaluation Environment Planning Planning Data Focus on Planning – given model, make measurement plans Han-Lim Choi (MIT), Nov. 10 5 Planning Challenges • Environmental dynamics nonlinear and typically have large state dimension – E.g. weather system ~O(106) • Multi-scale dynamics – What is the right choice of time- and length-scale to make the decisions of interest? • Uncertainty in model states, parameters, and observations – Need to be able to quantify uncertainty propagation through large-scale nonlinear dynamics • Planning: Large-Scale Complex Optimization Han-Lim Choi (MIT), Nov. 10 6 Weather Forecasting • Example: Improve weather forecast by developing supplementary sensing networks with teams of Unmanned Aerial Vehicles • Related work – Targeting for crewed aircraft to localize error-sensitive or error-reducing regions [Lorenz98,Palmer98, Majumdar02, Daescu04] • High cost, high risk, less adaptability – NOAA’s Unmanned Aerial Vehicles Program (announced ’08) 4 UAV sensors for better forecast – UAV path planning with considering realistic over red squares in 3days [Rubio04, Frew09] weather conditions (e.g., icing, storm) – Development of Micro Air Vehicles for weather sensing[Lawrence07] • This work: higher level decision on where and when to take measurement using network of mobile sensors Han-Lim Choi (MIT), Nov. 10 7 Planning for Information • Design mobile sensor networks to extract best possible information – Objective: Reduce uncertainty in knowledge about some environmental quantities of interest at some time of interest (called verification variables and time) – Quantification: Define information reward of sensing path to represent uncertainty reduction – Complications: • Combinatorial decision making • Constraints in mobility, communication, power • More complicated if large state dimension and/or long verification horizon – Most previous work in the context of tracking targets (small state) • Goal: Efficient algorithms for allocation of mobile sensor networks in a large-scale complex systems for maximum information Han-Lim Choi (MIT), Nov. 10 8 Approach • Bi-level decision framework  Better tractability and better accounting for multi-scale dynamics Scale of motion Long Short Decision space Discrete Continuous Problem Class Combinatorial selection Optimal control o o 72 W Informative Steering How to get there 78oW Information-rich waypoints Where to go 84 oW Objective o 60 W Motion Planning 66 W Targeting o 48 N • Mutual Information – Information-theoretic notion of uncertainty reduction – Information reward for both abstractions • Key Focus: Efficient quantification of Mutual Information Han-Lim Choi (MIT), Nov. 10 o 44 N Motion Targeting Planning o 40 N o 36 N o 32 N 9 Targeting as Sensor Selection s : measurement candidate (n) Z1 Z3 ZS V Z2 S : Search Space (N) Zn V : Verification Variables time • Objective: Select n sensing points from search space that maximize uncertainty reduction of verification variables (V ) – – : entropy of V (i.e., degree of randomness) : conditional entropy of V after knowing • Baseline formulation for problems with dynamics and constraints Han-Lim Choi (MIT), Nov. 10 10 Forward Approach s : measurement candidate (n) Z1 Z3 ZS V Z2 S : Search Space (N) Zn V : Verification Variables (M) • Explicitly compute conditional entropy for all possible measurement candidate – : same for all candidates  minimize conditional entropy – Gaussian  Entropy = log det of Covariance • Issues: Computational complexity – Combinatorial number of candidates (75 million for N=100, n=5) – Calculation of each conditional entropy takes non-trivial time (Covariance update, determinant calculation) Han-Lim Choi (MIT), Nov. 10 11 Key Intuition for Alternative Approach • Mutual Information – Represents entropy reduction – Commutative [Cover91]: Area = Entropy Han-Lim Choi (MIT), Nov. 10 12 Backward Approach s : measurement candidate (n) Z1 Z3 ZS V Z2 S : Search Space (N) Zn V : Verification Variables (M) • Look at entropy reduction of measurement candidate – Motivated by [Williams05] but identified new insights: • Single covariance update to compute conditional covariance of entire search space – Calculation of : submatrix extraction – Still combinatorial number of determinant calculation Han-Lim Choi (MIT), Nov. 10 13 Efficiency of Backward • Probabilistic representations – Covariance vs. Ensemble – Ensemble: typical framework for weather forecasting • Computation time comparison – Asymptotic analysis – Numerical experiments Numerical experiments: 104 103 Ensemble 102 Asymptotic analysis in flops 10 Covariance 1 • Backward approach is never slower than Forward approach • Significant benefit for ensemble representations [ChoiACC07] H.-L. Choi, J. P. How, and J. A. Hansen, “Ensemble-based adaptive targeting of mobile sensor networks,” American Control Conference, 2007. [ChoiTCST09] H.-L. Choi, and J. P. How, “Efficient targeting of sensor networks for large-scale systems,” IEEE Transactions on Control Systems Technology, submitted. Han-Lim Choi (MIT), Nov. 10 14 Summary of Targeting • Commutative mutual information  backward – Provable efficiency over forward approach by reducing the number of covariance updates – Significant computational saving for ensemble-based large-scale targeting • Constrained problem can be embedded in the backward framework [ChoiGNC07] o 72 W 78oW o 60 W o 66 W 84 oW [ChoiGNC07] H.-L. Choi and J. P. How, “A mult-UAV targeting algorithm for ensemble forecast improvement,” AIAA Guidance, Navigation, and Control Conference, Aug. 2007. o 48 N o 44 N • Next: Given where to go, How to get there by maximizing information Motion Targeting Planning o 40 N o 36 N Targeting o 32 N Han-Lim Choi (MIT), Nov. 10 15 Continuous Exploration During 0 to t θ At T Forecast Verification Sensing path • Linear environmental dynamics: – State variables Xt : environmental variables at grid points – Short time-scale/local behavior of original nonlinear dynamics – Additive Gaussian process noise • Linear measurement: – Off-grid measurement = linear combination of state variables • Vehicle Motion: Han-Lim Choi (MIT), Nov. 10 16 Issues in Continuous Planning During 0 to t θ At T Forecast Verification Sensing path • Continuous abstraction – Better models environmental sensing – Better scalability than increasing resolution of discrete targeting • Issues – Less-established theory on computation of mutual information – Simple extension of previous work is computationally inefficient Han-Lim Choi (MIT), Nov. 10 17 Continuous Mutual Information • Straightforward way: – Explicitly calculate entropy of verification variables at T – Need to integrate matrix differential equations for long time interval to propagate effect of measurement into the future  Computational inefficiency in optimal planning • Mutual Information by conditional independence – Once the state is known at the current time, no additional information can be obtained from the past in order to predict some future quantity – Difference in mutual information between before and after knowing Han-Lim Choi (MIT), Nov. 10 and 18 Smoother Form • Smoother form mutual information – – – : inverse covariance of state (Lyapunov) : inverse covariance of state conditioned on V (Lyapunov-like) : covariance of state conditioned on measurement (Riccati) • In planning, integration of matrix differential equation during  Computation time reduction by factor of • On-the-fly track of information accumulation at arbitrary time t  Expression of rate of information accumulation [Choi08cdc & Choi09auto] [ChoiCDC08] H.-L. Choi and J. P. How, “Continuous motion planning for information forecast,” IEEE Conference on Decision and Control, Dec. 2008. [ChoiAuto09] H.-L. Choi and J. P. How, “Continuous trajectory planning of mobile sensors for informative forecasting,” Automatica, submitted. Han-Lim Choi (MIT), Nov. 10 19 Information Potential Field • Distribution of information based on rate of information accumulation – Instantaneous information increment when taking measurement at location (x,y) at time t • Variation by design objectives Minimize uncertainty in V at T Han-Lim Choi (MIT), Nov. 10 Minimize uncertainty in current state 20 Summary • Conditional Independence  Smoother Form – Computational efficiency by reducing interval of integration of matrix differential equations – On-the-fly access to information accumulation  Legitimate information potential field – More insights in broader class of planning problem [ChoiACC09] [ChoiACC09] H.-L. Choi and J. P. How, “On the roles of smoothing in planning informative paths,” American Control Conference, June 2009. • Key Contributions: – Efficient quantification of information reward: Backward and Smoother – Information-theoretic planning of mobile sensor networks in the context of weather forecasting  Guideline for tractable design of measurement systems for multi-scale complex nonlinear systems Han-Lim Choi (MIT), Nov. 10 21 Future Research • Long-term goal: Innovations in Robotics and Control towards Ubiquitous Intelligent & Sustainable Living Environment • Information-driven decision making (for cyber-physical systems) – Sensor/actuator networks for large-scale systems • Richer class of uncertainties and constraints – Enrich notions of information • How to define information in constrained space • How to incorporate notions like risk & safety Indoor aerial robots – Navigation and control of autonomous robots – Uncertainty quantification & Design of experiments for complex systems – Funding sources • NSF cyber-physical systems, NSF cyber-enabled discovery & innovation – Related on-going work • Planning for mobile sensor networks with model uncertainty • Sensor management under limited communication budget [ChoiACC08] [ChoiACC08] H.-L. Choi, J. P. How, and P. I. Barton, “An outer-approximation algorithm for generalized maximum entropy sampling,” American Control Conference, June 2008. Han-Lim Choi (MIT), Nov. 10 22 Future Research • Networked (semi)-autonomous vehicles – Distributed decision making • Robustness in dynamic, uncertain environments • Multi-agent learning & information fusion – Interaction of humans and autonomy – Fundamental theories of networks Cooperative UxV missions • Information flow in heterogeneous networks – Funding sources • AFOSR, ONR, ARO, NSF Network science & engineering – Related on-going work • Distributed task allocation algorithm for network of agents robust to inconsistent situational awareness • Its extensions to handle: complex missions [Choi10acc, Ponda10acc], uncertainty [Bertuccelli09gnc, Redding10acc], human-autonomy interaction [Ponda10info] • Distributed decisions for sensor networks [Choi07gnc] Human-robot interactions ©UMBC eBiquity Research Group Social network Han-Lim Choi (MIT), Nov. 10 23 Future Research • New Applications (in Aerospace, Energy, Environment) – – – – Geosciences, Weather, Climate Air traffic management Smart building, Water resource Smart grid, Sustainable energy • Collaborations – Center for automation technologies and systems – Multi-scale science and engineering – Scientific computation research center Networked wind turbines Han-Lim Choi (MIT), Nov. 10 Networked electricity grid Air traffic management Building emergency Underwater mixing 24 Educational Synergy • Education Philosophy – Encourage inter-disciplinary thinking – Engineering insights & Mathematical foundation • why do we care about this • how can we solve it ©RPI Center for Innovation in Undergraduate Education • Educational impacts – Graduate research – Undergraduate research • Modeling/analysis/optimization of a variety of systems • Hands-on experiments with multi-robot testbeds – Existing curriculum • Enrich description of system dynamics – Curriculum development • Senior: Introduction to high-level control & autonomy • Graduate: Decision making under uncertainty Han-Lim Choi (MIT), Nov. 10 25 Acknowledgment • NSF Dynamic Data Driven Application System • AFOSR, ONR, Boeing • Collaborators: – – – – – – – Prof. Jonathan P. How (MIT Aero/Astro) Prof. Nicholas Roy (MIT Aero/Astro) Dr. James A. Hansen (Naval Research Laboratory) Prof. Paul I. Barton (MIT ChemE) Prof. Emilio Frazzoli (MIT Aero/Astro) Sooho Park, Daniel Gombos (MIT DDDAS) Luca Bertuccelli, Luc Brunet, Cameron Fraser, Sameera Ponda, Andrew Whitten, Josh Redding (MIT ACL) Han-Lim Choi (MIT), Nov. 10 26 References • [Lorenz98] E. Lorenz and K. Emanuel, “Optimal sites for supplementary weather observations: simulation with a small model,” Journal of the Atmospheric Sciences, 55(3), pp. 399-414. • [Parmer98] T. Palmer, R. Celaro, J. Barkmeijer, and R. Buizza, “Singular vectors, metrics, and adaptive observations,” Journal of the Atmospheric Sciences, 55(4), pp. 633-653. • [Majumdar02] S. Majumdar, C. Bishop, B. Etherton, and Z. Toth, “Adaptive sampling with the ensemble transform Kalman filter. Part II: Filed programming implementation,” Monthly Weather Review, 130(3), pp. 1356-1369. • [Daescu04] D. Daescu and I. Navon, “Adaptive observations in the context of 4D-var data assimilation,” Meteorology and Atmospheric Physics, 85(111), pp. 205-226. • [Williams05] J. Williams, J. Fisher III, and A. Willsky, “An approximate dynamic programming approach to a communication constrained sensor management,” Int’l conf. of Information Fusion, 2005. • [Cover91] T. Cover and J. Thomas, Elements of Information Theory. Wiley Series In Telecommunications, 1991. • [Rubio04] J.C. Rubio, J. Vagners, R. Rysdyk, “Adaptive path planning for autonomous UAV oceanic search missions,” AIAA Intelligent Systems Technical Conference, 2004. • [Frew09] J. Elston, E. W. Frew, “Unmanned Aircraft Guidance for Penetration of Pre-Tornadic Storms,” AIAA Journal of Guidance, Control, and Dynamics, 2009. • [ChoiACC10] H.-L. Choi, A.K. Whitten, and J.P. How, "Decentralized task allocation for heterogeneous teams with coordination constraints," American Control Conference, Baltimore, MD, USA, July 2010, submitted. • [PondaInfo10] S. Ponda, H.-L. Choi, and J.P. How, "Predictive planning for heterogeneous teams with human agents," AIAA Infotech@Aerospace, Atlanta, GA, USA, Apr. 2010, submitted. • [PondaACC10] S. Ponda, J. Redding, H.-L. Choi, J.P. How, B. Bethke, M. A. Vavrina, and J. Vian, "Distributed task planning for complex missions with communication constraints,"American Control Conference, Baltimore, MD, USA, July 2010, submitted. • [ReddingACC10] J. Redding, H.-L. Choi, and J.P. How, "An intelligent cooperative control architecture,"American Control Conference, Baltimore, MD, USA, July 2010, submitted. • [ChoiTRO09] H.-L. Choi, L. Brunet, and J.P. How, "Consensus-based decentralized auctions for robust task allocation," IEEE Trans. on Robotics, , Vol. 25, No. 4, pp. 912 - 926, 2009 • [ChoiAuto09] H.-L. Choi and J.P. How, "Continuous trajectory planning of mobile sensors for informative forecasting,"Automatica, submitted • [ChoiTCST09] H.-L. Choi and J.P. How, "Information-theoretic targeting of sensor networks for large-scale systems," IEEE Trans. on Control Systems Technology, submitted • [ChoiACC09] H.-L. Choi and J.P. How, "On the roles of smoothing in informative path planning," American Control Conference, St.Loius, MO, USA, June 2009, submitted. • [ChoiCDC08] H.-L. Choi and J.P. How, "Continuous motion planning for information forecast," IEEE Conference on Decision and Control, Cancun, Mexico, Dec. 2008. • [ChoiACC07] H.-L. Choi, J.P. How, and J.A. Hansen, "Ensemble-based adaptive targeting of mobile sensor networks," American Control Conference 2007, New York City, NY, USA, July 2007. • [ChoiGNC07] H.-L. Choi and J.P. How, "A multi-UAV targeting algorithm for ensemble forecast improvement," AIAA Guidance, Navigation, and Control Conference 2007, Hilton Head, SC, USA, Aug. 2007, AIAA-2007-6753. • [ChoiACC08] H.-L. Choi, J.P. How, and P.I. Barton "An outer-approximation algorithm for generalized maximum entropy sampling," American Control Conference 2008, Seattle, WA, USA, June 2008 Han-Lim Choi (MIT), Nov. 10 Han-Lim Choi (MIT), Nov. 10 Probabilistic Representations • Covariance Form – Mean and covariance – Conditional distribution: Update covariance (Kalman Filter[Grewel01]) • Ensemble Form – Monte-Carlo samples (a total of LE ) – Covariance is approximated as sample covariance – Conditional distribution: Update each sample (Ensemble SquareRoot Filter[Whitaker01]) – Preferred for estimation of large-scale nonlinear systems  Comparison of efficiency of backward approach in these two forms Han-Lim Choi (MIT), Nov. 10 29 Motion Planning Results Strategy Reward Optimal minimizing 0.69 uncertainty in V at T Gradient-ascent in 0.62 potential field Minimize uncertainty 0.20 in X at Minimize uncertainty 0.14 in current X Best Straight-line 0.43 Worst Straight 0.14 Type OCP Closed OCP Closed NLP NLP • Goal: 6-hr sensing trajectory to reduce uncertainty over in 3 days • Optimal Control Problem solution by NLP reformulation (<2mins per initial guess) • Reasonable performance of gradient-ascent • Potential performance degradation in decisions based on short-term perspectives Han-Lim Choi (MIT), Nov. 10 32 Probability of Correct Decision (PCD) • PCD = Prob [ Correctly distinguish the best candidate from a total of q candidates] = PCD (RNR, q) – Expressed as CDF of (q-1)-dimensional multivariate normal distribution • ε-PCD = Prob [ Better than (1- ε)-optimal solution obtained ] ≈ PCD (RNR, 1/ε) – Useful for large-scale targeting with very large q • Useful quantification: – Required RNR to achieve given level of optimality with a certain surety • RNR=16.5 for 90% optimality with 90% confidence – Achievable level of Optimality for given RNR with a certain surety • 75% optimality guaranteed for RNR=6 with 90% confidence Han-Lim Choi (MIT), Nov. 10 Confidence

Mutual Information

Related documents

Products

Support

Mutual Information

Related documents

Add this document to collection(s)

Add this document to saved

Suggest us how to improve StudyLib