Learning Automata

V-Lab … An Intelligent Virtual Laboratory for Autonomous Agents – Towards Simulating System of Systems Mo Jamshidi, Ph.D., DEgr., Dr. H.C. F-IEEE, F-ASME, F-AAAS, F-NYAS, F-HAE, F-TWAS Regents Professor, Electrical and Computer Engr. Department and Director, Autonomous Control Engineering (ACE) Center University of New Mexico, Albuquerque Advisor, JPL (1992-93), NASA Headquarters (1996-2003) Sr. Research Advisor, US AF Research Lab. (1984-90,2001-present) Consultant, US DOE Office of Renewable Energy (2001-2003) http://ace.unm.edu www.vlab.unm.edu pursue.unm.edu, moj@wacong.org OUTLINE • Introduction to V-Lab ® • Discrete-EVent simulation Specification (DEVS) • Intelligent DEVS … I-DEVS • Applications to Multi-agent Systems • System of Systems Engineering • Simulation and Experiment Movies V-Lab® is … … a NASA Research Announcement (NRA), Cross-Enterprise Grant with NASA Ames Research Center and a joint effort between UNM ACE Center, JPL, in cooperation with the Arizona Integrate Modeling and Simulation Center (U Arizona and Arizona State U) Dr. Edward Tunstel, Jr. of JPL is one of the 3 Co-PIs on the grant. V-Lab®’s potential role for NASA Robotic Colonies – System of Systems Paradigm MER MER MER MER Simulation Design Qualities  Layered Architecture   Object Modularity   Breaks up the entire simulation into layers, each with a distinct purpose and function Functional components of the individual layers should be broken up into distinct objects to allow for: • Maintainability – Changing a single object without modifying entire simulation. • Reuse – Distinct objects can be reused in different simulations • Object Hierarchy – Provide a structure for the composition and communication of objects Our Solution – V-Lab® V-Lab®  V-Lab® Environment Layers  Hardware Networking • Networking foundation for intermachine communication  Middleware • Software environment for interprocess communication (CORBA, HLA, Sockets, etc.)  I-DEVS • Library of tools for computational intelligence in DEVS formalism  V-Lab® • Organizational structure for DEVS objects. • Management objects to control time and message flow in multiagent systems V-Lab® I-DEVS MIDDLEWARE PHYSICAL NETWORK DEVS    Provides C++ and Java classes for object design A-BC A Hierarchical structure of objects B-C B Two Types of Objects  Coupled (A-BC, B-C) C Hierarchal tree • Contains other objects  Atomic (A, B, C) • Basic functionality to be used by coupled objects A-BC A B-C B C Coupling Relation DEVS execution  Create Models Determine imminent models  Imminent models send messages to output ports  Imminent models call Internal Transition function  Other models receive external messages  Other models call External Transition function Repeat until no models can be imminent   IDEVS … Intelligent DEVS DEVS Discrete Events Coupled Models Atomic Models Computational Intelligence Fuzzy-DEVS Object: Implement various fuzzy functions and operations within a DEVS Java environment. Example – fuzzy inferencing A DEVS model for a typical fuzzy rule: IF x is A OR y is B THEN z is C x X μ A Y OR y X μ AMF B Y X μ CMF C Y z Fuzzy-DEVS, NEG ZERO Cont’d. POS a) Membership functions for fuzzy-DEVS b) The fuzzy-DEVS controller 1 Theta 0.5 0 -0.5 c) The close-loop system 0 5 10 15 20 Time(sec) 25 30 35 d) The output of the system () Example of using fuzzy-DEVS to control an inverted pendulum GA’s Basic Cycle New Population Old Population Recombination Evaluation Crossover Mutation Selection A binary chromosome 0 1 0 0 1 1 0 1 0 1 00 GA-DEVS, Cont’d. GA-DEVS implementation inside V-Lab® GA-DEVS, Cont’d. 2 A simulation window of GA-DEVS Stochastic Learning Automata     What is SLA? SLA is a sequential machine. Given a finite number of actions that can be performed in a random environment, when a specific action is taken place the environment provides a random response which is either favorable or unfavorable. The objective in the design of the automaton is to determine how the choice of the action at any stage should be guided by past actions and responses. Stochastic Learning Automata – Reinforcement Learning Stochastic Learning Automaton Update Action Probabilities s(n) y(n) G(.) F(w(n),s(n)) SLA-DEVS Stochastic Learning Automaton – A statistical Learning approach to learning … Schematic of SLA inside the DEVS Environment Simulation, DEVS-SLA … 2 Agents (a) (c) (b) (d) Simulation, Cont’d … (e) (f) (g) (h) 2 Agents Neural Networks Multi-Layer Perceptron Output 1 Input Output 2 Input Layer Output Layer Hidden Layers The interest in neural networks comes from the networks’ ability to mimic human brain as well as its ability to learn and respond. NN-DEVS Neural Networks – A neural-based approach to learning … BPNN A perceptron structure inside DEVS NN-DEVS Neural Networks – Logical connectives “AND”, “OR”, and “XOR” are implemented in IDEVS (a) AND (b) OR Number of training data errors versus epochs (X, training epochs; Y, number of training data errors) (c) XOR V-Lab    Uses the I-DEVS structure for creating objects Organizes objects into 6 different categories  SimEnv/SimMan  Terrain Models  Agent Models  Physics Models  Control Models  Dynamic Models Provides objects for the time and message flow of simulation SimEnv Terrain Physics Agents Control SimMan Dynamic Hierarchical Tree SimMan and SimEnv  SimEnv      Highest leveled coupled model Instatiates all other models Houses all other models Couples all other models together SimMan   Message liaison providing indirection between all other models Controls flow of time for the simulation SimEnv Agent Control in/out in/out in in Physics in/out SimMan in/out Dynamic out in/out Terrain Coupling Relation out High-Level Models  Physics    Terrain   Model physical phenomena Possibly a differential equation solver and set of differential equations Information about the layout of the environment Dynamic  Analyze agent’s actuator state to make environmental changes, e.g. agent’s position and velocity. High-Level Models  Control   Algorithms that control the actuators of an agent model Agent   Represent the agents in the simulation Contain Sensor models • Sensor models contain all information about external world agent is aware of  Contain Actuator models • Actuators contain state information that dynamic models use V-Lab® Cycle Simulation execution is iteration through a 5 phase cycle  Phase 1   Phase 2   Control Algorithms update Agent Actuators Phase 4   Update Agent Sensors Phase 3   Check for termination conditions Wait for all actuators to change state Phase 5  Dynamic models update agents and environment A V-Lab® THEME EXAMPLE SIMULATION SimEnv Rover Dynamics Sensors Terrain Model SimMan Controller Block Diagram of Theme Example Plot Model A THEME EXAMPLE SIMULATION – Rover Dynamics 1  vr  vl  cos  2 1 y   vr  vl  sin  2 1    vr  vl  l x inw ||vv rr ,max xv cos syv w &   &   x vl vr dx/dt=vcos dy/dt=vcsin d/dt=w y  A THEME EXAMPLE SIMULATION – Fuzzy Controller a) Distance to objectiveb) Angle error b) Angle error c) Forward velocityd) Angular velocity. d) Angular velocity. Rule1: IF distance is ‘Near’, THEN forward velocity is ‘Slow’. Rule2: IF distance is ‘Far’, THEN forward velocity is ‘Speedy’. Rule3: IF angle is ‘Right’, THEN angular velocity is‘TurnRight’; forward velocity is ‘Slow’. etc. Pushing task: G B fl<0 rB rA A wl fl=0 fl>0 Pushing region fr>0 fr=0 wr fr<0 Cooperative pushing task: G wl wr Pushing regions B rB Cooperative Pushing Task “in” “in”?object_reached “in”?command “in”?object_lost “out”!goal_reached “out”?object_touched “in”?outside_pushing_region “out”?object_reached_the_goal “in”?everybody_ready “out” Simulation Results GA-DEVS Controller (1) y   x   y     Goal (xg, yg) ir2 ir1 vl w θg v θ l   1 (v r  vl ) cos  2 1 (v r  vl ) sin  2 1 (v r  v l ) l  x  cos   0   y   v   sin    w  0           0  1 ir3 (x, y,θ)  vr x GA-DEVS Controller (4) Goal 1 Goal 2 Goal 4 Goal 3 Goal 5 Goal 6 First simulation sample task using the GA control system, T=142 steps GA-DEVS Controller (5) Goal 1 Goal 2 Goal 5 Goal 3 Goal 6 Goal 4 Goal 7 Goal 8 Second simulation sample task using the GA control system, T=236 steps GA-DEVS Controller (6)  Trajectory of the robot in a task X X Trajectory of the robot for first task Y Trajectory of the robot for second task Y THEME EXAMPLE SIMULATION - Sensors Infra-red sensors. … Each sensor has a unique ID,X_s, Y_s and theta_s, where X_s is the X-position, Y_s is the Y-position and theta_s is the angle of each sensor relative to the rover in X-Y plane. GPS sensors … An atomic model simply simulates a position sensor, giving out the current position of the rover Compass Sensors … Respective atomic models simulate the sensing by converting the angle  of the rover from radians to degrees. A THEME EXAMPLE SIMULATION – Distributed 2 Parts of a distributed simulation with GenDEVS and Sockets Commercial ROVERS Two Pioneer II Mobile rovers are used to implement IDEVS-JAVA (V-Lab®). Movies will show demos at the end. Experiment Results (DEVS-Fuzzy) Experiment Results (GA-DEVS)   Application: Pioneer 2 mobile robot Two experiments of path planning  Goal searching task w/o obstacles. Robot (10, 9, -π/2)  (4, 4, -π/2)returns to the home point (10, 9, -π/2) after traveling to the goal point. Robot task of goal seeking (Home -> Goal -> Home) System of Systems - Introduction   Changing Aerospace and Defense Industry Emphasis on “large-scale systems integration”    Customers seeking solutions to problems, not asking for specific vehicles Mix of multiple systems capable of independent operation but interact with each other Emerging System of Systems Context EMERGING CONTEXT: SYSTEM OF SYSTEMS  Meeting a need or set of needs with a mix of independently operating systems   New and existing aircraft, spacecraft, ground equipment, other independent systems System of Systems Examples     Coast Guard Deepwater Program FAA Air Traffic Management Army Future Combat Systems Robotic Colonies System of Systems  SoS: A metasystem consisting of multiple autonomous embedded complex systems that can be diverse in:         Technology Context Operation Geography Conceptual frame An airplane is not SoS, an airport is a SoS. A robot is not a SoS, but a robotic colony is a SoS Significant challenges:  Determining the appropriate mix of independent systems  The operation of a SoS occurs in an uncertain environment  Interoperability Keating, et al., 2003 System of Systems Definitions SoS: No universally accepted definition 1. Operational & Management independence+Geographical Dist. + Emerging Behavior+Evolionary Dev. (ML, Space) 2. Integration+InterOperability.+Optmiz. to enhance battlefield scenarios (ML) 3. Large scale + distributed Systems Leading to more complex systems (Private Enterprize) 4. Within the context of warfighting systems – Inter Op.+Com’d. Synergy+Cont.+ Comp.+ Comm. +Info. (C4I) & Intel. (ML) Keating, et al., 2003 DISTINCTION BETWEEN SYSTEM ENGINEERING AND SoSE SoSE represents a necessary extension and evolution of traditional system engineering.      Greatly expanded SoS requirements for tiered levels of discipline and rigor. Centralized control structure vs. decentralized control structure A typical individual system (well defined end state, fixed budget, well defined schedule, technical baselines,homogeneous) A typical System of Systems (not well defined end state, periodic budget variations, heterogeneous) Nature of SoSE Engineering Existing Complex Systems Exclusive, Autonomous, Local Transformation Keating, et al., 2003 System of Systems System of Systems Integrated, Aligned, and Interconnected, Integrated Mission, Transforming Global, Emergent Structure Keating, et al., 2003 System of Systems Engineering The design, deployment, operation, and transformation of metasystems that must function as an integrated complex system to produce desirable results. Keating, et. al 2003 Jamshidi, 2005 System of Systems PROBLEM THEMES 1. Fragmented Perspectives 2. Lack of Rigorous Development 3. Lack of Theoretical Grounding 4. IT Dominance 5. Limitations of trad. SE single system focus 6. Whole Systems Analysis Keating, et al., 2003 Modeling Issues Modeling of Systems of Systems? TOP SoSE LSS TOP BOT. ss SS-1 SS-2 SS-2 LSS ss SS-1 LSS SS-1 … SS-2 Traditional LSS Modeling LSS LSS LSS LSS BOT. LSS SoSE Modeling Difficulty EXAMPLES       Air Traffic Control Personal Air Vehicles Future Combat Systems Internet Intelligent Transport Systems US Coast Guard Integrated Deepwater System, etc., etc. US COAST GUARD INTEGRATED DEEPWATER SYSTEM  The United States Coast Guard    Protect the public, the environment, and U.S. economic and security interests in any maritime region International waters and America's coasts, ports, and inland waterways. Missions      Maritime Security Maritime Safety Maritime Mobility National Defense Protection of Natural Resources US COAST GUARD INTEGRATED DEEPWATER SYSTEM   An integrated approach to upgrading existing assets while transitioning to newer, more capable platforms with improved systems for command, control, communications, computers, intelligence, surveillance, and reconnaissance and innovative logistics support. Ensure compatibility and interoperability of deepwater asstes, while providing high levels of operational effectiveness. Conclusions   V-Lab® is still at the beginning of its development Three sub-groups are working:    Software: Responsible for the design and implementation of V-LAB® (from Network to V-LAB®) Hardware: Implementation of its tasks, features on industrial or commercial rovers Theory: Develop new theories to bridge the gap between Soft Computing (FL, NN, GA, GP, SLA, etc.) and DEVS environment. Future: Leverage V-Lab ® success and 35 years of research on LSS for SoSE applications is among future directions. Cover , Modeling and Simulation Magazine, 2003 Publications – to date 1. J. Burge, A. El-Osery, M. Jamshidi and M. Fathi “V-LAB : A Virtual Laboratory for Distributed Robotic Modeling and Simulation,” Proc. IEEE Int. Conference on Systems, Man and Cybernetics, Tucson, AZ, October, 2001. 2. A. El-Osery, J. Burge, M. Jamshidi and M. Fathi “Stochastic Learning Automaton for Learning Control of Robotic Agents, “Proc. IEEE Int. Conference on Systems, Man and Cybernetics, Tucson, AZ, October, 2001. 3. A. El-Osery, J. Burge, A. Saha, M. Jamshidi, M. Fathi and M. Akbarzadeh-T., “V-Lab – A Distributed Simulation and Modeling Environment for Robotic Agents – Control Through Stochastic Learning Automata,” to appear in IEEE Transactions on Systems, Man and Cybernetics , Vol. 32, No. 6, pp. 795-804, December, 2002. Publications – to date 2 4. A. El-Osery and M. Jamshidi, “Implementation of an SLA-Based Controller in a Virtual Laboratory,” Proc. IEEE Int. Conference on Systems, Man and Cybernetics, Hammamet, Tunisia, October, 2002 5. M. Jamshidi, S. Sheikh-Bahaei, J. Kitzinger, P. Sridhar, S. Xia, Y. Wang, J. Liu, E. Tunstel, Jr , M. Akbarzadeh, A. El-Osery, M. Fathi, X. Hu, and B. P. Zeigler, “A Distributed Intelligent Discrete-Event Environment for Autonomous Agents Simulation,” Chapter 11, Applied Simulation, Kluwer Publishers, Amsterdam, the Netherlands, 2003. 6. M. JAMSHIDI, S. Sheikh-Bahaei, J. Kitzinger, P. Sridhar, S. Beatty, S. Xia, Y. Wang, T. Song, U. Dole, J. Liu, E. Tunstel, Jr., M. Akbarzadeh, P. Lino, A. El-Osery, M. fathi, X. Hu, and B. P. Zeigler, “V-Lab© - A Distributed Intelligent Discrete-Event Environment for Autonomous Agents Simulation,” Intelligent Automation and Soft Computing - Autosoft Journal, Vol. 9, No. 3, pp. 181-214 2003. 6. M. Jamshidi, S. Sheikh-Bahaei, J. Kitzinger, P. Sridhar, S. Xia, Y. Wang, J. Liu, E. Tunstel, Jr , M. Akbarzadeh, A. El-Osery, T. Song, M. Fathi,“A Distributed Intelligent Discrete-Event Environment for Autonomous Agents Simulation,” Chapter 11, Modeling and Simulation Magazine, (Cover article), M&S International, Snn Diego, CA, 2003. 7. P. Sridhar and M. Jamshidi, “A Framework for Multi-agent Discrete Event Simulation: V-Lab®,” Proc. IEEE SMC, the Hague, the Netherlands, Oct 10-12, 2004. Publications – to date 3 8. S. Sheikh-Bahaei, M. Jamshidi,” Discrete Event Fuzzy Logic Control with Application to Sensor-Based Intelligent Mobile Robot,” Proc. WAC 2004, Seville, Spain, June 28-July 1, 2004. 9. P. Sridhar and M. Jamshidi, “A Multi-agent Discrete Event Simulator: V-Lab®,” Proc. WAC 2004. Seville, Spain, July 28-July 1, 2004. 10. S. Sheikh-Bahaei, M. Jamshidi, and P. Lino, “An Intelligent Discrete Event Approach to Modeling, Simulation and Control of Autonomous Agents,” Intelligent Automation and Soft Computing - Autosoft Journal, Vol. 10, No. 4, pp. 337-348, 2004. THANK YOU!

Learning Automata

Related documents

Products

Support

Learning Automata

Related documents

Add this document to collection(s)

Add this document to saved

Suggest us how to improve StudyLib