NERDLab Neuro-Engineering Research & Development Laboratory Research Activities Mechanical Engineering Department The University of Texas at Austin, 1 University Station, C2200, Austin, Texas, 78712-0292. Voice: +1 (512) 471-7852. Fax:+1 (512) 471-7682. Email:benito@mail.utexas.edu. 1993-2015 1 Contents 1 Research Projects 1.1 Parallel Implementation of Neural Networks . . . . . . . . . 1.2 Active Vibration Quenching . . . . . . . . . . . . . . . . . . . 1.3 Optimal Mapping and Initialization . . . . . . . . . . . . . . 1.3.1 Problem Specific Networks . . . . . . . . . . . . . . . 1.3.2 Initialization . . . . . . . . . . . . . . . . . . . . . . . 1.3.3 Mapping of Complex Systems . . . . . . . . . . . . . . 1.3.4 Quantization of Complexity . . . . . . . . . . . . . . . 1.4 Self-tuning PID Controllers for Boiler . . . . . . . . . . . . . 1.5 Problem Statement . . . . . . . . . . . . . . . . . . . . . . . . 1.5.1 A Heuristic Error definition for backpropagation . . . 1.6 Modeling Environment of Nonlinear Dynamic Systems using and Bond Graphs . . . . . . . . . . . . . . . . . . . . . . . . . 1.7 Digital VLSI Neuro Chip Design . . . . . . . . . . . . . . . . 1.8 Remanufacturing Engineering Tools for Water-Jet Cutting . . 1.8.1 NFD — Neural–Network Feature Detector . . . . . . . 1.8.2 MADA — MMC Automated Data Acquisition . . . . 1.8.3 NOR — Neural–Network Object Reconstruction . . . 1.8.4 CCG — CNC Code Generation . . . . . . . . . . . . . 1.8.5 Deliverables . . . . . . . . . . . . . . . . . . . . . . . . 1.9 Robust Automation for Manufacturing . . . . . . . . . . . . . 1.10 System Identification of Non-Linear Systems . . . . . . . . . 1.11 Comparative analysis of control techniques . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Neural Networks . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 2 2 3 3 4 4 4 5 6 6 6 . . . . . . . . . . . 7 8 10 11 11 12 12 12 12 13 14 2 Relationships to On-Going Projects 16 3 Research Facilities 17 4 Research Personnel 18 5 Recent Publications 19 i Executive Summary It is indeed a pleasure for me to introduce the Neuro-Engineering Research & Development Laboratory staff of the Mechanical Engineering Department, The University of Texas at Austin. We have, I believe, a very strong team for the research and development of neuro-engineering technologies. We have secured support from state and federal agencies as well as from industry to develop and transfer technology in the area of neural networks applied to engineering. Our projects range from the development of new architectures, learning algorithms, and paradigms, through system identification of nonlinear dynamic systems, adaptive (intelligent) control, fault diagnostics, optimization, to knowledge representation, reverse engineering, design and precision machining. This document gives a glimpse of the resources and current projects at the NERDLab. We welcome your questions about our research. If you would like additional information, please contact us electronically at EMAIL:SMTP%“benito@NERDLab.me.utexas.edu”, by voice at (512) 471-7852, or FAX:(512) 471-7682. Thank you for your interest. Very truly yours, Benito Fernández Assistant Professor and NERDLab Director. ii NERDLab Neuro- Engineering Research & Development Laboratory Mechanical Engineering Department The University of Texas at Austin The NERDLab at UT is one of the newest programs of the Mechanical Engineering Department. The rapid growth in the size and quality of this program in the last year has corresponded with the growing recognition of The University of Texas as one of the top ranked American Universities in overall academic quality and the recognition of Neural Networks as a viable and important technology for the years to come. This document provides information on the objectives, programs, staff, research interests, and computational and laboratory facilities available for graduate studies and research in this area. Program Objectives/Mission: Neural Networks constitutes a frontier area of research and industrial activity in several disciplines of Science and Engineering. The underlying unifying basis that groups the technology is the effort to discover and imitate how the human brain works, then apply this technology to solve complex engineering problems. So our mission is to serve as a bridge between research in the area of intelligent systems and the application of these concepts to real world problems. Our focus is aimed at two fronts: Develop research tools to study, analyze, select, and test Neural Networks’ paradigms; Develop technology transfer vehicles through education and applications of this area of research to industry. 1 1 1.1 Research Projects Parallel Implementation of Neural Networks Artificial Neural Networks(ANN) are massively parallel computational models, that can be mapped onto parallel computers to achieve high Connections per Second. We are investigating the mapping neural networks onto Multicomputer Networks and a general Data Structure and a Message-Based Processing algorithm for computation, keeping the processing granularity sufficiently coarse. The nodes of these networks are the processors, each having a locally addressable memory. A Network communication controller is used to pass information among nodes. Neural Networks have large communication overheads, and the simulation will be parallelized to reduce the inter-process communication to a minimum. The issues of avoiding deadlocks and optimal mapping of Network, given the Parallel Neural Network Simulator (PNNS) kernel, are under study. The Node parallelism is the most efficient method of achieving high degree of parallelism in a small multicomputer systems with minimum communication overheads. The hardware consists of 3 Pac8 NuBus Link Adaptor, each with 4 Inmos 32-bit T805 25M Hz Transputer modules, interfaced to a Apple Macintosh. The NuBus Link Adaptor has a data transfer rate of 1 MB/sec. The basic building block of Artificial Neural Networks is a Node, a simple processing element. The inputs to a node are weighted activation level of other nodes. The weights associated to the each input form the information storing elements. In ANN simulations these nodes are topologically arranged in layers. The input to the nodes of a given layer are from the nodes of previous layers (topologically) and the inputs from the other nodes of layer arranged topologically ahead are of previous timestamp (dynamic networks). Nodes of a given layer are to be evaluated before proceeding to the next layer, as the inputs to the next layer are of same timestamp. Thus, a layer can be said to be a wavefront in the computational model during the simulation of Neural Networks. The information (or computation) progresses along the layers during forward propagation and backwards along the same front during backpropagation of error. These type of networks can be termed as Back-propagation Networks (BPN) or MultiLayer Perceptron (MLP) and are one of the most commonly used networks. The process of forward propagation involves calculating the weighted sum of the input to a node and passing through a non-linear element. In the case of cross-talk the input is the activation at previous timestamp of a node from the same layer. In order to take care of such possibilities, a front is sub-divided into two subfronts. In the first stage the inputs to each node of a layer is calculated and synchronized and in the second stage the outputs are evaluated by passing through the non-linear function. 2 1.2 Active Vibration Quenching Limitations of passive isolation systems have made non-linear control systems more attractive for vibration damping. To apply the properties of terfenol, which provide magnetostrictive strains for vibration damping, a plant consisting of three of these actuators was controlled in the presence of persistent vibration using Sliding Mode Control (SMC) and Neural Networks. In SMC, the principle is to define an error surface as a differential operator such that the control guarantees the system reaches and maintains the error dynamics within the parameters established by the surface thickness and profile. The plant is an aluminum platform to which the disturbance signal of the shaker table is mechanically transmitted through the passive structural impedance of unenergized terfenol actuator. Three velometers record vibrations from the table and these signals are then conditioned using digital filters. The conditioned signals are tapped as inputs to A/D on host computer 1 for control as well as to a fourier analyzer for analysis purposes on host computer 2. Control apparatus consists of host computer 1 which is an i386 machine with 3 Banshee Mother Boards (Atlanta Signal Processors Inc.), each with a TMS320C30 Digital Signal Processing chip, and each with A/D daughter boards. The control algorithm is generated by compiling C code and A/D–D/A routines are written in assembly to speed up data acquisition process. The executable is loaded onto the TMS320C30 processor boards for standalone operation. The signal received in A/D is processed by the SMC algorithm on the TMS320C30 and calculated control signal is output through the D/A. This analog output after passing through the power amplifier is fed to the terfenol actuator to achieve the desired reduction in amplitude of the vibrating table. Host computer 2 is an i386 machine which receives its input from the signal analyzer (Tektronix 2630 Four - channel Fourier Analyzer). This provides an alternative method of checking the transfer function and other desired responses of the system without interfering with its operation. We are currently waiting for a neurochip under development at Motorola to implement the control algorithm in this architecture. 1.3 Optimal Mapping and Initialization There has be an active interest in the area of faster learning methods to make backpropagation networks a feasible option for various applications. Towards this goal there has be an active interest in the group to look into issues relating optimal mapping and initialization. Optimal mapping results in faster learning and problem specific network where information content is more transparent. The initialization of network is made possible due to the complexity reduction by mapping. The main idea being tried at NERD Lab. is to achieve faster learning by defining problem specific networks, and initialization of the network using various techniques. In case of complex problems graph theoretic approaches are being investigated. 3 1.3.1 Problem Specific Networks Problem Specific Networks give rise to optimal network size and also a better distribution of complexity over the network. The idea of the custom networks is instead of using the black box approach, allocate a part of the network to an identifiable sub-problem. Therefore if the network is required to map a function, say, f (x, y, z) = g(x, y) · h(y, z) then the network consists of two subnets, mapping g(x, y) and h(y, z) respectively. This give rise to reduction in complexity of each network, getting over the poor scaling of the Backpropagation algorithm as the complexity increases. Instead of learning with huge full recurrent architectures, it is better to get rid of unnecessary connections. The problem of overgeneralization in feedforward networks and longer learning time are carried over to recurrent networks. The custom network can be used in both dynamic and static systems. In the case of dynamic system state equations can be used to derive the causality relation between the states and thus obtaining the mapping. An alternate method to find the mapping has been proposed using Bond Graphs. 1.3.2 Initialization Neural networks have the drawback of long training times. The training set, the initial conditions, and the learning algorithm play a crucial role in the network convergence. Based on the curve-fitting properties of static feed-forward nets, initialization algorithm (Neural Space Initialization) are developed. The NSI along with NeuroBond Graph are an ideal combination for initialization of recurrent networks used for system identification. The Neuro Bond Graph or the dynamical equations can give the interplay and causality relationships between the different dynamic elements, giving rise to a sparse network. Each subnet of the Network can be initialized to give a shorter learning periods. For system identification such methods can be applied which gives a representative estimate for the system parameters. The initialization of networks results in considerable improvement in learning process. One of the reasons of better performance to Radial Basis Function (RBF) networks is the initialization of centers, which is equivalent to initialization. Some methods of initialization are described in (3). 1.3.3 Mapping of Complex Systems The “optimal” mapping of a complex system, is of utmost importance for handling large system. The issues in maximum parallelism and maximum decomposition are crucial for real– time implementation and faster learning process respectively. In getting maximum parallelism ie., the complete mapping of a given function is done using a three layered perceptron will result in no decomposition of the function. This will result in large dimensional map were initialization of the network is impossible and the learning time will be large due to high dimensionality. The other extent is the decomposition is done as far as possible resulting in a network with many layers. The time taken for forward propagation becomes large. The problem 4 in backpropagating through a network with many layers is not well studied. Whereas with decomposition of complex function will facilitate in initialization of smaller networks making it possible to learn very complex as functions in reasonable time. The decomposition of a function is illustrated in the following example[1]. Example 1 (decomposition). x = f (a, b, c, d, e) (1) = f (f1 [f11 (a, b), f12 (a, c), f13 (b, d)) , f2 (f21 (e, d), f22 (a, c))] , The equation can be rewritten as, x11 = f11 (a, b) x12 = f12 (a, b) x13 = f13 (b, d) x21 = f21 (c, d) x22 = f22 (a, c) x1 = f1 (x11 , x12 , x13) x2 = f2 (x21 , x22 , x23 ) x = f (x1 , x2 ) (2) where the network with input dimensionality(emulating equation 1 of five is broken down to eight sub-networks emulating f11 , f12 , f13 , f21 , f22 , f1 , f2 , f from equation 2, where the dimensionality is reduced to three and makes the initialization feasible. The resulting network has 3 × 3 layers instead of 1 × 3 in the original network. k The dependency relation can be expressed as a directed graph G(V,E) , giving the dataflow information. The vertices represent the variables in the set of equations and the edges describing the dependency. the edges Ei incident onto the vertice Vi are the variables required to compute the ith variable. 1.3.4 Quantization of Complexity The graph G gives a way of mapping the set of equations onto a neural network. Each vertices Vi represent a multilayered (three layers) network with number of nodes in the first layer being 5 equal to the edges Eij incident and one output node. The hidden nodes can be selected based on the complexity of the function hi the vertices Vi is mapping. In a complex set of equations the number of layers can be arbitrarily large if all the equations are broken down to the equations of as simple as possible. To get a network with an upper bound in the number of layers the graph G has to be reduced to Gl with some vertices Vj ∈ Gl represents a composition of two or more vertices of the graph G. The reduction of graph G → Gl is achieved with increasing the complexity of the resulting graph to the minimum. The process requires the definition for the complexity of network. The complexity of network is defined as the number of inputs × outputs. As any given network has only one output, therefore the complexity is reduced to the number of inputs. It is noted that the definition inconsistent in very cases. For example, if a network is used to map f : (x, y, z) → x + y + z the mapping can be done using just one linear node, whereas a function of the form g : x → x2 although defined very small in complexity scale compared to the function f is not. The definition of complexity has to be extended to function dependent (non-linearity dependent) to take care of all cases. 1.4 Self-tuning PID Controllers for Boiler This part of the project is concern with the self-tuning PID control of the power plant control. The problem can be stated as: 1.5 Problem Statement Find the optimal PID gains for each channel (in the MIMO plant) & optimal in the sense of a standard performance measure ie., integral of a weighted squared error and input energy. Z J = xT Qx + uT Ru t The objective is to minimize the performance index for a linear MIMO system. The linear MIMO systems are to be obtained at various operating points of the boiler. The self-tuning PID has been tested on a simple SISO system and MIMO system. 1.5.1 A Heuristic Error definition for backpropagation The backpropagation through the layers is divided into two sub networks one of the subnet representing the system under study and other modeling the controller. It is noted that backpropagation and MIT rule are the same if the error function of the network is defined as the squared error (= DesiredOutput - NetworkOutput). The performance measure which is to be minimized lacks uniformity in the sense that the terms (quadratic of output error and the 6 control input to the system) to be minimized are defined at different layers of the networks, and the intermediate layers are fixed (ie., the weights corresponding to these layers are fixed ). To quantify a compound error at the output stage of the controller, a concept of ‘error of the controller output’ based on MIT rule is defined for the error at the output of the plant (xT Qx) at the controller output ( or plant input ), in addition to the error, (uT Ru). The error of the controller output corresponding to the error at the output is defined based on heuristic MIT rule. Assuming the input to the plant as a parameter and the error in the parameter is defined as the change required as defined by MIT rule. Consider a sub system x(k + 1) = A x(k) + B u(k) (3) y(k) = C x(k) + D u(k) (4) T E(j) = (error(j)) T = (yd (j) − y(j)) (yd (j) − y(j)) (5) according to MIT rule ∇u(k) = −µΣj ∂E(j) ∂u(k) and defining the ”error of the controller output” corresponding to the error at the plant output as res error(t) ∝ ∇u(t) and the compound error for the back propagation as res error(t) + u(t) for all t = 0, 1, 2, · · · , T . 1.6 Modeling Environment of Nonlinear Dynamic Systems using Neural Networks and Bond Graphs Artificial neural networks (ANNs) have proved themselves to be universal approximators (2; 4) and work well for system identification (6; 5; 3; 1). The usual approach used in the literature is to train an ANN as a fully connected multi-layer perceptron (MLP) to attain the desired mapping. This “black-box” approach works very well except for the usual complaint that it is hard to interpret what the network has learned or the lack of association with the physical system from where the training data was obtained. Using NeuroBondGraphs (NBG), our intention is to bridge this gap. Based on “Energy-based” modeling techniques, we are able to “qualitatively” characterize the system components’ interaction in a straightforward manner, allowing the construction of an ANN that resembles the physical system. The end result is a sparse network that, after training, has an almost one-to-one correspondence with the physical system it imitates giving a “qualitative” mapping of the system’ component nonlinearities. The basic idea is to use Bond Graphs (BGs) as the method to identify and differentiate the basic elements of the system being modeled. Then, based on these elements unique/defining characteristics/behavior (see Fig. 1) and the BG connectivity, we would be able to generate the dynamic structure, i.e. the set of first order nonlinear differential equations that describe the system’s energy interactions. The “unknown nonlinearities” are the constitutive relations between elements and would be approximated/learned by the neural network from input/output experimental or simulated training data. When combined with the network integration method and the system inherent mappings, we get a multi–layer dynamic (recursive) network as shown in Fig. 2 through the use of Neural Space Representation (NSR) topology. This approach gives: (1) the neural network a specific architecture to work with, so when the training is done, the “knowledge” acquired can be 7 Figure 1: BondGraph Energy-based Modeling Philosophy. referred to physical parameters,and, (2) the bondgraph community a tool to represent and identify generalized nonlinear systems. In summary, the basic objectives are to • Use BondGraphs to identify (qualitatively) the basic “elements” of the system to be modeled; • Generate, from the bondgraph, the neural network dynamic structure. • After training using input/output data, generate (quantitatively) the “elements” constitutive relations. 1.7 Digital VLSI Neuro Chip Design The objective of this research is to produce in silicon neural network paradigms and real–time learning algorithms which are being developed at the NERDLab. Neural Network based algorithm simplifies down to weighted average of the various previous 8 Figure 2: DyNet “Hybrid” Architecture. inputs to produce an output. Given a set of patterns which are representative of the I/O characteristics of the non-linear plant the Network For the System Identification (NFSI) learns the plant characteristics at different points in state space and maps to a structured parameter list. The O/P of the actual plant is compared to the network output and the error dynamics so obtained is analyzed by the Error Modeller which updates the error dynamics of the actual and NFSI. The error information is then back propagated to the plant. Such a process leads to an optimal (or sub-optimal) controller robust to uncertainty and measurement error (as the network also acts as an adaptive filter). In the process of selecting the architecture for VLSI implementation of a Neural Network it is imperative to optimize the operations of the basic cell i.e the Neuron. A single neuron is a part of the network having connections to neurons in other layers. Connectivity to Neurons in the same layer to enable cross talk is a feature that will be incorporated in the future versions. Each Neuron is to be laid out in CMOS 3µ technology on a MOSIS 40 pin pad. The present architecture consists of a 32 bit data bus and 6 bit control bus with Vdd and GND forming the other pins. Each Neuron is designed so that it can receive inputs from two neurons in the previous layer and send O/P to two other neurons in the next layer. Each link consists of a bus for the actual signal and the error signal for that link. The link is active only if the link control line is enabled. The weights are stored on on-chip static RAM and are updated on signal from the global controller. A global controller performs the task of enabling the links on various neurons and deciding on providing initialization and weight update enable signals. The exact architecture of the global controller is to be decided. The global controller is to incorporate provision to upload weights on initialization and download them at the end of learning process which are stored on off chip memory. With this architecture it is possible to build an arbitrarily large network where the final O/P delay is proportional to the number of intermediate layers only. 9 1.8 Remanufacturing Engineering Tools for Water-Jet Cutting The basic objective of this project is to develop and implement tools to aide the reverse engineering process from part/concept to product using Applied Intelligence tools such as Artificial Neural Network (ANN. It is proposed to automate the “reverse engineering” process. Starting by automating the planning and data acquisition from an existing part, or drawing of it, to the CNC–code generation. It is difficult to manage a maintenance utility without overstocking replacement components, so having the capability of reproducing different components in a reasonable turnaround time not only alleviates the stoking problem, but indeed increases the readiness and independence of the utility. Remanufacturing is expensive and time-consuming. If “one–of–a–kind” reproduction is desired, the current process cost/benefit ratio may not be favorable. With a neural network, we propose an architecture to improve this ratio, allowing “retooling” to be a competitive and almost vital technology. Most of the required technology already exist and has been proven at UT Austin, and most of the hardware requirements is already available in–house at Kelly AFB. Accurate coordinate measurement can be obtained through apparatus as the CMM (Coordinate Measuring System). Unfortunately, current data acquisition is performed manually. The approximate mathematical description of the part can be generated and fed into an automatic manufacturing process like CNC machining, then we can achieve a fully automatic remanufacturing process. Considerable resources (time and cost) will be saved by using this proposed technology. The basic architecture of this project is described in Fig. 3. The setup consists of three separable subsystems: NFD A Neural Feature Detector which consists of an inexpensive vision system to acquire a “rough” bitmap picture of the part that then is fed to a NNet algorithm to detect the part outline, some salient features and their relative position. CADA The CMM Automated Data Acquisition. This subsystem basically automates the data gathering with the CMM system that is currently used in manual form. The system with take the NFD findings and plan the path of the CMM–probe (or a mounted camera) for outline and features coordinates meassurements. NOR The Neural Object Reconstructor. This subsystem will take the data collected by MADA with NFD information and extract (compress) the information into features and/or part description. A possible alternative is to generate an AutoCAD file as the subsystem output. CCG The CNC Code Generation. This item is optional and will consider the development of CNC–code given the AutoCAD description. There are comercially available software to perform this task, but if Kelly’s AFLC staff feels complete integration is desirable, additional funding could be used to establish a Neural Network based database retrieval and CNC–code generation. Also in this subsystem, a parallel proposal is being submmitted on UDM (Universal Die Maker) that will allow the generation of universal dies (reusable) for metal forming. 10 Figure 3: RE-Tool diagram In what follows, a more detailed description of the different subsystems is included. 1.8.1 NFD — Neural–Network Feature Detector Using the BitMap image captured by the vision system, a NNet can be devise to “detect” the “basic” features of the part, create a list of the features encountered, and determine relative positions of features and an outline of the part. As a starting point we will try the concept with 2-D objects with some basic features. Neural Networks architectures exist in the open literature that are well suited for this task, We anticipate the use of MLP (Multi–Layer Perceptron), ART (Adaptive Resonant Theory) networks, and Neocognitron. 1.8.2 MADA — MMC Automated Data Acquisition After the outline of the object, features, and their relative psoition have been established, we are in the position to plan a sweep of the MMC system to automatically obtain true distances associated with the specific list of features. Standard path generation algorithms will be used for this task with a small variation to account for reflective force feedback at the MMC–probe. The MADA will generate a data set associated with each feature and the part outline. 11 1.8.3 NOR — Neural–Network Object Reconstruction Given the features list and the data associated with it, confirmation of features and reconstruction of the part is possible. Note that the NFD is not an extremelly accurate system since the vision system is not accurate enough. NFD is used as a filter of the features, while the accuracy of the meassurements is left to the MMC system. Exisiting architectures and learning algorithms under development at UT will be used to detect (confirm NFD guesses) new features and parametrize its parameter values. The output of this unit could be a file readable form AutoCAD. 1.8.4 CCG — CNC Code Generation This is an optional task/component of the overall system. If there are no tools to automatically generate CNC–code from an AutoCAD file, a suitable code can be implemented or interface the developed code with commercial software tools. A neural network could be impelented for library search and matching operations. In specific networks such as ASDM (Adaptive Sparse Distributed Memory) have been proven well suited for such tasks. 1.8.5 Deliverables At the end of the project we expect to have the major components that prove the feasibility of such a system to be an effective tool in remanufacturing. Further development, training, and improvements are expected due to the large ammount of parts processed at Kelly. 1.9 Robust Automation for Manufacturing The feasibility of using Artificial Neural Networks(ANN) as a tool for modeling and controlling complex machining processes is being studied in the Neuro Engineering Research & Development Laboratory. Machine tools are justifiably called the foundation of the industry. Most industrial parts are made by machine tools, especially in mass production by Computer Numerical Controlled (CNC) machines. Improving the precision of CNC machining processes will be important contribution to the automation of the manufacturing industry. A novel research program for Robust Automation for Manufacturing (RAM) concentrating on Light Weight High Precision Machining (LWHPM) using Artificial Neural Networks is currently under implementation in the Neuro Engineering Research & Development Laboratory. The most common error that causes machining inaccuracy is the lack of precision of the machine itself over a large control volume. This is due to faulty calibration, thermal compliance, and permanent deformation of machine structure. The second common error is due to the dynamic interaction between the machine and the workpiece. In the CNC turning processes as the tool cuts into the part, the part deflects. The larger the depth of the cut, the more the part 12 deflects resulting in larger error. At present the most common way of maintaining accuracy is to minimize the interaction by using small depth of cut and feed rate. This results in small throughput. In the unified approach for improving machining precision in turning process using Artificial Neural Networks, a Neural Network is used to learn the inverse mapping between the commanded and machined (actual) part dimensions generated by a CNC turning process. Several different part profiles are machined on the same CNC Turning machine after fixing the feed rate,and spindle speed. Multi samples of each profile are machined. It is observed that the error is maximum around the center of the part, and larger toward the supported end than the clamped end. The positions, actual diameters at these locations, and desired diameters are taken as training data for the Neural Network. The network learns the nonlinear mapping from actual dimensions to desired dimensions along the entire part length. For a given desired part dimensions, the network generates required CNC codes that correct the static and dynamic errors of the machine. After training, the network generalization capability is tested by presenting new part profiles to the network. Based on the Experimental results it can be concluded that the Neural Network can learn the characteristics of machine tool and generate compensation codes to correct the inaccuracies due to calibration errors and deformation interactions. Further research with more parameters like change in speed, feed is under progress. 1.10 System Identification of Non-Linear Systems The goal of this project is to develop robust controllers based on dynamic neural network architectures. The proposed controller is neural based and assumes a model of the plant/process in the form of a recurrent multilayer perceptron (DyN etT M ) and builds a robust controller based on sliding mode control principles. Since sliding controllers assume full state feedback we will also develop nonlinear observers based on the same neural network architecture. The complexities involved in using non-linear basis functions for system identification, deriving identification algorithms that can be applied to a reasonably large class of non-linear systems, and/or using the physical non-linear model for control, restrict the use and viability of using non-linear techniques. The most commonly used nonparametric identification methods for non-linear systems employ the Voltera series, the Wiener kernel approach, and the expansion of the restoring force in a series of orthogonal sets of functions as Chebyshev polynomials. The Voltera series and Wiener approach require excessive computation time and have serious convergence problems. In this respect neural networks offers a viable technique for non-linear system identification and control. The capability of a static neural network to map any function to any desired accuracy and, in addition, the system independent adaptive learning algorithm to take care of any online parameter variation or any modeling error offers as a possible tool for non-linear control. The neural network, a universal approximator, forms a universal structure for a large class of non-linear mappings. In addition neural networks can be thought of as a massively parallel architecture to take care of the computational needs for any non-linear system. 13 This project is focusing on a five layered network which brings out the dynamics of the system explicitly. It also gives rise to a fixed and defined number of system states which can be altered making it possible to model only the dominant dynamics. Such an approach can give a hierarchical structure of system identification with one network modeling only the dominant dynamics were as another on modeling all the dynamics. Such a structure may result in a more robust System Identification and Fault Diagnosis. The architecture consists of two generic sub-block structure each modeling f (·, ·) and g(·) respectively. Thus giving the flexibility of sub-identification for each block as well as input-output identification. In general we always have a mathematical model before starting which can be used to initialize each sub block by Neural Space Initialization (NSI) [3] and static back propagation. The architecture also makes it possible to associate the dynamic set of nodes some physical meaning. It can be again noted that each sub-block consists of an ordinary static network, with properties of nonlinear approximator. 1.11 Comparative analysis of control techniques Real-time control system has currently implemented in NeuroEngineering Laboratory, Mechanical Engineering in The University of Texas at Austin. Several control techniques such as PD, Fuzzy Logic Control, Sliding Mode Control, Expert System Control, and Neural Network Control are compared in terms of their control surfaces, stability, and robustness through computer simulation and experimentally real-time implementation. The cart-inverted-pendulum is used to demonstrate the performance of each controller in real-time. It is critical to validate the performance of controlling the nonlinear system through real–time control implementation. Since the results of the simulation are not able to give a complete measure of the performance of the controller, computational requirements plays an important factor. We have implemented real time controllers using the LabView package from National Instruments, Inc. and intend to compare the controller performances. In the data acquisition part, we have adopted a highresolution audio frequency range analog I/O NB-A2100 board for the Macintosh II family of computers. This board has 16-bit, simultaneously-sampled analog input with 64-times oversampling delta-sigma modulating ADCs and digital anti-aliasing filters for extremely high-accuracy data acquisition. The sensor for measuring the angle of the pendulum is a potentiometer. All the controllers are designed by using the utilities in LabView and Think-C. From the view point of controller design, the PD is easier controller to implement. However, in the real time control process, a PD loses its stability due to the effect of disturbances and is very sensitive to the change in the plant parameters. This verifies the fact that it is hard to control a nonlinear system with a linear controller. Based on the experimental results, a fuzzy controller has high capability to accommodate the nonlinearity and uncertainty problems in the control system. The membership function plays a very critical role in this control process. Once a good membership function is defined, the performance of a fuzzy controller will be very robust in the face of system uncertainty. However, one weakness is that a fuzzy control lacks formal synthesis and analysis techniques. The control process depends upon the experience of human operators whose qualitative “rules of thumb” can be described as fuzzy decision rules. Sliding mode controller has the capability to handle modeling imprecision and disturbances based on a good robust nonlinear control algorithm. However, it may excite high-frequency 14 Figure 4: Real-Time Control block diagram. dynamics when switching control is selected. Expert system control uses the same rules as fuzzy control, but, in general, it is discrete and cannot predict all the possibilities of the system uncertainty according to its searching algorithm. Training the neural network control system is time-consuming due to the backpropagation algorithm requiring a long time to train the controller network. The neural controller does have satisfactory performance compared with others. In this research, the simulation and experiments are conducted to illustrate the possible and actual performance of each controller. There is not enough reason to conclude that one controller is the best. The research indicates the weaknesses and strengths of each controller and provides a basic concept to choose a controller to control systems for real-time application. 15 2 Relationships to On-Going Projects Currently the principal investigators are involved in several projects with immediate relevance and significance to these research areas. The projects provide technology base are with the Department of Energy (DOE), the State of Texas Energy Research Application Program (ERAP), and the National Science Foundation (NSF). The DOE project is focusing on enhanced diagnostics and control of nuclear power plant components, through the use of ANNs. The major emphasis is on local (component) diagnostics and model predictive control. The project uses a recurrent multilayer perceptron and an associative memory for diagnostics and control. Accurate computer models of nuclear power plants and plant data can be made available to the proposed NSF project. The ERAP project objective is to reduce the consumption of fossil fuels used by electric-generating power plants through a combined thermal optimization and control approach based on ANNs. The plan to achieve that goal involves addressing general-purpose optimization algorithms that can be used to optimize the thermal set point profiles of the plant variables and the redesign of the plant control systems using a multivariable control approach to enhance the plant transient responses to various input disturbances. A hierarchical control is proposed that will be implemented in three phases. The NSF project addresses system fault diagnosis issues associated with the operation of power systems, using Adaptive Critic NNs. The approach includes studies towards the augmentation of Adaptive Critic(AC) into an integrated diagnostics approach which incorporates ART network as a trainer and Competitively Inhibited Neural Network (CINN) as critics. These networks are used for incipient fault detection where physical relevance of the network data with the process is crucial. Neuro Bond Graph(NBG)is an offspring tool developed as part of this task. But due to resources limitations, more funds are needed to support this effort, explore this ideas to validation and implementation issues. For classification, growing networks are being developed to incorporate new (never encountered) situations. 16 3 Research Facilities For Neuro Engineering research, a parallel processor that consist of 12 IMMOS T805 transputer are hosted by a Macintosh IIci with an external chassis. Software has been developed and continuous development is under progress for parallel as well as sequential processors. The software developed DyNet, and µNet can emulate virtually most Neural Network paradigms. They are flexible user–friendly shells with graphic interfaces that are portable to a variety of platforms. So far there are working versions on Macintosh, IBM PC, RISC 6000, and Sun with Motiff under X–Windows. For real–time control there is a i386 machine with 3 Banshee Mother Boards (Atlanta Signal Processors Inc.), each with a TMS320C30 Digital Signal Processing chip, and each with an A/D daughter board. There are also a Macintosh IIci and IIf x with National Instruments LabView software, 2 TMS320C30 Digital Signal Processing (DSP) boards, Direct Memory Access Boards connected through Real–Time System Interface (RTSI), and an Audio Frequency Fourier Analyzer (AFFA) board for fast data acquisition and processing. The Mechanical Engineering Department at the University of Texas at Austin has NeuroComputing facilities with extensive applications and graphics software. Additionally, the main campus of The University of Texas at Austin provide various digital computers and extensive software. The following is a partial list of some of the available computer hardware: Machine Number Cyber 170/750 Convex C120 Cray X-MP/24 Cray X-MP/14se DEC VAX 11/750 DEC VAX 11/785s DEC VAX 11/875 DEC VAX 6410 DEC VAX 8300 DEC VAX 8800 Encore UNIX System IBM 3081-D IBM 3090/200E IBM SYSTEM 36 LEVCO Super Server (12 IMMOS T-805 Transputers each) Macintosh IIfx Macintosh IIci Macintosh Quadra 900 Sun-2/170 RISC 6000 2 (dual) 1 1 1 6 4 2 1 2 1 8 NS32332 CPUs 1 1 1 2 2 2 1 3 10 17 4 Research Personnel The Neuro Engineering Research and Development (NERD) Laboratory in The University of Texas at Austin has been established and is devoted to explore new neural network architectures, develop fast and robust learning rules, and apply ANNs to solve real world engineering problems. Currently, the NERDLab has a staff of 12, actively working on ANN research in the areas of system identification, optimization, fault diagnosis, algorithms,controls, and manufacturing. Faculty participating in the NERDLab are well known in their fields and show a strong commitment to the technology. Dr. Benito Fernández, an Assistant Professor in ME at The University of Texas at Austin, has worked in modeling, simulation, and control of nonlinear systems for the past 11 years. His areas of specialty is multivariable nonlinear control (he has published 8 papers on this area), and neural networks for nonlinear system identification (he has 8 papers in this area). He has obtained funding from DOE, ERAP, and NDF in the area of neural networks applied to prediction, optimization, and control of power systems. He has developed computer codes for this purpose. Dr. Fernández has technical degrees in Chemical, Materials, and Mechanical Engineering. Dr. Michael D. Bryant, an Associate Professor in ME at The University of Texas at Austin, has worked on tribology related research for the past 13 years. His main research interest are in tribology, surface contact, system dynamics , and modeling. Dr. Bryant is a PYI (Presidential Young Investigator Award) recipient and has published approximately 20 technical papers on this topics, has served in various capacities in national and international meetings on the topic, and has consulted for several companies. He is especially noted for his research activities in surface contact. 18 5 Recent Publications [1] Bryant, M.D. , B. Fernández, N. Wang, V.V. Murty, V. Vadlamani, T.S. West, “Active Vibration Control in Structures using Magnetostrictive Terfenol with Feedback and/or Neural Network Controllers”, Conference on Advances in Adaptive Sensory Materials and Their Applications, Blacksburg, VA, April 27-29, 1992. Also submitted to the Journal of Intelligent Material Systems and Structures. [2] Fernández, B., “NeuroBondGraph,: Development Environment for Modeling of Nonlinear Dynamic Systems using Neural Networks and Bond Graphs,” Artificial Neural Networks In Engineering Conference, ANNIE’92, Nov. 15-18, 1992, Saint Louis, MO [3] Fernández, B. “Tools for ANN Learning”, Artificial Neural Networks In Engineering Conference, ANNIE’91, Nov. 10-13, 1991, Saint Louis, MO. [4] Chang, W. R., R. K. Tumati, and B. Fernández, “Improving Machining Precision in Turning Process Using Artificial Neural Networks” ASME Journal of Dynamic Systems, Measurement, and Control, 1992. (submitted for publication). [5] Fernández, B., “NeuroBondGraphs: Physically-based Modeling of Nonlinear Dynamic Systems using Neural Networks and Bond Graphs”. ASME Journal of Dynamic Systems, Measurement, and Control, 1992. (submitted for publication). [6] Fernández, B. “Why are Neural Networks Suitable for Nonlinear Systems?.” NERDLab (NeuroEngineering Research & Development Laboratory). The University of Texas at Austin, Austin, Texas. T.R. 90-NL-001. October 1990. [7] Fernández, B. “About Learning in Neural Networks.” NERDLab (NeuroEngineering Research & Development Laboratory). The University of Texas at Austin, Austin, Texas. T.R. 90-NL-002. October 1990. [8] Fernández, B. “DyNet: Shell for Developing Artificial Neural Networks.” NERDLab (NeuroEngineering Research & Development Laboratory). The University of Texas at Austin, Austin, Texas. T.R. 90-NL-003. November 1990. [9] Fernández, B. “Neural Space Representation of Nonlinear Systems Dynamics”. NERDLab (NeuroEngineering Research & Development Laboratory). The University of Texas at Austin, Austin, Texas. T.R. 90-NL-004. November 1990. [10] Chang, W. R., R. K. Tumati, and B. Fernández, “Improving Machining Precision in Turning Process Using Artificial Neural Networks”. NERDLab (NeuroEngineering Research & Development Laboratory). The University of Texas at Austin, Austin, Texas. T.R. 91-NL-005. October 1991. [11] Fernandez, B., R.K. Tumati, “ A Literature Survey of Electro Rheological (ER) Fluids & its applications in Engineering ”. NERDLab (NeuroEngineering Research & Development Laboratory). The University of Texas at Austin, Austin, Texas. T.R. 91-NL-006. November 1991. [12] Bryant, M. D., B. Fernández, N. Wang, V. V. Murty, V. Vadlamani, T. S. West, “Active Vibration Control In Structures Using Magnetostrictive Terfenol With Feedback And/or 19 Neural Network Controllers”. NERDLab (NeuroEngineering Research & Development Laboratory). The University of Texas at Austin, Austin, Texas. T.R. 91-NL-007 December 1991. [13] Fernández, B., “µNet: An Interactive Environment for the Development of Networks”. NERDLab (NeuroEngineering Research & Development Laboratory). The University of Texas at Austin, Austin, Texas. T.R. 91-NL-008 December 1991. [14] Fernández, B., R.K. Tumati, “ A Review on Fixture Design Automation”. NERDLab (NeuroEngineering Research & Development Laboratory). The University of Texas at Austin, Austin, Texas. T.R. 91-NL-009 February 1992. [15] Fernández, B., “NeuroBondGraphs: Physically-based Modeling of Nonlinear Dynamic Systems using Neural Networks and Bond Graphs”. NERDLab (NeuroEngineering Research & Development Laboratory). The University of Texas at Austin, Austin, Texas. T.R. 92-NL-010. February 1992. [16] Fernández, B., R.K. Tumati, “ Applications of Artificial Neural Networks in Manufacturing ”. NERDLab (NeuroEngineering Research & Development Laboratory). The University of Texas at Austin, Austin, Texas. T.R. 92-NL-011. August 1992. References [1] Chu, R., Shoureshi, R., and Tenorio, M., “Neural Networks for System Identification,” Proceedings of the American Control Conference, Pittsburgh, PA, pp. 916-921, June 1989. [2] Cybenko, G., “Approximation by Superposition of a Sigmoidal Function,” Mathematics of Control, Signals, and Systems, 2:303-314, 1989. [3] Fernández, B. “Tools for Artificial Neural Networks Learning.” in Intelligent Engineering Systems Through Artificial Neural Networks, C.H. Dagli, S.R.T. Kumara, and Y.C. Shin Editors, Proceedings of the Artificial Neural Networks in Engineering (ANNIE’91) conference, Nov. 10-13, 1991, St. Louis, MO. [4] Hecht-Nielsen, R., “Theory of Backpropagation Neural Network,” Proceedings of the International Joint Conference on Neural Networks, Vol. I, pp. 593-605, 1987. [5] Narendra, K. S. and Parthasarathy, K., “Identification and Control of Dynamical Systems Using Neural Networks,” IEEE Transactions on Neural Networks, Vol.1, No.1, March 1990. [6] Rumelhart, D.E., J.L. McLelland, & the PDP Research Group., Editors. Parallel Distributed Processing: Explorations in Microstructure of Cognition, Vol. I: Foundations, MIT Press/Bradford Books, Cambridge, MA, 1986. 20