Predictive human operator model to be utilized as a controller... neuro-fuzzy and fuzzy-ARX modeling techniques

ARTICLE IN PRESS Engineering Applications of Artificial Intelligence 23 (2010) 595–603 Contents lists available at ScienceDirect Engineering Applications of Artificial Intelligence journal homepage: www.elsevier.com/locate/engappai Predictive human operator model to be utilized as a controller using linear, neuro-fuzzy and fuzzy-ARX modeling techniques Ozkan Celik ,1, Seniz Ertugrul Department of Mechanical Engineering, Istanbul Technical University, Inonu Caddesi, No. 87 Gumussuyu, 34437 Istanbul, Turkey a r t i c l e in fo abstract Article history: Received 10 April 2009 Received in revised form 31 July 2009 Accepted 28 August 2009 Available online 12 November 2009 Modeling human operator’s behavior as a controller in a closed-loop control system recently finds applications in areas such as training of inexperienced operators by expert operator’s model or developing warning systems for drivers by observing the driver model parameter variations. In this research, first, an experimental setup has been developed for collecting data from human operators as they controlled a nonlinear system. Appropriate reference signals and scenarios were designed according to the system identification and human operator modeling theory, to collect data from subjects. Different modeling schemes, namely ARX models as linear approach, and adaptive-networkbased fuzzy inference system (ANFIS) as intelligent modeling approach have been evaluated. A hybrid modeling method, fuzzy-ARX (F-ARX) model, has been developed and its performance was found to be better in terms of predicting human operator’s control actions as well as replacing the operator as a stand-alone controller. It has been concluded that F-ARX models can be a good alternative for modeling the human operator. & 2009 Elsevier Ltd. All rights reserved. Keywords: Human operator Predictive modeling Fuzzy-ARX model Human controller model Fuzzy identification 1. Introduction A human carries out many different control tasks that range from simple manipulation to relatively more complex tasks such as operating a crane for carrying loads, driving a vehicle or piloting an airplane. Human operator (HO) models become an important tool when one needs to design a system to be controlled by a human (Sheridan and Ferrell, 1974). The model provides constraints for the design and the system can be tested with the model at hand in a closed-loop fashion if the model is accurate and reliable enough. Earlier studies in this area were motivated by a need for pilot models in order to utilize during the design phase of warplanes (McRuer, 1980). Some more recent studies on human operator modeling concentrate on obtaining models for drivers’ control behavior (Hess and Modjtahedzadeh, 1990; Prokop, 2001; Delice and Ertugrul, 2007; Macadam, 2003; Pellecchia et al., 2005). The ultimate and challenging goal of driver modeling is to obtain a robust and reliable model that can eventually replace the human driver. However, these models can more quickly find application Corresponding author. Now with Department of Mechanical Engineering and Materials Science, Rice University, MEMS MS-321, 6100 Main Street, Houston, TX 77005, USA. Tel.: +1713 348 2300; fax: +1713 348 5423. E-mail addresses: celiko@rice.edu (O. Celik), seniz@itu.edu.tr (S. Ertugrul). 1 This manuscript is based on the M.Sc. Thesis of O. Celik prepared with advisory of Dr. S. Ertugrul at Istanbul Technical University. 0952-1976/$ - see front matter & 2009 Elsevier Ltd. All rights reserved. doi:10.1016/j.engappai.2009.08.007 areas as tools for implementing warning systems that would alert the driver when it detects driver fatigue by keeping track of several model parameters and comparing them with a nominal model (Pilutti and Ulsoy, 1999; An and Harris, 1996; Ungoren and Peng, 2005). Another application for human operator models is utilizing them for human skill transfer. If a successful model that approximates and predicts human operator’s control behavior well enough can be obtained, then a model of an expert operator could be used as a teacher for novice operators by demonstrating what the expert operator’s control inputs would be throughout the control task. This approach has been applied by utilizing an artificial neural network model in Nechyba and Xu (1995). More examples of possible similar applications can be found in Nechyba and Xu (1997). A different approach is presented in Vingerhoeds et al. (1995) to the use of hybrid artificial intelligence. In general, there are two main alternatives to obtain a dynamic model; either based on knowledge or assumptions on the dynamics of the system (dynamic modeling) or based on recorded data and system identification techniques. It should be noted that, although the term ‘‘modeling’’ usually refers only to the dynamic modeling approach, throughout this paper it will be used in a context that covers both dynamics-based and identification-based methods and that all methods used in this study are identification-based. Human behavior as a controller is generally quite nonlinear (Sheridan and Ferrell, 1974). Human operators are able to gain ARTICLE IN PRESS 596 O. Celik, S. Ertugrul / Engineering Applications of Artificial Intelligence 23 (2010) 595–603 experience and learn by repeating control tasks and improve their behavior. Human operators can adjust themselves according to the changes in the dynamics of the system that they control. These aspects allow them to accomplish very different kinds of control tasks while making the modeling more difficult compared to other dynamic systems. Although the control behavior of humans can be accepted as quasi-linear under specific circumstances, these models require the satisfaction of important and extensive limitations (Sheridan and Ferrell, 1974; McRuer, 1980). In order to obtain a model that is valid for a wide range of outputs of the plant under human control or for plants that have nonlinear or nonstationary dynamics one needs to use nonlinear modeling techniques. Advancements in soft computing based identification and modeling methods which includes artificial neural networks (ANNs) and fuzzy logic (FL) triggered a more widespread use of these methods in modeling of nonlinear dynamic systems (Abonyi, 2003; Narendra and Parthasarathy, 1989). Hybrid modeling schemes that take advantage of both ANNs and FL have also been proposed in Jang (1993), Jang et al. (1997) and Nelles (2001). This study had two main purposes: (1) construction of an experimental setup that allows implementation of previous modeling attempts by Ertugrul that used computer based experiments and simulations only (Ertugrul and Hizal, 2005; Ertugrul, 2008), (2) investigating the possibility of utilizing local ARX models by fuzzy transition modeling approach, called fuzzy switching ARX (F-ARX) together with other well-known models such as linear ARX and ANFIS using the constructed setup and compare the performances in terms of prediction capability. Main contributions of this study are the development and the application of F-ARX modeling scheme to human operator modeling and proposing an alternative for decomposition of the operating region to use as the antecedents of the fuzzy inference system in an F-ARX model. In this paper, first, an experimental setup that has been constructed for the purpose of conducting data acquisition sessions with human operators is explained. The setup was used to capture human operators’ control actions while they were in charge of controlling a nonlinear system. The setup’s properties are given in Section 2. The data acquisition protocol with five subjects using this system is presented in Section 3. Among three different methods, initially the linear system identification techniques have been utilized to obtain AutoRegressive with eXogenous (ARX) inputs structured models. The second modeling method made use of the developed ARX models by means of a fuzzy inference system that allowed appropriate transitions between linear ARX models (a fuzzy switching ARX or F-ARX model). The last method was ANFIS, which produced a fuzzy inference system model by training an initial model structure using the data. These modeling techniques are covered in Section 4. Results obtained by different models and observations on the setup after replacing the human operator by the models are given in Section 6. Subsequently, findings and future work are discussed. 2. Experimental setup A flexible arm with dimensions of 2 mm 3 cm 50 cm is attached to a 12 V brush DC motor with gear box forms the mechanical part of the system to be controlled by a human operator. The flexible arm moves in the horizontal plane only. The bottom position of the arm is measured using a potentiometer. The complete mechanical setup can be seen in Fig. 1. A similar system was used by Sasaki et al. (1990) for similar purposes, Fig. 1. Mechanical setup: the plant to be controlled by operators is a flexible arm mounted to a DC gearmotor shaft. Weights are added on the tip to facilitate oscillations. however their system had a built-in closed-loop position controller and operators only changed the reference input to the system via a potentiometer. Our system differs in several aspects, the most important one is that the human operator is the only active controller in the closed loop in our setup. Operator uses the x-axis of a PC joystick. Joystick command is read into the computer and a proportional output is sent to DC motor. Operators do not observe the plant directly while they are controlling it; instead, they observe an online video of the system provided by a color camera (Sony SSC-DC18P) assembled above the plant. This video of the setup together with other components of the operator interface is displayed on a 17" computer screen. Operator uses only the x-axis of a standard PC joystick (Logitech Extreme 3D Pro) to control the direction and the voltage to be applied, hence approximately the speed of the motor. The joystick is mounted on top of a stand so that operators can hold it while their arms are on the armrest of the chair. The video is acquired at 720 576 pixels resolution and then cropped to a size of 493 300 pixels which corresponds to the area of interest. A white semicircle that is slightly larger than the trace that the tip position follows along the 03 21803 workspace is drawn on the video in software together with a white ‘‘þ’’ sign that illustrates the position of the rotation axis of the arm. The reference signal which appears as a 53 interval in which the tip of the arm should be positioned is also displayed on the video by two short yellow lines at both ends of the intervals (see Fig. 2). The video is arranged such that a left deflection on the joystick causes a counterclockwise rotation of the arm and a right deflection causes a clockwise movement. A potentiometer for bottom position measurement is also used but it is utilized just as a safety limit switch on the two ends of the workspace. The acquired video is processed to measure the tip point position using image processing techniques. In order to improve the performance of this algorithm, the background is covered with black fabric as well as the base portion of the setup. The arm is marked with a 0.5 mm wide orange strip and a 1 cm 1 cm blue marker is placed on the tip. The weights on the tip are also covered with a cover that also held the blue marker on it. In the tip position measurement algorithm, first the blue layer is extracted and then the pixel with maximum value (hence the bluest pixel) is determined. The position is then converted to its degree equivalent using trigonometric relations. The algorithm performed satisfactorily, allowing all tasks to be run at 20 Hz loop rate. A computer that had a P4 3.0 GHz processor and 1 GB RAM and a PCI data acquisition card was used. All codes and interfaces were prepared in LabVIEW 7 Express. IVision LabVIEW Toolkit was utilized for the image gathering and processing portions of the ARTICLE IN PRESS O. Celik, S. Ertugrul / Engineering Applications of Artificial Intelligence 23 (2010) 595–603 597 Fig. 2. Operator interface. The camera display has black background, the arm is orange and its tip is marked with a blue square. The white semicircle and the ‘‘ þ ’’ representing the rotation axis of the arm are imposed on the gathered video by image processing. The reference interval that the operator should place the tip of the arm in is demonstrated by two small yellow lines on the semicircle. (For interpretation of the references to color in this figure legend, the reader is referred to the web version of this article.) program. More information on the setup and the interface can be found in Celik (2006). The task assigned to the operators is to control the tip position of the arm so that the tip point follows the appearing references within its working space of 1803 , and they should do this as fast as possible and with minimum oscillations. Even tough flexible arm control resembles a real world crane control system, compared to a real crane, this system is faster and exhibits higher frequency oscillations. In order to emphasize the importance of the task goals, i.e. positioning the tip of the arm in the reference interval quickly and without too many oscillations, and reflecting these goals to operators, a scoring scheme has been developed. According to this scheme, a score counter is set to an integer value that is proportional to the difference between the last reference position and the new one each time a new reference position appears on the screen. Then this score is decreased at a certain rate until the tip of the arm is satisfactorily positioned in the interval. This satisfaction criterion includes a threshold that the position error between the midpoint of the interval and the tip of the arm should stay below a specific value for a specific time. This threshold ensures that the oscillations are almost completely damped. Successful completion of a positioning task for a reference is indicated to the operator by a virtual LED above the video window. The score counter, the video window, total score indicator and the ‘‘target achieved LED’’ which constitute the operator interface are shown in Fig. 2. All signals are recorded during data acquisition sessions at 20 Hz sampling frequency. There are several nonlinear effects in the plant. The proportional voltage created by the joystick deflection generates a dead zone in the control loop, i.e. there is a range of control inputs on both directions that does not make the motor move. Once this threshold is exceeded, the speed of the motor is approximately proportional to the joystick deflection. Another much less pronounced nonlinearity stems from the backlash in the gearbox of the motor. This backlash causes a play that significantly changes the tip position of the arm when it oscillates. A friction element is placed between the arm and the base to keep the bottom position of the arm fixed during oscillations. But the backlash is still in effect when a change in movement direction occurs. These nonlinearities are left as is, since real world plants controlled by HOs also contain various nonlinearities and they are important from the perspective that they induce the nonlinear control behavior. 3. Data acquisition sessions Data for modeling purposes is collected from five operators, all male graduate students. Their ages varied between 25 and 28 and they were all computer users and had used a joystick before. At each session, operators completed five different 3-min length scenarios, namely 153, 303 , 453 , 603 , and random amplitude (R) scenarios. In a 153 scenario, reference positions (the midpoints of the reference intervals) appeared on 153 to the left or 153 to the right of the center (zero) position which was the complete vertical position of the arm on the screen. The other scenarios were similar, except that in the R scenario, the position of the reference signal was determined randomly between the 7 603 limits. The duration between two successive reference positions was random in all scenarios but the minimum was set to 2 sec since the task becomes unrealistic or too challenging for reference inputs that change more frequently than at every 2 sec. This lead to a restriction on the obtained models: they are valid only for inputs that have frequency content up to 0.5 Hz. In Fig. 3 a portion of reference position and arm’s tip positions recorded in an R scenario of Operator 3 is shown. Each operator completed all scenarios in the mentioned order above in each 15-min long session, and was required to complete 10 sessions in total. A specific schedule was not followed; all operators completed the sessions in less than three weeks. 4. Modeling methods Three different model structures have been used: linear system identification techniques with an ARX model structure, a fuzzy switching algorithm that provides switching among different linear ARX models and ANFIS. ARTICLE IN PRESS 598 O. Celik, S. Ertugrul / Engineering Applications of Artificial Intelligence 23 (2010) 595–603 1 60 0.9 0.8 0.7 20 Coherence Position (degrees) 40 0 −20 0.6 0.5 0.4 0.3 −40 Reference position Displayed reference interval Arm’s tip position −60 0.2 0.1 0 10 20 30 40 50 60 Time (sec) Fig. 3. Example of a reference signal and the tip position of the arm while the setup was being controlled by Operator 3 in an R scenario. For all models, input and output signals are determined to be the position error between the reference position and the current position of the tip of the arm and the voltage applied to the motor (proportional to the deflection of the joystick), respectively. The outputs are scaled to the interval ½1; 1 while the inputs are scaled by a factor of 0.1, producing a comparable range for the input–output pair. The reference signals for the first four scenarios are pseudorandom binary sequences (PRBS) with a lowest possible duration limit of 2 sec (Soderstrom and Stoica, 1989). However, the assumed input to the model is different and the order of persistence excitation provided by the generated input signals should be confirmed. For this purpose, the covariance matrices of the inputs for a sample from each scenarios are calculated and it was found that the input signals in all scenarios were persistently exciting of order at least 10, allowing models of orders less than or equal to 10 to be estimated (Soderstrom and Stoica, 1989). For validating that a linear model could represent the HO reasonably well for the ARX model case, the linearity in HOs control actions was tested using coherence analysis (Bendat and Piersol, 1986). Coherence analysis provides the frequency range within which a linear model structure assumption would be valid. For data belonging to different operators and scenarios, it was found that the coherency between the input and the output always remained above 0.6 along the 0–0.5 Hz range. This lowest value varied between 0.6 and 0.8 for different data sets, implying that an almost linear relationship existed between input and output across this portion of the frequency spectrum. An example coherence plot where the minimum coherency value within the specified frequency range was 0.8 is given in Fig. 4. 4.1. ARX models According to the definition of quasi-linear human operator model given in Sheridan and Ferrell (1974), a model that is composed of a linear differential equation and an error term in the form of random noise should be sufficient to describe the dynamics of a human operator. The structure of the ARX model, as given in Eq. (1), is composed of a linear differential equation and error term, besides it has fewer parameters to be determined than ARMAX. Therefore, ARX model structure is chosen instead of ARMAX, ARMA, OE, or other structures as it showed the best prediction capability after estimating parameters for models of 0 10−1 100 Frequency (Hz) 101 Fig. 4. Example coherence plot for the input–output pair. For this specific data set, coherency is equal to or greater than 0.8 for the frequency range of 0–0.5 Hz. Testing with additional data sets revealed that coherency remained above 0.6–0.8 across the frequency range of 0–0.5 Hz for all scenarios and operators. different orders and also due to its simpler structure. In the ARX model structure, the output of the system at a specific time is assumed to be linear combinations of the previous outputs and inputs and the current input. A discrete-time designation of the ARX model is yðtÞ þ a1 yðt 1Þ þ a2 yðt 2Þ þ þ ana yðt naÞ ¼ b1 uðt nkÞ þ b2 uðt nk 1Þ þ þ bnb uðt nk nb þ 1Þ þ eðtÞ ð1Þ where t represents integer time steps, eðtÞ is the modeling error, y is the output, u is the input, ai ’s and bj ’s are model parameters to be estimated using the data, and na, nb and nk are the orders of the output, input and input–output delay, respectively (Ljung, 1999). na is the model order. Least squares estimation (LSE) algorithm is used for determining the model parameters (Soderstrom and Stoica, 1989). Using this structure, estimations are carried out and a parameter set is obtained for each scenario in each session for all operators. Then these values are averaged across the sessions to obtain a set of average parameters for each scenario of each operator. Trials with different model order values revealed that models with orders na ¼ 2 and nb ¼ 1 produced reasonably well fits to the data and further increase in order did not improve the results significantly. As for the order of the delay, generally an order of 6 or 7 gave the best results, which corresponded to 300 and 350 ms, noting that the sampling time is 50 ms. Since it is known that the reaction times for human motor actions varies between 200 and 300 ms, the order of the delay was found to be in good correlation with the literature (Sheridan and Ferrell, 1974). The delay order nk is selected to be 6 for all ARX models. This structure will be called ARX216 in short hereafter. 4.2. Fuzzy-ARX models The fuzzy-ARX (F-ARX) model represents a usual ARX structure whose model parameters are updated based on a fuzzy inference system, as illustrated in Fig. 5(a). This fuzzy transition scheme provides an interpolation and smooth transition between an infinite number of different local linear ARX models. The resulting model is a nonlinear ARX (NARX) model that can be considered as ARTICLE IN PRESS O. Celik, S. Ertugrul / Engineering Applications of Artificial Intelligence 23 (2010) 595–603 599 a |Δr(t)| u(t) Fuzzy System q-nk y(t) ARX q-nk-1 q-1 q-nb-nk+1 q-2 q-na b Output membership functions Input membership functions |Δr| Rule Base ARX model parameters Defuzzification a1 Defuzzification a2 Defuzzification b1 Fig. 5. (a) F-ARX model structure. Fuzzy system updates the ARX model parameters based on the absolute reference value changes. (b) Schematic structure of the fuzzy system. Input is the absolute change in the reference and outputs are the ARX model parameters. a parameter varying model as well. The outputs of the fuzzy inference system are model parameters, as stated, while for the input(s) there are several possible choices: previous inputs/ outputs of the model, reference signal, or different combinations of these or their variations. This model structure is in resemblance to a fuzzy gain scheduling method (Zhao et al., 1993), however here the fuzzy transition is between local linear models as opposed to controller gains. There are several applications of fuzzy gain scheduling to mimic or attain human-like control behavior in the literature. Fuzzy gain scheduling was used to achieve smooth automatic lane-keeping of a vehicle in Wu et al. (2005) as well as the lateral control of the vehicle (Chiang et al., 2006). It was also applied on a car-like robot to obtain automated parallel parking (Chiu et al., 2005). In all of these studies fuzzy gain scheduling was used as part of tuning the parameters of the designed automatic controllers. Here we propose to use fuzzy transitions as part of a human operator model, which then can be used as a controller in closed-loop fashion. In Abonyi et al. (1999) and Abonyi and Babuska (2000), inputs of a fuzzy system model were selected to be a subset of the previous outputs and inputs of the system to be modeled, as in regular Takagi–Sugeno type fuzzy modeling of dynamic systems (Takagi and Sugeno, 1985). Thus the inputs are a subset of fyðt 1Þ; yðt 2Þ þ þyðt naÞ; uðt nkÞ; uðt nk 1Þ þ þ uðt nk nb þ1Þg, following the same notation in Section 4.1 (Abonyi et al., 1999). This selection leads to a direct fuzzy model of the system that interpolates between local ARX models. However, the identification scheme is not based on establishing individual local ARX models by separate parameter estimations for predefined working regions. It is based on either a global least squares estimation (LSE) or a local LSE that divides the problem into weighted LSE problems. A similar approach to the one in this study was given by Johansen (1994). Johansen defined the problem on a multipleinput multiple-output (MIMO) NARX model, stated that the modeling problem could be decomposed into local models in a set of operating regimes. Remarks on how to choose operating regions were made and it was mentioned that there were various alternatives based on system’s characteristics. In Hadjili and Wertz (2002), a fuzzy modeling method that is used to identify the IF- and THEN-parts of a Takagi–Sugeno model separately is proposed and explained. In the IF part identification, again the previous input and output data were utilized, however a systematic way of eliminating the components that are irrelevant by a fuzzy clustering algorithm was developed. Differing from the mentioned studies, in this study, the input for the fuzzy system is selected to be the absolute value of the change in the reference signal, as shown in Fig. 5(a). This selection provides a constant model between two consecutive reference positions, hence decreasing the computational load in implementation. Also, utilization of the reference signal which is always ARTICLE IN PRESS 600 O. Celik, S. Ertugrul / Engineering Applications of Artificial Intelligence 23 (2010) 595–603 available increases the applicability of this modeling method to different systems. One of the rules for F-ARX model can be given as 3 IF the absolute change in Reference is 15 THEN a1 ¼ X1 ; a2 ¼ X2 ; a3 ¼ X3 ð2Þ For the F-ARX model structure of concern in this study, a fuzzy inference system is designed with four input membership functions (two trapezoidal and two triangular) that correspond to each reference input case, and four output membership functions (fuzzy singletons) that are equal to the average model parameters obtained in the previous ARX modeling step. A schematic of this fuzzy system is shown in Fig. 5(b). Hence, a two step modeling procedure was followed for the FARX modeling method in this study, similar to the decoupled IFand THEN-part identification method in Hadjili and Wertz (2002). First the local linear ARX models were identified based on each scenario, then a fuzzy inference system were designed to establish the transitions between the local models. 4.3. ANFIS models A problem that arises during pure fuzzy modeling is the extensive number of free parameters and number of rules that should be adjusted by trial and error (Zapata et al., 1999). An approach that utilized ANNs for solving this problem has been proposed by Jang (1993). In this hybrid modeling method called ANFIS, an initial fuzzy inference system is manipulated and adapted by training with the data collected from the system to be modeled. Initial system is a first order Takagi–Sugeno fuzzy system. The membership functions get adjusted through the training by gradient descent in backward passes while the coefficients of the first order output polynomial are tuned by least squares method. When the training is complete, the final model is obtained as a stand-alone fuzzy inference system. Further information on ANFIS can be found in Jang (1993) and Jang et al. (1997). ANFIS modeling scheme was applied for modeling HO actions in Ertugrul and Hizal (2005), Ertugrul (2008) and found to give successful results. The ANFIS model structure used in this study was selected to be equivalent to the ARX216 in order to allow fair comparison in terms of prediction performance. Same input–output pair, namely, the position error between the reference position and the tip position of the arm and the control applied by the HO were used in this modeling method. Thus the inputs of the ANFIS model were ^ fyðt 1Þ; yðt 2Þ; uðt 6Þg and its output was yðtÞ. Gaussian membership functions (MFs) given in the literature as 2 mðx; s; cÞ ¼ eðxcÞ =2s2 ð3Þ were used in this study. Number of MFs to be used for each input is another parameter that should be chosen for an ANFIS model. The number of rules that will be trained is given by mn, where m denotes the number of MFs for each input and n denotes the number of inputs. Consequently, although increasing the number of MFs leads to a better approximation, training time will increase significantly. In this study, initial models with two and three MFs were used, but the model with three MFs was chosen due to its slightly better performance. The training process was limited by 10 epochs. There are many parameters that should be determined for an ANFIS model and arriving at the optimal combination can be time consuming. Here only the structure that is equivalent to the ARX216 model was considered. A wide set of ANFIS models with different combinations were tested with the purpose of HO modeling in Ertugrul (2008). 5. Prediction performance of models Models obtained by different methods are compared in terms of their success in predicting HOs’ actions for varying prediction horizons. If Y t ¼ fyðtÞ; yðt 1Þ; . . .g represents the information available at time t, the equation ^ þ kjtÞ þ a1 yðt ^ þ k 1jtÞ þ þana yðt ^ þ k najtÞ ¼ b1 uðt þ k nkÞ yðt þ b2 uðt þ k nk 1Þ þ þ bnb uðt þ k nk nb þ1Þ ð4Þ can be used to simulate and predict the output of the model at ^ þ kjtÞ represents time t þ k (Soderstrom and Stoica, 1989). Here yðt the value of the simulated output at time t þ k, in which the information in Y t is used. It should be noted that ( ^ þ ijtÞ if i 40 yðt ^yðt þ ijtÞ ¼ ð5Þ yðt þ iÞ if i r0 During simulations, all initial conditions are assigned as zero. Prediction performance of the models is evaluated using the formulation Jy yr J ð6Þ Percent fit ¼ 100 1 m Jyr y r J where ym is the model’s output vector, yr is the real output vector (recorded from the HO), and y r is the mean value of the real output vector (Ljung, 1999). All models are evaluated for their one, five and infinite step prediction performances. Infinite step prediction is equivalent to pure simulation of the model with zero initial conditions. 6. Results Mean and standard deviation values for 10 sessions of model parameters in the ARX216 model for all scenarios are given for Operators 1 and 2 in Table 1. The table is limited to only two operators’ data, since it has been observed that both mean and standard deviation values of other operators did not show major difference from the data set presented here. As it is apparent from the table, parameters estimated using different operators’ data were very similar. Also, relatively small values of the standard deviations imply that the estimated parameters did not vary considerably throughout 10 sessions. Furthermore, although b1 exhibits some variability for different scenarios, parameters did not vary significantly with respect to the scenarios. All these observations lead us to consider the possibility of obtaining a single model with constant parameters suitable for all cases. Table 1 Means and standard deviations (m 7 s) of ARX216 model parameters of Operators 1 and 2 for 10 sessions. Operator 1 Scenario 3 15 303 453 603 R Operator 2 Scenario 3 15 303 453 603 R a1 a2 b1 1:583 7 0:053 1:573 7 0:059 1:571 7 0:045 1:559 7 0:047 1:564 7 0:061 0:679 7 0:048 0:660 7 0:048 0:640 7 0:039 0:618 7 0:045 0:640 7 0:056 0:01554 7 0:00176 0:01125 7 0:00299 0:00730 7 0:00182 0:00531 7 0:00085 0:00905 7 0:00127 a1 a2 b1 1:629 7 0:030 1:584 7 0:028 1:544 7 0:034 1:516 7 0:031 1:599 7 0:027 0:729 7 0:025 0:686 7 0:018 0:637 7 0:023 0:596 7 0:023 0:685 7 0:020 0:01792 7 0:00178 0:01457 7 0:00210 0:01046 7 0:00143 0:00769 7 0:00091 0:01101 7 0:00114 ARTICLE IN PRESS O. Celik, S. Ertugrul / Engineering Applications of Artificial Intelligence 23 (2010) 595–603 Table 2 Prediction performance for ARX models obtained from different scenarios and simulated on a randomly selected (sixth session) 153 scenario data of Operator 2. Scenario from which the parameters are estimated 153 303 453 603 R Parameters Prediction performance (%) a1 a2 b1 One step Five step Infinite step 1.629 1.584 1.544 1.516 1.599 0.729 0.686 0.637 0.596 0.685 0.01792 0.01457 0.01046 0.00769 0.01101 89.4 89.1 88.8 88.5 89.0 60.8 57.0 52.5 49.9 55.5 55.5 49.7 40.5 33.0 45.3 Table 3 Prediction performance of the model derived from the R scenario data of Operator 2 (a1 ¼ 1:599; a2 ¼ 0:685; b1 ¼ 0:01101), simulated with the R scenario data of the last session of each operator. Operator data 1 2 3 4 5 Prediction performance (%) One step Five step Infinite step 88.2 88.7 93.3 89.8 90.8 64.2 63.5 67.7 63.7 61.7 58.5 59.9 56.9 55.3 50.7 A single global model is highly desirable since it facilitates the utilization and implementation of the obtained model for control or prediction purposes, as in a model predictive controller. The feasibility of a single constant model is further investigated by simulating different models (models based on different scenarios) on a 153 scenario data recorded from Operator 2. Operator’s recorded response and the predicted response by the model for different horizons are used to calculate the prediction performance values (see Section 5). Table 2 summarizes the results of the described procedure for Operator 2. Estimations based on R scenario data provide an average model, since these estimations are based on the data calculated from random amplitude reference scenario. All models perform equally well based on their one step prediction performance. However for five step and especially for infinite step, prediction performances become considerably different and the average model is not satisfactory. In Table 2, it is apparent that small changes in model parameters has a significant impact on the infinite step prediction performance. Although the model obtained from the random scenario or the average model that has average coefficients calculated from the first four scenarios are successful to some extent, approximately a 10% decrease in performance occurs for infinite step predictions compared to the model derived from 153 scenarios only. These results motivated utilization of an F-ARX model structure that could switch to the best set of parameters for an approaching reference signal that should be followed. In the case of generalization capability of a single model for different operators, the average model derived from the R scenario data of Operator 2 has reasonably well results for the R scenario section of the last sessions of all operators, as summarized in Table 3. The reason for selecting Operator 2 for a base model is that he is assumed to be one of the two most expert operators by outperforming others in terms of average total points, hence the 601 Table 4 Prediction performance of the F-ARX model based on Operator 2’s ARX216 models, simulated with the random scenario data of the last session of each operator. Operator data 1 2 3 4 5 Prediction performance (%) One step Five step Infinite step 88.0 88.9 93.3 90.0 90.7 64.4 65.9 69.2 65.8 60.5 60.7 63.6 61.4 59.9 51.6 most successful controller for the defined control task. A onetailed paired t-test for mean total points across 10 sessions revealed that Operators 1 and 2 performed significantly better than all other operators [p o0:05] and the performance of Operators 1 and 2 was not significantly different [tð9Þ ¼ 0:025; p ¼ 0:490]. Since five step and infinite step predictions using a single constant ARX model were not satisfactory, an F-ARX model was obtained as described in Section 4.2, again using Operator 2’s ARX models. The prediction capacity of this model is evaluated and given in Table 4. The prediction results for only the R scenarios of each operator’s last session is considered descriptive enough, since this scenario included random amplitude references throughout the whole range. The prediction capacity of a single average model given for the same set of data in Table 3 allows a comparison. Comparing the results in Tables 3 and 4 reveals that although the F-ARX model generally performs better in infinite step predictions, the difference between the performances of two models remain around 1–4%. A one-tailed paired t-test on the mean difference between the infinite step prediction performance, the F-ARX model performed significantly better than the single average ARX model [tð4Þ ¼ 4:455; p o0:05]. Results in Table 4 offer that the F-ARX model carries the ability to produce equally well results for different scenarios and operators. To further illustrate this, the same data set used to produce Table 2 was used to evaluate the F-ARX model. F-ARX model demonstrated 89.4%, 59.8% and 53.7% fit values for one step, five step and infinite step prediction horizons, respectively. Especially an increase in fit around 8% in infinite step simulations can be accepted as a considerable gain compared to the single constant average ARX216 model’s performance. The prediction performance of the ANFIS model for different operators’ random scenario data in their last session is given in Table 5. It is observed that the ANFIS model was not as successful as other models. A single-tailed paired t-test indicated that the infinite step prediction performance of the ANFIS model was significantly lower than both the average ARX model [tð4Þ ¼ 3:640; p o0:05] and the F-ARX model [tð4Þ ¼ 4:525; p o 0:05]. The ANFIS model based on Operator 2’s random scenario data in the last session demonstrated problems in the simulation. An output that alternated between values well above or below the upper and the lower limits (1 and 1) in a very fast manner for short intervals of time was occasionally produced by the model. When the same model structure is trained by using Operator 1’s data (the other most expert operator), this problem occurred less frequently but it did not completely vanish. Due to this aspect, the ANFIS model evaluated in Table 5 is based on Operator 1’s data. Also, the output of the model was saturated with the thresholds 1 and 1 in the simulations in order to limit the percent fit error values for fair comparison and later to implement the model as the active controller on the setup. ARTICLE IN PRESS 602 O. Celik, S. Ertugrul / Engineering Applications of Artificial Intelligence 23 (2010) 595–603 ARX216 Model Table 5 Prediction performance of the ANFIS model based on Operator 1’s last session random scenario data, simulated with the random scenario data of the last session of each operator. HO and Model Outputs (normalized to [−1,1]) 1 2 3 4 5 F−ARX Model Reference position Reference interval Tip position 40 Prediction performance (%) One step Five step Infinite step 85.7 86.4 91.4 84.5 87.2 66.1 60.3 65.4 58.1 60.1 55.7 52.3 53.7 51.4 49.3 Position (degrees) Operator data ANFIS Model 60 20 0 −20 −40 1 −60 0.5 0 10 20 30 40 50 60 Time (sec) 0 Fig. 7. Behavior of the setup in closed-loop control, with the developed HO models integrated as the controller. −0.5 Original HO output Constant ARX216 model’s output F−ARX model’s output ANFIS model’s output −1 0 5 10 15 Time (sec) Fig. 6. HOs (Operator 3) output for the random scenario in the last session and the simulation results of the three models. Constant ARX216 model uses the parameters given in Table 3. To better illustrate the imitating capabilities of the different models, the simulations with zero initial conditions of all three models are given together with the recorded output of a human operator in Fig. 6. Data belong to the random scenario section of the last session of Operator 3. Although the infinite step prediction performances of the models remain around 50–60%, it is seen in this figure that all models effectively reflect the basic aspects of the actions taken by the HO. F-ARX outputs are generally the closest ones to the HOs, followed by the constant ARX model whereas ANFIS model’s outputs are relatively less representative of the original signal. These observations are in agreement with the results summarized in Tables 3–5. ANFIS is a promising method for HO modeling as reported in Ertugrul (2008). The relatively low performance of ANFIS model in the application presented here may be due to the following: (1) The data set used to train the model was not sufficiently rich. A rich and representative training data set is an important step for obtaining successful ANFIS models. (2) Imposing the model structure ARX216 on the ANFIS model probably prevented the best possible model to be obtained. The fast alternating outputs problem was a major factor that affected the prediction performance of this type of model. This problem can potentially be eliminated if the two limitations above were overcome and it is decided to be addressed in a future study. 7. Implementation of the models to replace HO as controller The obtained models were implemented as controllers in closed loop to replace the HO. While ARX and F-ARX models were successful in replacing the HO, ANFIS model displayed the problem mentioned above. All models were occasionally able to position the tip of the arm in the interval without inducing many oscillations, while the expert operators could almost always do. In Fig. 7, closed-loop behavior of the setup under the control of the implemented models is given. The 20 sec portions randomly selected from an R scenario test run are given for each model. None of the models could perform precisely as well as a HO, in the context of the control task defined in Section 2. All models suffered from a common problem. For small reference value changes, i.e. when a new reference position appeared that was close to the last reference position, the models did not respond. The position error at these instances was not large enough to make the controller produce an output that could move the motor. This property can be observed in Fig. 7, at the ends of the 20 sec portions for the ARX216 and the F-ARX model. ANFIS model seemed to suffer less from this issue, as it is demonstrated in Fig. 7. Apparently, this problem originated from the dead zone in the DC motor, which was mentioned in the Section 2. This hard nonlinearity in the setup found its reciprocal in the HO; HOs learned the thresholds of this dead zone and responded by just merely exceeding the limits of the zone when fine movements or corrections were needed. A nonlinear element that resembles this behavior of the HO can be explicitly introduced in the models to overcome this problem as a future extension of this study, following a similar approach that was used in Chen and Ulsoy (2000). 8. Conclusion An experimental setup that allowed the implementation of a human operator control task has been used in this study. The setup is used to collect data from five operators that performed a defined task for 10 sessions each being 15 min long. Each session also included five different scenarios which were later used to implement different local and global models. Modeling methods included well-known linear identification techniques with an ARX model structure for local models, a fuzzy switching model that provided smooth transition between appropriate local ARX models (the F-ARX model), and the adaptive network-based fuzzy ARTICLE IN PRESS O. Celik, S. Ertugrul / Engineering Applications of Artificial Intelligence 23 (2010) 595–603 inference system (ANFIS) model. Models were evaluated and compared based on their prediction performances as well as their performance after being implemented on the system as a replacement for the HO. Prediction performances of all models were found to be relatively successful as reported in Tables 3–5. Developed F-ARX models performed better than both ARX and ANFIS models. The nonlinearity in the HOs behavior that was capable of overcoming the dead zone in the plant was not captured by any of the models, which is a direction for future work. Another important direction for future work is the on-line training of F-ARX model and taking over the control from HO at any desired moment during real-time operation. References Abonyi, J., 2003. Fuzzy Model Identification for Control. Birkhauser, Basel. Abonyi, J., Babuska, R., 2000. Local and global identification and interpretation of parameters in Takagi–Sugeno fuzzy models. In: Proceedings of the IEEE International Conference on Fuzzy Systems, San Antonio, TX, USA, May, pp. 835–840. Abonyi, J., Babuska, R., Setnes, M., Verbruggen, H.B., Szeifert, F., 1999. Constrained parameter estimation in fuzzy modeling. In: Proceedings of the IEEE International Conference on Fuzzy Systems, Seoul, Korea, August, pp. 951–956. An, P.E., Harris, C.J., 1996. An intelligent driver warning system for vehicle collision avoidance. IEEE Transactions on Systems, Man and Cybernetics, Part A 26 (2), 254–261. Bendat, J.S., Piersol, A.G., 1986. Random Data: Analysis and Measurement Procedures. Wiley, New York. Celik, O., 2006. Intelligent modeling of human operators. Master’s Thesis, Istanbul Technical University, Istanbul (in Turkish). Chen, L., Ulsoy, A.G., 2000. Identification of a nonlinear driver model via NARMAX modeling. In: Proceedings of the American Control Conference, vol. 4, Chicago, IL, June, pp. 2533–2537. Chiang, H.H., Ma, L.S., Perng, J.W., Wu, B.F., Lee, T.T., 2006. Longitudinal and lateral fuzzy control systems design for intelligent vehicles. In: Proceedings of the 2006 IEEE International Conference on Networking, Sensing and Control, 2006, ICNSC’06, pp. 544–549. Chiu, C.S., Lian, K.Y., Liu, P., 2005. Fuzzy gain scheduling for parallel parking a carlike robot. IEEE Transactions on Control Systems Technology 13 (6), 1084–1092. Delice, I.I., Ertugrul, S., 2007. Intelligent modeling of human driver: a survey. In: 2007 IEEE Intelligent Vehicles Symposium, pp. 648–651. Ertugrul, S., 2008. Predictive modeling of human operators using parametric and neuro-fuzzy models by means of computer-based identification experiment. Engineering Applications of Artificial Intelligence 21 (2), 259–268. Ertugrul, S., Hizal, N.A., 2005. Neuro-fuzzy controller design via modeling human operator actions. Journal of Intelligent and Fuzzy Systems 16 (2), 133–140. Hadjili, M.L., Wertz, V., 2002. Takagi–Sugeno fuzzy modeling incorporating input variables selection. IEEE Transactions on Fuzzy Systems 10 (6), 728–742. Hess, R.A., Modjtahedzadeh, A., 1990. A control theoretic model of driver steering behavior. IEEE Control Systems Magazine 10 (5), 3–8. Jang, J.S.R., 1993. ANFIS: adaptive-network-based fuzzy inference system. IEEE Transactions on Systems, Man and Cybernetics 23 (3), 665–685. Jang, J.S.R., Sun, C.T., Mizutani, E., 1997. Neuro-Fuzzy and Soft Computing. PrenticeHall, Upper Saddle River, NJ, USA. Johansen, T.A., 1994. Fuzzy model based control: stability, robustness, and performance issues. IEEE Transactions on Fuzzy Systems 2 (3), 221–234. Ljung, L., 1999. System Identification: Theory for the User. Prentice-Hall, Upper Saddle River, NJ, USA. Macadam, C.C., 2003. Understanding and modeling the human driver. Vehicle System Dynamics 40 (1), 101–134. 603 McRuer, D., 1980. Human dynamics in man–machine systems. Automatica 16, 237– 253. Narendra, K.S., Parthasarathy, K., 1989. Adaptive identification and control of dynamical systems using neural networks. In: Proceedings of the IEEE Conference on Decision and Control, pp. 1737–1738. Nechyba, M., Xu, Y., 1995. Human skill transfer: neural networks as learners and teachers. In: Proceedings of the IEEE International Conference on Intelligent Robots and Systems, vol. 3, pp. 314–319. Nechyba, M., Xu, Y., 1997. Human control strategy: abstraction, verification, and replication. IEEE Control Systems Magazine 17 (5), 48–61. Nelles, O., 2001. Nonlinear System Identification. Springer, Germany. Pellecchia, A., Igel, C., Edelbrunner, J., Schoner, G., 2005. Making driver modeling attractive. IEEE Intelligent Systems 20 (2), 8–12. Pilutti, T., Ulsoy, A.G., 1999. Identification of driver state for lane-keeping tasks. IEEE Transactions on Systems, Man and Cybernetics, Part A 29 (5), 486– 502. Prokop, G., 2001. Modeling human vehicle driving by model predictive online optimization. Vehicle System Dynamics 35 (1), 19–53. Sasaki, M., Inooka, H., Ishikura, T., 1990. Manual control of a flexible arm and its application to an automatic control system. IEEE Transactions on Systems, Man and Cybernetics 20 (3), 718–722. Sheridan, T.B., Ferrell, W.R., 1974. Man–Machine Systems. MIT Press, Cambridge, MA. Soderstrom, T., Stoica, P., 1989. System Identification. Prentice-Hall, Hertfordshire, UK. Takagi, T., Sugeno, M., 1985. Fuzzy identification of systems and its applications to modeling and control. IEEE Transactions on Systems, Man, and Cybernetics 15, 116–132. Ungoren, A.Y., Peng, H., 2005. An adaptive lateral preview driver model. Vehicle System Dynamics 43 (4), 245–259. Vingerhoeds, R.A., Janssens, P., Netten, B.D., Aznar Fernandez-Montesinos, M., 1995. Enhancing off-line and on-line condition monitoring and fault diagnosis. Control Engineering Practice 3 (11), 1515–1528. Wu, S.J., Chiang, H.H., Perng, J.W., Lee, T.T., Chen, C.J., 2005. The automated lanekeeping design for an intelligent vehicle. In: IEEE Intelligent Vehicles Symposium Proceedings, 2005, pp. 508–513. Zapata, G.O.A., Kawakami, R., Galvao, H., Yoneyama, T., 1999. Extracting fuzzy control rules from experimental human operator data. IEEE Transactions on Systems, Man and Cybernetics, Part B 29 (3), 398–406. Zhao, Z.Y., Tomizuka, M., Isaka, S., 1993. Fuzzy gain scheduling of PID controllers. IEEE Transactions on Systems, Man and Cybernetics 23 (5), 1392–1398. Ozkan Celik was born in Sivas, Turkey. He earned his B.Sc. and M.Sc degrees in Mechanical Engineering in 2004 and 2006, respectively, both at Istanbul Technical University. He was the recipient of TUBITAK (The Scientific and Technological Research Council of Turkey) fellowship for M.Sc. students during his M.Sc. studies. He is currently pursuing Ph.D. studies at Rice University and is a research assistant at the Mechatronics and Haptic Interfaces (MAHI) Lab. His current research interests are human motor control system, motor adaptation and learning, robotic rehabilitation, mechatronics and haptics. Seniz Ertugrul was born in Bursa, Turkey. She graduated from Istanbul Technical University Mechanical Engineering Department with her B.Sc. and M.Sc. degrees in 1988 and 1992, respectively. She earned her Ph.D. degree at the Wichita State University, KS, USA in 1996. She worked as a lecturer at WSU and as a visiting researcher at the University of Michigan. She is currently an associate professor at the Istanbul Technical University, Mechanical Engineering Department. Her areas of interest are human operator modeling, mechatronics and intelligent systems.

Predictive human operator model to be utilized as a controller... neuro-fuzzy and fuzzy-ARX modeling techniques

Related documents

Products

Support

Predictive human operator model to be utilized as a controller... neuro-fuzzy and fuzzy-ARX modeling techniques

Related documents

Add this document to collection(s)

Add this document to saved

Suggest us how to improve StudyLib