Advanced Markov Chain Monte-Carlo Simulation For Gaussian Filtering Based Inverse Problems A report for the state of art seminar presentation Submitted by Arpita Ghosh (196104102) Under the supervision of Prof. Anjan Dutta Dr. Arunasis Chakraborty DEPARTMENT OF CIVIL ENGINEERING INDIAN INSTITUTE OF TECHNOLOGY GUWAHATI June 25, 2024 Acknowledgements I want to grab this opportunity to acknowledge the greatness of all those people who help and support me to complete this report. First of all I want to thank both of my supervisor, Prof. Anjan Dutta and Prof. Arunasis Chakraborty , for giving endless important suggestions and critical reviews of my work through their guidance. I also want to thank the other members of my Doctoral Committee, Prof. Sajal K. Deb, and Dr. Sandip Das , Prof. Debabrata Chakraborty for keeping their patience and believe over my work. And finally I will not let go this chance without mentioning the infinite support and help of my senior research fellows, my friends and family to keep me motivated for completing this report in time. Date:25-01-2021 Arpita Ghosh (196104102) IIT Guwahati,India i Abstract Combining mathematical modeling techniques with experimental tools offers most powerful means to study existing structures. Such studies lie at the heart of problems of structural health monitoring. A major component in these problems consists of identification of parameters of mathematical models of structural systems based on a limited set of measurements. The problem of structural system identification constitutes an important and difficult class of inverse problems in structural engineering.Methods based on dynamic state estimation offer powerful means to tackle these problems. These methods recognize that the mathematical model as well as measurements are invariably imperfect in nature and employ random processes to model these imperfections. Consequently, the problem of structural system identification is tackled within a probabilistic framework and one seeks to determine the joint probability density function of the system parameters conditioned on the measurements made. The solution to this problem becomes increasingly difficult as systems behave nonlinearly, size of the problem increases, and noises become multiplicative and (or) nonGaussian. The advent of powerful computing tools and advances in sensing techniques has spurred research activities to tackle these challenges. This report belongs to the literature review of this area of research. In the past two decades the development of Structural Health Assessment (SHA) techniques has become a booming research field in civil engineering community. Vibration based SHA techniques can be broadly classified into two categories, namely frequency domain and time domain. In frequency domain method, change in modal parameters (natural frequency, mode shape, modal damping etc.) are taken as a signal of damage occurrence . Although frequency domain techniques could successfully identify the global damaged condition of the structure, but it is insensitive to local damages. Time domain approach provides better flexibility to locate and quantify damage at local level. Commonly used time domain approach utilizes two techniques, least square and Bayesian filtering algorithm. Least square technique estimate the unknown structural parameter by minimizing the sum of square error between predicted and measured vibration response. Bayesian filter method for parameter estimation is more suitable in case vibration response contains ambient noise signatures. When noise characteristics follow Gaussian distribution, Kalman filter based (Extended Kalman filter, Ensemble Kalman Filter, Unscented Kalman filter etc.) methods are used. This limitation can be overcome using Particle filter method as it does not seek any approximate density function in parametric form. However, Particle filter method is more computationally expense than Kalman filter based method. In comparison to Kalman filters, they are more general and applicable to systems where model and measurement equations are highly nonlinear. ii Contents Acknowledgements i Abstract ii Contents vi List of Figures vi List of Abbreviations viii 1 Introduction 1 1.1 Structural system identification . . . . . . . . . . . . . . . . . . . . . . . . . 1 1.2 Application of system identification . . . . . . . . . . . . . . . . . . . . . . . 3 1.3 Books and review papers . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3 2 Dynamic State Estimation 10 2.1 Bayesian Methods . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 10 2.2 Bayesian Model Updating . . . . . . . . . . . . . . . . . . . . . . . . . . . . 11 2.2.1 Kalman filter (Kalman 1960) . . . . . . . . . . . . . . . . . . . . . . 14 2.2.2 Extended Kalman filter (EKF) . . . . . . . . . . . . . . . . . . . . . 15 2.2.3 Unscented Kalman filter (UKF) . . . . . . . . . . . . . . . . . . . . . 16 2.2.4 Monte Carlo Methods . . . . . . . . . . . . . . . . . . . . . . . . . . 17 2.2.4.1 Perfect Sampling and Sequential Importance Sampling (SIS) 17 2.2.4.2 Sequential Importance Resampling (SIR) and Bootstrap Filter 21 2.3 Hamiltonian Monte Carlo Method . . . . . . . . . . . . . . . . . . . . . . . . 23 2.3.1 General Description . . . . . . . . . . . . . . . . . . . . . . . . . . . . 23 2.3.2 Books and review papers . . . . . . . . . . . . . . . . . . . . . . . . . 24 2.3.3 Overview of HMC Method . . . . . . . . . . . . . . . . . . . . . . . . 27 iv Guides and tutorials 2.4 Overleaf 2.3.4 Time Evolution of the Hamiltonian System . . . . . . . . . . . . . . . 28 2.3.5 HMC for Model Updating . . . . . . . . . . . . . . . . . . . . . . . . 29 Brief model and analysis from the implementation of particle filter . . . . . . 30 2.4.1 System Identificatio of single-degree-of-freedom (SDOF) oscillator . . 30 2.4.2 System Identification of Linear Time Invariant (LTI) Synthetic model 31 3 Research proposal 40 3.1 Gap Areas . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 41 3.2 Scope of Research . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 43 3.3 Objectives of Research . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 44 References 47 Page v List of Figures 1.1 Dynamic System . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1 1.2 Schematic flowchart of system identification (Source:Soderstrom (2001)) . . . 4 2.-1 System identification of SDOF oscillator excited by Elcentro earthquake . . . 34 2.0 Parameter values for solving the forward problem . . . . . . . . . . . . . . . 36 2.1 System identification of LTI System . . . . . . . . . . . . . . . . . . . . . . . 39 vi List of Abbreviations SSI SHM KF EKF PCE GA RMS MLS MPP HDMR RSM FEA FEM MC MCMC MCMC MH Structural System Identification Structural Health monitoring Kalman Filter Extended Kalman Filter Polynomial Chaos Expansion Genetic Algorithm Root Mean Square Moving least square Most probable failure point High dimensional model representation Response surface method Finite Element Analysis Finite Element method Monte Carlo Markov Chain Monte Carlo Transitional Markov Chain Monte Carlo Metropolis-Hastings viii Introduction 1.1 Structural system identification Structural system identification (SSI) is the field of mathematical modeling of the inverse problem from the experimental data. It consists of determination of properties of a structure based on a set of measurements on system response and applied actions. It has acquired widespread applications in several areas like controls and systems engineering where system identification methods are used to get appropriate models for synthesis of a regulator, design of prediction algorithm and in signal processing applications (such as in communications, geophysical engineering and mechanical engineering). Models obtained by system identification are used for spectral analysis, fault detection, pattern recognition, adaptive filtering, linear prediction and other purposes. These techniques are also successfully used in other fields such as biology, environmental sciences and econometrics to develop models for increasing scientific knowledge on the identified object, or for prediction and control. These system identification problems constitute an important class of inverse problems in structural mechanics and form a crucial step in structural health monitoring (SHM) A dynamic system can be conceptually described in Fig 1.1. The system is driven by user controlled input variables u(t) while disturbances v(t) cannot be controlled. The output y(t) provide useful information about the system. There are several kinds of mathematical models used Figure 1.1: Dynamic System for solving the inverse problem which are mostly governed by the underlying differential 1 equations. The mathematical models can be segregated into two paradigms • Modeling, which refers to derivation of models from the basic laws of physics. Often, one uses fundamental balance equations for range of variables like energy, force, mass etc. • Identification, which refers to the determination of the model parameters from the experimental data. It includes the set up of identification experiment i.e data acquisition and determination of a suitable form of the model which is fitted to the recorded data by assigning suitable numerical values to its parameters. Though SSI methods are useful for large and complex structures where it is difficult to obtain the mathematical models directly, it has some limitations. They have a limited validity i.e they are valid for a certain working point, a certain type of input, a certain process,etc. SSI is not a foolproof methodology that can be used without interaction from the user. The reasons for this are • An appropriate model structure must be found. This can be a difficult problem, particularly if the dynamics of structure is non-linear. • The real life recorded data is not perfect always as these are always disturbed by noises. • The process may vary with time, which can cause problems if an attempt is made to describe it with a time invariant model Therefore these complicating features, such as, ill-posedness of governing equations, presence of measurement noise, modeling imperfections, structural nonlinearities, spatially incomplete measurements, lack of measurements on applied actions, and possible need for online identification of system parameters, make problems of SSI very challenging. It may be emphasized that these problems are studied essentially with respect to structures that already exist and the powers of both mathematical and experimental modeling are available for solving such problems. The recent advances in computational and sensing hardwares have spurred research interest in this class of problems. The two modeling approaches have flourished in structural engineering practice with extensive developments taking place in the areas of computational mechanics [essentially the finite element method (FEM)] and progress in experimental testing and analysis methods. These developments also have important bearing on studies on existing structures which provide the best opportunities to combine the two modeling approaches. Problems of structural system identification sprawl at the forefront of this class of problems. 2 1.2 Application of system identification In general terms an identification experiment is performed by exciting the system and observing its output over an interval of time. These signals are normally recorded in a computer mass storage for subsequent information processing. We then try to fit a parametric model of the process to the recorded input and output sequences. The first step is to determine an appropriate form of the model (typically a differential equation of certain order). In the second step, several statistical approaches are used to estimate the unknown parameters of the model. This estimation is often done iteratively. The model obtained is then tested to see whether it is an appropriate representation of the system. If this is not the case, some more complex model structure is considered, its parameters estimated and validated again. Fig 1.2 shows the schematic of the steps used in system identification. Following the above discussion, there are two main purposes of model updating or system identification of the structural system. The common goal is to identify the physical parameters e.g. (stiffness) of a structural element. These identified parameters can further be used as indicator for the status of the system. For example, stiffness parameter of a structural member can be monitored from time to time and an abnormal reduction indicates possible damage of the member.But this reduction may also be simply due to statistical uncertainty. Hence the quantification of uncertainty becomes important. Another purpose of model updating can be to obtain a mathematical model to represent the underlying system for future prediction. This is broadly known as Structural Health Monitoring. Another important area of application of system identification is structural vibration control which has received great attention in the last several decades (Housner et al.,[1]). The various ingredients in problems of SSI are: excitation signals, response quantities measured, mathematical model for the system being identified, and model validation. Depending on the manner in which these ingredients are included in the identification procedure, different schemes for SSI are possible. 1.3 Books and review papers Structural System identification has remained an active area of research for several decades with civil engineering researchers showing interest from 1970-s. Many researchers have come up with various methods and have solved several problems ranging from experimental models to real life large scale structures. There exists several monographs and review papers that address various topics within this discipline of research and here we make brief mention 3 Figure 1.2: Schematic flowchart of system identification (Source:Soderstrom (2001)) 4 about these works. The general approach can be divided in following categories. • Conventional model-based approaches • Time domain identification methods • Biologically inspired approaches such as neural network and genetic algorithm • Time-frequency based approaches using Wavelet, Hilbert Transform • Chaos theory Conventional model-based approaches for system identification typically use a computer model of the structure, such as a Finite-Element Method (FEM) model, to identify structural parameters primarily from field or laboratory test data. Damage identification in beams is a common theme in system identification. (Kim and Stubbs [2]) studied damage identification of a two-span continuous beam using modal information. (Lee and Shin, [3]) detected the changes in the stiffness of beams based on a frequency-based response function. Model based system identification methods cannot be used effectively for large and complicated real-world structures with nonlinear behavior. For such cases, biologically-inspired or soft computing techniques such as Neural Networks[4], Genetic Algorithms (GA), or particle swarm optimization have been proposed as a more effective approach. Nelles [5] has covered approaches based on neural networks, fuzzy models, optimization and classical polynomial based methods (e.g., Volterra-series models). (Franco et al., [6]) used an evolutionary algorithm to identify the structural parameters of a 10-DOF shear frame. (Raich and Liszkai [7]) used Genetic algorithm to identify the stiffness changes in a steel beam and a 3-story, 3 bay frame. for nonlinear SSI problems. In the past two decades because of their ability to retain both time and frequency information, wavelets have been used increasingly to solve complicated time series pattern recognition problems in different areas. (Liew and Wang [8]) used wavelets to identify cracks in simply supported beams. Worden and Tomlinson [9] cover wide ranging topics in nonlinear structural dynamical systems; these topics cover higher order spectral analysis, Hilbert transform technique, force state mapping, and time series models. The book also covers experimental studies in the area of nonlinear SSI.(Bao et al. [10]) employed the Hilbert-Huang transform for system identification of concrete-steel composite beams. Mahato et al.[11] developed a continuous wavelet transform based filtering strategy combined with Hilbert transform for modal identification of reinforced concrete road bridge with vehicular excitation. The book by Bendat [12] focuses on frequency domain methods for nonlinear SSI and 5 discusses the use of one-dimensional spectral functions arising in multiple input and single output nonlinear systems. A few researchers have employed the chaos theory and fractal concept [13] to model the complicated structural dynamics for system identification[14, 15]. However, the emphasis of discussion mainly focus on the literature review of the time domain dynamic state estimation methods which directly utilizes the response time histories to detect structural damage. These methods are particularly appropriate for situations with time-varying structural properties and where sequential data set is observed. The dynamic state estimation methods derive their origin from the Bayesian Methods. Bayesian theory was originally discovered by the British researcher Thomas Bayes in a publication (Bayes[16]).The methods have been widely used in many areas due to the pioneering work done by Thomas Bayes. The modern form of the theory was rediscovered by French mathematician Simon de Laplace in in Theorie Analy-tiquedes Probabilites. One of the earliest researches in iterative Bayesian estimation can be found in (Ho and Lee,[17]). (Spragins,[18]) discussed the iterative application of Bayes rule to sequential parameter estimation and called it as ”Bayesian learning”. The subject of dynamic state estimation has close links with problems of system identification. The methods for dynamic state estimation can be categorized into two groups. The first includes the well-known Kalman filter (Kalman[19]) and its variants[20,21],etc. and the other is the Monte Carlo simulation based algorithms named as particle filters (Gordon et al.[22]). The works of Jazwinski [23], Maybeck [24], Kailath [25], Brown and Hwang [26], Chui and Chen [27], and Grewal and Andrews [28] provide comprehensive accounts of the Kalman filters (KF). The KF represents recursive algorithms to exactly compute the evolution of mean and covariance of system states conditioned on measurements for linear state space models with additive Gaussian noise models. It provides an exact solution to the problem of state estimation for linear Gaussian state space models. The most popular variant of Kalman filter is the Extended Kalman filter (EKF), where linearization of the process equation is done to provide a Gaussian approximation of what really is a non-Gaussian quantity (Hoshiya and Saito[20]). Ghanem and Shinozuka[29] provided a review of methods of system identification by application to experimental data obtained on three and five-story steel building structures subjected to seismic loading, including the EKF, maximum-likelihood technique, recursive least-squares, and recursive instrumental-variable method. (Moaveni et al., [30]) examined six variations of model-based approach including, data- driven stochastic subspace identification, frequency domain decomposition, observer/Kalman filter identification, and general realization algorithm for system identification of a full-scale 7-story RC building structure subjected to shake table loading, and concluded that probabilistic system identification methods in connection with FE model updating provide the most desirable 6 results. Sen and Bhattacharya [31]coupled generalised Polynomial Coas with EKF algorithm in which the uncertainty propagation from parameter to measurement was described through gPC expansion of parameters and outputs. Subsequently, the gPC coefficients of the parameter expansion are estimated from available measurements employing EKF. Thus, instead of selecting the system parameters as states, we consider the associated parameter gPC coefficients as state variables which reduces the problem of estimating the complete distribution of parameters down to identification of a few gPC coefficients. The proposed method is tested on systems with either Gaussian or non-Gaussian parameters. The error in estimating non-Gaussian parameters using KF based techniques is demonstrated. The second group of methods known as the Monte Carlo methods begin by considering the exact form of the recursive integral equations that govern the evolution of filtering pdf and employ Monte Carlo simulation procedures to solve these equations in a recursive manner by approximating the complex integrals. Monte Carlo (MC) methods are stochastic computational algorithms and these are efficient for simulating the highly complex systems. The MCB approach was conceived by Ulam [32], developed by Ulam and von Neumann [33], and coined by Metropolis([34])(Candy[35]). The technique evolved during the Manhattan project in 1940-s, when the scientists were investigating the calculations related to atomic weapon designs. MC methods have wide variety of applications in engineering and finance. It offers alternative approach to solve numerical integration and optimization problems.The application of these methods to problems in structural mechanics is not yet widely explored. This more general class of problems solving involving nonlinear process and measurement equations, multiplicative/additive non-Gaussian noises implemented using MC methods has been studied by Tanizaki [36, 37], Doucet et al.,[38], Liu and West (2001)[39] and Ristic et al., [40]. Monte Carlo simulation methods are known variously as Monte Carlo filters, Bayesian filters, population Monte Carlo algorithms, or particle filters (Gordon et al.,[22], Tanizaki [36], Doucet [41], Liu and Chen [42], Pitt and Shephard [43], Doucet et al., [44], Iba [45], Doucet et al., [38], Crisan and Doucet [46], Ristic et al., [40], Cappe et al., [47], Doucet and Johansen [48]) and [49]. There are several variants of the particle filters available in the literature as well (Chen[50]). These methods have been widely used in robotics and for solving the tracking problems (Thrun[51]). The following passage describes the work done by the researchers on implementation of these methods to structural mechanics. Ching et al., [52]compared performance of EKF and particle filter by applying them on planar four-story shear building with time-varying system parameters and non-linear hysteretic damping system with unknown system parameters. The mass of the shear building is assumed to be time invariant with time varying stiffness and damping parameters at each floor level and applied synthetic data contaminated with noise. The non-linear SDOF Bouc7 Wen hysteretic damping system model was considered. He concluded that Particle filter is advantageous over EKF and EKF may sometimes create misleading results. Further EKF is not suitable for highly non-linear models. Manohar and Roy,[53] identified the parameters of nonlinear structures using dynamic state estimation techniques. They considered two single-degree of freedom nonlinear oscillators, namely, the Duffing oscillator and the one with Coulomb friction damping. They identified of parameter alpha and mu based on noisy observations using the density based MC filter, bootstrap filter and sequential importance sampling filter. The basic objective of the study was to construct the posterior pdf of the augmented state vector based on all available information. Nasrellah and Manohar, [54] did the combined computational and experimental study using multiple test and sensor data for structural system identification. They considered the problem of identification of parameters of a beam with spatially varying density and flexural rigidity as well as the identification of parameters of a rigidly jointed truss. It was concluded that various factors affect the accuracy of identification like number of particles used in filtering, closeness of the initial guess on system parameters to the true values, number of global iterations, noise levels in measurements and model,imperfections, the number of parameters to be identified and sensitivity of measurements with respect to the parameters being identified. Radhika and Manohar [55] The authors presented a approach for state estimation based on both Kalman filter and Sequential Monte Carlo filter algorithm. Authors utilizes the concept of substructure scheme to problems of hidden state estimation in structures with local non-linearities, response sensitivity model updating in nonlinear systems,and characterization of residual displacements in instrumented inelastic structures. Author divided the structure in linear and nonlinear substructure and solved the linear estimation problem with Kalman filter and nonlinear problem with particle filter method. It was illustrated with the help of numerical examples and the performance of the solution from the proposed scheme is compared with exact solutions and with alternative approximate solutions which do not employ sub-structuring, and the results are shown to be in satisfactory agreement. Sen et al. [56] have developed a conceptual framework taking the input excitation force to be an additional state that is estimated in parallel to the structural parameters and employed two concurrent filters for parameters and force respectively. They used two filters one for the parameters, an interacting Particle-Kalman filter to target systems with correlated noise and another to estimate the seismic force acting on the structure. For stability in the estimation the parameters and the inputs estimated as being conditional on each other. They validated numerically the proposed algorithm on a 6-DOF mass-spring8 damper system and a five-story building structure. Thier estimation results confirmed the applicability of the proposed algorithm. Similar studies were carried out by (Namdeo and Manohar, [57]), (Ghosh et al.,[58]) and (Sajeeb and Roy,[59]) ,[60], Radhika and Manohar [61] and Sen et al. [62].The particle filter algorithm has also been used for identification of fatigue cracks in vibrating beams (Rangaraj[63]).[64] the following sections briefly outlines the development of some of above strategies in the following sections. [65] 9 Dynamic State Estimation The problem of dynamic state estimation (DSE) can be stated by considering three entities: 1. the postulated mathematical model for the system behavior encapsulated through a process equation of the form xk+1 = hk (xk , θ, vk ); k = 0, 1, 2, .... (2.1) where k is a independent variable (typically discretized time variable) along which the n×1 system state vector system xk evolves; hk is a nx ×1 vector function: vk is a nv ×1 vector of random variables such that the sequence {vk },k = 1, 2, ... from an independent, identically distributed (iid) sequence of random variables with {vk } ∼ p(vk ); here p(k )is the n dimensional joint probability density function (pdf ) of k ; thesymbol ∼ is taken to mean ‘is distributed according to’; and θ is a p×1 vector of system parameters, 2. prior pdf of x0 , p(x0 ) at initial time step k = 0, and 3. a measurement equation of the form yk = qk (xk , θ, wk ); 2.1 k = 1, 2, ... (2.2) Bayesian Methods In many of the engineering problems, modeling of uncertain parameters is necessary for various purposes. In this context, Bayes Theorem offers the framework of modeling and inferring the uncertain models from the measurements. These methods have been applied to many different disciplines of natural sciences, social sciences and engineering, especially in statistical physics, engineering hydrology, econometrics, archaeology, information sciences, medical 10 sciences, forensic sciences, marketing, mechanical engineering, computer science, engineering geology, aerospace engineering, finance, population migration, and many other areas. In Structural Engineering , these methods have been used in system reliability, prediction of concrete strength , structural dynamics and system identification. Bayesian inference is very important for Structural engineering applications because of the wide variety of uncertainty associated with the structures. The examples of such uncertainties can be earthquake ground motion or complete time varying description of the wind pressure, material properties which are difficult to determine for heterogenous materials like concrete and the number and size of cracks present in concrete. Not only this, the modeling errors and uncertainties are also associated with the joints.Therefore, Bayesian statistics has wide application in Structural engineering as well. In many scenarios, the solutions gained through Bayesian inference are viewed as “optimal”. 2.2 Bayesian Model Updating Many real world data analysis tasks involve estimating the unknown quantities from some given observations. In such type of problems, generally the prior knowledge about the phenomenon to be modeled is available. This knowledge can be used to formulate the Bayesian models where prior knowledge about the state is updated using the likelihood function to generate a posterior distribution. Often the measurements arrive sequentially and it is possible to both carry out the offline as well as online inferences. Thus, one of the most important steps of this process is to update the states recursively once the measurements are available.The focus of dynamic state estimation techniques is to estimate the state of the system using the measurement data. The governing equation can be written as: X(t) = q(P (t), t) (2.3) where X(t) is the response of the structure when an input force P (t) is given to the system and q(.) relates the input to the output. Since the measurements are available at discrete time steps, it becomes obvious to discretize the above model equation as Xk+1 = qk (Xk , wk ) (2.4) where Xk represents the state of the system at time t = k ; Xk+1 represents predicted state at time t = k + 1 and wk represents the white noise. The discretized measurement equation can be written as Yk = hk (Xk , vk ) 11 (2.5) where Yk is the measurement at time t = k corresponding to the state Xk and vk is the measurement noise similar to the model noise. However the model as well the measurement noise has been assumed as uncorrelated. The measurements from the sensors are sampled at a particular rate and can be denoted as a vector Mk = [Y1 , Y2 , ., Yk ] (2.6) The objective of this formulation is to estimate the current state Xk based on the measurement Yk . As the model and the measurements are corrupted with noise it is required the problem of state estimate reduces to estimating the probability density function p(Xk |Mk ). Since estimating p(Xk|M k) is itself not easy, so the more simplified problem is to determine the moments of Xk . Mathematically, this can be written as: Z µ = Xk p(Xk |Mk )dXk Z σ= (Xk − µ)T (Xk − µ)p(Xk |Mk )dXk (2.7) (2.8) where µ and σ are the first moment or mean and the second moment or variance of the pdf p(Xk |Mk ) respectively. In the following, a detailed derivation of the recursive Bayesian Estimation is presented, which underlines the principles of sequential Bayesian filter. Two assumptions are used to derive the recursive Bayesian Filter. • The states follow a first order Markov process p(Xk |X0:k−1 ) = p(Xk |Xk−1 ); (2.9) • The observations are independent of the given states. At any time t, the posterior is given by the Bayes theorem as p(X0:t |Y1:t ) = R p(Y1:t |X0:t )p(X0:t ) p(Y1:t |X0:t )p(X0:t )dX) (2.10) The recursive equation can be obtained as p(X0:t+1 |Y1:t+1 ) = p(X0:t |Y1:t ) 12 p(Yt+1 |Xt+1 )p(Xt+1 |Xt ) p( Yt+1 |Y1:t ) (2.11) The following recursive relations are used for prediction and updating The prediction equation is given by Z p(Xt |Y1:t−1 ) = p(Xt |Xt−1 )p(Xt−1 |Y1:t−1 )dXt−1 (2.12) Based on this prediction the model updating equation is p(Xt |Y1:t ) = R p(Yt |Xt )p(Xt |Y1:t−1 ) p(Yt |Xt )p(Xt |Y1:t−1 )dXt (2.13) It is however difficult to compute the normalizing constant p(Y1:t ) and the marginal of the posterior p(X0:t |Y1:t ) as it requires evaluation of complex high dimensional integrals. The above expressions are modified in the following way when the system and the model noise are also present. Adopting the notations as p(Xk |Yk−1 )is the estimate of the state at time k based on the measurements Yk−1 and p(Xk |Yk ) denotes the pdf of the state at time k based on the measurements Yk . Therefore, the first one is the priori pdf or the prediction, while the latter is the posteriori pdf or the correction to the state once the measurements are available at time k. It is also assumed that p(X1 |M0 ) = p(X1 ) is known. The prediction equation can be expressed as: Z p(Xk |Mk−1 ) = p(Xk |Xk−1 )p(Xk−1 |Mk−1 )dXk−1 (2.14) (2.15) Here p(Xk |Xk−1 ) can be derived from 2.6. The conditional density can be used to write the following expressions. Z p(Xk |Xk−1 ) = p(Xk |Xk−1 , wk−1 )p(wk−1 |Xk−1 )dwk−1 (2.16) Since wk is independent of the state, it can be written that p(wk−1 |Xk−1 ) ≡ p(wk−1 ) (2.17) It can be clearly seen from the process equation that if Xk−1 and wk−1 are known, then Xk can be obtained deterministically from the process equation 2.6. Therefore the pdf of p(Xk |Xk−1 , wk−1 ) can be mathematically written as p(Xk |Xk−1 , wk−1 ) ≡ δ(Xk − fk−1 (Xk−1 , wk−1 )) (2.18) where δ(.) is the Dirac-Delta function. Substituting in this in the above Eq. 2.16 we get Z p(Xk |Xk−1 , wk−1 ) = δ(Xk fk−1 (Xk−1 , wk−1 ))p(wk−1 |Xk−1 )dwk−1 (2.19) 13 The above expression can be substituted in Eq. 2.12 As soon as the measurement Yk is available at the time step k the prediction can be updated using the Bayesian relation p(Xk |Mk ) = p(Yk |Xk )p(Xk |Mk−1 ) p(Yk |Mk−1 (2.20) where the normalizing denominator is given by Z p(Yk |Mk−1 ) = p(Yk |Xk )p(Xk |Mk−1 )dXk (2.21) The only unknown in the Eq. 2.20 is p(Yk |Xk ) which can be obtained as: Z p(Yk |Xk ) = p(Yk |Xk , vk )p(vk )dvk (2.22) which again takes the form of the Dirac-Delta function if Xk and vk are known. The measurement Yk is obtained from the measurement Eq. 2.5. Thus the above equations form the basis of the recursive Bayesian Model updating. If the functions f (.) and h(.) are linear and the noise wk and vk are Gaussian; then the closed form expressions of the above integrals are available and this leads to the wellknown Kalman Filter,[19]. However if the f (.) and h(.) are non-linear, then several other methods have been prescribed in literature like EKF, [20]. However the most recent interest is to exploit the cheap and faster computational facilities to develop methods based on the Monte Carlo Simulations for approximating the integrals in the above equations. 2.2.1 Kalman filter (Kalman 1960) Here we consider the following version of equations 2.12 and 2.13 xk+1 = Hk xk +k ; k = 0, 1, ... (2.23) yk = Qk xk + wk ; k = 1, 2, ... vk 0 Σv 0 ∼N , wk 0 0 Σw (2.24) (2.25) Here N (µ, Σ) denotes a multi-normal pdf with mean vector µ and covariance matrix Σ. The sizes of the quantities xk , Hk ,k , yk , Qk ,and wk appearing in the above equation are respectively ,n × 1, nx × nx , nx × 1, ny × 1, ny × nx andny × 1. Accordingly, the covariance matrices Σv and Σw have dimensions nx nx and ny ny , respectively. It is assumed that x0 ∼ N (µ0 , P0 ) 14 is known and is independent of k and wk . It is clear that p(xk+1 |xk ) ∼ N (Hk xk , σ) and p(yk |xk ) N (Qk , xk , Σ). Based on these facts, the functional recursive relations in equations 2.12 and 2.13 can be shown to lead to the following set of recursive relations for the evolution of conditional mean and covariance of the system states (see, for example, Tanizaki [36], Brown and Hwang [26]). Kk = Pk− Qtk [Qk Pk− QTk + Σw ]−1 x̂k = x̂k + Kk (yk − Qk x̂k ) Pk = [I − Kk Qk ]Pk− (2.26) x̂k+1 = Hk x̂k − Pk+1 = Hk Pk Hk + Σv Here a hat (.̂) denotes the estimate and a superscript ‘−′ denotes that the estimate is evaluated prior to assimilation of data at time tk ; the superscript t denotes matrix transposition; Pk denotes the covariance of estimation error ek = xk − x̂k ; Kk is the blending matrix (gain matrix) that relates the updated estimate x̂k to the measurement yk through the relation x̂k = x̂k + Kk [yk − Qk x̂k ] . In this model p(ck |y1:k ) ∼ N (x̂k , Pk ) 2.2.2 Extended Kalman filter (EKF) The extended Kalman filter (EKF) offers an approximate solution to the filtering problem governed by the following equations xk+1 = hk (xk ) + wk ; k = 0, 1, 2, ... yk = qk (xk ) + vk ; k = 1, 2, ... vk 0 Σv 0 ∼N , wk 0 0 Σw (2.27) Here vk and wk are as in the preceding section. The basic idea here is to linearize the functions hk (xk ) and qk (xk )around a reference state xR , that is, we approximate h(xk ) = hk (xR + xk − xR ) ≈ hk (xR ) + ∇hk |xk =xR (xk − xR ) q ( xk ) = qk (xR + xk − xR ) ≈ qk (xR ) + ∇qk |xk =xR (xk − xr )(2.28) Accordingly, we replace the model in Eq. 2.12 by the approximation xk+1 = hk (xR ) + ∇hk |xk =xR + wk ; yk = qk (xR ) + ∇qk |xk =xR (xk − xR )k = 1, 2, ...(2.29) 15 k = 0, 1, 2, ... Here ∇hk and ∇qk are, respectively, nx × nx and ny × nx Jacobian matrices given by ∇hk (i, j) = δqk (i) δhk (i) and ∇qk (i, j) = δxk (j) δxk (j) (2.30) The problem now is amenable for solution via the Kalman filtering. Typically, at the kt h step, one selects xR = x̂k − 1 , that is, as the assimilation proceeds, we apply local linearization on the fly around the evolving trajectory of x̂ˆk . The development of this method in DSE problems has been discussed by several authors (see, for instance, the books by Jazwinski [23], Maybeck [24], Brown and Hwang [26] and Grewal and Andrews [28]). The estimate provided by EKF is biased and the accuracy of its performance, in a given problem, is difficult to assess. The estimate does not permit interpretation as being minimum mean square error estimator. The method assumes that the functions hk (xk ) and qk (xk ) are differentiable with respect to xk - a condition that may not always be satisfied in problems of nonlinear structural mechanics (e.g., in dealing with friction, gaps and hereditary nonlinearities). The recent work of Ghosh et al., [58] presents an EKF method based on concept of transversal linearization which avoids the computation of the Jacobian matrices. It may also be noted that the idea of linearization can be extended to include higher order terms in Taylor’s expansion in equation (1.19). Here one needs to make a Gaussian closure approximation to handle the resulting recursions for x̂P and Pk Details of these formulations can be found in the work of Tanizaki [36]. 2.2.3 Unscented Kalman filter (UKF) This algorithm is based on the notion of an unscented transform which is a deterministic sampling technique to select realizations of random variables with a view to determine moments of functions of these random variables. In the context of nonlinear filtering problems, this method avoids the linearization of the nonlinear functions that appear in process and (or) measurement equations: instead, the method postulates Gaussian model for the posterior pdf and aims to estimate the mean and covariance of the transformed variables with higher accuracy than is possible in extended Kalman filtering. The method thus avoids the computation of gradients and, hence, could be used in dealing with nonlinearities that are not differentiable. The deterministic points here are termed as sigma-points and the filtering algorithm propagates these sigma points as measurements are assimilated. The posterior mean and covariance matrix of the system states are computed based on these propagated control points and associated weights. These response moments are shown to be accurate upto the second order. 16 2.2.4 Monte Carlo Methods The underlying principle of the MC methods is that they utilize Markov chain theory. The resulting empirical distribution converges to the desired posterior distribution through random sampling. The method is widely used in signal processing where one is interested in determining the moment of the stochastic signal f (X) with respect to some underlying probabilistic distribution p(X). However the similar concept is used in system identification problem where one is interested to estimate the expected values of the system parameters. The methods have the great advantage since these are not subject to constraints of linearity and Gaussianity. The methods as well have appealing convergence properties. Several variants of MC methods are available in the literature. This includes Perfect Monte Carlo sampling,Sequential importance sampling, Sequential importance resampling and the Bootstrap particle filter. The following section presents the mathematical formulation of each of the method. The concept has been illustrated by solving single degree of freedom oscillator at the end of the chapter. 2.2.4.1 Perfect Sampling and Sequential Importance Sampling (SIS) Monte Carlo methods use statistical sampling and estimation techniques to evaluate the solutions to mathematical problems. The underlying mathematical concept of Monte Carlo approximation is simple. Consider the statistical problem of estimating the expected value of E[f (x)] with respect to some probabilistic distributionp(X): Z E[f (X)] = f (X)p(X)dX (2.31) Here the motivation is to integrate the above expression using stochastic sampling techniques rather than using the numerical integration techniques. Such a practice is useful to estimate complex integral where it is difficult to obtain the closed form solution. In MC approach, the required distribution is represented by random samples rather than analytic function. The approximation becomes better and more exact when the number of number of such random samples increases. Thus, MC integration evaluates Eq. 2.31 by drawing samples X(i) from p(X). Assuming perfect sampling, the empirical distribution is given by N 1 X p(x) = δ(X − X(i)) N i=1 The above equation can be substituted to give Z N 1 X E[f (x)] = f (X)p(X)dX ≃ f (X(i)) N i=1 17 (2.32) (2.33) Generalization of this approach is known as Importance sampling where the integral is written as Z I= Z p(x)dx = where p(x) q(x)dx q(x) (2.34) Z q(x)dx = 1 (2.35) Here q(X) is known as the importance sampling distribution since it samples p(X) nonuniformly giving more importance to some values of p(x). The Eq. 2.34 can be written as N p(X) 1 X p(X(i)) I = Eq = q(X) N i=1 qX(i)) (2.36) where X(i) are drawn from the importance distribution q(.). The central theme of importance sampling is to choose importance distribution q(.) which can approximate the target distribution p(.) as close as possible. Using the concept of importance sampling, it is possible to approximate the posterior distribution. Since it is generally not easy to sample from the posterior, we use importance sampling coupled with an easy to sample proposal distribution q(Xt |Yt ).This is one of the most important steps of the Bayesian importance sampling methodology. Using the importance sampling concept the mean of f (Xt ) can be estimated as follows: Z E[f (Xt )] = f (Xt )p(Xt |Yt )dXt (2.37) where (Xt |Yt ) is the posterior distribution. Here, we insert the importance proposal density function q(Xt |Yt ) such that the estimate becomes Z p(Xt |Yt ) F (t) = E[f (Xt )] = f (Xt ) q(Xt |Yt )dXt q(Xt |Yt (2.38) Now using Eq. 2.20 (Bayes Rule) to the posterior distribution and defining the weighting function as p(Xt |Yt ) q(Xt |Yt ) p(Yt |Xt )p(Xt ) = p(Yt )q(Xt |Yt ) W̃ (t) = (2.39) Calculation of W̃ (t) requires the knowledge of the normalizing constant p(Yt ) which is given by Z p(Yt ) = p(Yt |Xt )p(Xt )dXt 18 (2.40) This normalizing constant is generally not available and hence the new weight W (t) can be defined by substituting Eq. 2.39 into Eq. 2.38. Z 1 p(Yt |Xt )p(Xt ) F (t) = f (Xt ) q(Xt |Yt )dXt p(Yt ) q(Xt |Yt ) Z 1 = W (t)f (Xt )q(Xt |Yt )dXt p(Yt ) 1 Eq [W (t)f (Xt )] = p(yt ) (2.41) The above equation can be also be written as: W (t)q(Xt |Yt ) = p(Yt |Xt )p(Xt ) (2.42) Thus, the normalizing constant in Eq.2.40 can be replaced by Eq.2.42 Eq [W (t)f (Xt )] p(Yt ) Eq [W (t)f (Xt )] = R W (t)q(X|Y ) Eq [W (t)f (Xt )] = Eq [W (t)] F (t) = (2.43) Now, if the samples are drawn from the distribution q(Xt|Y t), from perfect sampling distribution we have N 1 X q̃ = δ(X − X(i)) N i=1 (2.44) and therefore, the normalized weights w̃i of the it h sample can be written as W i(t) w̃i = PN i=1 Wi (t) where W i(t) = p(Yt |Xti )p(Xti ) p(Yt )q(Xti |Yt ) (2.45) (2.46) Therefore the final estimate of the Eq. 2.38 becomes F (t) ≈ N X w̃i f (Xt (i)) (2.47) i=1 As the number of samples (N → ∞), the approximation of posterior becomes p(Xt |Yt ) ≈ N X w̃i δ(Xt − Xt (i)) i=1 19 (2.48) With the above mathematical framework in place, we can derive the expressions for sequential interfacing of measurement data available at time instant t = k. One can write the approximation of posterior as: p(Xk |Y1:k ) ≈ N X w̃ki δ(Xk − Xk (i)) (2.49) i=1 where δ(.) is the dirac delta function and w̃i is the normalized weight of the it h particle at time k. p(X0:k |Y1:k ) ∝ p(Yk |X0:k , Y1:k−1 )p(X0:k |Y1:k−1 ) = p(Yk |Xk )p(Xk |X0:k−1 , Y1:k−1 )p(X0:k−1 |Y1:k−1 ) (2.50) = p(Yk |Xk )p(Xk |Xk−1 )p(X0:k−1 |Y1:k−1 ) i We could now construct an importance distribution X0:k ∼ q(X0:k |Y1:k ) and compute the corresponding (normalized) importance weights as i i p(Yk |Xki )p(Xki |Xk−1 )p(X0:k−1 |Y1:k−1 ) i w̃k ∝ 1 q(X0:k |Y1:k ) (2.51) The recursive form of the importance distribution can be written as: q(X0:k |Y1:k ) = q(Xk |X0:k−1 , Y1:k )q(X0:k−1 |Y1:k−1 ) (2.52) Substituting Eq. 2.52 in Eq. 2.51 we obtain the following expression w̃ki = i i p(Yk |Xki )p(Xki |Xk−1 )p(X0:k−1 |Y1:k−1 ) i i i q(Xk |X0:k−1 , Y1:k )q(X0:k−1 |Y1:k−1 ) (2.53) Thus the recursive weight can be given as: w̃ki ∝ i p(Yk |Xki )p(Xki |Xk−1 ) i w̃ i i q(Xk |X0:k−1 , Y1:k ) k−1 (2.54) So, the algorithm works the following way • Initilization: Draw N samples X0i from the prior X0i ∼ p(x0 ) (2.55) • Prediction: Draw N new samples Xki from importance distribution X ∼ q(Xk |X0:k−1i , Y1:k ) 20 (2.56) • Update: Calculate new weights according to Eq. 2.54. Once the weights are updated the posterior can be calculated using Eq. 2.39 One of the major problems associated with SIS Filter is the degeneracy where all the particles have negligible weight except one particle after few iterations. The variance of the importance weights increases with time and it becomes impossible to control the degeneracy phenomenon. A suitable measure of the degeneracy of the algorithm is the effective sample size (Gordon et al.,[22]) Nef f which can be defined as Ne f f = Ns 1 + V ar(wk∗i ) (2.57) where wk∗i can be obtained from Eq.2.39 The estimate of Nef f is given by the following relation Ñef f = PN 1 i 2 i=1 (w̃k ) (2.58) where w is the normalized weight obtained using the Eq. 2.54 When Nef f becomes less than N ; it implies degeneracy and a small Nef f indicates severe degeneracy. Therefore to counter this (Arulampalam et al.,[66]) suggested two ways • Good choice of Importance density: This involves the choosing the importance density such that the V ar(w∗i ) can be reduced and hence the value of Nef f increases. • Resampling: This is another important step which differentiates SIR filter from SIS filter and has been discussed in detail in the following section. Both of the above issues form the basis of “Sequential Importance Resampling” also known as “Adaptive Particle Filters” and have been discussed the following section. 2.2.4.2 Sequential Importance Resampling (SIR) and Bootstrap Filter The SIR filter is an MC method which can be applied to recursive Bayesian filtering problems. To use SIR algorithm, both the state dynamics Eq. 2.3 as well as the measurement equations 2.43 must be known. Further it is required to be able to sample from the noise distribution of the process as well as from the prior. A likelihood functions p(Yk |Xk ) needs to be known for computing the particle weights. SIR algorithm is very similar to SIS filter except the choice of optimal importance density as well as Resampling step included in the SIR algorithm. 21 The SIR algorithm can be easily derived from SIS algorithm by appropriate choice of the importance density. The optimal importance density used in SIR is i i q(Xki |X0:k−1 , Y1:k ) = p(Xki |Xk−1 , Yk ) (2.59) By substituting Eq. 2.59 in Eq. 2.54 the updated weight becomes i i wki ∝ wk−1 p(Yk |Xk−1 ), (2.60) This optimal importance distribution can be used when the state space is finite. The present report also uses the similar assumption of importance density. However, the report deals with the problem of system identification where we are more interested in identifying the system parameters rather then tracking the sate vector. The algorithm can be implemented in the following manner • Draw particles Xi from the importance distribution i Xki ∼ q(Xk |X0:k−1 , Y1:k ), i = 1, ..., N (2.61) • The new weights can be calculated from Eq. 2.54 for all the particles an normalize them to unity. • If Nef f calculated in Eq. 2.58 becomes too low, perform the resampling step. • Interpret each weight wki as the probability of obtaining the sample index i in the set Xki for [i = 1, . . . ,N]. • Draw N samples from that discrete distribution and replace the old sample set with this new one. • Set all weights to the constant value wki = N1 . The Bootstrap filter is a special case of SIR filter where the dynamic model is used as importance distribution as in Eq. 2.60 and the resampling is done at each step. A brief algorithm is presented here for a more clear illustration. However, the problem formulation section gives the detailed implementation of Bootstrap filter to System identification problem. • Draw point Xki from the dynamic model i ) , i = 1, ..., N Xki ∼ p(Xk |Xk−1 22 (2.62) • Calculate new weights and normalize them to unity. wki ∝ p(Yk |Xki ) , i = 1, ..., N (2.63) • Perform resampling after each iteration. One of the important steps in the above algorithm is resampling from the discrete probability mass function containing the normalized weights. Resampling ensures that particles with larger weights are more likely to be preserved than particles with smaller weights. Although the resampling solves the degeneracy, but it introduces sample impoverishment. There are wide variety of resampling algorithms available in the literature (Li, 2013[67]). 2.3 Hamiltonian Monte Carlo Method 2.3.1 General Description HMC is a non-random-walk based MCMC method which employs a deterministic mechanism derived from the Hamiltonian dynamics to simulate the samples according to a target distribution. It uses the concept of canonical ensemble from statistical mechanics in conjunction with the Hamiltonian principles to give a physical interpretation of the problem with useful intuitions. A canonical ensemble is a probability distribution which assigns a finite probability to a particle attaining a particular state of the system based on the energy of the state. For a system, where x represents the state and E(x) is the energy function, the canonical ensemble is defined as 1 E(x) f (x) = exp − Z T (2.64) where, T is the temperature of the system and Z is a normalizing constant. If we consider a Hamiltonian system such that it is entirely determined by its position (β) and momentum (γ), then the total energy of the system can be defined in terms of the Hamiltonian as H(β, γ) = U (β) + K(γ) (2.65) where, U (β) represents the potential energy of the system, which depends only on the position and K(γ) is kinetic energy, which depends on the momentum. For this system, the canonical ensemble becomes 1 f (β, γ) = exp Z 23 H(β, γ) − T (2.66) If the potential energy and kinetic energy of a system are defined such that the canonical ensemble represents a target probability distribution then the solution of the governing Hamiltonian equations, describes the time evolution of the system. A more detailed description of the properties of this evolution can be found in [68–70]. 2.3.2 Books and review papers The groundbreaking work in this domain is done by Beck and Katafygiotis [71] and Katafygiotis et al. [72] where the authors extended the central idea of Bayesian inference for updating the continuous-valued uncertain parameters θ using the measurement of continuous-valued structural response D [73]. Through their continual research, the authors have developed a general Bayesian statistical formulation for updating the model parameters and quantitative assessment of their relative accuracy using Bayesian inference [68, 72–85]. In this approach, the values of uncertain parameters θ, attaining a specific value is considered as a hypothesis while the initial degree of belief towards this hypothesis is updated using measured structural response D in terms of posterior probability which is conditioned by the structural behavior and a class of mathematical model C such that p(D|θ, C)p(θ|C) p(θ|D, C) = = K0 p(D|θ, C)p(θ|C) p(D|C) (2.67) The term p(θ|D, C) is the updated or posterior pdf of the uncertain model parameters θ based on the findings D (i.e. measurement) which provides a quantitative judgment of the plausibility of a set of uncertain parameter values while the term p(D|θ, C) is called the Likelihood function which quantifies the likelihood of obtaining the structural response D for a specified set of model parameter values θ. The term p(θ|C) is called the prior pdf, as it gives the initial plausibility of model parameters θ for a mathematical model in its actual 1 state when the actual structural response is not available. K0 = p(D|C) is a normalizing constant, such that integrating the right-hand side of the above equation over the parameter space Θ gives unity i.e. K0−1 = Z Z p(θ|D, C)dθ = Θ p(D|θ, C)(p(θ|C)dθ (2.68) Θ In the above formulation, exact evaluation of the posterior pdf involves a multidimensional integral which cannot be performed using analytical procedure. In early research work, since the exact nature of this Bayesian predictive pdf is not known a priori, it is approximated by a Gaussian distribution centered at the optimal points which globally minimize the objective function [73] and an algorithm for efficient searching of parameter space to find these optimum points has also been proposed [71]. 24 However, the asymptotic Gaussian approximation is feasible only for the identifiable cases where a non convex global optimization exists while for locally identifiable and unidentifiable cases, finding these optimal points involve a series of local optimization problems. Therefore, the computational efforts required to solve this problem grows with the dimension of the parameter space which makes it infeasible for high-dimensional cases [86]. These factors of model identifiability and asymptotic approximation of the posterior pdf are studied by Katafygiotis et al. [71], where the authors have also addressed practical issues like noise and incomplete measurement due to limited number of sensors. Early work in model updating is carried out using ambient time histories for updating model parameters. Subsequently, the formulation is extended to incorporate load-dependent Ritz vectors [87] and eigenvalues [88] as the parameter to detect damage. Here, it may be noted that the Metropolis-Hastings (MH) algorithm [80] is often used to simulate the samples of the random variable distributed according to an arbitrary continuous pdf . Hence, this method finds its way into Bayesian model updating where samples for the successive iterations are simulated by this method. It has been successfully implemented for model updating, bypassing the computationally exhaustive optimization algorithms [86, 89]. However, since the high probability region of target pdf p(θ|D, C) is concentrated in narrow bands of probability space, direct application of the MH technique to simulate samples distributed as per target pdf may lead to higher rejection rate of the generated samples and form a chain with large number of repeated samples which may leave the chain into nonergodic state. These problems are dealt with various adaptive MH algorithms developed in the recent past. The adaption is done either on the proposal pdf used to generate the candidate state [75, 83, 90] or the target posterior distribution [81, 86, 91] to obtain the information about the manifold in an iterative manner. The later case involving the adaption over the target pdf uses a sequence of intermediate pdf s with their spread such that they gradually converge to posterior pdf while maintaining the proper acceptance rate. In this context, use of intermediate pdf s solves the problem of high rejection rate in Markov chain caused due to narrow high probability region which cannot be anticipated before-hand. Beck and Au [86] have proposed an adaptive MetropolisHastings (AMH) method which uses predefined sequence of intermediate pdf s and requires kernel density estimation to approximate them which makes it inefficient for high dimensional problem. Ching and Chen [81] have proposed a new adaptive approach called Transitional Markov Chain Monte Carlo (TMCMC) where it uses the resampling to bypass the kernel density estimation. Unlike AMH method [86], TMCMC can automatically generate intermediate pdfs. The recent developments for efficient simulation techniques include the algorithms developed using Darwinian theory in the genetic algorithm [92], Gibbs-sampling 25 algorithm [78], enhanced MCMC method [93] and Online model updating [94]. All the above mentioned Markov Chain based simulation techniques exhibit random walk behavior while generating the candidate state, which needs higher time to discover the area of significant probability content. Duane et al. [68] have presented a new technique for the numerical simulation of lattice field theory called Hybrid Monte Carlo (aka Hamiltonian Monte Carlo) technique. It uses a deterministic mechanism inspired by the principles of Hamiltonian dynamics for simulating samples following a target distribution. It alleviates random-walk theory with a consistent exploration of the probability space. Its application in Bayesian model updating is first studied by Cheung and Beck [79], where the authors have used this method for finite element model updating of a ten-storey building using simulated data. In general, HMC offers faster convergence as compared to conventional random walk based MCMC methods [79]. Boulkaibet et al. [77] have proposed an improved version of the HMC method called Shadow Hybrid Monte Carlo technique where the sampling is done from a modified Hamiltonian function instead of the conventional Hamiltonian function. In both these versions of HMC algorithm, the authors have used Leapfrog Algorithm to generate the candidate state where the gradient of the logarithm of posterior pdf acts as a guide to explore the high probability region of the target pdf . However, numerical evaluations of the gradient over all the uncertain parameters are computationally exhaustive and even infeasible for a multi-dimensional problem involving large scale complex finite element model. Mbalawata et al. [95] developed a parameter estimation method for Stochastic differential equations(SDEs) based on the HMC and utilized EKF for evaluating the (approximate) marginal likelihood of the parameters. Septier and Peters[96] provided a unifying framework for Sequential Markov Chain Monte Carlo (SMCMC) approache and proposed inclusion of the principle of Langevin diffusion and Hamiltonian dynamics in order to cope with the increasing number of highdimensional applications. Thier simulation results show that the proposed algorithms are advantageous over existing algorithms. In the recent work Wang et al. [70] have proposed a modified version of HMC algorithms where the authors have derived a closed form solution for the simulation of Standard Gaussian random variables. The authors have also studied its application in reliability analysis using subset simulation where this method has shown great improvements over the standard MCMC algorithm. Baisthakur and Chakraborty [97] developed a Hamiltonian Monte Carlo-based algorithm for finite element model updating under Bayesian framework by proposing adaptive prior-based approach to generate the intermediate pdf s. The authors endorse efficiency of this method using synthetic experiments and actual test data for updating the finite element model of a steel truss bridge, suggesting performance of this algorithm as advantageous over the standard MCMC algorithm. Nikbakht et al.[98] They quantified rare events probabilities by introducing a gradient-based Hamil26 tonian Markov Chain Monte Carlo (HMCMC) framework, termed Approximate Sampling Target with Post-processing Adjustment (ASTPA). The basic idea was to construct a relevant target distribution by weighting the high-dimensional random variable space through a one-dimensional likelihood model, using the limit-state function. To sample from this target distribution they utilize HMCMC algorithms that produce Markov chain samples based on Hamiltonian dynamics rather than random walks. The performance of this Quasi-Newton based mass preconditioned HMCMC algorithm was found faster and superior over the conventional scheme. They further examined the performance of the proposed methodology against Subset Simulation in a series of static and dynamic low- and high-dimensional benchmark problems. Zhang et al.[99] analysed the applied effect of MCMC simulation method. They proposed a reliability assessment process based on MCMC simulation method with MH Algorithm. The advantage of this method is to improve the application efficiency and accuracy of reliability assessment based on bridge health monitoring data. 2.3.3 Overview of HMC Method Developing a deterministic Monte Carlo method using the concept of Hamiltonian dynamics requires constructing a Hamiltonian system which is mathematically equivalent to the probability space of interest. To establish such a relation, the variable (x) is viewed as position β of the Hamiltonian system, and an auxiliary momentum variable γ is introduced to complete the position-momentum phase space. The potential energy, V (γ), of the system is expressed in terms of the target pdf as a canonical ensemble in the following form V (β) ≡ − log(π(β)) (2.69) Also, the Kinetic energy K(γ) of the system is expressed in terms of the momentum as follows γ T M −1 γ K(γ) ≡ (2.70) 2 where, M is a positive-definite symmetric matrix. For simplicity, this matrix is always chosen as a scalar multiple of the identity matrix. Thus, the Hamiltonian of the equivalent mathematical system becomes H(β, γ) = V (β) + K(γ) (2.71) γ T M −1 γ Setting the temperature T = 1, the canonical ensemble for this system in terms of position = −log(π(β)) + vector β and the momentum vector γ is given by −γ T M −1 γ 1 1 1 2 f (x) = π(β, γ) = e−H(β,γ) = e−V (β) − e−K(γ) = π(β)e Z Z Z 27 (2.72) where, Z is a normalizing constant such that it makes the area under the pdf equal to unity. Above equation shows that the position vector β and the momentum vector γ are statistically independent of each other. Also, it is evident that the position vector β follows the target probability distribution π(β) and the momentum vector γ follows a Gaussian distribution with zero mean and covariance M . Therefore, if the samples are simulated as per the pdf π(β, γ), then the momentum component γ can be easily projected out to get the samples distributed as per the target probability distribution π(β). 2.3.4 Time Evolution of the Hamiltonian System To simulate a random variable distributed as per the target distribution π(β), the equivalent Hamiltonian system can be assumed as a conservative system for which the governing Hamiltonian equations are dβ ∂H = (2.73) dt ∂γ dγ ∂H = (2.74) dt ∂β These equations are solved numerically and the solution of these equations generate the trajectories of(β, γ) following the target pdf, π(β), and the Gaussian distribution for γ, respectively. Generally leapfrog algorithm is used to solve the above mentioned differential equations [[68, 79]]. However, when the target pdf follows Standard Gaussian distribution and M becomes an identity matrix [i.e. K(γ) = γ T γ/2]. Hamiltonian for such a system reduces to H(z, γ) = V (z) + K(γ) = −log(ϕ(z)) + = γT γ 2 (2.75) zT z γT γ + 2 2 The system in this form has an analytical solution which is given by [49] z(t) = γin sin t + zin cos t (2.76) γ(t) = γin cos t − zin sin t (2.77) where, zin and γin denote the initial position and momentum, respectively. This closed form solution describes the time evolution of the system and bypasses the computationally exhaustive solution of leapfrog algorithm. This candidate proposal has been successfully implemented for reliability analysis using subset simulation [[70]] . For generating the candidate state, the time parameter t is assigned with an arbitrary value at the beginning of the 28 chain which is then modified based on the acceptance ratio of the candidates and the limits of acceptance [[70]] such that if the rate of acceptance (a) is less than minimum acceptance rate (amin ) then t is decreased to t = sin−1 sin t exp[(a − amin )/2] (2.78) whereas, if the acceptance ratio exceeds the maximum rate of acceptance (amax ), then t is increased using the formula t = sin−1 {sin t exp[(a − amax )/2]} (2.79) However, considering the circular behavior of Eq. 2.78 and 2.79 , it is reasonable to assign an initial value to t such that t ∈ [−π/2, π/2]. Using the above formulation and the acceptance -rejection criteria, HMC method can be applied for Bayesian model updating with Standard Gaussian prior. 2.3.5 HMC for Model Updating HMC algorithm is applied for model updating by Cheung and Beck [79] where the leapfrog strategy is used to generate the candidate state. In this approach, generation of a new candidate state requires the evaluation of gradient of the Potential Energy function with respect to each uncertain parameter. However, gradient evaluation using numerical techniques or approximate allied methods are a cumbersome task and the computational cost grows with the dimension and the complexity of the Potential Energy function. Cheung and Beck [79] have used Simultaneous Perturbation Stochastic Approximation (SPSA) optimization algorithm to estimate the starting point θ0 closer to the high probability region of the posterior pdf which itself requires large number of evaluations of the posterior pdf, further consuming the computational resources. Also, the problem of low acceptance rate and multi-modal posterior density are often encountered in model updating due to its inverse nature and the narrow regions with high prob- ability concentration. These problems are addressed in MCMC based simulation methods with the help of intermediate pdf s and adaptive proposal distribution, respectively. However, in HMC based techniques, time parameter (δt) can be tuned to maintain a consistent exploration of the probability space with a required rate of acceptance. But, as the candidate state is proposed through a close form solution or a numerical time-stepping method, the liberty of using the information from the simulated posterior is not available. Therefore, this method fails to identify well-separated high probability regions leading to a multimodal posterior density function. These shortcomings limit the application of this method to less complicated mathematical models having unimodal 29 posterior distribution. A new HMC based model updating algorithm is presented in the following section that provides a way to overcome these drawbacks using the closed-form solution in Eq. 2.76 and Eq. 2.77 for simulating the random variables following an adaptive prior based formulation. It can be used to update more complicated finite element model having multimodal posterior pdf. 2.4 Brief model and analysis from the implementation of particle filter 2.4.1 System Identificatio of single-degree-of-freedom (SDOF) oscillator This section presents the results from the implementation of particle filter for identifying the stiffness of a single-degree-of-freedom oscillator excited by Elcentro ground motion. The SIS filter have been used to solve this example. The measurement data has been synthetically generated by solving the forward problem by assuming known values of the system parameters. Once the synthetic measurements are known, the inverse problem is solved using time domain methods. A schematic diagram of the oscillator is shown in Fig 2.1a. The governing equation of motion of SDOF oscillator is given by the second order differential equation as: M ü(t) + C u̇(t) + Ku(t) = −M üg(t) (2.80) where , M is the Mass, C is the damping, K is the stiffness, üg(t) is the acceleration due to the ground motion.The stiffness of the SDOF oscillator was identified in this problem. The forward problem was solved using the β Newmark algorithm which is an implicit unconditionally stable time marching algorithm (Newmark, [100]). The ground excitation due to the Elcentro earthquake has been plotted in Fig 2.1b. The overall duration of the excitation is 40s. The time step considered in the analysis for solving the forward problem is 0.01 sec. Hence the total number of data points are 4000. The SIS filter was then applied to identify the stiffness value of the oscillator. The total number of particles considered are 50. The initial values of the stiffness are generated in the domain of [10000, 90000] from the uniform distribution. The algorithm is dependent on the parameter values generated at time t = 0. The identified value over the entire time history is shown in the Fig 2.0c. Hence, the algorithm acts as a filter and returns the best value among all the values generated at t = 0. The effect of domain dependency can be bypassed and the algorithm can be made more general by mutating the particles so obtained by adding a small Gaussian noise with a controlled 30 value of σ which can be obtained by several test run of the algorithm. For simplicity and clarity, only time history of 4 particles is given in Fig 2.0d. The evolution of the posterior density with time is given by Fig 2.-1e. The estimated states and the states of the original system are plotted in Fig 2.-1f. 2.4.2 System Identification of Linear Time Invariant (LTI) Synthetic model The problem of identifying the system parameters using the Bootstrap Particle filter algorithm has been considered here. The natural frequency is calculated by solving the Eigen value problem involving Mass and Stiffness Matrix. The general equation of motion for a linear system can be written as M ü(t) + C u̇(t) + Ku(t) = −M üg(t) (2.81) where, M is the mass matrix, C is the damping matrix, K is the stiffness matrix, üg(t) is the ground excitation and u(t), u̇(t) and ü(t) are respectively the displacement, velocity and acceleration of the nodes where the sensors are placed for recording. The system parameters to be identified are represented by ϕ. Here ϕ could represent parameters such as stiffness, damping, mass density etc. The above equation can be represented in continuous state space form as Ż(t) = τ (Z(t), ϕ(t), t) (2.82) where Ż(t) is the state vector of the vibrating system and τ (.) relates the state of the system to its first order derivative with respect to time. Now the problem in hand is to identify the system parameters ϕ. The particle filter algorithm simulates particles based on the updated posteriori distribution of the state. More samples are generated from the region where the likelihood is greater. To solve the problem for identification of parameters, ϕ, the state vector can be augmented as Xk = [Zk ϕk ] and assuming model noise as the sequence of i.i.d random variable wk , the above Eq 2.82 can be discretized in the form of Eq 2.82 The dimension of the problem is equal to sum of vector Z and ϕ. Hence one is able to identify the state vector as well as the parameters. In the system identification problem we are generally interested in identifying the system parameters rather than tracking the state of the system. (Nasrellah and Manohar, [54]) suggested that a larger computational effort can be reduced by formulating the problem in terms of system parameters . Hence,the systems which remain invariant with time, the system equation can be expressed as: dϕ =0 dt 31 (a) Schematic diagram of SDOF system (b) Ground excitation due to Elcentro earthquake 32 (c) Estimation of ratio of identified stiffness to original stiffness as function of time (d) Evolution of weights of particles over time 33 (e) Evolution of posterior density with time (f) States estimation from the original and the identified system Figure 2.-1: System identification of SDOF oscillator excited by Elcentro earthquake 34 ϕj(0) = ϕ0 j = 1, 2.........n(2.83)where ϕ0 is the value of the value of the system parameters at time t = 0. The discrete version of the equation can be presented as ϕk+1 = ϕk + wk (2.84) where ϕk is the system parameters at time k, wk is the model noise. The corresponding measurement equation can be written as Yk = hk (ϕk ) + vk (2.85) The Fig 2.1a shows the plan and elevation of three story shear building model kept in the Structural Engineering laboratory of Department of Civil Engineering, IIT Guwahati.Synthetic measurement from the known value of system parameters have been considered and then validated using Bootsrap filter algorithm. The lumped mass for the slab (600 × 300 × 10mm) has been calculated for solving the forward problem. The density of the steel taken is 8400 kg/m3. The classical damping matrix is obtained by considering the Rayleigh Damping, (Chopra,[?]). C = αM + βK (2.86) Here M ,K and C are the Mass, Stiffness and Damping matrix respectively, α and β are the coefficients. For the given model, the mass and the stiffness matrix is given as: m1 0 0 M = 0 m2 0 0 0 m3 k1 + k2 −k2 0 k2 + k3 −k3 K = −k2 0 −k3 k3 (2.87) where m1 , m2 and m3 are the lumped mass at the floor levels and k1 , k2 and k3 are the stiffness of the each story. The coefficients α and β can be obtained from specified damping ratios i and j for the it h and jt h mode respectively. If both the modes are assumed to have same damping ratios then arpita α=ζ (2.88) (2wi wj ) wi + wj 2 β = ζ wi +w j 35 Figure 2.0: Parameter values for solving the forward problem where wi and wj are the natural frequency of the system in it h and jt h modes respectively. The response of the model has been calculated using the values of the parameters given in Table 2.0 below. This is known as solving the forward problem. The response of the structure due to the ground motion excitation by the 1940 El-Centro Earthquake has been considered. The response of the model for the given excitation is shown in Fig 2.1b. The inverse problem starts with simulating random values from the uniform distribution, for the parameters which are to be identified, at time t = 0. The simulation of random values for stiffness and damping at all the floor levels is done firstly. The domain over which the stiffness values are simulated is between 10000 to 90000 N/m and the damping values are between 0 to 50 N − s/m. The number of values generated at t = 0are 100. This number remains constant for each and every iteration of the algorithm. Normal distribution has been used to calculate the likelihood or the weights of the particles with error covariance been chosen as 0.001. . Fig 2.1a shows the identified values of the stiffness using the Bootstrap filter. The standard deviation of the parameters has been plotted in Fig 2.1b. The statistical fluctuations die out once the parameters are identified and the standard deviation becomes zero. The robustness of the algorithm is clearly depicted in Fig 2.1b and Fig 2.1a which shows the mode shapes of the identified as well original system and the estimated states of the identified and original system for El-Centro earthquake. The study suggests that the bootstrap filter gives a good estimate of the stiffness values. A detailed discussion of the various techniques used to m Based on the previous discussions the problem statement for the present work is formulated, which are explained below. 36 (a) Plan and Elevation of Synthetic Model (b) Ground excitation due to Elcentro earthquake 37 (a) Ratio of identified stiffness to original stiffness: Elcentro earthquake (b) Standard deviation of stiffness: Elcentro earthquake 38 (a) Original and estimated states of model: El-Centro earthquake (b) Mode shape of the original and identified structure Figure 2.1: System identification of LTI System 39 Research proposal A review of selected topics in the area of structural system identification has been presented in the preceding sections. The focus of the review has been mainly on SSI methods based on dynamic state estimation methods and bayesian updating. . These methods have roots in Markov vector methods and Bayes’ theorem and provide a systematic framework to sequentially assimilate the measurements into mathematical models and they address several features of SSI problems in a unified manner. The basis of these methods lies in the exact functional recursive relations that the posterior pdf-s of the system states satisfy. These equations can be solved in an exact manner for a limited class of problems but are amenable for approximate solutions, via numerical methods, such as Monte Carlo simulation techniques, for, a more general class of problems. Some of the outstanding features of these methods are : (a) they operate in time domain and hence they are most suited for study on nonlinear SSI problems; (b) the method could also be suitably formulated to identify system parameters even when the measurements originates from static tests, (c) both process and measurement equations could be nonlinear; (d) issues related to specification of initial conditions can be taken care of; (e) spatial incompleteness of measurements does not pose any special difficulties; the method does not need any model reduction methods to be invoked to match measured and mathematically modeled dof-s; (f) imperfections in measurements and in mathematical modeling are explicitly taken care of by modeling these imperfections as white noise random processes;the governing equations can be interpreted as Ito- stochastic differential equations and the power of Ito’s calculus based approaches can be brought to bear on discretizing the mathematical models; (g) the method leads to probabilistic description of parameters to be identified and hence one could assess the confidence in the estimates obtained; (h) the dual problem of state and parameter estimation can be tackled; (i) the method could be applied in online or offline manners; and (j) the method can take advantage of modern sensing and computing facilities. The Kalman filter provides exact solution to the problem of DSE for linear state space 40 models with additive Gaussian noises. The method leads to exact recursive relations for evolution of posterior mean vector and covariance matrix. Ensemble Kalman filters are suited when number of system states becomes very large. Extended Kalman filter provides a conceptually simple approach to deal with nonlinearities in process and measurement equations. Here the system nonlinearities are linearized around the evolving expected trajectory of the system states. The method is ad hoc in nature and leads to biased estimates for system response moments. Unscented Kalman filter avoids linearization of the system nonlinearities but instead assumes Gaussian models for posterior pdf-s and deduces mean and covariance of system states with improved accuracy. Both the extended Kalman and unscented Kalman filters provide Gaussian models for posterior pdf-s and they assume additive Gaussian noise models. For more general class of problems, involving nonlinear state space models and multiplicative/additive non-Gaussian noises, the particle filtering methods provide systematic framework to tackle the state estimation problems. These methods are based on Monte Carlo simulation strategies and, hence, are computationally demanding; the existing computational power can however be exploited to obtain acceptable solutions. Various versions of particle filtering methods exist and are widely used in modern engineering including applications to tracking and signal processing applications. The use of these methods in structural engineering problems is of recent origin and their power has not yet been fully explored in the existing literature. The challenge here lies in combining particle filtering methods with commonly used structural modeling techniques, such as the finite element method, to develop suitable identification tools. When fully developed, these methods can become powerful tools for structural system identification based on response measurements under ambient loads such as wind, earthquake and traffic loads. 3.1 Gap Areas Based on the review of literature, we recognize the following as worthy of further research and to contribute to their solution: • The recently developed HMC algorithm has shown great potential in the Subset Simulation. Therefore, application of this algorithm in stochastic model updating problem is expected to improve the efficiency of searching the probability space which is yet to be studied. Further most of the studies involving Bayesian algorithms are focused on its performance using synthetic experiments and/or small-scale laboratory applications. Only a handful of studies are available for large-scale field structures. In particular, the application of HMC algorithm for stochastic simulation and load carrying capacity 41 of bridges has never been studied before. • Within the framework of dynamic state estimation methods, the problem of SSI and estimation of hidden states are often solved as dual problems. Questions on the possibility of avoiding state estimation and focusing only on system parameter identification require careful considerations. • There resides almost no studies in the literature under review to develop a Hamiltonian close from solution integrated with the Gaussian filters or particle filters to solve bridge health monitoring problems applied to existing bridge structures. Developing such scheme can be used to characterize fundamental modes and other associated parameters of an existing bridge system. • Within the framework of dynamic state estimation methods, the problem of SSI and estimation of hidden states are often solved as dual problems. Questions on the possibility of avoiding state estimation and focusing only on system parameter identification require careful considerations. • The recently developed HMC algorithm in the literature has shown great potential in the Subset Simulation. Therefore, application of this algorithm in model updating problem is expected to improve the efficiency of searching the probability space which is yet to be studied. The fusion of HMC methods and the state estimation filtering techniques can be utilized for reliability assessment of existing bridge systems. • The particle filtering methods are eminently suited for computation on parallel computers although very few researchers have explored this method for implementation on such computers. The identification procedures can be efficiently implemented if the particle filtering steps are embedded within professional FE softwares instead of interfacing through batch files. • Finite element methods are perhaps the most popular modeling tools that are currently being used in structural engineering practice. A host of professional softwares are readily available for simulating a wide range of structural behavior under static and dynamic loading situations. In studies on condition assessment of large structures, such as bridges, it would be most advantageous if the structural models are made to reside on readily available finite element softwares and the identification tools are designed to communicate with these models. To achieve this, we have to suitably formulate the identification algorithm and develop the necessary suite of softwares. 42 • Most of the studies in literature has been limited to static behavior of linear systems. Hence study for nonlinear static and dynamical systems w.r.t. synthetic models and existing structures is needed. One of the interesting questions in this context would be on identifying properties of degrading systems under severe dynamic loads. Such problems are of significance in earthquake engineering problems. • A very few literatures have been found where optimal sensor locations on the structure have been identified in time domain approaches for acquiring responses of the structure. The optimal number of sensors and their locations required for accurate identification of structural parameters and input force need further development. • The modern sensors, such as scanning laser vibrometres, distributed fibre optic sensors, wireless sensors in collaborative mode, and image based distributed strain measurement devices, provide measurements that are spatially extensive and their assimilation into mathematical model for the purpose of system identification pose interesting challenges. • Most of the studies in literature has assumed that the measurement and process noises are additive and have zero mean, stationary and Gaussian distributed with a specified covariance matrix. This assumption has been found to be not restrictive, especially when the identification procedures have been applied on experimental/field data. Notwithstanding this, a rational justification of the assumptions made on noise characteristics needs to be explored. One possibility would be to treat the parameters of the noise process themselves to be unknowns to be estimated as a part of the overall identification procedure. Also, statistical processing of electronic noise from measurement circuits would also enable to quantify a part of the model parameters. Characterizing modeling errors, however, would require substantial efforts. • In problems of bridge condition assessment, the moving vehicles serve as the simplest means to excite the structures. The identification of bridge parameters here requires the use of measurement data that originate from dynamic vehicle-structure interactions. Methods to achieve this, based on particle filtering strategies, are presently not been studied much in the existing literature. 3.2 Scope of Research A part of the present study will be conducted in the context of an ongoing research project at the Department of Civil Engineering, Indian Institute of Technology, Guwahati on in-situ 43 condition assessment of an existing bridge in West Bengal, India. The main objective of this part of the project will be to analyse the test data/measurements for in-situ condition assessment of this bridge, so that appropriate measures can be taken, if the need be. The emphasis of the present work will be mainly on time domain parametric methods for structural system identification. Moreover, the focus will be on methods that can be generalized to include structural nonlinearities, finite element models for structures, measurements on static and dynamic responses and imperfections in structural models and measurement noises. Methods based on dynamic state estimation offer promise to achieve these ends. Consequently, the review of literature on methods of SSI is basically limited to cover these methods. Within this framework, methods that are applicable to linear state space models and nonlinear systems are to be studied. 3.3 Objectives of Research The following mentioned problems will be addressed during the course of the study to develop generalised solutions: • Formulation of a closed form HMC solution to carry out stochastic simulation and model updating. • Develop a framework for Bayesian parametric state estimation using fusion of the closed form formulation and Gaussian filter for structural system identification. • Extending the closed form formulation for stochastic simulation using particle filter to solve load carrying capacity and rating factor for bridges. • Using the above developed schemes for Reliability assessment and service life estimation of existing bridge The present work aims to tackle the above-mentioned problems. A satisfactory demonstration of identification tools thus developed requires measurement data from laboratory based experiments and field studies. The work, accordingly, also aims to employ field data from studies on an existing bridge. 44 Time Frame 45 References [1] G Wea Housner, Lawrence A Bergman, T Kf Caughey, Anastassios G Chassiakos, Richard O Claus, Sami F Masri, Robert E Skelton, TT Soong, BF Spencer, and James TP Yao. Structural control: past, present, and future. Journal of engineering mechanics, 123(9):897–971, 1997. [2] J.-T. KIM and N. STUBBS. Improved damage identification method based on modal information. Journal of Sound and Vibration, 252(2):223 – 238, 2002. [3] U. Lee and Jinho Shin. A structural damage identification method for plate structures. Engineering Structures, 24:1177–1188, 09 2002. [4] Abhinav Subramanian and Sankaran Mahadevan. Bayesian estimation of discrepancy in dynamics model prediction. Mechanical Systems and Signal Processing, 123:351–368, 2019. [5] Oliver Nelles. Nonlinear dynamic system identification. In Nonlinear System Identification, pages 547–577. Springer, 2001. [6] G. Franco, R. Betti, and H. LuÅŸ. Identification of structural systems using an evolutionary strategy. Journal of Engineering Mechanics, 130(10):1125–1139, 2004. [7] Anne Raich and Tamas Liszkai. Improving the performance of structural damage detection methods using advanced genetic algorithms. Journal of Structural Engineeringasce - J STRUCT ENG-ASCE, 133, 03 2007. [8] K. M. Liew and Q. Wang. Application of wavelet theory for crack identification in structures. Journal of Engineering Mechanics, 124(2):152–157, 1998. [9] Keith Worden and Geoffrey R Tomlinson. Nonlinearity in experimental modal analysis. Philosophical Transactions of the Royal Society of London. Series A: Mathematical, Physical and Engineering Sciences, 359(1778):113–130, 2001. 47 [10] Chunxiao Bao, Hong Hao, Zhong-Xian Li, and Xinqun Zhu. Time-varying system identification using a newly improved hht algorithm. Computers Structures, 87(23):1611 – 1623, 2009. [11] Swarup Mahato, Meda Vinay Teja, and Arunasis Chakraborty. Combined wavelet– hilbert transform-based modal identification of road bridge using vehicular excitation. Journal of Civil Structural Health Monitoring, 7(1):29–44, 2017. [12] Julius S Bendat. Nonlinear systems techniques and applications. Wiley, 1998. [13] Zhenyu He, Xinge You, Long Zhou, Yiuming Cheung, and Jianwei Du. Writer identification using fractal dimension of wavelet subbands in gabor domain. Integrated Computer-Aided Engineering, 17(2):157–165, 2010. [14] Franck Schoefs, Humberto Yáñez-Godoy, and Francesca Lanata. Polynomial chaos representation for identification of mechanical characteristics of instrumented structures. Computer-Aided Civil and Infrastructure Engineering, 26(3):173–189, 2011. [15] H Li, Y Huang, WL Chen, ML Ma, DW Tao, and JP Ou. Estimation and warning of fatigue damage of frp stay cables based on acoustic emission techniques and fractal theory. Computer-Aided Civil and Infrastructure Engineering, 26(7):500–512, 2011. [16] Thomas Bayes and null Price. Lii. an essay towards solving a problem in the doctrine of chances. by the late rev. mr. bayes, f. r. s. communicated by mr. price, in a letter to john canton, a. m. f. r. s. Philosophical Transactions of the Royal Society of London, 53:370–418, 1763. [17] Y. Ho and R. Lee. A bayesian approach to problems in stochastic estimation and control. IEEE Transactions on Automatic Control, 9(4):333–339, 1964. [18] J. Spragins. A note on the iterative application of bayes’ rule. IEEE Transactions on Information Theory, 11(4):544–549, 1965. [19] Rudolph Emil Kalman. A new approach to linear filtering and prediction problems. Transactions of the ASME–Journal of Basic Engineering, 82(Series D):35–45, 1960. [20] Masaru Hoshiya and Etsuro Saito. Structural identification by extended kalman filter. Journal of Engineering Mechanics, 110(12):1757–1770, 1984. [21] Simon J Julier and Jeffrey K Uhlmann. New extension of the kalman filter to nonlinear systems. In Signal processing, sensor fusion, and target recognition VI, volume 3068, pages 182–193. International Society for Optics and Photonics, 1997. 48 [22] N.J. Gordon, D.J. Salmond, and A.F.M. Smith. Novel approach to nonlinear/nongaussian bayesian state estimation. Radar and Signal Processing, IEE Proceedings F, 140:107 – 113, 05 1993. [23] Andrew H Jazwinski. Mathematics in science and engineering. Stochastic processes and filtering theory, 64, 1970. [24] Peter S Maybeck, Any Form, Or Byany Means, Or Mechanical, et al. Stochastics models, estimation, and control: Introduction. 1979. [25] Thomas Kailath. Lectures on wiener and kalman filtering. In Lectures on Wiener and Kalman Filtering, pages 1–143. Springer, 1981. [26] R Brown and P Hwang. Introduction to random signals and applied kalman filtering (john wiley & sons, new york). 1992. [27] Charles K Chui, Guanrong Chen, et al. Kalman filtering. Springer, 2017. [28] Mohinder S Grewal and Angus P Andrews. ”kalman filtering: Theory and practice using matlab, ”john wiley &sons. New York, 2001. [29] Masanobu Shinozuka and Roger Ghanem. Structural system identification. ii: Experimental verification. Journal of Engineering Mechanics, 121(2):265–273, 1995. [30] Babak Moaveni, Xianfei He, Joel Conte, Jose Restrepo, and Marios Panagiotou. System identification study of a seven-story full-scale building slice tested on the ucsd-nees shake table. Journal of Structural Engineering, 137, 01 2011. [31] Subhamoy Sen and Baidurya Bhattacharya. Non-gaussian parameter estimation using generalized polynomial chaos expansion with extended kalman filtering. Structural Safety, 70:104–114, 2018. [32] SM Ulam and John Von Neumann. Random ergodic theorems. Bull. Amer. Math. Soc, 51:660, 1945. [33] Stanislaw Ulam, Robert D Richtmyer, and John von Neumann. Statistical methods in neutron diffusion. LAMS-551, Los Alamos National Laboratory, pages 1–22, 1947. [34] Nicholas Metropolis and S. Ulam. The monte carlo method. Journal of the American Statistical Association, 44(247):335–341, 1949. 49 [35] J. V. Candy, K. Sale, B. L. Guidry, E. Breitfeller, D. Manatt, D. Chambers, and A. Meyer. Bayesian processing for the detection of radioactive contraband from uncertain measurements. In 2007 2nd IEEE International Workshop on Computational Advances in Multi-Sensor Adaptive Processing, pages 49–52, 2007. [36] Hisashi Tanizaki and Roberto S Mariano. Nonlinear filters based on taylor series expansions. Communications in Statistics-Theory and Methods, 25(6):1261–1282, 1996. [37] Hisashi Tanizaki. Nonlinear filters: estimation and applications. Springer Science & Business Media, 2013. [38] Arnaud Doucet, Nando De Freitas, and Neil Gordon. An introduction to sequential monte carlo methods. In Sequential Monte Carlo methods in practice, pages 3–14. Springer, 2001. [39] Jane Liu and Mike West. Combined parameter and state estimation in simulationbased filtering. In Sequential Monte Carlo methods in practice, pages 197–223. Springer, 2001. [40] Branko Ristic, Sanjeev Arulampalam, and Neil Gordon. Beyond the Kalman filter: Particle filters for tracking applications, volume 685. Artech house Boston, 2004. [41] Arnaud Doucet. On sequential simulation-based methods for bayesian filtering. 1998. [42] Jun S Liu and Rong Chen. Sequential monte carlo methods for dynamic systems. Journal of the American statistical association, 93(443):1032–1044, 1998. [43] Michael K Pitt and Neil Shephard. Filtering via simulation: Auxiliary particle filters. Journal of the American statistical association, 94(446):590–599, 1999. [44] Arnaud Doucet, Simon Godsill, and Christophe Andrieu. On sequential monte carlo sampling methods for bayesian filtering. Statistics and computing, 10(3):197–208, 2000. [45] Yukito Iba. Extended ensemble monte carlo. International Journal of Modern Physics C, 12(05):623–656, 2001. [46] Dan Crisan and Arnaud Doucet. A survey of convergence results on particle filtering methods for practitioners. IEEE Transactions on signal processing, 50(3):736–746, 2002. 50 [47] Olivier Cappé, Simon J Godsill, and Eric Moulines. An overview of existing methods and recent advances in sequential monte carlo. Proceedings of the IEEE, 95(5):899–924, 2007. [48] Adam M Johansen and Arnaud Doucet. A note on auxiliary particle filters. Statistics & Probability Letters, 78(12):1498–1504, 2008. [49] Simon Godsill. Particle filtering: the first 25 years and beyond. In ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pages 7760–7764. IEEE, 2019. [50] Zhe Chen. Bayesian filtering: From kalman filters to particle filters, and beyond. Statistics, 182, 01 2003. [51] Sebastian Thrun. Probabilistic robotics. Commun. ACM, 45(3):52–57, March 2002. [52] Jianye Ching, James L. Beck, and Keith A. Porter. Bayesian state and parameter estimation of uncertain dynamical systems. Probabilistic Engineering Mechanics, 21(1):81 – 96, 2006. [53] Cs Manohar and Debasish Roy. Monte carlo filters for identification of nonlinear systems. Sadhana, 31:399–427, 08 2006. [54] H.A. Nasrellah and C.S. Manohar. Finite element method based monte carlo filters for structural system identification. Probabilistic Engineering Mechanics, 26(2):294 – 307, 2011. [55] B Radhika and CS Manohar. Nonlinear dynamic state estimation in instrumented structures with conditionally linear gaussian substructures. Probabilistic engineering mechanics, 30:89–103, 2012. [56] Subhamoy Sen, Antoine Crinière, Laurent Mevel, Frédéric Cérou, and Jean Dumoulin. Seismic-induced damage detection through parallel force and parameter estimation using an improved interacting particle-kalman filter. Mechanical Systems and Signal Processing, 110:231–247, 2018. [57] Vikas Namdeo and C.S. Manohar. Nonlinear structural dynamical system identification using adaptive particle filters. Journal of Sound and Vibration, 306(3):524 – 563, 2007. [58] Debraj Ghosh and Charbel Farhat. Strain and stress computations in stochastic finite element methods. International Journal for Numerical Methods in Engineering, 74(8):1219–1239, 2008. 51 [59] R. Sajeeb, C.S. Manohar, and D. Roy. Use of particle filters in an active control algorithm for noisy nonlinear structural dynamical systems. Journal of Sound and Vibration, 306(1):111 – 135, 2007. [60] HA Nasrellah and CS Manohar. A particle filtering approach for structural system identification in vehicle–structure interaction problems. Journal of Sound and Vibration, 329(9):1289–1309, 2010. [61] B Radhika and CS Manohar. Dynamic state estimation for identifying earthquake support motions in instrumented structures. Earthquakes and Structures, 5(3):359– 378, 2013. [62] Subhamoy Sen, Neha Aswal, Qinghua Zhang, and Laurent Mevel. Structural health monitoring with non-linear sensor measurements robust to unknown non-stationary input forcing. Mechanical Systems and Signal Processing, 152:107472, 2021. [63] P. Rangaraj, Abhijit Chaudhuri, and Sayan Gupta. Polynomial Chaos in Bootstrap Filtering for System Identification, pages 653–667. 01 2013. [64] Simon Rouchier, Maria José Jiménez, and Sergio Castaño. Sequential monte carlo for on-line parameter estimation of a lumped building energy model. Energy and Buildings, 187:86–94, 2019. [65] Tijana Radivojević and Elena Akhmatskaya. Modified hamiltonian monte carlo for bayesian inference. Statistics and Computing, 30(2):377–404, 2020. [66] M. Arulampalam, Simon Maskell, Neil Gordon, and Tim Clapp. A tutorial on particle filters for online nonlinear/non-gaussian bayesian tracking. Signal Processing, IEEE Transactions on, 50:174 – 188, 03 2002. [67] Tiancheng Li, Miodrag Bolic, and Petar Djuric. Resampling methods for particle filtering: Classification, implementation, and strategies. Signal Processing Magazine, IEEE, 32:70–86, 05 2015. [68] Simon Duane, Anthony D Kennedy, Brian J Pendleton, and Duncan Roweth. Hybrid monte carlo. Physics letters B, 195(2):216–222, 1987. [69] Radford M Neal et al. Mcmc using hamiltonian dynamics. Handbook of markov chain monte carlo, 2(11):2, 2011. [70] Ziqi Wang, Marco Broccardo, and Junho Song. Hamiltonian monte carlo methods for subset simulation in reliability analysis. Structural Safety, 76:51 – 67, 2019. 52 [71] Lambros S Katafygiotis, Costas Papadimitriou, and Heung-Fai Lam. A probabilistic approach to structural model updating. Soil Dynamics and Earthquake Engineering, 17(7-8):495–507, 1998. [72] LS Katafygiotis, NC Mickelborough, and KV Yuen. Statistical identification of structural modal parameters using ambient response data. In Proceedings 3rd International Conference on Computational Stochastic Mechanics, pages 93–102, 1998. [73] James L Beck and Lambros S Katafygiotis. Updating models and their uncertainties. i: Bayesian statistical framework. Journal of Engineering Mechanics, 124(4):455–461, 1998. [74] James L Beck and Ka-Veng Yuen. Model selection using response measurements: Bayesian probabilistic approach. Journal of Engineering Mechanics, 130(2):192–203, 2004. [75] Iman Behmanesh and Babak Moaveni. Probabilistic identification of simulated damage on the dowling hall footbridge through bayesian finite element model updating. Structural Control and Health Monitoring, 22(3):463–483, 2015. [76] Iman Behmanesh, Babak Moaveni, Geert Lombaert, and Costas Papadimitriou. Hierarchical bayesian model updating for structural identification. Mechanical Systems and Signal Processing, 64:360–376, 2015. [77] I Boulkaibet, L Mthembu, T Marwala, MI Friswell, and S Adhikari. Finite element model updating using the shadow hybrid monte carlo technique. Mechanical Systems and Signal Processing, 52:115–132, 2015. [78] Sai Hung Cheung and Sahil Bansal. A new gibbs sampling based algorithm for bayesian model updating with incomplete complex modal data. Mechanical Systems and Signal Processing, 92:156–172, 2017. [79] Sai Hung Cheung and James L Beck. Bayesian model updating using hybrid monte carlo simulation with application to structural dynamic models with many uncertain parameters. Journal of engineering mechanics, 135(4):243–255, 2009. [80] Siddhartha Chib and Edward Greenberg. Understanding the metropolis-hastings algorithm. The american statistician, 49(4):327–335, 1995. [81] Jianye Ching and Yi-Chu Chen. Transitional markov chain monte carlo method for bayesian model updating, model class selection, and model averaging. Journal of engineering mechanics, 133(7):816–832, 2007. 53 [82] Michael Friswell and John E Mottershead. Finite element model updating in structural dynamics, volume 38. Springer Science & Business Media, 2013. [83] Heikki Haario, Eero Saksman, Johanna Tamminen, et al. An adaptive metropolis algorithm. Bernoulli, 7(2):223–242, 2001. [84] François M Hemez and Scott W Doebling. Review and assessment of model updating for non-linear, transient dynamics. Mechanical Systems and Signal Processing, 15(1):45–74, 2001. [85] Maryam Kalantarnia, Faisal Khan, and Kelly Hawboldt. Dynamic risk assessment using failure assessment and bayesian theory. Journal of Loss Prevention in the Process Industries, 22(5):600–606, 2009. [86] James L Beck and Siu-Kui Au. Bayesian updating of structural models and reliability using markov chain monte carlo simulation. Journal of engineering mechanics, 128(4):380–391, 2002. [87] Hoon Sohn and Kincho H Law. Application of load-dependent ritz vectors to bayesian probabilistic damage detection. Probabilistic engineering mechanics, 15(2):139–153, 2000. [88] Michael W Vanik, James L Beck, and SK2000 Au. Bayesian probabilistic approach to structural health monitoring. Journal of Engineering Mechanics, 126(7):738–745, 2000. [89] George Kuczera and Eric Parent. Monte carlo assessment of parameter uncertainty in conceptual catchment models: the metropolis algorithm. Journal of hydrology, 211(14):69–85, 1998. [90] Christophe Andrieu and Johannes Thoms. A tutorial on adaptive mcmc. Statistics and computing, 18(4):343–373, 2008. [91] Ka-Veng Yuen, James L Beck, and Siu Kui Au. Structural damage detection and assessment by adaptive markov chain monte carlo simulation. Structural Control and Health Monitoring, 11(4):327–347, 2004. [92] Feng Qian and Wei Zheng. An evolutionary nested sampling algorithm for bayesian model updating and model selection using modal measurement. Engineering Structures, 140:298–307, 2017. 54 [93] Heung-Fai Lam, Jiahua Yang, and Siu-Kui Au. Bayesian model updating of a coupledslab system using field test data utilizing an enhanced markov chain monte carlo simulation algorithm. Engineering Structures, 102:144–155, 2015. [94] Roberto Rocchetta, Matteo Broggi, Quentin Huchet, and Edoardo Patelli. On-line bayesian model updating for structural health monitoring. Mechanical Systems and Signal Processing, 103:174–195, 2018. [95] Isambi S Mbalawata, Simo Särkkä, and Heikki Haario. Parameter estimation in stochastic differential equations with markov chain monte carlo and non-linear kalman filtering. Computational Statistics, 28(3):1195–1223, 2013. [96] Francois Septier and Gareth W Peters. Langevin and hamiltonian based sequential mcmc for efficient bayesian filtering in high-dimensional spaces. IEEE Journal of selected topics in signal processing, 10(2):312–327, 2015. [97] Shubham Baisthakur and Arunasis Chakraborty. Modified hamiltonian monte carlobased bayesian finite element model updating of steel truss bridge. Structural Control and Health Monitoring, 27(8):e2556, 2020. [98] Hamed Nikbakht and Konstantinos G Papakonstantinou. A direct hamiltonian mcmc approach for reliability estimation. arXiv preprint arXiv:1909.03575, 2019. [99] Liye Zhang, Lijuan Dong, Shoushan Cheng, Wanheng Li, Bingjian Wang, Hanyong Liu, and Ke Chen. Efficient reliability assessment method for bridges based on markov chain monte carlo (mcmc) with metropolis-hasting algorithm (mha). In IOP Conference Series: Earth and Environmental Science, volume 580, page 012030. IOP Publishing, 2020. [100] Nathan M Newmark. A method of computation for structural dynamics. Journal of the engineering mechanics division, 85(3):67–94, 1959. 55