Estimating the Change Point of Binary Profiles with a Linear Trend Disturbance Alireza Sharafi (corresponding author) M.S.C. Department of Industrial Engineering, Amirkabir University. a.sharafi@aut.ac.ir, Majid Aminnayeri Associate Prof. Department of Industrial Engineering, Amirkabir University. mjnayeri@aut.ac.ir, Amirhossein Amiri Assistant Professor. Department of Industrial Engineering, Faculty of Engineering, Shahed University. amiri@shahed.ac.ir, Mohsen Rasouli M.S. student. Department of Industrial Engineering, Isfahan University of Technology. mohsen.rasouli@in.aut.ac.ir, sooner and corrective action can be implemented more quickly. Identifying the real time of the process change is known as change-point estimation problem. Abstract Identification of a real time of a change in a process, when an out-of-control signal is present is significant. This may reduce costs of defective products as well as the time of exploring and fixing the cause of defects. Another popular topic in the Statistical Process Control (SPC) is profile monitoring, where knowing the distribution of one or more quality characteristics may not be appropriate for discussing the quality of processes or products. One, rather, uses a relationship between a response variable and one or more explanatory variable for this purpose. In this paper, the concept of Maximum Likelihood Estimator (MLE) applied to estimate of the change point in binary profiles, when the type of change is drift. Simulation studies are provided to evaluate the effectiveness of the change point estimator. Change point problems are mainly classified according to change types including step, drift and isotonic shifts. To find the real time of a change, many authors have suggested several methods such as Maximum Likelihood Estimator (MLE), Cumulative Sum (CUSUM), Exponentially Weighted Moving Average (EWMA) and intelligent method, (artificial network, clustering and decision tree). For example Pignatiello and Samuel [1- 4] proposed a Maximum Likelihood Estimator in different control charts to find the real time of change point under a step shift. Perry and Pignatiello [5, 6] used this method to estimate the change point in the mean of normal and poisson distribution with a linear trend disturbance, respectively. Keywords: binary profiles; change point; maximum In the area of SPC, usually the quality of a process or a product was represented by the distribution of one (or more) quality characteristic and monitored by univariate (or multivariate) control chart. However, Kang and Albin [7] presented a popular topic with widespread application in SPC namely, profiles monitoring, in which a relationship between a response variable and one or more independent variables, known as profiles is monitored over time. Scientists classify profiles based on the type of relationships into the linear (simple, multiple and polynomial), nonlinear and geometric profiles. Profiles are studied in Phases I and II. In Phase I, Mestek et al. likelihood estimator (MLE); statistical process control (SPC), control chart 1- Introduction Statistical Process Control (SPC) is used to monitor and reduce the variation of a process. Control charts are the most important tools in SPC which are used to monitor quality characteristics. These charts can detect any changes or shifts in a process; however, a shift usually occurs much earlier before it is detected. When a control chart signals a special cause, quality engineers should identify and remove the cause(s) of variation and return the process to in-control state. However, knowing when a process has changed would help quality engineers to limit the time window within which they should search for assignable causes. Consequently, the assignable causes can be identified 2 [8] used a T based control charts to monitor the regression parameters of a simple linear profiles. Mahmoud and Woodall [9] used a global F-test for monitoring the regression coefficients in conjunction with a univariate control chart to monitor the variance. Mahmoud et al. [10] applied a likelihood ratio test to monitor simple linear profiles. In Phase II, Kang and 1 Albin [7] proposed two methods including a 2- Logistic Regression Model 2 multivariate T chart and a combination of EWMA-R chart to monitor simple linear profiles. Kim et al. [11] recommended three independent univariate EWMA control charts to monitor shifts in parameters of a simple linear profile. Many categorical response variables have only two categories: for example, whether you take public transportation today (yes, no), or whether you have had a physical exam in the past year (yes, no). Denote a binary response variable by y and E ( y ) . The value of can vary as the value of x changes, and we replace by ( x ) . For n independent observations, the number of successes has the binomial distribution specified by the index n and parameter . The relationships between ( x ) and Identifying the change point in profiles has been studied by some researchers. Mahmoud et al. [10] used a likelihood ratio based method to identify the real time of a step change in Phase I monitoring of a simple linear profiles. Zou et al. [12] proposed a method based on likelihood ratio statistics to find the change point in parameters of a simple linear profile in Phase II. Kazemzadeh et al. [13] used the same method to estimate the change point in polynomial profiles under a step shift in Phase I. Note that the change point problem in profile data is under a different sampling framework from that of the other models. In the profile applications, multiple data sets are collected over time in a functional data sampling framework and a possible change point is take place after any data set. x are usually nonlinear and the S -shaped curves are often realistic shapes for this relationship which are called the logistic regression models. In a logistic regression model, there are n independent experimental sets, with p predictor variables for each set, which is shown by xi ( xi1 , xi 2 ,, xip )T as well as corresponding Bernoulli response variables namely zi for i 1, 2,..., n . The probability of success in Most researches in profile monitoring, assume the response variable is continuous (usually Normal) and characterize profiles with linear or nonlinear models. However, in many industrial applications the response variable is discrete such as binary (as in the case of a product can be classified as defective or non defective) or countable (as the number of defect products or number of patients in a hospital). Yeh et al. [14] studied binary profiles in Phase I. They proposed each set is denoted by i and each i is a function of xi . In the logistic regression model this function is characterized by the link function g i , defined as g i 2 different T control charts for monitoring logistic regression profiles. Shang et al. [15] proposed a control scheme based on EWMA-GLM to monitor the relationship between the binary response and random explanatory variables in Phase II. Their approach assumes that explanatory variables are random variable and takes different values in each profile sample. To the best of our knowledge, Only Sharafi et al. [16] suggested a method to identify the real time of a step change in Phase II monitoring of binary profiles and there is no more researches in this area. In this paper we propose a Maximum Likelihood Estimator to find the real time of a change in Phase II monitoring of logistic regression profiles. Here, we assumed that the type of the disturbance is drift. The rest of the paper is organized as follows: Section 2 illustrates logistic regression model and explains the steps of estimating the model parameters. Section 3 presents the change point model and assumptions of the problem. The performance of the proposed model is investigated in Section 4. Conclusions and some future researches are provided in the final section. log i 1 log i 1 xi1 2 xi 2 ... p xip where 1 , 2 ,..., p T (1) is the model parameters vector. It is usual to set xi1 1 in order that 1 be the intercept of the model. Equation (1) simplifies to i exp( xiT ) 1 exp( x ) T i exp(i ) 1 exp(i ) T In this equation i xi (2) . p x k ik . We also assume k 1 that data are grouped so that for the i th setting of the predictor variables, there are i 1,2,..., n . mi observations, n M mi is total number of i 1 mi observations. The response variable is yi zik , k 1 where 2 zik is the k th observation (0 or 1) in i th yi follows a binomial distribution with parameters mi , i . Albert and predictor variable settings, Initialize the estimate of ˆ (0) . Set i 0 Anderson [17] proposed MLE method to estimate the model parameters for this purpose. They used the following likelihood function: m y m y L (, y) i i i 1 i i i , i 1 y i ˆ (i ) Calculate ˆ (i ) , ˆ (i ) , ˆ (i ) and W (i ) ˆ (i ) ˆ (i ) W qˆ n (3) Where 1 , 2 ,..., n and y y1 , y2 ,..., yn Taking the logarithm of the Equation (3) and using i xiT i p x log k ik k 1 1 i ˆ (i ) X ˆ ( i 1) XT W l (, y ) mi n ( i 1) ˆ p i Yes (4) (i) ˆ ˆ Figure I. The procedure of estimation of logistic regression parameters Taking derivative of Equation (4) with respect to , and using iterative weighted least square estimation method suggested by McCullagh and Nelder [18], the logistic regression parameters can be estimated as follows: ˆ XT WX In the ˆ XT Wq Equation 3- MLE Change Point Estimator Here, it is assumed that the underlying process initially operates in a state of statistical control, with observations coming from a poisson distribution with the known parameters 0 ( is a p-dimensional (5) (5) X x1 , x2 ,..., x n , T ˆ diag m ˆ 1 ˆ , m ˆ 1 ˆ ,..., m ˆ 1 ˆ , W 1 1 1 2 2 2 n n n vector); ˆ 1 y ˆ and ˆ m ˆ , m ˆ ,..., m ˆ . qW 1 1 2 2 n n f yij T The procedure iterations is described in the figure I. McCullagh and Nelder [18] proved that as large or mi is constant, is 1 so, the mass probability mi y m y i ij 1 i i ij where y ij function is yij is the value taken by the response variable for the i th value of the predictor variable in the j th profile. n becomes distributed The change type is also assumed to be drift or linear trend change in the parameter i . Consider an in- asymptotically as a p-dimensional normal distribution N p , XT WX No 1 ˆ ( i )q ( i ) XT W k ik p mi log 1 exp k xik , i 1 k 1 1 i 1 k 1 i n (i ) ( i 1) ˆ ˆ log y y x i 1 (i ) , one can reexpress the log-likelihood as n 1 Update the estimation of i i 1 T T y ˆ . This procedure will be used in control process with independent observations coming from a binomial distribution with parameters mi and MLE change point estimator described in Section 3. i . After elapsing an unknown amount of time, the parameter i changes from its in control state of i 0 to an unknown out-of-control state of i1 , i1 i0 , and the function of i1 is given as i1 i 0 b j 3 j 1, 2,..., T , (6) where Ln L ( , b y ) b is the slope (or magnitude) of the linear trend i . T During the formulation of profiles j 1,2,..., the process parameter i become larger with linear trend some unknown value i1 T b and equal to i1 b j 1 b j j 1 i 1 i1 yij 1 i1 b j n n mi (7) n j 1 i 1 T i0 1 i 0 n j 1 i 1 n m Ln 1 i j 1 i 1 i0 (10) b j yij Ln i1 j 1 i 1 1 i1 b j T n T b j (8) i1 b j mi Ln 1 i 0 yij Ln 1 i1 j 1 i 1 b , and denote it as b̂ . ˆ arg max { yij Ln mi n yij Ln i 0 1 i0 yij j 1 i 1 T j b j i1 Substituting b for b in Equation (8) and computing Equation (8) over all possible change points in search of the maximum log-likelihood function yields Ln L( , b y ) Ln j 1 i 1 i MLE of the rate parameter, The MLE of is the value of that maximizes the likelihood function in Equation (7). Taking the logarithm of Equation (7) n m 1 In Equation (9) there is no closed-form solution for b . Perry and Pignatiello [6] proposed the Newton’s method (also known as the Newton–Raphson method) to find the MLE of the rate parameter, b . Newton’s method is one of the Numerical analysis which is used to find successively better approximations to the roots of a function. We use Newton’s method to find the n n yij j 1 i 1 i 0 b j , where mi n i 0 mi L( , b y ) yij 1 1 i 0 j 1 i 1 i0 j 1 i 1 T j 1 i1 b j i1 b j (9) T is the last profile sampled in which the control chart signaled an out-of-control state. Two unknown parameters in model are and b , representing the last profile taken from an in-control process and the rate of the linear trend, respectively. To estimate these unknown parameters, this paper used a MLE approach, similar to Perry and Pignatiello [6]. We denote the MLE estimator of the change point as ˆ . The likelihood function is as follows: n j 1 i 1 is equal to its known in-control value. For profiles j 1, 2,..., T the parameter i b disturbance in n m Ln 1 j 1 i 1 i i1 b j } where ˆ is the MLE for the last profile taken from the in-control process. Whenever this chart signals an outof-control state, the real time of a change can be estimated via Equation (10). n mi Ln 1 i1 b j , j 1 i 1 To determine the unknown parameters in Equation (8), ( and b ), an expression is required for b in terms of which maximize the log-likelihood function in Yeh et al. [14] introducted five Hotelling T control charts to monitor binary profiles in Phase I. The plotting statistic for profile j is defined as Equation (8), defined as b . Obtaining this expression is not a trivial task. The partial derivative of Equation (8) with respect to b is given by T j2 j 2 T S 1 j , (11) where ˆ j is estimator of the logistic regression parameters in j th profile and S is the variance covariance matrix of ˆ j . Any of these T 2 charts uses a different method to estimate the mean vector and 4 Now, suppose an out-of-control process whose parameter vector i shifts from i 0 to 2 covariance matrix. They showed TI control chart which estimate the covariance matrix by averaging the covariance estimates of each given sample, provides more effective way to detect step shifts and drift. These control charts also can be used in Phase II. So i1 i 0 b j . A Monte Carlo simulation study is accomplished to examine the performance of the estimator. The change point is simulated at profile 50 ( 50 ). For profiles j 1, 2,...,50 the independent observations is produced by binomial distribution with parameters 50 and i 0 . Starting at 2 we used TI control chart to detect the out-of-control state in Phase II. Covariance matrix in Phase II is computed using the following equation: ( XT WX ) 1 , where profile 51, observations are simulated from the out-of- (12) W is equal control process with i1 to i 0 b j 50 until the diag{m 1 (1 1 ), m 2 (1 2 ),, m n (1 n )} TI2 chart signals an out-of-control. At this time, the . The upper control limit for the proposed control chart change point estimator is used and the real time of the process change is determined. This procedure is repeated 10,000 times for each magnitude of the change under study, which is described in the figure II. is also equal to 2, ,the percentile points on the 2 chi-square distribution with 2 degrees of freedom. The results are summarized in Tables I and II and Figure III. Table I shows the expected length of each simulation run E T which is the expected value of 4- Performance of the MLE Estimator To estimate the performance of the MLE estimator, an example is given. In this example, number of explanatory variables is equal to 2, (p=2). Thus the link function is simplified as g i 1 2 xi , the number of samples taken until the first alarm is given by the control chart, i.e. E T ARL 50 . Table I also shows the average change point estimate and the standard deviation of the change point estimator under different magnitude of the shifts considered. Because the actual change is at time 50, the average change point estimate, , should be close to 50. where 1 and 2 are the intercept and the slope of the regression function respectively and is shown by the vector 1 , 2 T We set the matrix X as: T 1 ... 1 1 X = , log 0.1 log 0.2 ... log 0.9 It is assumed that the number of experiments in each predictor variable is constant and equal to 50 , Produce 50 independent incontrol profiles with the vector of i 0 ( mi 50 for i 1,2,...,9 ) and the in-control i 0 is 0.04,0.06,0.13,0.27,0.48,0.68,0.82,0.89, 0.91T . initial is estimated as 2.5, 3.46 T from historical 2 2,0.005 10.59 . i The dataset in Phase I. The upper control limit for the TI control chart is equal to Make a change in the vector of 10,000 times repeat 2 The covariance matrix of the logistic regression parameters ( ) in Phase II is computed by Equation (12) as follows: Estimate the logistic parameters and depict control chart until issuing an outof control signal Calculate for all profiles Determine the largest as the change point Figure II. The flowchart of the simulation procedure (X T WX) 1 0.06627 0.07693 0.07693 0.1179 5 Table I. Expected number of samples until the signal and standard deviations of the change point estimators with 10,000 simulations runs when 50 b E T se(ˆ) 0.003 56.13 51.82 2.39 0.005 55.01 51.21 1.73 0.007 53.86 50.86 1.32 0.01 53.02 50.58 1.01 0.015 52.43 50.44 0.74 0.02 52.04 50.29 0.51 0.025 51.72 50.21 0.41 Figure III. The performance of the MLE estimator Table II shows the results of proportion of 10,000 simulation runs that the estimator lies within a specified tolerance of the real change point value. For example if b 0.01, the estimated probability that ˆ lies within 1 or less from the real change point is 0.81. Also in this case, in 46% of the simulation runs the estimator correctly identifies the real time of the change. As shown in Table I, for drift rate parameter equal to 0.003 the expected number of samples taken until the signal is 56.13, the average change point estimate is 51.82 and the standard deviation of the change point estimator is 2.39. Moreover, the performance of the estimator improves significantly with increases in magnitude of the drift rate parameter b . Figure III also indicates this issue. As perceived from this figure, with increasing b , the expected number of samples taken until the signal and average change point estimator are becoming near to 51 and 50, respectively. Furthermore, the standard deviation of the change point estimator, that is shown shady, is becoming smaller. Table II shows that the performance of the MLE estimator improves significantly with increases in magnitude of the shift in the slope, b . Table II. Estimated precision performances over a range of i with 10,000 simulations runs when 50 b 0.003 0.005 0.007 0.01 0.015 0.02 0.025 p̂ ˆ 0 0.21 0.30 0.36 0.46 0.57 0.72 0.83 p̂ ˆ 1 0.47 0.59 0.70 0.81 0.85 0.97 0.99 p̂ ˆ 2 0.64 0.81 0.89 0.94 0.99 1.00 1.00 p̂ ˆ 3 0.79 0.92 0.98 1.00 1.00 p̂ ˆ 4 0.90 0.98 1.00 p̂ ˆ 5 0.97 1.00 p̂ ˆ 6 1.00 6 [10] Mahmoud, M. A., Parker, P. A., Woodall, W. H., Hawkins, D. M. (2007) A change point method for linear profile data. Quality and Reliability Engineering International, 23(2), 247-268. 5- Conclusions In this paper, we provided the MLE method to estimate the change point in phase II monitoring of logistic regression profiles, when the type of change is drift. The results of this study showed that the performance of the proposed estimator is fine to identify the real time of change point under different magnitude of drift rate. [11] Kim, K., Mahmoud, M. A., Woodall, W. H. (2003). On the monitoring of linear profiles. Journal of Quality Technology, 35(3), 317-328. Developing this method to the other distributions of the exponential family such as Poisson and Gama would be future researches in this area. Furthermore the other types of the change, including step changes and isotonic changes could be investigated by researchers. [12] Zou C, Zhang Y, Wang Z. Control chart based on change point model for monitoring linear profiles. IIE Transactions 2006; 38(12), 1093-1103. References [14] Yeh, B., Huwang, L., Li, Y. M. (2009). Profile monitoring for a binary response. IIE Transaction, 41(11), 931-941. [13] Kazemzadeh, R. B., Noorossana, R., Amiri, A. (2008). Phase I monitoring of polynomial profiles. Communications in Statistics - Theory and Methods, 37(10), 1671-1686. [1] Pignatiello, J. J., Samuel, T. R. (1998a). Identifying the time of a step change with X control charts. Journal of Quality Engineering, 10(3), 521-527. [15] Shang, Y., Tsung, F. and Zou, C. (2011). Phase II profile monitoring with binary data and random predictors. Journal of Quality Technology, 43(3), 196-208. [2] Pignatiello, J. J., Samuel, T. R. (1998b). Identifying the time of a step change in a normal process variance. Journal of Quality Engineering, 10(3), 529-538. [16] Sharafi, A., Aminnayeri, M. and Amiri, A. (2012). Identifying the time of step change in binary profiles. To appear in the International Journal of Advanced Manufacturing Technology, DOI: 10.1007/s00170-0123899-4. [3] Pignatiello, J. J., Samuel, T. R. (1998c). Identifying the time of a step change in a poisson rate parameter. Journal of Quality Engineering, 10(3), 673-681. [4] Pignatiello, J. J., Samuel, T. R. (2001). Estimation of the change point of a normal process mean in SPC applications. Journal of Quality Technology, 33(1), 82-95. [17] Albert, A., Anderson, J. A. (1984). On the existence of maximum likelihood estimates in logistic regression models. Biometrika, 71(1), 1-10. [5] Perry, M. B., Pignatiello, J. J. (2005). Estimation of the change point of a poisson rate parameter with a linear trend disturbance. Quality and Reliability Engineering International, 22(4), 371-384. [18] McCullagh, P., Nelder, J. A. (1989). Generalized linear models, 1nd ed. London, UK: Chapman & Hall. [6] Perry, M. B., Pignatiello, J. J. (2006). Estimation of the change point of a normal process mean with a linear trend disturbance. Quality Technology and Quantitative Management, 3(3), 101-115. [7] Kang, L., Albin, S. L. (2000). Online monitoring when the process yields a linear profile. Journal of Quality Technology, 32(4), 418-426. [8] Mestek, O., Pavlik, J., Suchnek, M. (1994). Multivariate control charts: Control charts for calibration curves. Freseniusí Journal of Analytical Chemistry, 350, 344-351. [9] Mahmoud, M. A., Woodall, W. H. (2004). Phase I analysis of linear profiles with calibration applications. Technometrics, 46(4), 380-391. 7