On a Robust Test for SETAR-Type Nonlinearity in Time Series Analysis

Journal of Forecasting J. Forecast. 28, 445–464 (2009) Published online 7 January 2009 in Wiley InterScience (www.interscience.wiley.com) DOI: 10.1002/for.1122 On a Robust Test for SETAR-Type Nonlinearity in Time Series Analysis KING CHI HUNG,1 SIU HUNG CHEUNG,1,2 WAI-SUM CHAN1* AND LI-XIN ZHANG3 1 2 3 Chinese University of Hong Kong, Hong Kong National Cheng Kung University, Taiwan Zhejiang University, PR China ABSTRACT There has been growing interest in exploiting potential forecast gains from the nonlinear structure of self-exciting threshold autoregressive (SETAR) models. Statistical tests have been proposed in the literature to help analysts check for the presence of SETAR-type nonlinearities in observed time series. However, previous studies show that classical nonlinearity tests are not robust to additive outliers. In practice, time series outliers are not uncommonly encountered. It is important to develop a more robust test for SETAR-type nonlinearity in time series analysis and forecasting. In this paper we propose a new robust nonlinearity test and the asymptotic null distribution of the test statistic is derived. A Monte Carlo experiment is carried out to compare the power of the proposed test with other existing tests under the influence of time series outliers. The effects of additive outliers on nonlinearity tests with misspecification of the autoregressive order are also studied. The results indicate that the proposed method is preferable to the classical tests when the observations are contaminated with outliers. Finally, we provide illustrative examples by applying the statistical tests to three real datasets. Copyright © 2009 John Wiley & Sons, Ltd. key words additive outliers; GM estimation; nonlinearity tests; robustness; threshold time series INTRODUCTION Owing to the deficiency of linear models in capturing some commonly observed features of time series data, many nonlinear time series models have been proposed in the literature. One popular class is the self-exciting threshold autoregressive model (SETAR). It was first proposed by Tong (1978) and is discussed in detail in Tong (1983, 1990). If a time series Yt follows the SETAR(k; p; d) model, then * Correspondence to: Wai-Sum Chan, Department of Finance, The Chinese University of Hong Kong, Shatin, Hong Kong. E-mail: chanws@cuhk.edu.hk Copyright © 2009 John Wiley & Sons, Ltd. 446 K. C. Hung et al. p Yt = φ0( j) + ∑ φi( j)Yt − i + ε t if rj −1 ≤ Yt − d < rj (1) i =1 i.i.d. N (0,s2); and s2 < ∞. where j = 1,2, . . . , k, t = p + 1, . . . , n, d ≤ p are both positive integers, et ∼ Parameters p and d denote the autoregressive (AR) order and the delay parameter, respectively. Furthermore, k is the number of regimes, rjs are the threshold parameters such that −∞ = r0 < r1 < . . . < rk = ∞, and n is the number of observations. In this way, different regimes possess different (s) AR(p) models. If f (j) i = f i for all i = 0,1, . . . , p and j ≠ s = 1, 2, . . . , k, then the model reduces to a linear AR(p) process. A more generalized version of the SETAR model allows different orders of AR models inside different regimes. With the introduction of threshold parameters, rjs, the process switches among different linear autoregressive models. This class of piecewise linear models can effectively capture jump phenomena, amplitude-dependent frequency, and limit cycles. In application, testing problems for the SETAR model against a linear model has aroused considerable interest and many tests have been proposed in the literature (e.g., De Gooijer and Kumar, 1992; Petruccelli, 1990). Their power and size properties are shown to be satisfactory. However, realizations from the SETAR model can have an asymmetric distribution of observations among regimes. Thus, it may look like a linear time series contaminated with outliers. Owing to this similarity, standard testing procedures might not be able to distinguish between a threshold-type nonlinear time series and a linear time series with an outlier contamination structure. This conjecture is supported in previous research. Moeanaddin and Tong (1988) report that some nonlinearity tests are sensitive to outliers. An analysis of a real dataset by Chan and Cheung (1994) further confirms this possibility. They apply Tsay’s (1989) F test for SETAR-type nonlinearity to the monthly average wholesale prices of regular leaded gasoline in the United States between 1973 and 1987. It is found that outliers can induce aberrant F statistics. Furthermore, the simulation results of Chan and Ng (2004) also stress the need for a more robust test for SETAR-type nonlinearity in time series analysis and forecasting. Regarding this insufficiency, Van Dijk et al. (1999) suggest a robust Lagrange multiplier (LM) test for the smooth transition autoregressive (STAR) model. The class of STAR models contains the class of SETAR processes as a special case. The generalized-M estimate is used to ensure the robustness of the standard LM test (Luukkonen et al., 1988) for STAR models. Although simulations indicate that the robustified LM test is often erroneously oversized, it is a first attempt to construct a robust nonlinearity test in time series analysis. In this paper, we propose a new robust nonlinearity test for SETAR-type models. It does not have the oversized problem as does the robustified LM test, and it offers reasonable power in detecting SETAR-type nonlinearity in the presence of outliers. The paper proceeds as follows. In the next section we propose a new robust test for SETAR-type nonlinearity. The asymptotic null distribution of the test statistic is derived. The third section compares the power of the proposed test with other existing tests under the influence of time series outliers via Monte Carlo experiments. The effects of additive outliers on nonlinearity tests with misspecification of the autoregressive order are also studied. The fourth section applies the tests to three real examples. A summary of the findings is given in the final section. Copyright © 2009 John Wiley & Sons, Ltd. J. Forecast. 28, 445–464 (2009) DOI: 10.1002/for A Robust Test for SETAR-Type Nonlinearity 447 TESTS FOR SETAR-TYPE NONLINEARITY Various statistical tests for linearity (against SETAR-type nonlinearity) are available in the literature. Petruccelli and Davies (1986) propose a CUSUM test for threshold nonlinearity, which is based on the concept of arranged autoregression. The CUSUM test is modified by Petruccelli (1990) to achieve a higher power. Tsay (1989) derives an F test, which is based on one-step-ahead forecast errors from the arranged autoregression. Luukkonen et al. (1988) consider LM tests. Chan and Tong (1990) employ the likelihood ratio approach for testing SETAR-type nonlinearity. Chan and Ng (2004) conduct a small-scale simulation study on the outlier robustness of these tests. They conclude that no single test is able to resist satisfactorily large outlying values. In this section we first briefly describe the arranged autoregression approach of Tsay’s (1989) F test. Tsay’s test is commonly used owing to its simple and regression-type structure. A robustified version of Tsay’s F test is then proposed and its asymptotic null distribution is derived. Tsay’s F test Under Tsay’s arranged regression approach, the linear AR(p) model is regarded as the null model against the alternative SETAR model. We consider an observed time series {Yt, t = 1, 2, . . . , n}. Let h = max(1, p + 1 − d ) and denote pi as the index of the ith smallest value among {Yt : t = h, h + 1, . . . , n − d}. The observations {Yt : t = h, h + 1, . . . , n − d} are sorted in ascending order. Both the dependent vector Y and the corresponding lagged design matrix X are arranged according to the threshold parameter Yt−d. Under the null hypothesis of linearity, the arranged autoregression is written in the form Y = XΦ + e, where Φ = {f0, f1, . . . , fp} is a vector of the AR parameters and e is a vector of noise, i.e., Yπ1 + d −1  Yπ1 + d  1   Yπ 2 + d  Yπ 2 + d −1 1   =     Y   1 Y π n− d −h +1 + d −1 π n− d −h +1 + d Yπ1 + d − p   φ0   ε π1 + d  Yπ 2 + d − p   φ1   ε π 2 + d    +              Yπ n−d −h+1 + d − p φ p επ n−d −h+1 + d  (2) Stepwise autoregression of the first j rows of Y on the first j rows of X is performed successively for j = m, m+ 1, . . . , n − d − h, where m > p + 1 is the startup value. Let xj+1 be the ( j + 1)th row of the X matrix and Xj be the submatrix containing the first j row of X. The corresponding one-stepahead prediction residuals ê pj+1+ d are obtained successively: εˆ π j+1 + d = Yπ j +1 + d − Yˆπ j +1 + d 1+ x −1 ( X X j ) x j +1 T j +1 T j (3) Tsay’s F test is developed based on the orthogonality property between the predictive residuals given in (3) and the regressors {Ypj+1+ d−vv = 1, . . . , p, j = 1, . . . , n − d − h − m} under the null hypothesis of linearity. This property will be destroyed if observations that are lying in other regimes are involved, which in turn indicates nonlinearity. Copyright © 2009 John Wiley & Sons, Ltd. J. Forecast. 28, 445–464 (2009) DOI: 10.1002/for 448 K. C. Hung et al. The orthogonality (thus the SETAR-type nonlinearity) can be tested by considering the regression model p ε̂π j+1 + d = ω 0 + ∑ ωυ Yπ j+1 + d −υ + eπ j+1 + d (4) υ =1 for j = m, m + 1, . . . , n − d − h, i.e.,  εˆ π m+1 + d   1 Yπ m+1 + d −1  εˆ π m+2 + d   1 Yπ m+2 + d −1   =     εˆ   1 Y π n− d −h +1 + d −1 π n− d −h +1 + d Yπ m+1 + d − p   ω 0   eπ m+1 + d  Yπ m+2 + d − p   ω1   eπ m+2 + d    +      Yπ n−d −h+1 + d − p   ω p   eπ n−d −h+1 + d  (5) The usual F statistic of the regression in (5) is F= MSS ( p + 1) RSS ( n − d − h − m − p) (6) where RSS = Σêt2 and MSS = Σε̂ t2 − RSS. Under the null hypothesis, the test statistic is asymptotically distributed as Fp+1,n−d−h−m−p. The proposed test To enhance the discriminative power of the F test in the presence of additive outliers, the Schweppe type of generalized-M (GM) estimator is considered (Handschin et al., 1975). For simplicity, we shall explain the concept of this type of GM estimation procedure for a simple linear AR(1) model without intercept. However, the method can be easily generalized to any ordinary AR(p) models (see, for example, Maronna et al., 2006, Ch. 8). Following Franses and Van Dijk (2000, p. 67), we consider a simple AR(1) process: Yt = φYt −1 + ε t (7) Given the observations {Yt : t = 1, . . . , n}, the ordinary least squares (OLS) estimate of f can be obtained by minimizing the sum of the squared residuals under the first-order condition: ∑Y t −1 ( Yt − φYt −1) = 0 (8) If an outlying observation appears at t = s, it creates two outliers in the regression. The point (Ys−1, Ys) becomes a vertical outlier and (Ys, Ys+1) becomes a bad leverage point. In this situation, the OLS estimator will be biased towards zero (Rousseeuw and Van Zomeren, 1990). The Schweppe-type regression (with GM estimator) in computing the estimate of f has the advantage of utilizing the information that is given by good leverage points while reducing the weights of the vertical outliers and bad leverage points (Hampel et al., 1986). Under the Schweppe-type regression with a given y-function (e.g., Maronna et al., 2006, p. 149), the first-order condition for the simple AR(1) model becomes Copyright © 2009 John Wiley & Sons, Ltd. J. Forecast. 28, 445–464 (2009) DOI: 10.1002/for A Robust Test for SETAR-Type Nonlinearity ∑ω y (Yt −1 ) Yt −1ψ ( zt ) = 0 449 (9) where zt = εt σ ε ω y (Yt −1 ) (10) is the standardized residual series, ε t = Yt − φYt −1 (11) is the tth residual, and se is a robust measure of the scale of the residuals et. The function wy(·) in (9) and (10) downweights the outlying observations in the regressor Yt−1. As proposed by Van Dijk et al. (1999), the weight function for the regressor is chosen as ψ ([δ ( Yt −1)] 2 ω y ( Yt −1) = [δ ( Yt −1)] ) 2 (12) where δ ( Yt −1) = Yt −1 − m y σy (13) and my and sy are the robust measures of location and the scale of Yt−1, respectively. The y-function must be specified under some regularity conditions such that the corresponding GM estimator is asymptotically consistent (Hampel et al., 1986). We will discuss the choices of the y-function in the next subsection. Define the weight function for the standardized residuals as  ψ ( zt )  zt ω z ( zt ) =   1  for zt ≠ 0 (14) otherwise Substituting (14) into the first-order condition (9), we have ∑ω y (Yt −1 ) Yt −1 ⋅ω z ( zt ) Z t = 0 (15) Finally, combining (10), (11) and (15), we obtain an equivalent form of the first-order condition, which is nonlinear in f, as ∑ ω ( z )Y z Copyright © 2009 John Wiley & Sons, Ltd. t t −1 ( Yt − φYt −1) = 0 (16) J. Forecast. 28, 445–464 (2009) DOI: 10.1002/for 450 K. C. Hung et al. It should be noted that once a y-function has been chosen, the weight function for the regressor wy(·) and the weight function for the standardized residuals wz(·) are fixed by equations (12) and (14), respectively. Given an initial estimate of f, denoted by f̂ (0), the corresponding fitted residuals ê t(0) can be computed via equation (11). In this paper we employ the median absolute deviation (MAD) measure for estimating the se in (10), i.e., σˆ ε(0) = 1.483 median ( εˆ t(0) − mˆ ε(0) ) (17) with mˆ ε(0) = median ( εˆ t(0) ) (18) for t = 2, . . . , n. When et is normally distributed, the constant 1.483 is used to adjust the MAD measure to become a consistent estimator of the scale of the residuals. Analogous to (17) and (18), we employ the estimators ŝ y = 1.483 median (Yt−1 − m̂y) and m̂y = median(Yt−1) for the estimation of sy and my in (13). Finally, the corresponding standardized residual series ẑ (0) t can be computed. Equation (16) indicates that the GM estimate of f can be obtained using the iterative weighted least squares (IWLS) algorithm n ∑ ω ( zˆ ( ) )Y i z φˆ (i +1) = t t =2 n Y t −1 t (19) ∑ ω ( zˆ )Y (i ) t z 2 t −1 t =2 where ẑ t(i)s are the fitted standardized residuals that are computed using the estimate f(i) at the ith iteration. For the starting value f̂ (0), we suggest using the least median of squares estimator that is proposed by Rousseeuw (1984). For a general pth-order autoregression with intercept, as in (2) and (5), we need a multivariate version of the IWLS algorithm in (19). In particular, we replace the d (Yt−1) value in equation (13) by δ ( Yt(−p1) ) = (Yt(−p1) − M y( p) ) T Σ y( p) (Yt(−p1) − M y( p) ) (20) p) = (Yt−1, Yt−2, . . . , Yt−p)T with the multivariate which is the Mahalanobis distance for regressors Ỹ (t−1 p) location My(p) and scatter matrix Σy(p). The value of d (Y (t−1 ) in (20) can be efficiently computed by using the minimum volume ellipsoid (MVE) estimator that is discussed in Rousseeuw and Van Zomeren (1990). The corresponding FORTRAN subroutine MINVOL for computing MVE estimators is available at Statlib (www.stat.cmu.edu). For our proposed test, we suggest replacing the OLS estimates with the GM estimates in the arranged regressions in (2) and (4). Given p and d, we first compute the GM estimates and predictive residual ê pj+1+ d successively for j = m, m+1, . . . , n − d − h. A set of n − d − h − m + 1 predictive residuals are obtained. The Schweppe-type regression (4) is performed between the predictive residuals ê pj+1+ d and the last n − d − h − m + 1 rows of the design matrix X. The proposed FGM statistic is formulated as FGM = Copyright © 2009 John Wiley & Sons, Ltd. MSS ( p + 1) RSS ( n − d − h − m − p) (21) J. Forecast. 28, 445–464 (2009) DOI: 10.1002/for A Robust Test for SETAR-Type Nonlinearity 451 where ˆ T WX ( X T WWX )−1 X T W Ψ ˆ; MSS = Ψ ˆ TΨ ˆ − MSS; RSS = Ψ ˆ = (ψ ( zˆπ + d ) ,…,ψ ( zˆπ Ψ ))T ; m +1 n − d − h +1 + d X = (Yπ(mp+)1 + d ,…, Yπ(np−)d −h+1 + d ) ; T W = diag (ω y (Yπ(mp+)1 + d ) ,…, ω y (Yπ(np−)d −h+1 + d )) ; Yπ( jp+)1 + d = (1, Yπ( jp+)1 + d −1 ,…, Yπ( jp+)1 + d − p ) ; T and the standardized residual is defined as zˆπ j +1 + d = εˆ π j +1 + d σˆ ε (22) for j = m, m + 1, . . . , n − d − h. Theorem 1. Let Yt be a linear stationary AR(p) process. That is, Yt follows the model in (1) with k = 1. Then, for large n, the statistic FGM that is defined in (21) follows approximately an F distribution with degrees of freedom (p + 1) and (n − d − h − m − p). Proof. This theorem can be proved by using a similar argument to that used by Tsay (1986, p. 463). For t = pm+1 + d, pm+2 + d, . . . , pn−d−h+1 + d, we denote ˆ − Φ ) and Yt( p) = Yt(−p1) Φ + ε t εˆ t = ε t + Yt(−p1) (Φ It can be shown that Φ̂ and ŝe are n -consistent estimates of Φ and se, respectively. With g = (y(epm+1+ d/se), . . . , (yepn−d−h+1+ d /se))T taking the place of Ψ̂, the asymptotic distribution of FGM in (21) does not change. Now, X T W ϒ n = ∑ t Yt(−p1)ω y (Yt(−p1) ) ψ (ε t σ ε ) n satisfies a central limit theorem (Billingsley, 1961) and converges in distribution to a (p + 1)-dimensional normal random vector with a mean 0 and covariance matrix k1, with κ 1 = E [ψ 2 (ε t σ ε )] E ω y2 (Yt(−p1) ) Yt(−p1)  T Also, X T WWX n = ∑ t ω y2 (Yt(−p1) ) Yt(−p1) n converges in probability to k2 with T κ 2 = E ω y2 (Yt(−p1) ) Yt(−p1)  T 2 × E[y2(et /se)]. Finally, the large-sample F distribuThus, the numerator of (21) converges to c p+1 tion of the testing statistic FGM in (21) follows from an argument which is similar to that of the usual analysis of variance. The choice of the y-function The outlier robust properties of the proposed FGM test depend on the choice of y-function. Table I summarizes several commonly used y-functions (Hoaglin et al., 2000; Van Dijk et al., 1999). The efficiencies for these y-functions for robust estimation of the location parameter u depend on their tuning constants. They are often chosen with reference to the asymptotic efficiency of the Gaussian Copyright © 2009 John Wiley & Sons, Ltd. J. Forecast. 28, 445–464 (2009) DOI: 10.1002/for 452 K. C. Hung et al. Table I. Some commonly used y-functions Estimator with tuning constant(s) Least squares (LS) Huber1 (k) k>0 Tukey’s biweight (c) c>0 Polynomial2 (c1, c2) c2 > c1 > 0 y-Function y(u) Range of u u u ksgn(u) u[1 − (u/c)2]2 0 u 0 g(u) (−∞; ∞) u ≤ k u > k u ≤ c u > c u ≤ c1 u > c2 otherwise Huber reduces to the LS (i.e., mean) estimator when k → ∞. y(u) = u[1 − H(u − c1)] + H(u − c1)[1 − H(u − c2)]g(u); where H(.) is the Heaviside function. H(z) = 1 if z > 0, and H(z) = 0 if z ≤ 0. g(u) is a fifthorder polynomial such that y(u) is twice continuously differentiable. 1 2 Table II. Tuning constant(s) of selected y-functions Estimator Tuning constant(s) 1.345 or 3.291 4.685 or 15 2.576, 3.291 Huber (k) Tukey’s biweight (c) Polynomial (c1, c2) model and the square root of the quantiles of the c2 distribution with one degree of freedom (Spath, 1991, p. 195). At 95% and 99.95% efficiency of the Gaussian model, the tuning constants of the Huber y-function are 1.345 and 3, respectively, while those of Tukey’s biweight y-function are 4.685 and 15, respectively. We also have χ12 (0.99) = 2.576 and χ12 (0.999) = 3.291. The tuning constants for the y-functions that are considered in this paper are listed in Table II. Their impact on the power of the proposed FGM test will be studied via simulation experiments in the next section. MONTE CARLO RESULTS In the first experiment, we consider a simple SETAR (2;1,1) model as the underlying outlier-free data-generating process (DGP): (1) (1) φ0 + φ1 Yt −1 + ε t Yt =  (2) ( 2) φ0 + φ1 Yt −1 + ε t if Yt −1 ≤ r if Yt −1 > r i.i.d. where et ∼ N(0, 1). A straightforward generalization of the definition of the additive outlier (AO) in a linear time series by Denby and Martin (1979), in which both the number and the location of the outliers are determined by a random mechanism, is given as follows: Z t = Yt + ηt Copyright © 2009 John Wiley & Sons, Ltd. (23) J. Forecast. 28, 445–464 (2009) DOI: 10.1002/for A Robust Test for SETAR-Type Nonlinearity 453 where Zts are the observations, hts are i.i.d. with density (1 − b)∆0(·) + bp, p follows N(0, w2), ∆0(·) represents a degenerate density at 0, w is the size parameter of the contamination, and b is the percentage of contamination (0 ≤ b ≤ 1). Chan and Ng (2004) extend the definition of an innovational outlier (IO) by Denby and Martin (1979) to the SETAR models, and find that the existence of IOs seems to cause no harm to most SETAR-type nonlinearity tests. Furthermore, Chan (1992) and Wei and Wei (1998) suggest that the existence of innovational outliers is not an obstacle in model specification and estimation for linear time series processes. Therefore, we do not consider IO in this section. Time series data {Z1, . . . , Zn} are generated from (23). We consider sample sizes of n = 100 and n = 200. The startup value, Y0, is set to zero. It is important to discard a sufficiently large number of observations to remove transient effects in generating a SETAR time series. Therefore, the first 1500 observations are omitted in each replication. The proposed FGM test with various choices of y-functions is considered. When the choice of yfunction is LS (see Table II), it is actually the ordinary least squares method with equal weighting, and the FGM test reduces to the original Tsay (1989) F test. Following Denby and Martin (1979), we set the size of contamination w = 0, 3, 6, and 10. The percentage of contamination is fixed at b = 5%. We consider the level of significance a = 1%, 5% and 10%. In each test, p = 1 and d = 1 are used. The startup value for the arranged regression is set at m = n/10 + p. Table III gives the empirical relative frequencies of rejecting the linear null hypothesis based on (1) (2) 1000 replications. We consider seven parameter combinations (r, f(1) 0 , f1 , f0 ) = (1.0, 0.5, 0.5, 0.5), (2) and f1 varies from −0.8 to + 0.8. Our experience has shown that the results are not sensitive to the choice of AR(1) coefficient in regime 1; hence we only report the case of f 1(1) = 0.5 here. When f1(2) = 0.5, the model reduces to the linear AR(1) process (i.e., the null model). Some of the parameter combinations are taken from Tsay (1989). Other combinations are chosen such that there are adequate observations in both regimes for efficient parameter estimation and nonlinearity testing. Table IV shows results that correspond to Table III but with a threshold parameter r = 0 and both intercept (2) terms f (1) 0 and f 0 set to zero. Chan and Ng (2004) find that the intercept term is related to the level of a process and it has a significant impact on the power of SETAR-type nonlinearity testing. First, we examine the effect of the size of the outlier contamination (w). Under the null model of linearity (f1(2) = 0.5), then the empirical sizes of the tests are indicated by the relative rejection frequencies in the tables. If there are no outliers (w = 0), the empirical sizes of the tests are consistently smaller than (but all satisfactorily close to) the nominal levels (1%, 5% and 10%). However, when w increases, the empirical sizes of the LS test (i.e., the original Tsay F test) become significantly larger than the corresponding nominal levels. Unfortunately, the adverse effects of the outliers on the LS test cannot be diluted by increasing the sample size. On the other hand, the proposed FGM tests can keep the sizes steady at around the nominal level without being affected by the outliers. For example, in Table III (sample size n = 100 with a = 5%), under the null model (f1(2) = 0.5), the empirical sizes for the LS test are 0.031 (w = 0), 0.176 (w = 3), and 0.312 (w = 6). For the FGM test (with k = 3.291 Huber y-function) the empirical sizes are 0.025 (w = 0), 0.056 (w = 3), and 0.057 (w = 6). The results show that the original F test is sensitive to outlying observations. The test suffers the erroneous oversized problem when the data are generated from the null model with outliers. However, the proposed FGM tests are robust with respect to outliers in the null model. As usual, the robust test encounters some power loss when there is no outlier in the data, especially when the sample size is not large (see Tables III and IV for the case w = 0). When the degree of SETAR-type nonlinearity gets stronger (e.g., f1(2) moves from 0.5 to −0.8), the powers of the proposed Copyright © 2009 John Wiley & Sons, Ltd. J. Forecast. 28, 445–464 (2009) DOI: 10.1002/for Copyright © 2009 John Wiley & Sons, Ltd. 0.5 0.5 0.5 1.0 1.0 1.0 0.5 0.5 1.0 1.0 0.5 1.0 0.5 0.5 0.5 0.5 0.5 0.5 0.5 0.5 0.5 0.5 0.5 0.5 0.8 0.5 0.2 0.0 −0.2 −0.5 0 3 6 10 0 3 6 10 0 3 6 10 0 3 6 10 0 3 6 10 0 3 6 10 0 3 6 10 f 1(1) f (2) f 1(2) w a = f (1) 0 0 (a) Sample size n = 100 1.0 0.5 0.5 0.5 −0.8 r Parameters 0.969 0.685 0.336 0.166 0.810 0.442 0.204 0.122 0.414 0.205 0.123 0.090 0.181 0.108 0.087 0.076 0.047 0.059 0.066 0.058 0.006 0.052 0.132 0.116 0.014 0.195 0.479 0.498 1% 0.996 0.836 0.494 0.275 0.927 0.679 0.350 0.209 0.648 0.410 0.248 0.162 0.398 0.271 0.204 0.142 0.152 0.154 0.145 0.116 0.034 0.191 0.307 0.257 0.066 0.349 0.689 0.722 1.000 0.879 0.572 0.341 0.964 0.767 0.449 0.268 0.767 0.530 0.344 0.222 0.535 0.382 0.289 0.206 0.224 0.250 0.220 0.179 0.079 0.283 0.428 0.371 0.138 0.448 0.783 0.801 0.721 0.609 0.601 0.586 0.518 0.438 0.419 0.406 0.232 0.190 0.176 0.178 0.090 0.064 0.062 0.059 0.023 0.019 0.019 0.015 0.002 0.005 0.003 0.003 0.008 0.016 0.015 0.016 5% 10% 1% LS 0.870 0.817 0.807 0.808 0.754 0.661 0.646 0.648 0.469 0.396 0.383 0.369 0.257 0.193 0.199 0.198 0.079 0.061 0.072 0.067 0.027 0.027 0.024 0.022 0.045 0.058 0.083 0.070 0.934 0.881 0.873 0.877 0.838 0.772 0.766 0.757 0.600 0.509 0.506 0.492 0.391 0.312 0.299 0.293 0.146 0.133 0.123 0.124 0.042 0.050 0.046 0.042 0.087 0.104 0.151 0.135 0.948 0.784 0.700 0.643 0.767 0.548 0.448 0.417 0.371 0.237 0.196 0.173 0.171 0.094 0.083 0.067 0.046 0.032 0.025 0.024 0.005 0.007 0.008 0.014 0.015 0.048 0.073 0.070 0.990 0.911 0.850 0.813 0.911 0.783 0.684 0.648 0.633 0.450 0.385 0.368 0.380 0.269 0.224 0.215 0.132 0.101 0.087 0.088 0.035 0.044 0.044 0.045 0.068 0.154 0.214 0.203 0.998 0.948 0.906 0.886 0.952 0.864 0.793 0.751 0.746 0.585 0.519 0.495 0.520 0.392 0.342 0.327 0.210 0.167 0.161 0.157 0.073 0.099 0.105 0.087 0.125 0.240 0.339 0.304 0.501 0.439 0.461 0.467 0.322 0.298 0.295 0.298 0.140 0.134 0.135 0.138 0.065 0.051 0.051 0.055 0.014 0.014 0.013 0.010 0.003 0.006 0.003 0.006 0.019 0.016 0.012 0.015 0.717 0.662 0.677 0.687 0.542 0.509 0.508 0.530 0.309 0.285 0.279 0.294 0.162 0.139 0.145 0.145 0.058 0.054 0.060 0.058 0.024 0.024 0.021 0.021 0.046 0.053 0.056 0.051 0.786 0.753 0.763 0.778 0.650 0.619 0.618 0.647 0.443 0.398 0.410 0.410 0.266 0.221 0.241 0.240 0.113 0.097 0.105 0.096 0.056 0.043 0.048 0.056 0.084 0.092 0.094 0.092 0.948 0.771 0.670 0.649 0.756 0.551 0.452 0.444 0.368 0.238 0.200 0.191 0.163 0.108 0.090 0.093 0.039 0.032 0.026 0.022 0.003 0.008 0.014 0.012 0.015 0.044 0.037 0.033 5% 10% 1% c = 4.685 Tukey 5% 10% 1% k = 3.291 Huber 5% 10% 1% k = 1.345 Huber The FGM test with various y-functions 0.990 0.909 0.847 0.835 0.908 0.764 0.701 0.674 0.621 0.455 0.398 0.397 0.382 0.274 0.231 0.225 0.132 0.098 0.077 0.087 0.032 0.035 0.038 0.038 0.065 0.118 0.116 0.111 0.996 0.952 0.917 0.897 0.950 0.863 0.803 0.785 0.738 0.577 0.508 0.526 0.523 0.400 0.343 0.332 0.204 0.175 0.150 0.161 0.077 0.077 0.087 0.068 0.120 0.185 0.194 0.174 0.625 0.527 0.550 0.559 0.414 0.340 0.350 0.339 0.175 0.143 0.148 0.136 0.063 0.051 0.052 0.044 0.014 0.012 0.014 0.012 0.003 0.005 0.003 0.003 0.006 0.012 0.012 0.016 0.798 0.737 0.752 0.764 0.638 0.573 0.568 0.574 0.355 0.297 0.311 0.313 0.179 0.152 0.158 0.162 0.063 0.059 0.068 0.056 0.024 0.026 0.021 0.025 0.044 0.045 0.054 0.050 0.857 0.811 0.826 0.848 0.757 0.678 0.687 0.694 0.476 0.407 0.420 0.432 0.286 0.236 0.249 0.258 0.122 0.104 0.114 0.108 0.056 0.052 0.052 0.056 0.082 0.091 0.111 0.095 5% 10% c1 = 2.576 c2 = 3.291 Poly 5% 10% 1% c = 15 Tukey Table III. The empirical relative frequencies of rejecting a linear model based on 1000 replications of a SETAR (2;1,1) with intercept model 454 K. C. Hung et al. J. Forecast. 28, 445–464 (2009) DOI: 10.1002/for 0.5 0.5 0.5 0.5 0.5 0.5 1.0 0.5 1.0 0.5 Copyright © 2009 John Wiley & Sons, Ltd. 1.0 0.5 1.0 0.5 1.0 0.5 1.0 0.5 0.5 0.5 0.5 0.5 0.5 0.5 0.8 0.5 0.2 0.0 −0.2 −0.5 (b) Sample size n = 200 1.0 0.5 0.5 0.5 −0.8 0 3 6 10 0 3 6 10 0 3 6 10 0 3 6 10 0 3 6 10 0 3 6 10 0 3 6 10 1.000 0.942 0.502 0.184 0.998 0.819 0.346 0.135 0.854 0.489 0.179 0.091 0.485 0.275 0.143 0.093 0.123 0.141 0.146 0.098 0.007 0.154 0.392 0.298 0.043 0.432 0.899 0.918 1.000 0.977 0.680 0.293 0.998 0.912 0.526 0.230 0.941 0.706 0.357 0.167 0.708 0.495 0.301 0.186 0.310 0.318 0.342 0.215 0.037 0.350 0.621 0.531 0.145 0.621 0.949 0.982 1.000 0.984 0.766 0.377 1.000 0.941 0.631 0.309 0.966 0.795 0.471 0.249 0.801 0.617 0.421 0.268 0.435 0.441 0.448 0.323 0.089 0.477 0.733 0.651 0.239 0.694 0.973 0.986 0.974 0.944 0.948 0.946 0.895 0.841 0.831 0.830 0.622 0.524 0.503 0.492 0.289 0.212 0.213 0.204 0.062 0.043 0.039 0.033 0.001 0.001 0.003 0.001 0.015 0.019 0.023 0.025 0.994 0.987 0.982 0.979 0.974 0.943 0.931 0.940 0.827 0.741 0.742 0.725 0.515 0.424 0.428 0.417 0.179 0.145 0.144 0.140 0.007 0.016 0.017 0.010 0.059 0.075 0.101 0.097 0.996 0.993 0.995 0.990 0.987 0.973 0.973 0.973 0.889 0.830 0.824 0.820 0.651 0.563 0.564 0.538 0.277 0.234 0.231 0.223 0.037 0.040 0.044 0.031 0.099 0.140 0.177 0.184 1.000 0.989 0.976 0.957 0.991 0.927 0.864 0.830 0.822 0.596 0.495 0.451 0.452 0.272 0.230 0.218 0.106 0.072 0.054 0.044 0.005 0.015 0.014 0.016 0.029 0.088 0.153 0.136 1.000 0.999 0.995 0.988 0.997 0.970 0.952 0.937 0.926 0.788 0.724 0.684 0.683 0.487 0.433 0.405 0.283 0.194 0.180 0.160 0.032 0.062 0.067 0.050 0.108 0.239 0.362 0.351 1.000 0.999 0.997 0.995 0.999 0.985 0.970 0.965 0.955 0.865 0.824 0.790 0.778 0.602 0.549 0.526 0.415 0.302 0.263 0.253 0.071 0.097 0.129 0.104 0.184 0.347 0.494 0.494 0.833 0.819 0.824 0.836 0.684 0.641 0.660 0.670 0.410 0.358 0.360 0.358 0.162 0.123 0.133 0.129 0.041 0.029 0.027 0.020 0.003 0.001 0.001 0.001 0.011 0.013 0.010 0.006 0.921 0.921 0.924 0.930 0.828 0.820 0.832 0.837 0.635 0.597 0.604 0.604 0.370 0.309 0.304 0.312 0.124 0.097 0.100 0.107 0.019 0.014 0.014 0.020 0.052 0.047 0.045 0.041 0.954 0.952 0.957 0.958 0.888 0.876 0.893 0.904 0.743 0.689 0.700 0.708 0.488 0.422 0.440 0.447 0.192 0.165 0.164 0.165 0.049 0.041 0.043 0.045 0.077 0.081 0.088 0.088 0.999 0.986 0.971 0.961 0.991 0.927 0.867 0.861 0.820 0.589 0.506 0.502 0.439 0.284 0.244 0.228 0.110 0.070 0.055 0.054 0.006 0.014 0.009 0.006 0.030 0.053 0.046 0.042 1.000 0.999 0.995 0.989 0.998 0.967 0.947 0.950 0.923 0.801 0.716 0.714 0.675 0.493 0.439 0.430 0.278 0.186 0.157 0.157 0.036 0.044 0.034 0.026 0.108 0.158 0.153 0.129 1.000 1.000 0.996 0.996 0.999 0.984 0.969 0.974 0.954 0.866 0.815 0.799 0.779 0.618 0.558 0.550 0.402 0.287 0.249 0.246 0.068 0.081 0.068 0.067 0.178 0.243 0.238 0.192 0.908 0.872 0.886 0.888 0.781 0.707 0.723 0.730 0.490 0.390 0.396 0.417 0.193 0.147 0.158 0.149 0.037 0.022 0.028 0.025 0.005 0.003 0.002 0.004 0.012 0.013 0.014 0.011 0.969 0.958 0.956 0.969 0.908 0.871 0.879 0.890 0.708 0.623 0.632 0.627 0.391 0.323 0.319 0.337 0.138 0.106 0.101 0.106 0.018 0.020 0.019 0.023 0.040 0.051 0.062 0.050 0.980 0.972 0.976 0.985 0.942 0.927 0.931 0.940 0.787 0.725 0.739 0.743 0.518 0.448 0.457 0.452 0.202 0.171 0.166 0.177 0.046 0.047 0.046 0.044 0.080 0.091 0.107 0.101 A Robust Test for SETAR-Type Nonlinearity 455 J. Forecast. 28, 445–464 (2009) DOI: 10.1002/for Copyright © 2009 John Wiley & Sons, Ltd. 0.5 0.5 1.0 0.5 1.0 0.5 0.5 1.0 0.5 0.5 0.5 1.0 0.5 1.0 0.5 0.5 1.0 0.5 0.5 0.5 0.5 0.5 0.5 0.5 0.8 0.5 0.2 0.0 −0.2 −0.5 0 3 6 10 0 3 6 10 0 3 6 10 0 3 6 10 0 3 6 10 0 3 6 10 0 3 6 10 0.560 0.307 0.160 0.109 0.330 0.190 0.115 0.082 0.137 0.105 0.099 0.082 0.062 0.076 0.095 0.100 0.021 0.063 0.091 0.077 0.006 0.052 0.132 0.116 0.029 0.161 0.319 0.302 f (1) f 1(1) f (2) f 1(2) w a = 1% 0 0 (a) Sample size n = 100 1.0 0.5 0.5 0.5 −0.8 r Parameters 0.765 0.539 0.315 0.202 0.587 0.400 0.247 0.164 0.339 0.265 0.217 0.160 0.206 0.211 0.208 0.178 0.100 0.176 0.201 0.167 0.034 0.191 0.307 0.257 0.112 0.303 0.529 0.501 0.850 0.636 0.401 0.266 0.692 0.514 0.343 0.237 0.459 0.374 0.325 0.230 0.333 0.333 0.331 0.255 0.160 0.270 0.312 0.256 0.079 0.283 0.428 0.371 0.194 0.398 0.638 0.631 0.194 0.171 0.164 0.160 0.103 0.093 0.087 0.084 0.045 0.041 0.043 0.043 0.022 0.020 0.023 0.015 0.009 0.012 0.011 0.011 0.002 0.005 0.003 0.003 0.023 0.022 0.018 0.015 5% 10% 1% LS 0.415 0.352 0.347 0.343 0.262 0.236 0.227 0.224 0.148 0.134 0.130 0.122 0.092 0.075 0.069 0.079 0.044 0.052 0.044 0.048 0.027 0.027 0.024 0.022 0.081 0.076 0.071 0.067 0.533 0.474 0.456 0.456 0.398 0.340 0.333 0.332 0.233 0.222 0.221 0.221 0.175 0.151 0.146 0.150 0.086 0.090 0.091 0.085 0.042 0.050 0.046 0.042 0.140 0.138 0.136 0.140 0.441 0.298 0.248 0.226 0.248 0.168 0.137 0.119 0.103 0.068 0.066 0.062 0.047 0.031 0.040 0.040 0.018 0.022 0.018 0.012 0.005 0.007 0.008 0.014 0.030 0.054 0.057 0.038 0.662 0.515 0.443 0.412 0.511 0.367 0.325 0.317 0.277 0.216 0.191 0.160 0.176 0.128 0.114 0.113 0.075 0.084 0.075 0.071 0.035 0.044 0.044 0.045 0.115 0.155 0.158 0.132 0.765 0.619 0.550 0.529 0.616 0.492 0.443 0.415 0.396 0.310 0.293 0.253 0.285 0.219 0.205 0.191 0.145 0.156 0.146 0.132 0.073 0.099 0.105 0.087 0.184 0.242 0.270 0.232 0.109 0.096 0.093 0.097 0.062 0.059 0.057 0.057 0.036 0.029 0.033 0.029 0.023 0.017 0.016 0.018 0.010 0.002 0.009 0.010 0.003 0.006 0.003 0.006 0.023 0.023 0.020 0.018 0.240 0.212 0.219 0.230 0.164 0.150 0.163 0.156 0.098 0.093 0.105 0.095 0.069 0.058 0.052 0.063 0.042 0.035 0.036 0.035 0.024 0.025 0.021 0.021 0.074 0.070 0.067 0.063 0.347 0.322 0.327 0.346 0.238 0.240 0.238 0.244 0.153 0.168 0.176 0.180 0.122 0.100 0.103 0.113 0.078 0.078 0.068 0.067 0.056 0.043 0.048 0.056 0.111 0.113 0.103 0.113 0.445 0.311 0.244 0.240 0.251 0.167 0.134 0.127 0.105 0.067 0.066 0.068 0.048 0.037 0.040 0.041 0.016 0.022 0.017 0.022 0.003 0.008 0.014 0.012 0.028 0.048 0.028 0.022 5% 10% 1% c = 4.685 Tukey 5% 10% 1% k = 3.291 Huber 5% 10% 1% k = 1.345 Huber The FGM test with various y-functions 0.662 0.514 0.439 0.442 0.505 0.374 0.318 0.303 0.274 0.212 0.187 0.180 0.177 0.127 0.112 0.120 0.080 0.081 0.065 0.071 0.032 0.035 0.038 0.039 0.113 0.131 0.105 0.087 0.762 0.625 0.555 0.548 0.621 0.493 0.442 0.420 0.399 0.314 0.273 0.272 0.279 0.218 0.190 0.191 0.134 0.136 0.138 0.114 0.077 0.077 0.087 0.068 0.184 0.213 0.171 0.153 0.154 0.117 0.119 0.127 0.080 0.068 0.072 0.069 0.036 0.034 0.038 0.029 0.018 0.019 0.015 0.015 0.006 0.005 0.009 0.007 0.003 0.005 0.003 0.003 0.014 0.012 0.012 0.014 0.314 0.267 0.277 0.284 0.195 0.170 0.168 0.169 0.112 0.107 0.121 0.110 0.070 0.057 0.067 0.065 0.033 0.033 0.038 0.038 0.024 0.026 0.021 0.025 0.056 0.061 0.062 0.057 0.412 0.373 0.371 0.388 0.315 0.256 0.269 0.267 0.192 0.178 0.189 0.189 0.133 0.100 0.111 0.118 0.080 0.070 0.074 0.072 0.056 0.052 0.052 0.056 0.105 0.108 0.107 0.105 5% 10% c1 = 2.576 c2 = 3.291 Poly 5% 10% 1% c = 15 Tukey Table IV. The empirical relative frequencies of rejecting a linear model based on 1000 replications of a SETAR (2;1,1) without intercept model 456 K. C. Hung et al. J. Forecast. 28, 445–464 (2009) DOI: 10.1002/for 0.5 0.5 0.5 0.5 0.5 0.5 1.0 0.5 1.0 0.5 Copyright © 2009 John Wiley & Sons, Ltd. 1.0 0.5 1.0 0.5 1.0 0.5 1.0 0.5 0.5 0.5 0.5 0.5 0.5 0.5 0.8 0.5 0.2 0.0 −0.2 −0.5 (b) Sample size n = 200 1.0 0.5 0.5 0.5 −0.8 0 3 6 10 0 3 6 10 0 3 6 10 0 3 6 10 0 3 6 10 0 3 6 10 0 3 6 10 0.926 0.673 0.288 0.127 0.761 0.471 0.230 0.130 0.412 0.295 0.199 0.111 0.203 0.205 0.197 0.128 0.067 0.164 0.219 0.144 0.007 0.154 0.392 0.298 0.096 0.391 0.722 0.693 0.976 0.836 0.501 0.242 0.897 0.706 0.418 0.227 0.650 0.523 0.394 0.237 0.420 0.423 0.413 0.286 0.196 0.358 0.456 0.330 0.037 0.350 0.621 0.531 0.244 0.553 0.884 0.876 0.991 0.908 0.593 0.334 0.943 0.808 0.554 0.318 0.752 0.644 0.531 0.333 0.532 0.553 0.534 0.387 0.287 0.476 0.601 0.460 0.089 0.477 0.733 0.651 0.357 0.649 0.925 0.926 0.506 0.464 0.448 0.450 0.329 0.261 0.261 0.265 0.144 0.117 0.119 0.111 0.063 0.058 0.058 0.051 0.018 0.019 0.021 0.020 0.001 0.001 0.003 0.001 0.044 0.032 0.027 0.025 0.715 0.675 0.680 0.680 0.550 0.495 0.501 0.496 0.317 0.298 0.289 0.284 0.199 0.194 0.187 0.179 0.069 0.079 0.073 0.063 0.007 0.016 0.017 0.010 0.120 0.115 0.120 0.103 0.814 0.783 0.779 0.777 0.659 0.621 0.605 0.605 0.458 0.429 0.414 0.413 0.294 0.281 0.285 0.266 0.139 0.139 0.125 0.127 0.037 0.040 0.044 0.031 0.198 0.199 0.197 0.186 0.842 0.686 0.599 0.569 0.618 0.427 0.366 0.350 0.314 0.231 0.181 0.161 0.149 0.105 0.081 0.071 0.046 0.049 0.043 0.038 0.005 0.015 0.014 0.016 0.087 0.108 0.114 0.089 0.932 0.856 0.803 0.772 0.828 0.665 0.601 0.570 0.570 0.439 0.384 0.363 0.356 0.249 0.251 0.223 0.150 0.134 0.110 0.114 0.032 0.062 0.067 0.050 0.210 0.265 0.293 0.253 0.973 0.909 0.876 0.847 0.899 0.780 0.728 0.689 0.697 0.555 0.512 0.488 0.451 0.371 0.362 0.335 0.240 0.230 0.207 0.189 0.071 0.097 0.129 0.104 0.338 0.382 0.421 0.377 0.258 0.241 0.249 0.271 0.164 0.136 0.147 0.156 0.078 0.082 0.070 0.066 0.032 0.037 0.030 0.031 0.016 0.013 0.014 0.013 0.003 0.001 0.001 0.001 0.025 0.016 0.016 0.024 0.476 0.445 0.462 0.464 0.351 0.314 0.326 0.334 0.200 0.194 0.185 0.186 0.129 0.134 0.132 0.123 0.063 0.059 0.044 0.045 0.019 0.014 0.014 0.020 0.072 0.066 0.064 0.072 0.575 0.566 0.580 0.576 0.449 0.431 0.435 0.447 0.296 0.277 0.271 0.269 0.206 0.207 0.201 0.202 0.102 0.098 0.093 0.097 0.049 0.041 0.043 0.045 0.121 0.111 0.118 0.122 0.841 0.693 0.600 0.590 0.619 0.432 0.376 0.368 0.329 0.225 0.171 0.156 0.150 0.098 0.077 0.077 0.050 0.036 0.028 0.029 0.006 0.014 0.009 0.006 0.085 0.067 0.056 0.048 0.938 0.855 0.798 0.787 0.838 0.664 0.610 0.604 0.576 0.435 0.367 0.360 0.350 0.253 0.218 0.206 0.152 0.122 0.105 0.088 0.036 0.044 0.034 0.026 0.211 0.198 0.146 0.133 0.971 0.900 0.872 0.855 0.900 0.776 0.734 0.725 0.688 0.545 0.488 0.487 0.452 0.360 0.330 0.309 0.247 0.194 0.166 0.144 0.068 0.081 0.068 0.067 0.322 0.284 0.232 0.204 0.364 0.294 0.318 0.334 0.215 0.172 0.185 0.190 0.089 0.095 0.082 0.078 0.045 0.049 0.049 0.039 0.012 0.008 0.012 0.011 0.005 0.003 0.002 0.004 0.021 0.013 0.020 0.018 0.573 0.526 0.552 0.547 0.419 0.376 0.370 0.378 0.219 0.204 0.199 0.216 0.146 0.148 0.149 0.138 0.067 0.050 0.052 0.050 0.018 0.020 0.019 0.023 0.075 0.060 0.061 0.066 0.673 0.642 0.660 0.656 0.511 0.498 0.482 0.493 0.325 0.311 0.298 0.313 0.249 0.234 0.223 0.218 0.123 0.107 0.103 0.099 0.046 0.047 0.046 0.044 0.135 0.130 0.120 0.119 A Robust Test for SETAR-Type Nonlinearity 457 J. Forecast. 28, 445–464 (2009) DOI: 10.1002/for 458 K. C. Hung et al. tests increase. In general, an increase in (w) will slightly lower the power of the proposed FGM tests, but the effects can often be diluted by increasing the sample size. For example, in Table III (n = 100 with a = 5%), when f1(2) = −0.8, the powers for the FGM test (with c = 15 Tukey y-function) are 0.988 (w = 0), 0.899 (w = 3), and 0.860 (w = 6). For n = 200, the powers are 1.000 (w = 0), 0.996 (w = 3), and 0.990 (w = 6). Nevertheless, our simulations show that the proposed FGM tests can retain a reasonable level of power in the presence of outliers. In Tables III and IV, when f 1(2) = 0.8, the proposed tests have low power. This is not surprising because the nonlinearity is not strong. On the other hand, the LS test seems to have some power, even if one uses the empirical critical values. We perform a simple simulation to study the ‘spurious’ power of the LS test in that situation. The DGP is a contaminated AR(1) model: Yt = 0.5 + fYt−1 + et, and the contamination mechanism is according to equation (23). Table V shows the empirical rejection rates using the LS test with a = 5%, b = 5%, w = 0, 3, 6, 10 and various values of f. The results indicate that the rejection rates of the LS test will normally be inflated when w increases. The degree of inflation becomes quite extreme when f is close to 1 and w is large. The ‘spurious’ power of the LS test when f 1(2) = 0.8 in Tables III and IV might be due to this kind of erroneous inflation of rejection rates. As for the choice of the y-functions and tuning constants for the FGM tests, the simulation results show that the Huber function (with k = 3.291) and Tukey’s biweight function (with c = 15) often have higher powers than the other tests. For practical applications, we recommend these two tests because they are robust with respect to outlying observations and do not result in large Type I errors in the presence of outliers. The results in Tables III and IV assume that the model is known. It might be interesting to study the effects of additive outliers on SETAR-type nonlinearity tests, if the AR order is misspecified. Another issue of interest is the effects of a patch of additive outliers (Justel et al., 2001). External events may affect the underlying time series in a finite number of consecutive time periods. This situation is less relevant in linear regression, but is important in time series analysis. Therefore, it is important to examine the robustness of the proposed FGM test with respect to a patch of additive outliers. In order to examine the above two issues, we conduct another simulation experiment. The first outlier-free DGP is a linear AR(3) model: Yt = 2.1Yt −1 − 1.46Yt − 2 + 0.336Yt − 3 + ε t (24) Table V. The empirical relative frequencies of rejecting a linear model based on 1000 replications of an AR(1) process, the LS test with n = 100 w 0 3 6 10 f 0.95 0.8 0.5 0.2 0.038 0.263 0.668 0.853 0.032 0.313 0.676 0.742 0.034 0.191 0.307 0.257 0.043 0.074 0.079 0.079 Copyright © 2009 John Wiley & Sons, Ltd. J. Forecast. 28, 445–464 (2009) DOI: 10.1002/for A Robust Test for SETAR-Type Nonlinearity 459 with s2a = 1. The roots of the autoregressive polynomial are 0.6, 0.7 and 0.8, so that the model is stationary. This model has been considered by Justel et al. (2001). The second outlier-free DGP is a nonlinear SETAR(2;3,1) process: Yt = { 2.1Yt −1 − 1.46Yt − 2 + 0.336Yt − 3 + ε t 1.2Yt −1 − 0.74Yt − 2 + 0.060Yt − 3 + ε t if Yt −1 ≤ −1.5 if Yt −1 > −1.5 (25) with AR roots (0.6, 0.7, 0.8) in regime 1, and (0.3, 0.4, 0.5) in regime 2. The threshold parameter r = −1.5 is chosen such that there are adequate observations in both regimes for efficient parameter estimation and nonlinearity testing. Time series data (with sample size n = 100) are generated from the two DGPs. We consider two outlier contamination situations: (i) no outliers, and (ii) a patch of four consecutive additive outliers, from t = 68 to t = 71, with sizes (11, 10, 9, 10). The proposed FGM tests with a = 5% and various AR orders (p = 1 to 5) are applied to the simulated series. It should be noted that the correct AR specification is p = 3. Table VI reports the empirical relative frequencies of rejecting the linear null hypothesis based on 1000 replications. When the underlying DGP is a linear AR(3) process, the rejection rates in Table VI are just the empirical sizes of the tests. The problem of AR order misspecification does not seem to create adverse effects on the sizes of the tests when there are no outliers. As can be seen from the first panel of results in Table VI, all the figures are reasonably close to the nominal 5% level. However, when a patch of additive outliers is introduced, the empirical sizes of the LS test become significantly larger than the nominal level, except for the case of p = 1. We will attempt to explain this exception later. On the other hand, the proposed FGM tests can keep the sizes steady at around the nominal level without being affected by the patch of outliers (see the second panel of results in Table VI). When the underlying DGP is a nonlinear SETAR(2;3,1) process, the rejection rates in Table VI (the third and the fourth panels) indicate the empirical powers of the tests. In general, the powers of the proposed FGM tests are robust with respect to outliers, as well as to the specification of the AR order. Analogous to the linearity case, the rejection rates of the LS test are significantly inflated due to the adverse effects of the patch of outliers. On the other hand, when the under-specification of the AR order is significant, the LS test might create a sudden drop in rejection rates. For example, in the third panel of Table VI, the rejection rates for the LS test are 0.428 (p = 3), 0.457 (p = 2) and 0.013 (p = 1). This phenomenon can provide an explanation for the exception mentioned above, where the empirical size of the LS test is not inflated by the effect of additive outliers for the case of p = 1. The inflation effect by the outliers might be neutralized by the pushdown effect of the under-specification of the AR order. REAL EXAMPLES In this section, we apply the new robust tests to analyze the linearity behavior (against the SETARtype nonlinearity alternative) of three well-known real datasets. In addition to applying FGM tests on the data, we would like to investigate the consistency of the results with the commonly used SETARCopyright © 2009 John Wiley & Sons, Ltd. J. Forecast. 28, 445–464 (2009) DOI: 10.1002/for 460 K. C. Hung et al. Table VI. The empirical relative frequencies of rejecting a linear model based on 1000 replications for the FGM test (a = 5%) with various specifications of the AR order p and a patch of additive outliers, n = 100 The FGM test with various y-functions AR specification k = 1.345 Huber k = 3.291 Huber c = 4.685 Tukey c = 15 Tukey c1 = 2.576 c2 = 3.291 Poly 0.010 0.065 0.046 0.045 0.037 0.014 0.073 0.039 0.039 0.033 0.009 0.089 0.067 0.049 0.056 0.010 0.075 0.039 0.041 0.032 0.002 0.081 0.053 0.047 0.034 (ii) With a patch of AO’s from t = 68 to 71, with sizes (11, 10, 9, 10) 1 0.016 0.014 0.018 0.014 2 0.410 0.057 0.061 0.095 3 0.399 0.039 0.038 0.065 4 0.390 0.027 0.031 0.047 5 0.543 0.028 0.013 0.050 0.017 0.044 0.019 0.015 0.007 0.010 0.092 0.056 0.041 0.039 (b) DGP: Nonlinear SETAR(2;3,1) process (i) No outliers 1 0.013 0.004 2 0.457 0.367 3 0.428 0.299 4 0.375 0.217 5 0.340 0.193 0.002 0.217 0.167 0.133 0.096 0.005 0.456 0.381 0.280 0.206 0.008 0.234 0.178 0.128 0.106 (ii) With a patch of AO’s from t = 68 to 71, with sizes (11, 10, 9, 10) 1 0.091 0.016 0.046 0.002 2 0.746 0.318 0.406 0.188 3 0.665 0.241 0.302 0.167 4 0.630 0.166 0.207 0.106 5 0.801 0.159 0.130 0.099 0.028 0.196 0.137 0.068 0.028 0.006 0.191 0.141 0.085 0.068 p (a) DGP: Linear AR(3) (i) No outliers 1 2 3 4 5 LS process 0.012 0.057 0.038 0.030 0.033 0.012 0.517 0.446 0.338 0.281 type nonlinearity tests, which include Tsay’s F test (Tsay, 1989); the CUSUM test (Petruccelli and Davies, 1986); the RC test (Petruccelli, 1990); the bootstrap version of the likelihood ratio LR–BS test (Hansen, 1996), and the Lagrange multiplier LM test (Luukkonen et al., 1988). Chan and Ng (2004) provide a brief review of these five tests. We first consider the Australian blowfly data (BLOWFLY). These are the bi-daily population sizes of blowflies obtained by Nicholson (1957). The dataset, together with a detailed description and nonlinear time series analysis, are given in Tong (1990). We shall only use the first 206 observations, because many researchers found SETAR-type nonlinearity and/or aberrant observations in this portion of the data (e.g., Tsay, 1988; Tong, 1990; Wong and Kohn, 1996). Following Tsay (1988), we employ Yt = Zt/1000 transformation, where Zt is the original series. Copyright © 2009 John Wiley & Sons, Ltd. J. Forecast. 28, 445–464 (2009) DOI: 10.1002/for A Robust Test for SETAR-Type Nonlinearity 461 Table VII. SETAR-type nonlinearity test results for the real datasetsa Test Datasets (a) Non-robust tests F CUSUM RC LR–BS LM (b) Robust FGM tests Huber (k = 1.345) Huber (k = 3.291) Tukey (c = 4.685) Tukey (c = 15) Poly (c1 = 2.576, c2 = 3.291) a BLOWFLY GAS UORDER NL (0.000) L (0.317) NL (0.030) NL (0.022) NL (0.000) NL (0.000) NL (0.000) NL (0.003) NL (0.000) NL (0.000) L (0.222) L (0.133) L (0.085) L (0.424) L (0.085) NL (0.000) NL (0.000) NL (0.007) NL (0.000) L (0.060) L (0.320) L (0.116) L (0.361) L (0.120) L (0.543) L (0.235) L (0.332) L (0.604) L (0.294) L (0.907) L, linear; NL, nonlinear; p-values are in parentheses. For the second dataset, we consider the monthly average wholesale prices of regular leaded gasoline in the United States between January 1973 and December 1978 (GAS). The data are listed in Liu (1991). The studies by Chan and Cheung (1994) and Chen (1997) conclude that nonlinear and/or outlier models should be employed for this time series. We shall use the first differenced series of the logarithmic transformed data in our analysis. The third real dataset is the monthly US industry-unfilled orders for radio and TV, in millions of dollars, from January 1958 to October 1980 (UORDER). We use the logged data and focus on the seasonally adjusted series, where the seasonal component was removed by the well-known X11ARIMA procedure. The original data are listed in Zellner (1983). Justel et al. (2001) detect a patch of additive outliers in the seasonally adjusted series. For the practical application of the tests, we need to determine p and d. Following Tsay (1989) and Sarantis (2001), we estimate the model order p by the cutoff pattern of the sample partial autocorrelation function (SPACF) of the observed series. The value of d is then searched among {1, 2, . . . , p} such that its corresponding p-value of the nonlinearity test is the smallest. Table VII provides the test results at the 5% level. Both the existing non-robust tests and the proposed FGM tests indicate SETAR-type nonlinearity for the BLOWFLY dataset in most cases. After downweighting the aberrant observations in the dataset, the p-values of the proposed FGM tests are in general less than those of the non-robust tests. The results for the GAS dataset are interesting. All the traditional non-robust tests identify the data as NL (nonlinear), while all of the FGM tests support the linearity (L) conclusion. The robust and non-robust tests give completely different conclusions in this dataset. Finally, all tests favor linearity conclusion of the UORDER dataset. The p-values for the non-robust tests are much less than those of the proposed FGM tests, which might be due to the potential effect of the patch of additive outliers in the data. CONCLUSION We have investigated the robustness properties of the classical Tsay (1989) F test for SETAR-type nonlinearity. The test is not robust to extreme additive outliers, and becomes very unreliable when Copyright © 2009 John Wiley & Sons, Ltd. J. Forecast. 28, 445–464 (2009) DOI: 10.1002/for 462 K. C. Hung et al. the data contain large outlying values. In practice, time series outliers are not encountered very often, so it is important to develop a more robust test for SETAR-type nonlinearity in time series analysis and forecasting. In this paper we propose a new robust nonlinearity test and the asymptotic null distribution of the test statistic is derived. A Monte Carlo experiment is carried out to compare the power of the proposed test and other existing tests under the influence of time series outliers. The effects of additive outliers on nonlinearity tests with misspecification of the autoregressive order are also studied. The results indicate that the proposed method is preferable to the classical tests when the observations are contaminated with outliers. In the empirical study we applied the tests to three well-known datasets, which have been judged to be outlier-contaminated by many time series analysts. The proposed FGM tests give consistent conclusions for each dataset under different choices of y-functions and they do not seem to be adversely affected by the outliers. The overall performance of the proposed tests is supported by the real examples. ACKNOWLEDGEMENTS The authors thank the departmental editor and an anonymous referee for their valuable suggestions, which helped to shape the final version of this paper. This work was supported by a grant from the Research Grants Council of the Hong Kong Special Administrative Region (Competitive Earmarked Research Grant Project No. CUHK7413/05H). REFERENCES Billingsley P. 1961. The Lindeberg–Lévy theorem for martingales. Proceedings of the American Mathematical Society 12: 788–792. Chan KS, Tong H. 1990. On likelihood ratio tests for threshold autoregression. Journal of the Royal Statistical Society B52: 496–476. Chan WS. 1992. A note on time series model specification in the presence of outliers. Journal of Applied Statistics 19: 117–124. Chan WS, Cheung SH. 1994. On robust estimation of threshold autoregressions. Journal of Forecasting 13: 37–49. Chan WS, Ng MW. 2004. Robustness of alternative nonlinearity tests for SETAR models. Journal of Forecasting 23: 215–231. Chen CWS. 1997. Detection of additive outliers in bilinear time series. Computational Statistics and Data Analysis 24: 283–294. De Gooijer JG, Kumar K. 1992. Some recent developments in nonlinear time series modelling, testing and forecasting. International Journal of Forecasting 8: 135–156. Denby L, Martin D. 1979. Robust estimation of the first-order autoregressive parameter. Journal of the American Statistical Association 74: 140–146. Franses PH, Van Dijk D. 2000. Non-linear Time Series Models in Empirical Finance. Cambridge University Press: Cambridge, UK. Hampel FR, Ronchetti EM, Rousseeuw PJ, Stahel WA. 1986. Robust Statistics: The Approach Based on Influence Functions. Wiley: New York. Handschin E, Kohlas J, Fiechter A, Schweppe F. 1975. Bad data analysis for power system state estimation. IEEE Transactions on Power Apparatus and Systems 2: 329–337. Hansen BE. 1996. Inference when a nuisance parameter is not identified under the null hypothesis. Econometrica 64: 413–430. Hoaglin DC, Frederick M, Tukey JW. 2000. Understanding Robust and Exploratory Data Analysis. Wiley: New York. Copyright © 2009 John Wiley & Sons, Ltd. J. Forecast. 28, 445–464 (2009) DOI: 10.1002/for A Robust Test for SETAR-Type Nonlinearity 463 Justel A, Peña D, Tsay RS. 2001. Detection of outlier patches in autoregressive time series. Statistica Sinica 11: 651–673. Liu LM. 1991. Dynamic relationship analysis of US gasoline and crude oil prices. Journal of Forecasting 10: 521–547. Luukkonen R, Saikkonen P, Terasvirta T. 1988. Testing linearity against smooth transition autoregressive models. Biometrika 75: 491–499. Maronna RA, Martin RD, Yohai VJ. 2006. Robust Statistics: Theory and Methods. Wiley: New York. Moeanaddin R, Tong H. 1988. A comparison of likelihood ratio test and CUSUM test for threshold autoregression. The Statistician 37: 213–225. Nicholson AJ. 1957. The self-adjustment of populations to change. Cold Spring Harbour Symposia on Quantitative Biology 22: 153–173. Petruccelli JD. 1990. A comparison of tests for SETAR-type nonlinearity in time series. Journal of Forecasting 9: 25–36. Petruccelli JD, Davies N. 1986. A Portmanteau test for self-exciting threshold autoregressive-type nonlinearity in time series. Biometrika 73: 687–694. Rousseeuw PJ. 1984. Least median of squares regression. Journal of the American Statistical Association 79: 871–880. Rousseeuw PJ, Van Zomeren BC. 1990. Unmasking multivariate outliers and leverage points (with comments and rejoinder). Journal of the American Statistical Association 85: 633–651. Sarantis N. 2001. Nonlinearities, cyclical behaviour and predictability in stock markets: international evidence. International Journal of Forecasting 17: 459–482. Spath H. 1991. Mathematical Algorithms for Linear Regression. California Academic Press: Millbrae, CA. Tong H. 1978. On a threshold model. In Pattern Recognition and Signal Processing, Chen CH (ed.), Sijhoff and Noordoff: Amsterdam; 101–141. Tong H. 1983. Threshold Models in Non-linear Time Series Analysis. Springer: New York. Tong H. 1990. Non-linear Time Series: A Dynamical System Approach. Clarendon Press: Oxford. Tsay RS. 1986. Nonlinearity tests for time series. Biometrika 73: 461–466. Tsay RS. 1988. Nonlinear time series analysis of Blowfly population. Journal of Time Series Analysis 17: 247–263. Tsay RS. 1989. Testing and modelling threshold autoregressive processes. Journal of the American Statistical Association 84: 231–240. Van Dijk D, Franses PH, Lucas A. 1999. Testing for smooth transition nonlinearity in the presence of additive outliers. Journal of Business and Economic Statistics 17: 217–235. Wei WWS, Wei S. 1998. Comparison of some estimates of autoregressive parameters. Journal of Applied Statistical Science 8: 51–58. Wong CM, Kohn R. 1996. A Bayesian approach to estimating and forecasting additive nonparametric autoregressive models. Journal of Time Series Analysis 17: 203–220. Zellner A. 1983. Time Series Analysis of Economic Data. US Bureau of the Census, Government Printing Office: Washington, DC. Authors’ biographies: King Chi Hung gained her MPhil degree in Statistics from the Chinese University of Hong Kong in 2008. Her research interests include robust statistics and nonlinear time series analysis. Siu Hung Cheung is a professor in the department of Statistics at the Chinese University of Hong Kong and an adjunct professor in the Department of Statistics at National Cheng Kung University. He received a PhD in Statistics from Temple University, USA. His current research interests include multiple comparisons, adaptive designs and nonlinear time series analysis. Wai-Sum Chan is a professor in the Department of Finance at the Chinese University of Hong Kong. He received a PhD in Statistics from Temple University, USA. His current research interests include nonlinear time series analysis, clinical statistics and actuarial modeling. Li-Xin Zhang is a Professor in the Institute of Statistics and Department of Mathematics at Zhejiang University. He received a PhD in Statistics from Fudan University, PR China. His current research interests include clinical statistics, limit theorems, stochastic processes and time series analysis. Copyright © 2009 John Wiley & Sons, Ltd. J. Forecast. 28, 445–464 (2009) DOI: 10.1002/for 464 K. C. Hung et al. Authors’ addresses: King Chi Hung, Department of Statistics, the Chinese University of Hong Kong, Shatin, Hong Kong. Siu Hung Cheung, Department of Statistics, The Chinese University of Hong Kong, Shatin, Hong Kong; Department of Statistics, National Cheng Kung University, Tainan, Taiwan. Wai-Sum Chan, Department of Finance, the Chinese University of Hong Kong, Shatin, Hong Kong. Li-Xin Zhang, Department of Mathematics, Zhejiang University, Zheda Road No. 38, Hangzhou 310027, PR China. Copyright © 2009 John Wiley & Sons, Ltd. J. Forecast. 28, 445–464 (2009) DOI: 10.1002/for

On a Robust Test for SETAR-Type Nonlinearity in Time Series Analysis

Related documents

Products

Support

On a Robust Test for SETAR-Type Nonlinearity in Time Series Analysis

Related documents

Add this document to collection(s)

Add this document to saved

Suggest us how to improve StudyLib