vii TABLE OF CONTENTS CHAPTER TITLE DECLARATION ii DEDICATION iii ACKNOWLEDGEMENT iv ABSTRACT v ABSTRAK vi TABLE OF CONTENTS vii LIST OF TABLES x LIST OF FIGURES xi LIST OF ABBREVIATIONS 1 PAGE xiii INTRODUCTION 1 1.1 Introduction 1 1.2 Background Study 3 1.3 Problem Statement 7 1.4 Research Goal 9 1.5 Research Objective 9 1.6 Research Scope 9 1.6 Significance of The Study 10 viii 2 3 LITERATURE REVIEW 11 2.1 Introduction 11 2.2 Autoregressive Integrated Moving Average (ARIMA) 11 2.3 Artificial Neural Network 13 2.4 Least Square Support Vector Machine 17 2.5 Self Organizing Map 20 2.6 Hybrid Model 24 2.7 Input Determination 25 2.8 Summary 26 RESEARCH METHODOLOGY 28 3.1 Introduction 28 3.2 River Flow Forecasting 25 3.2.1 29 River Flow Forecasting Model 3.3 Autoregressive Integrated Moving Average Model 29 3.3.1 ARIMA Tentative Model 31 3.3.1.1 The ARIMA (p, 0, 0) Model 31 3.3.1.2 The ARIMA (0, 0, q Model 33 3.3.1.3 The ARIMA (p, d, q) Model 35 3.3.1.4 The SARIMA (p, d, q) x (P, D,Q)s) Model 36 3.3 Artificial Neural Network Model 37 3.3.1 FeedForward Network 38 3.3.1.1 39 Multilayer Perceptron 3.4.2 Backpropagation Algorithm 40 3.4.3 The ANN Architecture 41 3.4.3.1 The Number of Input Nodes 41 3.4.3.2 The Number of Hidden Layers and Hidden Nodes 41 3.4.3.3 The Number of Output Nodes 42 3.4.3.3 Activation Function 42 ix 3.5 Least Square Support Vector Machine Model 44 3.5.1 Karush-Kuhn-Tucker 45 3.5.2 Validation Technique 46 3.5.2.1 46 3.5.3 LSSVM Algorithm 46 3.5.1 Kernel 50 3.5.2.1 3.6 4 Kernel Function 50 Self Organizing Map (SOM) Model 52 3.6.1 Train The Network 53 3.6.1.1 Input Determination 53 3.6.1.2 Weight Vector 54 3.6.1.3 Learning Rate 55 3.6.1.4 Determining The Winning Node 55 3.6.1.5 Topological Neighbourhood 56 3.6.1.6 Neighbourhood Function 57 3.6.1.7 Updating The Weight 58 3.6.2 3.7 Cross Validation SOM Algorithm Framework of The Study 58 60 DATA COLLLECTION AND ANALYSIS 63 4.1 Introduction 63 4.2 Data Source 63 4.2.1 Case Study 1 (Muda River) 65 4.2.2 Case Study 2 (Selangor River) 66 4.2.3 Case Study 3 (Bernam River) 68 4.3 Statistical Measure 69 4.4 Input Determination 71 4.4.1 Stepwise Regression Analysis 71 4.4.2 Step In Stepwise Regression Analysis 72 x 5 4.4.3 Input determination For Muda River 74 4.4.3 Input determination For Selangor River 76 4.4.3 Input determination For Bernam River 77 EXPERIMENT AND RESULT ANALYSIS 79 5.1 Introduction 79 5.2 Experiment Setup 79 5.3 Fitting The ARIMA Model To The Monthly River Flow 80 5.3.1 Case Study 1 – Muda River 81 5.3.1.1 Identification Step 81 5.3.1.2 Estimation and Diagnostic Checking Step 5.3.1.3 Forecasting Step 5.3.2 85 89 Case Study2 – Selangor River 90 5.3.2.1 Identification Step 91 5.3.2.2 Estimation and Diagnostic Checking Step 5.3.3 94 5.3.2.3 Forecating Step 98 Case Study 3 – Bernam River 99 5.3.3.1 Identification Step 100 5.3.3.2 Estimation and Diagnostic Checking Step 5.3.3.3 Forecasting Step 5.4 5.5 Fitting The ANN Model To The Monthly River Flow 104 104 109 5.4.1 Case Study 1 – Muda River 110 5.4.2 Case Study 2 – Selangor River 113 5.4.3 Case Study 3 – Bernam River 116 Fitting The LSSVM Model To The Monthly River Flow 119 5.5.1 Case Study 1 – Muda River 120 5.5.2 Case Study 2 – Selangor River 121 5.5.3 Case Study 3 – Bernam River 123 xi 5.6 Fitting The SOM-LSSVM Model To The Monthly River Flow 6 7 125 5.6.1 Case Study 1 – Muda River 126 5.6.2 Case Study 2 – Selangor River 128 5.6.3 Case Study 3 – Bernam River 131 COMPARISON AND DISCUSSION 134 6.1 Introduction 134 6.2 Muda River 134 6.3 Selangor River 139 6.4 Bernam River 143 CONCLUSION AND RECOMMENDATION 148 7.1 Introduction 148 7.2 Conclusion 148 7.3 Contributions 150 7.4 Suggestions For Future Research 151 REFERENCES 153 Appendices A-I 166 xii LIST OF TABLES TABLE NO. TITLE PAGE 4.1 List of River Name 64 4.2 The Input Data Structures For Muda River 75 4.3 The Input Data Structures For Selangor River 77 4.4 The Input Data Structures For Bernam River 78 5.1 List of MSE and Tentative ARIMA Models for Muda River 85 5.2 The Performance Evaluation for The Tentative Models 86 5.3 List of MSE and Tentative ARIMA Models for Selangor River 94 5.4 The Performance Evaluation for The Tentative Models 95 5.5 List of MSE and Tentative ARIMA Models for Bernam River 104 5.6 The Performance Evaluation for The Tentative Models 105 5.7 The Network Architecture of ANN For Muda River ( n =input nodes) 5.8 The Result for The Training and Testing for Muda River Using ANN Model 5.9 115 The Network Architecture of ANN for Bernam River ( n =Input Nodes) 5.12 114 The Result for The Training and Testing for Selangor River Using ANN Model 5.11 112 The Network Architecture of ANN for Selangor River ( n =Input Nodes) 5.10 110 117 The Result for The Training and Testing Bernam River Using ANN Model 118 xiii 5.13 The Result for The Training and Testing for Muda River Using LSSVM Model 5.14 The Result for The Training and Testing for Selangor River Using LSSVM Model 5.15 122 The Result for The Training And Testing for Bernam River Using LSSVM Model 5.16 120 124 The Result for The Training and Testing Using A Hybrid Model of SOM-LSSVM With Different of Map Sizes for Muda River 5.17 127 The Result for The Training And Testing Using A Hybrid Model of SOM-LSSVM With Different of Map Sizes for Selangor River 5.18 129 The Result for The Training and Testing Using A Hybrid Model of SOM-LSSVM With Different of Map Sizes for Bernam River 6.1 132 The Comparative Performance of ARIMA, ANN, LSSVM and SOM-LSSM Models for Muda River. 6.2 135 The Comparative Performance of ARIMA, ANN, LSSVM and SOM-LSSM Models for Selangor River. 6.3 139 The Comparative Performance of ARIMA, ANN, LSSVM and SOM-LSSM Models for Bernam River. 144 xiv LIST OF FIGURES FIGURE NO. 2.1 TITLE PAGE Architecture Of A Three-Layer Back Propagation Neural Network 15 2.2 The Self Organizing Maps Architecture 21 3.1 The Feed Forward Network 38 3.2 Multilayer Perceptron Architecture 40 3.3 The Sigmoid Function 44 3.4 The Linear Function 44 3.5 An Illustrated Of The Proposed Model 60 3.6 The Framework Of Study 62 4.1 Monthly River Flow For Muda River From January 1962 – December 1989 4.2 Monthly River Flow For Selangor River From January 1960 – December 2006 4.3 67 Monthly River Flow For Bernam River From January 1966 – December 2008 5.1 66 69 Non-Stationary Time Series Plot of Muda River From January 1962 – December 1989 81 5.2(a) The ACF of Monthly Time Series for Muda River, Kedah 81 5.2(b) The PACF of Monthly Time Series for Muda River, Kedah 83 5.3 The Stationary Time Series Plot for Muda River After Differencing From January 1962 – December 1989 5.4 (a) 83 The ACF After Differencing for Monthly Time Series for Muda River, Kedah 84 xv 5.4(b) The PACF After Differencing for Monthly Time Series for Muda River, Kedah 5.5(a) 84 The ACF of Residuals for ARIMA (0, 1, 2) x (3, 0, 2)12 Without Constant for Monthly Time Series for Muda River, Kedah 5.5(b) 88 The PACF of Residuals for ARIMA (0, 1, 2) x (3, 0, 2)12 Without Constant for Monthly Time Series for Muda River, Kedah 5.6 The Observed and Predicted ARIMA (0, 1, 2) x (3, 0, 2)12 for Monthly River Flow of Muda River 5.7 88 89 Non-Stationary Time Series Plot of Selangor River From January 1960 – December 2006 90 5.8(a) The ACF of Monthly Time Series for Selangor River, Selangor 92 5.8(b) The PACF of Monthly Time Series for Selangor River, Selangor 92 5.9 The Stationary Time Series Plot of Selangor River After Differencing From January 1960 – December 2006 5.10(a) The ACF After Differencing for Monthly Time Series for Selangor River, Selangor 5.10(b) 93 The PACF After Differencing for Monthly Time Series for Selangor River, Selangor 5.11(a) 93 94 The ACF of Residuals for ARIMA (1, 0, 2) x (0, 1, 1) 12 Without Constant for Monthly Time Series for Selangor River, Selangor 5.11(b) 97 The PACF of Residuals for ARIMA (1, 0, 2) x (0, 1, 1) 12 Without Constant for Monthly Time Series for Selangor River, Selangor 5.12 The Observed and Predicted ARIMA (1, 0, 2) x (0, 1, 1) 12 for Monthly River Flow of Selangor River 5.13 98 99 Non-Stationary Time Series Plot of Bernam River From January 1966 – December 2008 100 5.14(a) The ACF of Monthly Time Series for Bernam River, Selangor 101 5.14(b) The PACF of Monthly Time Series for Bernam River, Selangor 102 xvi 5.15 Stationary Time Series Plot of Bernam River After Differencing From January 1966 – December 2008 5.16(a) The ACF After Differencing for Monthly Time Series for Bernam River, Selangor 5.16(b) 103 The PACF After Differencing for Monthly Time Series for Bernam River, Selangor 5.17(a) 102 103 The ACF of Residuals for ARIMA (2, 0, 0) x (2, 0, 2)12 Without Constant for Monthly Time Series for Bernam River, Selangor 5.17(b) 107 The PACF of Residuals for ARIMA (2, 0, 0) x (2, 0, 2)12 Without Constant for Monthly Time Series for Bernam River, Selangor 5.82 The Observed and Predicted ARIMA (2, 0, 0) x (2, 0, 2)12 for Monthly River Flow of Bernam River 5.19 128 The Observed and Predicted SOM-LSSVM for Monthly River Flow of Selangor River 5.27 125 The Observed and Predicted SOM-LSSVM for Monthly River Flow of Muda River 5.26 123 The Observed and Predicted LSSVM for Monthly River Flow of Bernam River 5.25 121 The Observed and Predicted LSSVM for Monthly River Flow of Selangor River 5.24 119 The Observed and Predicted LSSVM for Monthly River Flow of Muda River 5.23 116 The Observed and Predicted ANN for Monthly River Flow of Bernam River 5.22 113 The Observed and Predicted ANN for Monthly River Flow of Selangor River 5.21 108 The Observed and Predicted ANN for Monthly River Flow of Muda River 5.20 107 130 The Observed and Predicted SOM-LSSVM for Monthly River Flow of Bernam River 133 xvii 6.1(a) The Observed and Predicted River Flow During Testing Period By ARIMA, For Muda River 6.1(b) The Observed and Predicted River Flow During Testing Period By ANN For Muda River 6.1(c) 146 The Observed and Predicted River Flow During Testing Period By SOM-LSSVM For Bernam River 6.6 145 The Observed and Predicted River Flow During Testing Period By LSSVM For Bernam River 6.5 (d) 145 The Observed and Predicted River Flow During Testing Period By ANN For Bernam River 6.5 (c) 143 The Observed and Predicted River Flow During Testing Period By ARIMA, For Bernam River 6.5 (b) 142 The Boxplot for Predicted River Flow Using ARIMA, ANN, LSSVM And SOM-LSSVM For Selangor River 6.5 (a) 141 The Observed and Predicted River Flow During Testing Period By SOM-LSSVM, For Selangor River 6.4 141 The Observed and Predicted River Flow During Testing Period By LSSVM, For Selangor River 6.3 (d) 140 The Observed and Predicted River Flow During Testing Period By Ann, For Selangor River 6.3 (c) 138 The Observed and Predicted River Flow During Testing Period By ARIMA, For Selangor River 6.3 (b) 137 The Boxplot For Predicted River Flow Using ARIMA, ANN, LSSVM And SOM-LSSVM For Muda River 6.3 (a) 137 The Observed and Predicted River Flow During Testing Period By SOM-LSSVM For Muda River 6.2 136 The Observed and Predicted River Flow During Testing Period By LSSVM For Muda River 6.1 (d) 136 146 The Boxplot for Predicted River Flow Using ARIMA, ANN, LSSVM And SOM-LSSVM For Bernam River 147 xviii LIST OF SYMBOLS p - The non seasonal auto regression d - The regular differencing q - The non seasonal moving average P - The seasonal auto regression D - The seasonal differencing Q - The seasonal moving average S - The length of season xt - The current value of the time series - The constant at - The random errors 1 , 2 ,...., p - The AR parameters of the model 1 , 2 , q - The MA parameters of the model - The mean of time series t 1 , t 2 ,..., t q - The errors of the model yt - The output layers, xt i - The input of the network, wj - The connection weights between nodes of input and hidden layers wi - The connection weights between nodes of hidden and output layers, g (.) and f (.) - The activation function. (.) - Map from input space to feature space xix J - The loss function w, e - The output layer vector or parameter vector in primal space wT - The transpose of w - The regularization constants b - Bias term L - The Lagrangian (t ) - The learning rate (t 1) - updated learning rate j - The updated neighbourhood function || X W j || - Represent Euclidean distance (t ) - The bandwidth of the Gaussian RBF (t ) - The learning rate at time t W j (t ) - The old weight vector. h ji - The neighbourhood function W j (t 1) - The The adjusted weight vector at time (t 1) . xx LIST OF ABBREVATIONS ACF - Autocorrelation Function ANN – Artificial Neural Network AR – Autoregressive ARIMA – Autoregressive Integrated Moving Average BP - Backpropagation FFN - Feed Forward Network LSSVM – Least Square Support Vector Machine MA – Moving Average MLP - Multilayer Perceptron MAE - Mean Absolute Error MSE - Mean Square Error PACF - Partial Autocorrelation Function R - Correlation Coefficient RMSE - Root Mean Square Error SARIMA - Seasonal Autoregressive Integrated Moving Average SOM – Self Organizing Map SVM – Support Vector Machine SVR - Support Vector Regression xxi LIST OF APPENDICES APPENDIX TITLE PAGE A1 Monthly River Flow Data For Muda River 166 A2 Monthly River Flow Data For Selangor River 168 A3 Monthly River Flow Data For Bernam River 171 B1 Input Determination For Muda River (K7) 174 B2 Input Determination For Selangor River (L7) 180 B3 Input Determination For Bernam River (M7) 190 C1 Input Determination For Selangor River (L8) 199 C2 Input Determination For Bernam River (M8) 208 D1 Comparison Between The Observations And Predictions Using ARIMA For Monthly River Flow In Muda River D2 Comparison Between The Observations And Predictions Using ANN For Monthly River Flow In Muda River D3 220 Comparison Between The Observations And Predictions Using ANN For Monthly River Flow In Selangor River E3 219 Comparison Between The Observations And Predictions Using ARIMA For Monthly River Flow In Selangor River E2 218 Comparison Between The Observations And Predictions Using SOM-LSSVM For Monthly River Flow In Muda River E1 217 Comparison Between The Observations And Predictions Using LSSVM For Monthly River Flow In Muda River D4 216 221 Comparison Between The Observations And Predictions Using LSSVM For Monthly River Flow In Selangor River 222 xxii E4 Comparison Between The Observations And Predictions Using SOM-LSSVM For Monthly River Flow In Selangor River F1 Comparison Between The Observations And Predictions Using ARIMA For Monthly River Flow In Bernam River F2 I 231 The Predicted Monthly River Flow In Selangor River Using ARIMA H3 230 The Predicted Monthly River Flow In Muda River Using ARIMA H2 229 The Estimated Parameter For Tentative Models For Bernam River H1 228 The Estimated Parameter For Tentative Models For Selangor River G3 227 The Estimated Parameter For Tentative Models For Muda River G2 226 Comparison Between The Observations And Predictions Using SOM-LSSVM For Monthly River Flow In Bernam River G1 225 Comparison Between The Observations And Predictions Using LSSVM For Monthly River Flow In Bernam River F4 224 Comparison Between The Observations And Predictions Using ANN For Monthly River Flow In Bernam River F3 223 232 The Predicted Monthly River Flow In Bernam River Using ARIMA 233 The List of Publication 234