Prediction of Urban Ozone Concentration with Artificial Neural Networks Negar Banan, Mohd Talib Latif and LiewJuneng School of Environmental and Natural Resource Sciences, Faculty of Science and Technology, Universiti Kebangsaan Malaysia, 43600 Bangi, Selangor, Malaysia Abstract. Artificial neural network model was used to predict the daily maximum of ozone concentration and we have applied it to the case of four monitoring stations in the urban namely (Gombak, Shah Alam, Putrajaya and Petaling) and predict values of meteorological variables. Artificial neural network was trained using samples of daily maximum data provided by the Malaysian Department of the Environment (DOE) over a period of nine year (2003-2011). Based on the results, it can be deduced that the relationship between the parameters and the ozone concentrations are highly complex and non-linear. Evaluation of the regression based model results between the four stations were analysied using the ANN. Based on the sample results it was confirmed that Putrajaya has the highest regression result of R= 0.64 in comparison with other stations. This study shows that one hidden layer more adroit in predicting the daily maximum ozone concentration. Keywords: Prediction, Artificial neural network, time series, urban ozone INTRODUCTION Ozone (O3) is considered to be secondary pollutant, photochemical oxidant and the main component of smog. It is also regarded as a crucial air pollutant in the atmosphere because O3 is capable of causing damage to human health via respiratory disease (Ho et al., 2007; Karakatsani et al., 2010; Neidell and Kinney, 2010; Mills et al., 2011; Huang et al., 2012). In addition, exposure to O 3 can lead to a decrease in lung function (Highfill and Costa, 1995). High concentrations of surface ozone also affect vegetation and forests due to the phytotoxic nature of O3. Ozone concentrations greater than 40 ppbv may be harmful to the crop yield, biomass production, vitality and stress tolerance of forest trees (Fuhrer et al., 1997). Excessively, high levels of O3 may be an obstacle to a forests’ capacity to seize carbon should there be an excess of carbon dioxide in the future (Karnosky et al., 2003; Banan et al., 2013). Modeling of O3 concentrations is not a trivial procedure due to the intrinsic relationship between pollutants and meteorological variables (Sousa et al., 2007). Widespread review of the statistical modeling of surface ozone based mostly on the basis of meteorological data has been presented by Thompson et al. (2001). Wang et al. (2003) studied statistical characteristics of ozone to enable selection of appropriate predictors of daily maximum ozone levels and emphasized the inclusion of factors that affect both photochemical production and atmospheric accumulation of ozone when predicting ozone levels. ANN models are more suitable amid the data-based models without needing explicit mathematical representations. ANN performs better in prediction cases rather than statistical regression analysis (Tasadduq et al., 2005; Yigit and Ertunc, 2006). This paper artificial neural network is used to predict daily maximum ozone concentration in urban areas using the precursors and meteorological variables. MATERIALS AND METHODS 1. Data Sets This experiment is performed on four datasets collected from the air quality monitoring sites by the Department of the Environment (DOE) in Malaysia and managed by a private company, Alam Sekitar Sdn Bhd (ASMA). Daily maximum concentration of air pollutants (O3, NO, NO2, NOx, CO, PM10 and NMHC) and daily maximum meteorological variables (Ambient temperature (AT), Humidity (H), Wind speed (WS) and wind direction (WD)) were used in this research. The logarithm of the daily maximum ozone concentrations of four stations, namely Gombak, Shah Alam, Putrajaya and Petaling Jaya stations, which are located in Kalang Valley in the middle of the Malaysian Peninsula, were extracted and used for modeling. The concentration data were obtained during the nine-year period from (2003-2011). All 2 the data were used to evaluate the predicting performance of the modeling data. The characteristics of the datasets used are summarized in Table 1. TABLE1. Characteristics of the datasets 2. Datasets No. of attributes Training set 70% Testing set 20% Validation set 10% Gombak 10 1019 291 146 Shah Alam 10 1732 495 247 Putrajaya 10 1468 419 210 Petaling Jaya 10 1988 568 284 ANN Model for Air Quality Prediction ANN is modeled as the neurons connected or functionally-related to each other; imitate the behavior of human biological neurons. Neural networks have found extensive application in diverse areas such as; pattern recognition and classification, time series prediction and modeling (Behrang et al., 2010). Neurons which are the basic components of the neural network are interconnected through different layers such as input, hidden and output layers. The degree of interconnection is expressed by the weight which means the impact of a neuron on a neuron. The representation of neural network is shown in Figure 1. NO NO2 NOx CO PM10 O3 NMHC Output layer AT H WS Hidden layer WD Input layer FIGURE 1. Schematic of a neural network The topology of ANN plays a critical role in its performance evaluation. A multilayer feed forward network model with one hidden layer has the inherent capability to forecast any complex nonlinear function. In as much as there are adequate numbers of hidden layer neurons in the model. Hence, in this literature, multilayer feed forward networks models consisting of one hidden layer were used. The choices of optimum number of hidden layers are very crucial to ensure optimal performance accuracy of ANN. In the meantime, there has not been any acceptable theory to determine the adequate number of hidden layer neuron need to be used for a particular problem. A rule of thumb to determine the optimal numbers of neurons is to start with a few numbers of neurons and increase in a stepwise direction. In the course of the experiment, the performance of the each hidden neuron was monitored against a chosen performance criteria. Lastly, in order to reduce the training difficulty and balance the importance of each parameter during the training process, the data inputs were normalized. It is recommended that the data be 3 normalized between slightly offset values such as 0.1 and 0.9. One way to scale the input and output variables in the interval [0.1, 0.9] as Pn= 0.1+ (0.9-0.8) (P − Pmin) / (Pmax − Pmin) Xu et al. ( 2007). The data set is randomized into three distinct sets called training, testing and validation sets. The training set constitutes the largest data inputs of about 70% and is used by the neural network to acquire the patterns present in the data. Next is the testing set making up of about 20% of the data set and is used to evaluate the generalization ability of supposedly trained network. While the remaining 10% of the data is for checking on the performance of the trained network. The model of employed ANN is shown in Figure 1. It consists of three layers of neurons including input, hidden, output layers. The input layer is the first layer of neurons. Each input neuron represents a separate attribute in the train/test datasets station (for example from NO to WD in Figure 1). The number of the inputs is equal to the number of attributes in the dataset. The number of nodes for other hidden layers is equal to the half of the number of nodes in the previous layer. Experimental Results Artificial neural network have been applied in this study. The input and target values were normalized into the range of [0.1and 0.9] in the pre-processing phase. The weights and biases were adjusted based on gradient-descent method in the training phase. The correlation coefficient (R) was chosen as the statistical criteria for measuring the network performance. The results are value was shown in Table 2. TABLE2. Correlation coefficient Datasets Correlation coefficient (R) Gombak 0.62945 Shah Alam 0.63819 Putrajaya 0.64199 Petaling Jaya 0.57657 Figure 2 represents the network performance versus the number of epochs. The network performance starts by a large value at the first epochs and due to training, the weights are adjusted to minimize this function which makes it decreasing. Moreover, a black dashed line is plotted representing the best validation performance of the network. The training stops when the green line which represents the validation training set (network performance) intersects with the black line. Gombak Shah Alam 4 Putrajaya Petaling Jaya FIGURE 2.Performance function of the network during training Network structure is, 10-5-1 Regression analysis was performed to investigate the correlation between the actual and predicted results based on the value of correlation coefficient (R). The perfect fit between the training data and the produced results was indicated by the value of R which is equal to 1. Figure 3 shows the regression analysis plots of the network structure. In a regression plot, the perfect fit which shows the perfect correlation between the predicted and targets is indicated by the solid line. The dashed line indicates the best fit produced by the algorithm. Gombak Putrajaya Shah Alam Petaling Jaya FIGURE 3.Regression plot analysis Network structure, 10-5-1 From Table 2, it can be seen that model with network structure 10-5-1 is the best models for the air quality prediction as it yields the highest values of correlation coefficient (R). As shown in Figure3, a good agreement between predicted and measured values based on the value of correlation coefficient (R). 5 In order to get a sense of the solutions distribution, we examined the minimum, maximum, median, lower quartile and upper quartile using a box and whisker plots as presented in Figure 4,a ( one hidden layer for Putrajaya , Petaling Jaya, Shah Alam, Gombak stations) and Figure 4,b (two hidden layer for Putrajaya , Petaling Jaya, Shah Alam, Gombak stations). From Figure4,a based on one hidden layer on ANN algorithm, it can be seen that most of the observations are concentrated in the low end of the scale. This shows that the solution’s distribution is skewed to the lower end. However, there is a case where the solution’s distribution is symmetric, where the observation is evenly split at the median (i.e., Shah Alam at left side). However, based on the two hidden layer on ANN algorithm (Figure4,b), there is a case that the observation is concentrated on the upper end of the scale which shows that the solution’s distribution is skewed to the upper end (i.e., Gombak at the left side and Putrajaya at the right side) and there is a case that the observation is concentrated in the lower end of the scale which shows that the solution’s distribution is skewed to the lower end (i.e., Putrajaya at the left side and Petaling Jaya at the right side). Based on the patterns presented for the distribution of the result, generally we can say that the one hidden layer on ANN algorithm is better than two hidden layer on ANN algorithm when tested on the Putrajaya, Petaling Jaya, Shah Alam, Gombak stations, but of course a statistical test should be carried out to support this claim. 4 0.675 0.625 0.650 0.600 5 Correlation coefficient Correlation coefficient 0.625 3 0.600 0.575 1 0.550 0.575 7 7 10 0.550 0.525 8 Gombak Shah Alam Putrajaya petaling Jaya Stations Gombak Shah Alam Putrajaya Petaling Jaya Stations 1 hidden layer (a) 2 hidden layers (b) FIGURE 5(a) and 5(b). Box-and-whisker plots on the four datasets Conclusion Artificial neural network employed to predict ozone concentrations as a perform of meteorological conditions and precursor concentrations. The network was trained utilizing daily maximum data that provided from the Malaysian department of the Environment throughout a nine year (2003-2011) into the cities of Gombak, Shah Alam, Putrajaya and Petaling Jaya in Malaysia. In this research, urban ozone concentrations are estimated utilizing surface meteorological variable as predictors by an artificial neural network for four areas in Malaysia. The experimental results using four stations have exposed that the proposed algorithms effectively to predict ozone concentrations. Further studies will be devoted to validate the artificial neural network for the function of predict ozone concentrations as a function of meteorological conditions and precursor concentrations for monthly scales. ACKNOWLEDGEMENT The authors would like to thank Universiti Kebangsaan Malaysia for Research University Grant (DIP2012-020 and LRGS/TD/2011/UKM/PG01) and the Malaysian Department of the Environment (DOE) for providing all the necessary investigative information and air quality data in the process of conducting research. REFFERENCES 6 Banan, N. Latif, M.T. Juneng, L. & Ahmad, F. 2013. Characteristics of surface ozone concentrations at stations with different backgrounds in the Malaysian Peninsula. Aerosol Air Qual. Res.13: 10901106. Behrang, M. A. Assareh, E. Ghanbarzadeh, A. & Noghrehabadi. A. R. 2010. The potential of different artificial neural network (ANN) techniques in daily global solar radiation modeling based on meteorological data. Solar Energy. 84: 1468-1480. Fuhrer, J., Skarby, L. & Ashmore, M.R. 1997. Critical Levels for Ozone Effects on Vegetation in Europe. Environ. Pollut. 97: 91-106. Highfill, J.W. & Costa, D.L. 1995. Statistical response models for ozone Exposure: Their generality when applied to human spirometric and animal permeability functions of the Lung. J. Air Waste Manage. Assoc. 45: 95-102. Ho, W.C., Hartley, W.R., Myers, L., Lin, M.H., Lin, Y.S., Lien, C.H. & Lin, R.S. 2007. Air Pollution, Weather, and Associated Risk Factors Related to Asthma Prevalence and Attack Rate. Environ. Res. 104: 402-409. Huang, H.L., Lee, M.G. & Tai, J.H. 2012.Controlling Indoor Bioaerosols Using a Hybrid System of Ozone and Catalysts. Aerosol Air Qual. Res. 12: 73-82. Karakatsani, A., Kapitsimadis, F., Pipikou, M., Chalbot, M.C., Kavouras, I.G., Orphanidou, S. D., Papiris, S. &Katsouyanni, K. 2010. Ambient Air Pollution and Respiratory Health in Mail Carriers.Environ. Res. 110: 278-285. Karnosky, D.F., Zak, D.R. & Pregitzer, K.S. 2003. Tropospheric O3 Moderates Responses of Temperate Hardwood Forests to Elevated CO2: A Synthesis of Molecular to Ecosystem Results from the Aspen FACE Project. Funct. Ecol. 17: 289-304. Mills, G., Hayes, F., Simpson, D., Emberson, L., Norris, D. Harmens, H. &Buker, P. 2011. Evidence of Widespread Effects of Ozone on Crops and (Semi-) Natural Vegetation in Europe (1990-2006) in Relation to AOT40-and Flux-Based Risk Maps.Global Change Biol. 17: 592-613. Neidell, M. & Kinney, P.L. 2010. Estimates of the Association between Ozone and Asthma Hospitalizations that Account for Behavioral Responses to Air Quality Information. Environ. Sci. Policy 13: 97-103. Sousa, S.I.V., Mrtins, F.G., Alvim-Ferraz, M.C.M. & Pereira, M.C. 2007. Multiple linear regression and artificial neural networks based on principle components to predict ozone concentrations. Environmental Modeling & Software. 22: 97-103. Tasadduq, I., Rehman, S. & K. Bubshait. 2005. Application of neural networks for the prediction of hourly mean surface temperatures in Saudi Arabia. Renewable Energy. 25: 545-554. Thompson, M.L., Reynolds, J., Cox, L.H, Guttorp. P. & Sampson.P.D. 2001. A review of statistical methods for the meteorological adjustment of tropospheric ozone. Atmos. Environ. 35: 617-630. Wang, W., Lu, W., Wang, X. & Leung, A.Y.T. 2003. Prediction of maximum daily ozone level using combined neural network and statistical characteristics. Environmental International. 29: 555562. Xu, L., Jiandong, X., Shizhong, W., Yongzhen, Z. &Rui, L. 2007. Optimization of heat treatment technique of high-vanadium high-speed steel based on back-propagation neural networks. Materials & Design. 28: 1425-1432. Yigit, K. S. & Ertunc, H. M. 2006. Prediction of the air temperature and humidity at the outlet of a cooling coil using neural networks. International Communications in Heat and Mass Transfer. 33: 898907. 7 8 9