The Management Science Process

advertisement
Time Series Forecasting - Final Paper
BUS/461 - Applied Management Science
Introduction
When we observe events on a timely manner, this is called time series prediction
also known as a part of forecasting, or predicting an outcome in the future based on the
information from the past (Lawrence & Pasternack, 2002, p. 387). Time series then
involves basic information and data from the collective past and these are used, analyzed
and reviewed to predict the future of a specific product or say service. Time series
forecasting is important in making business decisions hence it is important that the
forecasting must be made precisely and correctly. Among the things we are hearing being
forecasted are the weather, future sale of a particular product or the coming temperature.
There are different time series forecasting modelling and in this paper, we will discuss the
importance of such information and how it is being used. In the end, we will be providing a
recommendation based on the conclusion that we will have on this paper.
The Importance of Time Series Data
The most important use of time series forecasting is to be able to predict a certain
future involving the product or service that the company is providing, in case a prediction
is inline, then it means that the company can now see possible problem or issues in the
future that they can address in advance. Besides, if they can address issues in advance, the
company can avoid losing more because they can have a backup plan all the time. One
company that uses forecasting all the time are BPO or call centers where in the volume of
calls that the company might need to answer over a period of time is being forecasted in
advance so the company can provide enough number of people to answer it. This means
effectively utilizing the resources and manpower that the company has while at the same
time being able to identify if each employee got all the tools they need in order to do their
work. Tools like computers, chairs and tables are all necessary in order to make each
individual employee function properly.
Aside from industry needs and corporate needs, there are other sector of the
community that uses forecasting and time series to be a useful part of humanity. Take for
example the scientific discoveries that we have in the past, the different diseases and
medical conditions that we have recorded and the different medications and researches
done to cure such medical condition. All of these are being reviewed and analyzed to get
better information about different current medical conditions that we have at the moment.
In cases where in gaps are visible, there is a big chance that filling such gap can actually
resolve the problem or the current condition that the doctors are trying to identify.
If there are statistics and data that need to be reviewed, time series can be used to
make sure that these data do correlate with each other in any way (Brillinger, 2000).
Besides, time series data is not an ordinary set of number but is in fact a data that means to
communicate and provide information.
Components of Time Series Data
Time series can be computed in different ways using different values that are
present for the study. Among the things that can be looked at are long term trends of a
particular product like sale in ten years, seasonal variations, random effects and cyclical
variation (Lawrence & Pasternack, 2002, p. 387). (Lawrence & Pasternack, 2002).
Long Term trend
When there is a slow evolving process in a certain period of time, long term trend
analysis can be used. The result on this analysis can be a good one or a bad one (upward or
downward trend) or sometimes it can be no change at all. When a trend is not showing any
changes, it is called stationary trend but when there are a lot of change, it is called nonstationary. For non-stationary data, it means that there are variables that changes from
time to time and that the variance varies as well. To check the validity of the variables,
correlation is being used. On the other hand, for non-stationary long trend time series, it
can be that the variables are not moving or it goes down to 0 (Pankratz, 1983). If the data
result in a time series forecasting is stationary, the autocorrelation must be estimated to
test the result's reliability. Using the Dickey-Fuller regression test is also possible where in
the model as follow can be used (Dickey & Fuller, 1979):
t = 1, 2, n. (1)
In this equation, the variable of interest will be Yt and the real number is
represented by Y0 = 0 with an error of 0 and at least 2 points variance. This equation can
be rewritten as (2).
H0: = 0 = null hypothesis
Seasonal Variation
When a change occurs on any of the variables that we have on the time series, this is
called a seasonal variation. The changes occurred must be on different seasons and time
series to call it seasonal variation. Seasonal variation can happen regardless of how long
the time frame is between the changes and when it occurred so it can be between a few
days or weeks or even longer. Longer variations are like demands for turkey which is very
high during thanksgiving where in almost every American family celebrates it with a turkey
hence a lot of order for a turkey meat needs to be met. Some of the seasonal variations that
we are experiencing are also for demands in cars which according to the data of GM
happens to be high for some time of the year. A graph is normally used to show all data.
From the graph, the entire data can be seen hence a particular movement at a specific date
of the year or month can be seen leading to the conclusion of the time when a particular
product sale would go high or low.
Cyclical Variation
In this variation, it means there are no patterns as to when change will happen.
Random Effects
When time series variables changes without any specific pattern, we call random
change. During a random change pattern, there are variables that need to be constant and
independent while one specific variable changes all the time. The use of small-shuffle
surrogate (SSS) is used to test for any random effects in time series data (Nakamura &
Small, 2007). Using the SSS though is not dependent on the way the information is being
distributed (Nakamura & Small, 2007).
Forecasting Using Time Series Data
Forecasting is a two-step method where in the first step to work on the data is
needed for the time series. Prior to collecting the data, the time or the length of time for the
data collection must be set first whether it will be a weekly or a monthly data collection.
With the data at hand, you can now determine the type of time series method to use. With
the present data, certain hypothesis must be given as well. Once data is gathered, the next
important step to take is to select the best model of time series to use for forecasting
purposes. There are variables that you need to consider prior to selecting the time series
method that you will use and there are models that varies because of certain factors. If you
have stationary time series, then you will require a mathematical model to make it work
while seasonal time series requires a different model. Lastly, once you have the model and
the data at hand, you can now begin with your forecasting work.
Forecasting Methods
There are different forecasting methods that we have and each method varies from
one another and can be used depending on some variables that we currently have. Let us
take a look at the most common forecasting methods we have.
Chatfield (2001) identified three forecasting methods that are commonly used =.
They are judgmental forecasts, univariate forecasts and multivariate forecasts.
When we say judgmental forecasts, the forecasts will be based on the intuition or
tacit knowledge of the person or the subjective input of the person. Relevant information is
used as well.
Time series is also called as univariate method of forecasting where in the data from
the past is being used believing that there are similarities from the past to the current
trends in the market.
If there are more values or factors being considered in the time series predictions,
then multivariate methods are being used. In this model, there are different models being
used and different equations yet the variables used are dependent on each other (Chatfield,
2001, p.12). (Chatfield, 2001).
In this paper, we will focus on the methods of univariate time series.
One sample is the long term trend within a stationary time series where in there is
just three or less equations used (Lawrence & Pasternack, 2002, p. 390).
(3)According to Lawrence & Pasternack (2002, p. 390), t can represent the value of
time series with a random error if the mean is stationary. Random error here are 0 and are
independent. If you will be doing a long term analysis here, the trend doesn't change and is
constant.
(4) In equation 4, there is a linear equation because of the coefficient used, because
there are different values used aside from 0, the time series is no longer stationary. When
there are errors randomly distributed, the trends on the time series can be checked using a
linear regression.
For seasonal time series forecast, it should be first identified whether it is an
additive or a multiplicative meaning additive seasonality is when the mean doesn’t affect
the variation while multiplicative variation is when the mean is proportion or doesn't
change (Chatfield, 2001, p. 30). According to (Chatfield, 2001, p. 30). Equation 7 can be
used to forecast additive seasonality.
(5) There represents the value of the time series while seasonal index is the mean of
the series. There is a random error as well. Using equation five, additive series can be
forecasted too if some values gets modified through logarithmic transformation.
(6) If you are doing a time series forecast with something of a random effects then
you will use a random walk model given with the following formula x(t+1) = x(t) + (t) (7)
(Chatfield, 2001; Spitzer, 2001).
There will be the value of t as the independent variable and the IID or the identically
distributed noise. When the value of t is represented by zero, then the drift is considered to
be random. Model 7 is not stationary but it can be transformed into a non-stationary by
putting in different values (Chatfield, 2001, p. 38). Differencing is a method of making
stationary series a non-stationary and a second differencing is normally encouraged to
attain such effect. The Box-Jenkins methodology which uses Autoregressive Moving
Average (ARMA) or Autoregressive Integrated Moving Average (ARIMA) is one of the most
common forecasting methods being used today. With the use of ARIMA, there are things
that can be performed like estimating model parameters and doing residual analysis using
the two terms of ARIMA methods (Hyndman, 2001). The two terms of ARIMA are the
autoregressive term and the moving average term. The first term means that the data used
for predictions are based from the past while the second term uses errors from random
data (Hyndman, 2001). There are 4 steps involved in the Box-Jenkins model. The first step
is identification where in relationship of data and the statistics they represent are being
checked. From there, data that will fit the model will be looked at and choose from. The
second step is estimation where in the models is being identified and estimated here. The
third step is diagnostic check to figure the adequacy of the statistical analysis done to the
values and the last one is forecasting or giving predictions based on the adequate analysis
made from step 3.
Forecasting method using the Box-Jenkins method is too popular among business
and healthcare providers and companies. There are programs and software applications
that use such and there are so many references that you can to review more information
about Box-Jenkins method (Pankratz, 1983).
Issues
Time series forecasting is not a 100% accurate that is why it is called forecasting
because there can be differences with the values being used like the time plotted and the
season used. There is also a problem if there are missing links or data necessary to make
the forecast. Accuracy can be sacrificed if there are missing data from the values being
used.
Forecasting is not just a prediction but it also lies on assumptions based on data
from the past and from estimations of the future. It means that using different methods for
estimation can mean different results and predictions. If there are new information coming
out, then it means that the forecast needs to be changed or modified as well (Chatfield,
2001, p. 16).
The data being used itself can be a problem as well when making forecasts. There
should be a random yet standardized collection of data during the data collection period.
Daily data for example should be collected at the same time of the day but sometimes when
people knew they are being observed and data are collected, they might tend to change
their behavior causing the data collection to change results.
Future Enhancements
The use of Box-Jenkins methodology can be a good enhancement for me since
almost every companies and corporation out there rely on it today when it comes to
forecasting. Also, there are a lot of new programs that can be used to make the Box-Jenkins
methodology more effective depending on the need of the company. Most programs today
just uses a software that can do the analysis and suggestion of model for the time series
then the forecast will be provided automatically. All you have to do with Box-Jenkins
method is to upload the data as needed.
Conclusion
Forecasting using the time series method is essential to almost all businesses out
there that I know. Forecasting and time series depends on different data gathered and used
and through time series, it is expected that companies can make good decisions that will
benefit the company. With different methods to use for time series, it is important to be
aware of the advantages and disadvantages of it before using any.
References
Brillinger, D. (2000). Time Series: General. 1-22. Retrieved from
http://www.stat.berkeley.edu/~brill/Papers/encysbs.pdf
Chatfield, C. (2001). Time-series forecasting (1st ed.). CRC Press.
Dickey, D. A., & Fuller, W. A. (1979). Distribution of the Estimators for Autoregressive Time
Series with a Unit Root. Journal of the American Statistical Association, 74(366), 427-431.
Hyndman, R. J. (2001, May 25, 2001). Box-Jenkins Modeling. Retrieved from
http://www.statistical-solutions-software.com/BMDP-documents/BMDP-2T.pdf
Lawrence, J., & Pasternack, B. (2002). Applied Management Science: Modeling, Spreadsheet
Analysis, and Communication for Decision Making (2nd ed.). Hoboken, NJ: John Wiley &
Sons.
Nakamura, T., & Small, M. (2007). Testing for Random Walk. Physics Letters A, 362(2-3),
189-197.
Pankratz, A. (1983). Forecasting with Univariate Box-Jenkins Models (Vol. 3): Wiley Online
Library.
Spitzer, F. (2001). Principles of Random Walk (Second Ed.). New York: Springer Verlag.
Download