Freshman Meteo forecasting comparison project

advertisement
Faith Eherts, Erin Kreher, Ryan Dreedon, Eric Donten
Since the city under observation is Louisville, KY, temperatures were expected to range
from the mid-80s (at the commencement of the project, in early September) to about 50 degrees
(beginning of December). It was found that, when the mean absolute errors and mean errors of
each provider (The Weather Channel/TWC, Accuweather, and Weather Underground) were
determined and compared (see Chart 1), The Weather Channel very clearly comes out on top –
with the smallest error values reported virtually every day ahead in the extended forecast.
However, these charts do not display the standard deviations on any given day – say, day five and, based on a statistical one-way ANOVA test, the differences between the errors of the
providers on this day are not significant enough to draw any valid conclusions. On the other
hand, when the one-way ANOVA test is done using the values from day 10, there is sufficient
evidence to determine that one piece of data significantly differs from the others. Therefore,
although one could not legitimately conclude that TWC puts out the most accurate forecasts, it
can be said that Weather Underground puts out the least accurate. Each provider’s biases can
also be unveiled using these error calculations. It was observed that TWC harbors a very slight
but very consistent cold bias, since every forecast date, on average, has negative error values.
Weather Underground shares with TWC a cold bias trend, but one that is far stronger – and gets
stronger as the forecast day goes out - than that of TWC. On the other hand, Accuweather is more
variable, having days where error is slightly negative and others where it is slightly positive,
making it more difficult to discern any patterned bias. Climatology, interestingly, usually underpredicted the temperature, suggesting that this autumn was warmer than average. Based on
Graph 1, it would be expected that climatology becomes more accurate than TWC forecasts after
day 12 (and for Accuweather and Weather Underground after day nine, graphs not shown).
Frequency distributions (graphs two through five) provide proof that mean absolute error is not
always the most efficient way to compare data. In comparing these skewed-left histograms, it is
easier to understand which provider has the most accurate and consistent forecasts. The accuracy
of the forecasts, specifically for seven days ahead, also shifts dramatically depending on the
season. By performing a difference of means test on the data gathered from September 16th
through October 16th and from November 1st through November 30th, there is sufficient statistical
evidence to suggest a fairly dramatic shift in absolute value error as the season changes – and
that forecast accuracy was higher during the beginning of autumn than at the end. Another error
measure that can be utilized to assess the accuracy of forecasts is the squared error. The squared
error is able to convey information about the dispersion of the forecasted temperatures about the
actual recorded temperature. The mean squared errors at the bottom of each column in the
spreadsheet are equal to the average variance of the forecasted temperatures for each particular
day, and indicate that providers with large variances are inconsistent. A good way to measure the
consistency of a provider as a given day approaches is to find the sum of the absolute values of
the differences between forecasted temperatures of adjacent days (See Example 1). The results
are as follows: TWC, 10.9 F; Accuweather, 10.4 F; and Weather Underground, 13.7 F. A oneway ANOVA test indicates that there is, once again, not enough statistical evidence to definitely
conclude that one of these means stands out from the others (even though it appears that
Accuweather is the most consistent). All of these conclusions may have turned out much
differently (and possibly more accurately) if the data was gathered over a larger period of time
(covering more seasons and creating more accurate averages) or in more locations besides just
Louisville. Also, if more than only the temperature was record (such as rainfall or cloud cover),
the prediction accuracy of each provider could be better determined.
Chart 1
Forecast
Provider
Mean Errors Corresponding to Forecast Date
The Weather
Channel
Accuweather
Weather
Underground
Climatology
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
-0.5
-0.4
-1.1
-0.5
-0.5
-1.0
-1.4
-1.5
-0.6
-0.7
---
---
---
---
---
-0.3
-0.5
0.08
0.07
0.66
0.98
1.50
0.95
2.32
2.03
0
-0.1
-0.7
0.29
0.93
-0.3
-0.4
-0.6
-0.3
-0.4
0.17
0.67
-2.1
-1.6
-5.5
---
---
---
---
---
-1.1
---
---
---
---
---
---
---
---
---
---
---
---
---
---
Forecast
Provider
Mean Absolute Errors Corresponding to Forecast Date
The Weather
Channel
Accuweather
Weather
Underground
Climatology
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
1.76
2.26
2.74
3.11
3.63
4.32
4.91
5.68
6.69
6.43
---
---
---
---
---
1.76
2.31
2.73
3.04
3.76
4.30
4.76
5.59
7.77
7.84
8.54
8.91
9.18
9.84
9.44
1.97
2.52
2.95
3.47
4.47
4.20
5.59
5.22
7.37
10.9
---
---
---
---
---
7.71
---
---
---
---
---
---
---
---
---
---
---
---
---
---
Graph 1
The figure below plots mean absolute forecast error as a function of forecast date for The Weather Channel.
Graph 2
The Weacther Channel:
7-Day Absolute Error Distribution
14
12
10
8
6
Frequency
4
2
0
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
More
0
Graph 3
Accuweather:
7-Day Absolute Error Distribution
14
12
10
8
6
4
2
0
Frequency
Graph 4
Weather Underground:
7-Day Absolute Error Distribution
14
12
10
8
6
Frequency
4
2
0
Graph 5:
Climatology:
7-Day Absolute Error Distribution
10
8
6
4
Frequency
2
0
Example 1
For example, if a forecast for seven days away in State College was 32F and the forecast for that
same day as we made our way through the week became 33F, 36F, 34F, 38F, 36F, 40F, and 41F
(for each day between the today and seven days from now), then the consistency of the
forecasting provider would be abs(32F – 33F) + abs(33F – 36F) + abs(34F – 36F) + abs(38F –
34F) + abs(36F – 38F) + abs(40F – 36F) + abs(41F – 40F) = 17F.
Download