
Figure 1: Previous day’s temperature (persistence) used as a forecast of 24hr
temperature, Salt Lake City airport data, 1979 to 2001. Red line is the fit for the central
tendency (mean) using standard linear linear regression; middle black line is the fit of the
median (0.5 quantile); upper black 0.9 quantile; lower black 0.1 quantile. Notice the
similarity but noticeable divergence of the median and mean fits for larger temperatures.
Notice also the heteroscedastic behavior of the persistence fitting, which is seen by the
convergence of the 0.1 and 0.9 quantile lines for higher temperatures.
Figure 2: Time-series of the daily uncalibrated 15-member ensemble temperature
forecasts (colors) versus the observation (black) at station KSLC over the period of 19902001 for: a) 24hr lead-time January forecasts; b) 24hr July; c) 360hr January; d) 360hr
July. Note the strong underbias of the forecasts for both seasons and lead-times. (Red
oval in panel b discussed in text and Figure 7.)
Figure 3: Rank histograms for the same data shown in Figure 2, but for the complete
data set (period of 1979-2001) although sub-sampled to remove temporal autocorrelations
(see text). Red dotted lines show 95% confidence limits for a perfectly calibrated
forecast. Note the strong underbias of the forecasts for both seasons and lead-times.
Figure 4: schematic of the logistic regression ensemble fitting procedure: step 1 –
prescribe climatological temperature thresholds to estimate for (99 chosen); step 2 – fit
LR model and generate out-of-sample conditional probabilities (CDF) of being less than
or equal to each threshold; step 3 – use CDF to estimate (linearly-interpolate) evenlyspaced 15 member ensemble for each day, each lead-time; final result is a “sharper”
posterior forecast PDF than the climatological prior, but used as an independent regressor
set in the QR procedure.
Figure 5: Schematic of the QR post-processing procedure. See text for details.
Figure 6: Same as Figure 2, but for the spread-interval-post-processed time-series. See
text for details. (Red oval in panel b discussed in text and Figure 7.)
Figure 7: Rank histograms of July 24-hr lead-time 15-member (16 interval)
postprocessed ensemble using logistic regression (LR) and 2mo training periods (panel
a), dispersion-selected quantile regression (QR) and 2mo training periods (panel b), LR
and 22mo training periods (panel c), and QR and 22mo training periods (panel d). Red
dotted lines show 95% confidence limits for a perfectly calibrated forecast (upper line in
panel b and c not shown).
Figure 8: A kernel fitting creates a PDF out of the original uncalibrated (black line) and
calibrated (blue) 24-hr ensemble forecast for one day (July 3, 1995), as highlighted by the
red oval in Figure 6 panel b. Comparison to the observation (red) shows the bias shift and
increase in dispersion that calibration performs. Also shown is the tail of the
climatological PDF (dashed), showing the forecasts for this anomalously cold event are
significantly sharper than climatology. Is the ungaussian behavior of the calibrated
forecast PDF consistent across other forecasts?
