Monte Carlo Methods in Forecasting the Demand for

advertisement
Monte Carlo
Methods in
Forecasting the
Demand for
Electricity
Frank S. McGowan
Market Forecast
Department
October 26, 2007
Contents
•BC Hydro
•Load Forecasting at BC Hydro
•Stochastic Forecasting
• Monte Carlo Simulation
•BC Hydro’s Monte Carlo Model
•Model Results
• Description of the Model
•Comparison with Other Methods
•Conclusion
2
3
BC Hydro
•
•
•
•
•
•
•
•
•
BC Hydro is the major provider of electricity in the province of British Columbia.
It is one of the largest electric utilities in Canada
1.7 million customers in an area containing over 94 per cent of BC's population.
(4,364,565 in 2007).
Generating capacity over 11,000 megawatts (MW)
90.3% from hydroelectric sources .
BC Hydro's various facilities generate between 43,000 and 54,000 gigawatt hours of
electricity annually, depending on prevailing water levels.
For fiscal 2006, domestic electric sales volume reached 52,440 gigawatt hours. and
52,911 for fiscal 2007.
Net income was $266 million In fiscal 2006, $407 million in fiscal 2007.
Employees 4,546 in March 2007 - Including its subsidiaries and British Columbia
Transmission Corporation.
• New Energy – Green Emphasis
> Independent Power Producers
> Site-C
4
Load Forecasting at BCHydro
• Starting point for the Monte Carlo Model is Reference Case Forecast.
• Major categories - Residential, Commercial and Industrial.
• Regions - Lower mainland (including the city of Vancouver), Vancouver
Island, the southern interior and the northern region.
• The time horizon - 21 years from 2007 to 2028.
• Key Drivers - number of accounts, GDP, retail sales, employment,
housing starts, population, weather, price of electricity and price of
natural gas.
• Detailed End Use Models
• High Level Regression Models
• Judgment
• Statistically Adjusted End Use Models (SAE models by ITRON).
5
Stochastic Forecasting
• The Future is Uncertain. A deterministic forecast
provides an incomplete picture. The appropriate model
is a stochastic process.
• Definition (informal) : A stochastic process is a
collection of random variables X together with their
probability distributions which are indexed by time.
{ F( X, t ) }
6
Stochastic Forecasting
• Let y(t) and xj(t) (j=1,…k) be random variables defined by
stochastic processes.
• Suppose there is a model that relates these variables.
y(t) = L (x1(t), x2(t), …, xk(t) )
• If the probability distributions of the independent variables are
known, then the model specifies the probability distribution of y(t).
• But it would usually be very difficult to work out this distribution
analytically.
• Monte Carlo Simulation allows us to bypass this difficulty. @RISK
facilitates these simulations.
7
Monte Carlo Simulation
Monte
Carlo
Methods
Monte
Carlo
Model:
Probability distributions are specified for each input variable.
.The mode uses samples from these and calculates many realizations of the output variables
which are then used to determine output variable probability distributions.
Input Variables with Probability Distributions
Samples
Model
Realizations
Output Variables with Probability Distributions
8
BC Hydro’s Monte Carlo Model
• What it Does
> Provides stochastic forecast by calculating a probability distribution
for Load in each year of the forecast period. And for each category.
• How it Does It
> Key Drivers
• GDP of British Columbia
• Weather (Heating Degree Days)
• Price of Electricity (Rates)
> Probability Distributions are Specified for Each Driver using @RISK
> The Model calculates output variables
> Probability distributions for output variables are calculated by @RISK
9
Model Results
Similar Table for Each Category
10
11
Model Results
A probability distribution for each year and for each category.
12
Model Results
Width of Distributions Increases With Time
13
Description of the Model
• The stochastic forecast is generated by multiplying the
reference case forecast by Stochastic Impact Factors.
•
L(t) = L0(t) FG FW FP FU
•
•
•
•
•
L0 =
FG =
FW =
FP =
FU =
Base Case Load
GDP factor
Weather factor
Own Price factor
Residual factor
14
GDP Impact Factor
•
•
•
•
Gt - GDP at time t.
0Gt - Reference case GDP forecast (from CFB Canada).
gt - growth rate of reference case forecast.
Perturbed GDP forecast grows by the equation:
Gt = Gt-1 ( 1 + gt + ut )
• ut - perturbation that is N(0,s) with s = 1.54%
• FG the GDP impact factor is given by:
FG = exp[ a ln(Gt / 0Gt ) ] = ( Gt / 0Gt )a where:
a=0.65 is the elasticity of Load with respect to GDP.
15
16
Price Impact Factor
• 0Pt - Base Case Price forecast .
• Pt - Perturbed Price forecast at time t.
• gt - growth rate of reference case Price forecast.
• Perturbed Price forecast grows by the equation:
Pt = Pt-1 ( 1 + gt + ut )
• ut -perturbation that has as its probability distribution
RISKTRIANG(2.5%,0,+2.5%).
• FP the Price Impact factor is given by:
FG = exp[ e ln(Pt / 0Pt ) ] = ( Pt / 0Pt )e where:
e = the elasticity of Load with respect to GDP.
17
Delayed Impact of Price Change
•
•
•
•
Short run price elasticity
-0.10 (Residential)
Long run price elasticity
-0.30 (Residential)
Impact of a Price change is felt over a period of n years afterwards.
The Phased in Elasticity applicable y<=n years after a price change
is:
ey = eS + ( eL - eS ) * ( y / n)
• For each year, the model aggregates the effects of all previous price
changes using appropriate phased in elasticities.
18
Price Change Scenarios
• Different price change scenarios can specified by
choosing the reference case growth rate gt .
• Probability distributions are then calculated around
whatever scenario is used as a reference case.
19
20
21
Weather Impact Factor
• Weather’s effect on energy consumption in BC is primarily
due the use of electricity for heating.
• This effect is modeled using Heating Degree Days (HDD).
> Daily HDD=Max(0,18-ave. daily temperature)
• Annual HDD =SUM over year ( Daily HDD).
• @RISK’s fitting features were used to estimate a probability
density function for Annual HDDs
• The result was BetaGeneral(1.9531, 2.3592, 2550.00,
3425.00). Min=2550 Max=3425
22
23
Weather Impact Factor
• The pdf for HDDs is used to define the weather inpact factor.
• Fw = exp[ e ln( HDDt / 0HDDt ) ]
> 0HDDt is normal weather HDD.
> HDDt
>
is Beta General distributed HDD with mean = normal
weather HDD.
e = is the elasticity of Load with respect to HDD.
(0.37 for residential, 0.05 for Commercial, not applied to Industrial)
• This weather impact factor does not expand with time as do the
factors for GDP and Price.
24
25
26
Demand Side Management and Residual
• 0DSM(t) = Reference case forecast of DSM savings.
• The Monte Carlo Model gives these savings the following
probability density function:
DSM(t) = RiskTriang(75%,100%,125%)*0DSM(t)
then load after DSM is calculated by subtraction.
LOADafter(t) = LOADbefore(t) – DSM(t)
• The residual impact factor is :
FU (t) = FU (t-1) * (1 + RiskTriang(-0.2%,0.0%,0.2%) )
27
28
Comparison With Other Methods
• Standard Linear Regression Model
y=Xb+u
•
• where:
•
•
•
•
y is vector of data for dependent variable
(Nx1).
X data matrix for independent variables
(Nxk).
u is normal with zero mean and st. dev s (Nx1).
In the Estimation Period X is non-stochastic and there are N data points.
-1
-1
Estimator of b is b = (X’X) X’y = b + (X’X) X’u
b is normally distributed because u is.
Estimator of y is ŷ = X b
29
Regression Model and Monte Carlo Compared
• A regression of Sales on GDP was estimated by OLS.
• The results are graphed on the next slide.
• BC Hydro’s Base Case Forecast is shown for
comparison.
30
Regression Model Predicts Sales Well
31
Forecasting Using Regression Model
• In the Standard Regression Model, the predictor variables
are assumed to be non-stochastic.
• This assumption is O.K. for estimating the model in the
historical estimation period. Here data are “actuals”.
• But in the Forecast Period, predictor variables are
uncertain. They need to be modeled by random variables.
• What is needed is a formula for prediction intervals that is
valid when predictor variables are random variables.
• The following three slides outline how one such formula
can be derived.
32
Forecasting Using Regression Model
•
•
•
•
In Forecasting Period (t=1,…,T):
Predictor Variables X are Stochastic.
Let X = Z + V where Z = E[X], the mean of X.
The model is
y = Z b + u = Z b + (V b + u).
• Assume that Z is a matrix of given non-stochastic forecasts
of the predictor variables and
• V is modeled so that Vt j ~ N(0, st j ) and
• Predictor variables and the error term { V, u } are a set of
independent random variables.
> That is, Vt j and Vs k and ut are all independent
unless indices t = t and j = k.
33
Forecasting Using Regression Model
• The estimator (Forecast) of y is:
-1
•
ŷ = Z b where b = (X’X) X’y
• Under our strong assumptions the prediction error
•
y - ŷ is normally distributed with
s2u
-1
• Var ( y0 - ŷ0 ) =
Z(X’X) Z’ + s2u I
+ diag{ Σk1 b2j s2 t j + s2u Σk1 gj s2 t j }
Prediction Intervals ( P10 to P90) are:
| y - ŷ | = 1.28 Sqrt (Var ( y0 - ŷ0 ) )
34
Forecasting Using Regression Model
• In the case of one predictor variable (and constant) the
prediction variance reduces to:
• Var ( y0 - ŷ0 ) = s2u { 1 + (1/n) + (Z - Xsm )2 / SST }
+ b2 s2 t + s2 u s2 t / SST
> Where Xsm is the mean of X in the estimation period (non-stochastic)
• This equation was used to construct the Confidence
Bands graphed on a subsequent slide.
35
Comparison of Confidence Bands
• Confidence Bands were also produced by the Monte Carlo
methodology explained previously.
• A third set of Confidence Bands was produced using only the first 3
terms of the prediction variance equation. These bands are due to
coefficient estimation and residual error only.
• Note that:
> The Monte Carlo and Prediction Formula Bands are Very Close.
> The third set of bands are small compared to the others, showing that
uncertainty in the predictor variables in the forecast period is the main
source of error in forecasted Load.
• This is true for a “good” regression, like the one considered here, where in the
prediction interval formula, SST is large compared to the other terms.
36
37
38
Conclusion
• BC Hydro’s Monte Carlo Model and its implementation
using @RISK was described.
• It provides a method for generating a stochastic forecast
of the various components of BC Hydro’s Load.
• The results are reasonably accurate and have survived
regulatory scrutiny for several years.
• Confidence bands resulting from the Monte Carlo Model
and from an econometric prediction interval formula
were calculated and compare. They are close to each
other.
39
Download