A point process model for rainfall with truncated Gaussian rain cells

advertisement
A point process model for rainfall with
truncated Gaussian rain cells
By Paul J. Northrop, Tom M. Stone
A stochastic model for rainfall at a single point in space is studied in which storms
arrive in a Poisson process, each storm consisting of a cluster of a random number
of rain cells, each cell having random maximum intensity and duration. A common
feature of existing models of this kind is that the intensity of each rain cell is constant over its duration, thus forming a rectangular pulse of rainfall. In this paper
rain cells are given a more realistic structure in which their intensity increases after
birth and decreases before death. This enables better reproduction of rainfall properties at the short timescales required for some important hydrological applications.
The model is fitted to rainfall data from south west England.
Keywords: rainfall modelling; point processes; raingauge data
1. Introduction
Rainfall is the primary input for most hydrological systems and therefore models
of point rainfall time series have potential application to a range of hydrological
problems. For example, one approach to evaluating flood risk routes synthetic rainfall sequences generated from a rainfall model through a rainfall-runoff model to
obtain simulated flood responses (Wheater, 2002).
There is extensive literature on the use of Poisson cluster processes in the
stochastic modelling of rainfall, steming largely from Rodriguez-Iturbe et al. (1987).
The general structure of these models is motivated by observational studies such as
Austin & Houze (1972) and Petterssen (1956) which identified areas of relatively
intense rainfall, rain cells. These studies found that rain cells tend to cluster to
form larger structures which, for the purposes of this paper, we call storms.
Various extensions of Rodriguez-Iturbe et al. (1987) have been proposed. They
include: Rodriguez-Iturbe et al. (1988), in which cell durations are randomised
between storms in such a manner that storms have a common structure but occur
on different timescales; Cowpertwait (1994), in which there are two cells types
- short duration, high intensity cells and long duration, low intensity cells; and
Kakou (1997) in which rain cell intensity and duration are stochastically dependent,
producing a continuum of rain cell types.
A common feature of all these models is that rain cells are represented as rectangular pulses of rainfall. This structure is convenient mathematically but physically
unrealistic. Realizations generated from such models will have an unrealistic appearance at low levels of temporal aggregation and may have undue influence on
subsequent analyses. The properties of rainfall totals over short time periods may
be important in some applications, for example, a 15 minute time-step is commonly
used in the UK for flow forecasting purposes (Wood et al., 2000).
Research Report No.251, Department of Statistical Science, University College London
Date: March 2005
2
Paul J. Northrop, Tom M. Stone
The assumption of rectangular pulses is not supported by empirical studies of
rain cell structure (Willems, 2001) which suggest that rain cells have a spatialtemporal structure which can be represented adequately by a Gaussian density
function. In addition, it is unlikely that rain cell intensity and duration are independent. In fact, we might expect rain cell intensity and duration to be negatively
correlated - very intense cells tending to have short durations and cells with a long
duration tending to be relatively less intense.
In this paper we consider two closely-related generalizations of the rectangular
pulses model in which rain cells increase in intensity after their birth and then
dissipate before their death. In both cases each rain cell has the shape of a truncated
Gaussian density function. A parameter controls the rate of increase and decrease
of cell intensity. The rectangular pulses models may be recovered as a limiting case.
These rain cells are anticipated to have increased realism. The rain cells have the
property that the average intensity over the course of their lifetime is stochastically
dependent on their duration. For one of the cases the average intensity is negatively
related to the duration.
The aims of this paper are to derive expressions for the main properties of
the new model and to demonstrate, by fitting the model to rainfall data from the
HYREX (Moore et al., 2000) raingauge network in the catchment of the river Brue
in south west England, that the model is better able to reproduce properties of
sub-hourly rainfall data than the equivalent rectangular pulses model.
2. Bartlett-Lewis Gaussian Pulses Model (BLGPM)
(a) Specification of the model
We consider an extension of the Bartlett-Lewis Rectangular Pulses Model (BLRPM)
proposed by Rodriguez-Iturbe et al. (1987). Storm origins arrive in a Poisson process of rate λ. Each origin is followed by a Poisson process of rate β of cell origins.
After a time, exponentially distributed with rate γ, the process of cell origins terminates. With each cell origin (and with each storm origin) is associated a rain cell
that deposits rainfall at a rate which varies over time as specified in equation (2.1)
below. The rain cell deposits rainfall for a time D, its duration, and then it ‘dies’.
The durations of distinct rain cells are assumed to be exponentially distributed
random variables with parameter η, that is, with mean µD = 1/η. The intensity
X(s) of a rain cell of duration d a time s after its birth is given by
1
1 2
2
X(s) = X exp − 2 (s − d/2) − I exp − 2 d
,
2σ
8σ
0 < s < d,
(2.1)
where X is a random variable determining the peak intensity of the rain cell (attained midway through its duration) and σ determines the rate of increase in intensity to this peak and the rate of dissipation after the peak. The total intensity Y (t)
at time t is the sum of contributions from all cells alive at t. It is assumed that the
different peak intensities and durations are mutually independent and independent
of the process of cell origins. We consider two cases: I = 0 and I = 1. The latter
case has the property that X(0) = X(d) = 0, that is, the cell intensity is zero at its
birth and at its death. While this is obviously a physically appealing property we
Rainfall modelling
3
rainfall
intensity
x
d/2
d
Figure 1. The form of a truncated Gaussian rain cell with a duration of d and a peak
intensity of x.
also consider the case I = 0, in which rectangular pulses are recovered in the limit
as σ tends to ∞. Figure 1 shows the structure of these cells.
The average rainfall intensity, over the duration d, of a rain cell of the form (2.1)
is given by
d
√
1 2
,
X(s) ds = X σ 2π [2Φ(d/2σ) − 1] /d − I exp − 2 d
(1/d)
8σ
0
where Φ(·) is the distribution function of a standard Gaussian random variable. For
I = 0 this quantity decreases with cell duration. For I = 1 this relationship is not
monotonic, with low average intensities for short and for long durations. Figure 2
illustrates how average rainfall intensity decays with cell duration in the two cases.
The two types, I = 0 and I = 1, of Gaussian rain cell differ substantially for short
durations only.
(b) Some properties of the model
We note that the generalization of rectangular cells to truncated Gaussian
cells does not affect the expressions for rainfall occurrence properties derived in
Rodriguez-Iturbe et al. (1987) and Onof & Wheater (1993). For example, the probability that an interval of length h hours is entirely dry is given by
h = exp −λ(h + µT ) + λG∗P (0, 0) γ + ηe−(β+γ)h /(β + γ) ,
where, defining κ = β/η and φ = γ/η,
G∗P (0, 0) = η −1 e−κ
0
1
tφ−1 (1 − t)eκt dt
time
Paul J. Northrop, Tom M. Stone
0.6
0.4
0.2
0.0
average cell intensity
0.8
1.0
4
0.0
0.5
1.0
1.5
cell duration (hours)
Figure 2. Average cell intensity against cell duration for a rain cell with a peak intensity
of 1 and σ = 1. Top: I = 0, bottom: I = 1.
and
µT = φη −1
1
dv
0
0
1
dt v−1 tφ−1 1 − (1 − vt) e−κv(1−t) + (φη)−1 .
The latter quantity is the expected time from the start of a storm until its last cell
dies, that is, the expected storm duration.
The mean and autocovariance of the BLGPM involve various normal integrals,
that is, integrals involving the density and/or distribution function of the standard
normal distribution. We can evaluate the required integrals using some of the normal integrals collected in Owen (1980). The relevant results are collected in the
appendix.
The rainfall intensity at time t is given by
t
Y (t) =
Xu (t − u) dN (u),
(2.2)
−∞
where Xu (t − u) is the (random) intensity of a pulse originating at time u, a time
t − u later and where {N (u)} counts occurrences in the process of rain cell origins.
Equation (2.2) sums the contributions to the rainfall intensity at time t from cells
Rainfall modelling
5
born before t. A cell born at time u(< t) contributes to the total rainfall intensity
at time t if its duration is greater than t − u. Therefore,
X(t − u) if D > t − u,
Xu (t − u) =
(2.3)
0
if D < t − u.
For the truncated Gaussian pulses considered here X(t − u) is given by (2.1).
It follows that
t
E {Xu (t − u)} E {dN (u)} ,
E {Y (t)} =
−∞
where the intensity E {dN (u)} of the process of cell occurrences is λµC and µC =
1 + β/γ is the mean number of cells per storm. In the current example,
E {Y (t)}
2
2
− Ie−l /8σ ηe−ηl dl du,
−∞ l=t−u
∞
∞
√
−ηl
−2σηz
= λµC µX σ 2π 2
Φ (l/2σ) ηe
dl − I 1 + 4ση
zφ(z)e
dz
,
0
0
√
2 2
2 2
1
= 2 2π λµC µX σ e2σ η Φ(−2ση) − 2σηI √ − 2σηe2σ η Φ(−2ση) ,
2π
= λµC µX
t
∞
e−(t−u−l/2)
2
/2σ2
where µX = E(X).
The autocovariance at lag τ is given by
c(τ ) =
=
cov {Y (t), Y (t + τ )}
t
Xu (t − u) dN (u),
cov
t
−∞
t+τ
=
−∞ −∞
∞ ∞
=
0
0
t+τ
−∞
Xv (t + τ − v) dN (v)
E {Xu (t − u)Xv (t + τ − v)} cov {dN (v), dN (u)} ,
E {Xt−w (w)Xt+τ−z (z)} cov {dN (t − w), dN (t + τ − z)}(2.4)
,
where the change of variables w = t − u, z = t + τ − v has been used. The covariance
density γ(u) = cov{dN (t), dN (t + u)} can be expressed (Cox & Isham, 1980) in
terms of the conditional intensity function h(·) for the point process of rain cell
origins as
γ(u) = λµC {δ(u) + h(u) − λµC } ,
where δ(·) is the Dirac delta function
1
δ(x) =
0
if x = 0,
if x =
0.
The conditional intensity function of the Bartlett-Lewis process considered here is
h(u) = λµC + βe−γ|u| .
6
Paul J. Northrop, Tom M. Stone
Therefore, γ(u) = λµC {δ(u) + β exp(−γ|u|)}. Substitution in (2.4) gives
∞ ∞
c(τ ) = λµC
E {Xt−w (w)Xt+τ−z (z)} δ(τ + w − z) + βe−γ|τ+w−z| dw dz,
0 ∞ 0
= λµC
E {Xt−w (w)Xt−w (w + τ )} dx
0
∞ ∞
E {Xt−w (w)Xt+τ−z (z)} e−γ|τ+w−z| dw dz,
+ λµC β
0
0
= cw (τ ) + cb (τ ).
The autocovariance c(τ ) is the sum of two terms, the first, cw (τ ), is a ‘within cell’
contribution resulting from the possibility that a single rain cell may live longer
than τ hours. The second term is a ‘between cell’ contribution resulting from the
possibility that two distinct cells in the same storm cover points separated by τ
hours in time.
The first term is given by
∞ ∞ 2
2
2
2
2
2
2
2
cw (τ ) = λµC E(X 2 )
e−(w−l/2) /2σ −Ie−l /8σ
e−(w+τ−l/2) /2σ − Ie−l /8σ ηe−ηl dldw,
0
w+τ
√
√
2 2
2
2
= 2 πλµC E(X 2 )σeσ η Φ(− 2ση)e−τ /4σ e−η|τ|
|τ |
|τ |+4σ 2 η
|τ |+4σ 2 η
2
2 2σ2 η 2
− 8πλµC E(X )σ ηe
I G −2ση − , 1,
−G 2ση, −1,
σ
2σ
2σ
2
2
√
√
|τ |+2σ η
|τ |+2σ η
2
σ2 η2
2
√
√
−(|τ |+2σ η) Φ −
,
+ 2 πλµC E(X )σηe
I σ 2φ
σ 2
σ 2
where the function G(·, ·, ·) is given in the appendix. The decay in the first term of
2
2
cw (τ ) with time lag τ is governed by two terms: e−τ /4σ and e−η|τ| . The former
corresponds to rapid decay in covariance due to changing cell intensity. Large values
of σ result in highly peaked cells and in rapid decay of autocovariance. The latter
corresponds to exponential decay due to cell death. Large values of η result in
(stochastically) long-lived cells and in rapid decay of the autocovariance.
The second term is given by
∞ ∞
E {Xt−w (w)Xt+τ−z (z)} e−γ|τ+w−z| dz dw,
cb (τ ) = λµC β
0
0
∞ ∞ ∞ ∞ 2
2
2
2
2
2
2
2
e−(w−l1 /2) /2σ −Ie−l /8σ
e−(z−l2 /2) /2σ −Ie−l /8σ
= λµC βµ2X
0
=
−
−
+
0
l1 =w l2 =z
×η 2 e−ηl1 e−ηl2 e−γ|τ+w−z| dl2 dl1 dz dw,
1 1
1 1
B e−γτ C1 −2ση,
,
, 2η +γ, 2η −γ +eγτ C2 −2ση,
,
, 2η −γ, 2η +γ
2σ 2σ
2σ 2σ
1
1
1
1
−γτ
γτ
e
C1 −2ση,
, − , 2η + γ, −γ − e C2 −2ση,
, − , 2η − γ, γ
2σ
2σ
2σ
2σ
1 1
1 1
−γτ
γτ
e
C1 −2ση, − ,
, γ, 2η − γ − e C2 −2ση, − ,
, −γ, 2η + γ
2σ 2σ
2σ 2σ
1
1
1
1
−γτ
γτ
e
C1 −2ση, − , − , γ, −γ + e C2 −2ση, − , − , −γ, γ
,
2σ
2σ
2σ
2σ
(2.5)
Rainfall modelling
where B = 8πλµC µ2X βσ 2 η 2 e4σ
2 2
η
∞
,
Φ (a + b1 w) e−cw
C1 (a, b1 , b2 , c, d) =
7
w=0
w+τ
Φ (a + b2 z) e−dz dz dw
z=0
and
∞
−cw
C2 (a, b1 , b2 , c, d) =
∞
Φ (a + b1 w) e
w=0
Φ (a + b2 z) e−dz dz dw.
z=w+τ
These integrals can be expressed in terms of the distribution functions of univariate and bivariate standard Gaussian random variables. The details and resulting
expressions are given in the appendix.
(c) Time aggregation
Rainfall data are often utilized in time aggregated form, for example in the form
of hourly rainfall totals. Therefore, we consider the cumulative rainfall totals
(h)
Yi
ih
=
Y (v) dv,
(i−1)h
in disjoint time intervals of fixed length h hours. The second-order properties of the
aggregated process are given by
(h)
E(Yi
(h)
var(Yi
)
= hE{Y (t)},
h
= 2
(h − τ )c(τ ) dτ,
)
(h)
(h)
cov(Yi , Yi+k )
0
h
=
−h
(h)
(2.6)
(h − |τ |) c(kh + τ ) dτ,
(h)
(h)
(h)
k = 1, 2, . . . .
(h)
(2.7)
We define ρ(h) (k) = corr(Yi , Yi+k ) = cov(Yi , Yi+k )/var(Yi ). For the BLGPM
it does not appear possible to evaluate the integrals in (2.6) or (2.7) algebraically.
Therefore, we evaluate them numerically using quadrature. For small h and k this
does not pose a problem. However, for large τ the covariance c(τ ) is difficult to
evaluate reliably because it involves quantities which are either too large or too
small to be stored accurately by a computer. Therefore, in cases where (2.6) and
(2.7) involves evaluating c(τ ) for large τ , we note that for τ >> 1/η, c(τ ) is very
(h)
closely approximated by a constant multiplied by exp(−γτ ). We obtain var(Yi )
as
z
h
(h)
var(Yi ) = 2
(h − τ )c(τ ) dτ + 2
(h − τ )c(τ ) dτ
0
≈
=
z
z
h
(h − τ )c(z) e−γ(τ−z) dτ
z
0 z
2
1
1
(h − τ )c(τ ) dτ + c(z) h − z − + e−γ(h−z) ,
2
γ
γ
γ
0
2
(h − τ )c(τ ) dτ + 2
8
Paul J. Northrop, Tom M. Stone
where z is chosen to be sufficiently large that c(τ ) ≈ c(z) exp{−γ(τ − z)}, yet sufficiently small that c(z) may be evaluated reliably. Similarly, using the approximation
c(kh + τ ) = c(r) exp{−γ(kh + τ − r)}, we obtain
(h)
(h)
cov(Yi , Yi+k )
z
=
−h
(h−|τ |) c(kh+τ ) dτ + g(r, z)
where


g(r, z) =
h − z − γ1 e−γz + γ1 e−γh
 1 c(r)e−γ((k−1)h−r) h + z + 1 e−γ(h+z) − 2 e−γh + 1 e−2γh
γ
γ
γ
γ
1
−γ(kh−r)
γ c(r)e
The value of r is chosen so that c(kh + τ ) ≈ c(r) exp{−γ(kh + τ − r)} and c(r) can
be evaluated reliably. The value of z is chosen, subject to z ≥ −h, so that c(kh + z)
may be evaluated reliably.
3. Model fitting and validation
Rainfall models of this form are generally fitted using a generalised method of
moments approach in which the parameters of the model are chosen to reproduce
selected properties of the data as closely as possible. This may take the form of
selecting as many properties as parameters, seeking to reproduce all these properties
exactly, or selecting a larger number of properties and minimizing some measure of
the discrepancy between model and data, for example, a sum of (weighted) squared
differences. The properties chosen should not be strongly mutually correlated and
should have relatively small sampling variances. The fit of the model to the data
is judged by its ability to reproduce rainfall properties over a range of scales of
temporal aggregation, in particular those properties not used to fit the model.
This method has some drawbacks: the choice of fitting properties is subjective,
the resulting parameters estimates can be very sensitive to the properties chosen,
there may be quite different set of parameter estimates that give similar fits to the
data and inefficient use is made of the data. Nevertheless, if the fitting properties
are selected carefully, to provide information about the parameters of the model,
physically meaningful estimates are likely to be obtained. More formal methods
of fitting are possible. However, the physically appealing short-term dependencies
in the model make formulating a likelihood difficult. Chandler (1997) has developed a fitting method in which an approximate likelihood is constructed based on
the sample Fourier coefficients of the data rather than the data themselves. This
provides an objective fitting method and a means of comparing formally the fit of
different rainfall models. The method is, however, based entirely on the mean and
the second-order properties of the data and has been found to reproduce properties
relating to rainfall occurence poorly (Wheater et al., 2000).
Following Rodriguez-Iturbe et al. (1987) we approximate µT and G∗P (0, 0) using
expansions of the integrals involved, assuming that κ and φ are small. The resulting
apprximations are
1
1
µT ≈ (φη)−1 {1 + φ(κ + φ) − φ(κ + φ)(κ + 4φ) + φ(κ + φ)(4κ2 + 27κφ + 72φ2 )}
4
72
z > 0,
z < 0.
Rainfall modelling
9
λ
µC
µD
µL
µT
µX
σ
BLGPM0
0.043
11.71
0.51
1.64
3.34
2.57
0.035
BLGPM1
0.044
13.70
0.46
1.65
3.42
2.68
0.031
BLRPM
0.045
9.07
0.25
2.49
2.67
1.00
Table 1. Parameter estimates for models fitted to January Brue data. Line 1: BLGPM
with I = 0, line 2: BLGPM with I = 1, line 3: BLRPM.
and
G∗P (0, 0)
≈
−1
(φη)
3
1
1 − κ − φ + κφ + φ2 + κ2 − φ3 −
2
2
15
89
25
+φ4 + κφ3 + κ2 φ2 + κ3 φ +
8
72
72
7 2 11 2
1
κφ − κ φ − κ3
4
12
6
1 4
κ .
24
(a) Empirical results
We present results of fitting the BLGPM and BLRPM to data recorded by
raingauge 19 in the HYREX network, located at East Hill Lane Fields. The dataset
consists of 15 minute rainfall totals, recorded in millimetres, over a period from 10th
October 1993 to 30th September 1998. The tipping bucket mechanism of the gauge
limits the temporal resolution of the raingauge data. The median time between
bucket tips varies from approximately 6 minutes in the winter to approximately 11
minutes in the summer. Consequently, for aggregation periods of less than about 15
minutes, the data discretisation due to the tipping mechanism has a serious effect.
Clearly it is not appropriate to fit the model to the entire dataset as there will
be strong seasonal effects. Therefore, we separate the data into months and take the
month of January as an example. We assume that X is exponentially distributed.
The model fits reported in tables 1 and 2 are based on E(Y (1)), var(Y (1)), P (Y (1) =
0), ρ(1)(1), ρ(1) (3) and ρ(1/2)(1). The autocorrelations have been chosen to give information about the ‘temporal’ parameters µD , µL = 1/γ (the expected time for
which a storm is able to produce rain cells) and σ. The remaining properties give
information about λ, µC and µX . For the January data, and for other months not
presented here, the BLGPM (with either I = 0 or I = 1) is able to reproduce
the observed values of these statistics extremely closely. Therefore the exact form
of the objective function is unimportant. The BLRPM has only five parameters
and is unable to reproduce the six properties chosen exactly. We choose to reproduce E(Y (1) ), var(Y (1) ) and P (Y (1) = 0) exactly and use an (unweighted) sum of
squared differences for the remaining properties. Neither of these choices affects
the qualitative conclusions drawn. Table 1 contains the parameter estimates obtained under the two BLGPMs and the BLRPM. As we would expect the fitted
BLGPMs have rain cells which are, on average, longer-lasting and with a greater
peak intensity than the fitted BLRPM. This is illustrated in figure 3. The shape of
the Gaussian rain cells depends on the duration of the cell. For the BLRPM0 cells
of short duration have a large instantaneous change in intensity at their birth and
death, whereas BLGPM1 cells start and end at zero intensity. For sufficiently large
2.0
1.5
intensity
0.0
0.5
1.0
1.5
1.0
0.0
0.5
intensity
2.0
2.5
Paul J. Northrop, Tom M. Stone
2.5
10
0.0
0.5
1.0
1.5
2.0
2.5
3.0
0
2
4
2.5
2.0
1.5
intensity
0.0
0.5
1.0
2.0
1.5
1.0
0.5
0.0
intensity
8
time / minutes
2.5
time / minutes
6
0
5
10
time / minutes
15
20
0
5
10
15
20
25
time / minutes
Figure 3. Fitted cell profiles under the BLGPM0 (———), the BLGPM1 (- - - - -) and the
BLRPM (− · − · − · −), for cell durations corresponding to to the 10th percentile (top left),
the 25th percentile (top right) the median (bottom left) and the mean (bottom right) of
the respective fitted distribution of cell duration.
durations (greater than the median cell duration, say) cells in the two BLGPMs
are very similar.
Table 2 summarises the fits of the BLGPM and the BLRPM. The crucial aspect
is the extent to which properties not used in the fitting procedure are reproduced.
Overall the BLGPMs are better able to reproduce sub-hourly properties than the
BLRPM. There is very little to choose between the BLGPMs. The autocorrelation
(h)
(h)
corr(Yi , Yi+k ) of the BLRPM decreases in h for fixed k whereas the autocorrelation of the BLGPM more closely follows the non-monotonic pattern of the observed
autocorrelations. For all three models there is close agreement between observed
and fitted probabilities of zero rainfall up to the 12 hour level of aggregation. The
proportion of dry days is underestimated by the models, to a slightly lesser extent
by the BLGPMs than the BLRPM. There is poor agreement between the models
and the data in autocorrelations at large levels h of temporal aggregation - the fitted
autocorrelations decay more quickly than the observed autocorrelations. However,
for large h the sample autocorrelation do not decay monotonically in h and in time
lag as would be expected. This is probably as result of the fact that sample prop-
30
Rainfall modelling
11
level of
aggregation
(h)
1/4
mean
(mm)
0.0256
variance
(mm2 )
0.0169
0.0165
0.0165
0.0141
autocorrelation
lag-1
lag-2
lag-3
probability
of zero rain
0.512
0.466
0.464
0.682
0.353
0.373
0.374
0.430
0.298
0.327
0.326
0.320
0.928
0.931
0.926
0.897
1/2
0.0513
0.0502
0.0484
0.0484
0.0475
0.525
0.525
0.525
0.553
0.337
0.388
0.387
0.321
0.254
0.288
0.287
0.245
0.897
0.879
0.877
0.877
3/4
0.0769
0.0963
0.0925
0.0926
0.0930
0.494
0.535
0.535
0.495
0.319
0.341
0.341
0.285
0.189
0.217
0.217
0.207
0.873
0.863
0.862
0.864
1
0.1025
0.1025
0.1025
0.1025
0.1475
0.1475
0.1475
0.1475
0.521
0.521
0.520
0.464
0.265
0.285
0.285
0.259
0.155
0.155
0.156
0.172
0.852
0.852
0.852
0.852
6
0.6152
2.1907
2.1902
2.1903
2.1175
0.274
0.161
0.162
0.208
0.155
0.028
0.028
0.018
0.176
0.001
0.001
0.002
0.650
0.686
0.684
0.679
12
1.2303
5.5712
5.0856
5.0889
5.1164
0.278
0.083
0.084
0.102
0.228
0.001
0.001
0.001
0.207
0.000
0.000
0.000
0.523
0.528
0.525
0.517
24
2.4606
15.7924
10.9137
10.9245
11.2743
0.282
0.052
0.052
0.047
0.251
0.000
0.000
0.000
0.303
0.000
0.000
0.000
0.394
0.313
0.310
0.300
Table 2. Assessment of models fitted to January Brue data (10th October 1993 to 30th
September 1998). Historical values and fitted values under the BLGPMs and the BLRPM.
Line 1: historical values, line 2: BLGPM0 , line 3: BLGPM1 , line 4: BLRPM. Entries in
bold indicate properties used to fit the model.
erties are relatively imprecise for large h. A lengthier data record is likely to have
produced the expected pattern in the autocorrelations.
4. Concluding remarks
Generalisation of the rectangular pulses of rainfall to truncated Gaussian form has
the desired effect of providing greater flexibility to reproduce short-term properties
of rainfall data. In addition the truncated Gaussian rain cells have greater physical
12
Paul J. Northrop, Tom M. Stone
realism than rectangular cells. The main drawback is the reduction in mathematical tractability which means that evaluation of (co)variances between rainfall totals
requires numerical integration. A potentially useful further generalisation is to incorporate truncated Gaussian cells into the random η BLRPM of Rodriguez-Iturbe
et al. (1988), which may further improve the ability of the model to reproduce the
properties of rainfall over a wide range of temporal scales, and the spatial-temporal
rainfall model of Northrop (1998). An alternative rain cell profile we considered
is X(s) = Xsσ (d − s)σ , for some positive parameter σ. However, this proved less
amenable to mathematical analysis than the Gaussian cell profiles.
Appendix A.
We express the various normal integrals appearing in the mean and autocovariance of the BLGPM in terms of univariate and bivariate Gaussian densities and
distribution functions.
The mean E{Y (t)} involves an integral of the form xφ(x)ecx dx. Integrating
by parts with u = ecx and dv = xφ(x) gives
xφ(x)ecx dx = −φ(x)ecx + φ(x)cecx dx
−φ(x)ecx + cec
=
2
/2
Φ(x − c),
where φ(x)cecx dx follows from equation 100,010 of Owen (1980).
The mean, cw (τ ) and cb (τ ) involve integrals of the form
∞
D(a, b, c) =
Φ(a + bx) c e−cx dx.
0
It follows directly from
1
1 2 2
Φ(bx)eax dx = eax Φ(bx) − ea /2b Φ(bx − a/b),
a
a
(Owen, 1980, equation 101,000) that
2
2
Φ(a + bx) c e−cx dx = −Φ(a + bx) e−cx + eca/b ec /2b Φ(a + bx + c/b).
Thus,
D(a, b, c) =
2
2
Φ(a) − eca/b ec /2b Φ(a + c/b) + eca/b ec
2
2
Φ(a) − eca/b ec /2b Φ(a + c/b)
2
/2b2
if b > 0
(A 1)
if b < 0.
Direct application of equations 10,010.1 and 10,010.8 of Owen (1980) gives
∞
a
a
b
G(a, b, c) =
−BvN √
,
Φ(a+bx)φ(x) dx = Φ √
, c; − √
1 + b2
1 + b2
1 + b2
c
where
BvN [h, k; ρ] =
2π
1
1 − ρ2
k
−∞
2
x − 2ρxy + y2
exp −
dx dy
2(1 − ρ2 )
−∞
h
Rainfall modelling
13
is the distribution function of a standard bivariate Gaussian distribution with correlation ρ. The function G(a, b, c) appears in cw (τ ) and cb (τ ).
If c + d = 0 the integrals C1 and C2 defined in section b can be expressed, using
(A 1), as
1 Φ(a)
e−dτ
C1 (a, b1 , b2 , c, d) =
D(a, b1 , c) −
F (a, a + b2 τ, b1 , b2 , c + d)
d
c
c+d
2
eda/b2 ed
+
c
and
e−dτ
C2 (a, b1 , b2 , c, d) =
c
/2b22
[F (a, a + b2 τ + d/b2, b1 , b2 , c) − Φ(a + d/b2 )D(a, b1 , c)]
1
1
Φ(a)D(a + b2 τ, b2 , d) −
F (a, a + b2 τ, b1 , b2 , c + d)
d
c+d
eca/b1 ec
+
d
2
/2b21
[F (a+c/b1 , a+b2 τ, b1 , b2 , d)−Φ(a+c/b1 )D(a+b2 τ, b2 , d)]
where
F (a1 , a2 , b1 , b2, c)
∞
=
0
Φ(a1 + b1 x)Φ(a2 + b2 x) c e−cx dx.
(A 2)
The only instances where c + d = 0 are in the last two terms of cb (τ ) in (2.5). In
both these cases b1 = b2 < 0. Therefore for these two terms we replace the second
terms in C1 and C2 by
−
e−dτ
J(a, a + bτ, b)
d
respectively, where,
J(a1 , a2 , b) =
0
and
−
e−dτ
J(a, a + bτ, b),
c
∞
Φ(a1 + bx)Φ(a2 + bx) dx.
(A 3)
Integrating (A 2) by parts with u = Φ {a1 + b1 x} Φ {a2 + b2 x} and dv = ce−cx
we obtain
∞
−cx ∞
F (a1 , a2 , b1 , b2 , c) = − Φ(a1 +b1 x)Φ(a2 +b2 x)e
+ b1
Φ(a2 +b2 x)φ(a1 +b1 x) e−cx dx
0
0
∞
Φ(a1 +b1 x)φ(a2 +b2 x) e−cx dx.
+b2
0
Now we integrate the second and third terms by parts with u = Φ(a2 + b2 x) and
dv = φ(a1 + b1 x) e−cx for term two, and similarly for term three. Using
2
φ(x) ecx dx = ec /2 Φ(x − c),
(A 4)
(Owen, 1980, equation 100,010) we find that
1 ca1 /b1 c2 /2b21
φ(a1 + b1 x) e−cx dx =
e
e
Φ {a1 + b1 x + c/b1 } .
b1
14
Paul J. Northrop, Tom M. Stone
Therefore,
F (a1 , a2 , b1, b2 , c) = Φ(a1 )Φ(a2 )
∞
∞
ca1 /b1 c2 /2b21
+e
[Φ(a2 +b2 x)Φ(a1 +b1 x+c/b1 )]0 −b2
e
φ(a2 +b2 x)Φ(a1 +b1 x+c/b1 ) dx
0
∞
∞
ca2 /b2 c2 /2b22
[Φ(a1 +b1 x)Φ(a2 +b2 x+c/b2 )]0 −b1
e
φ(a1 +b1 x)Φ(a2 +b2 x+c/b2 ) dx .
+e
0
In order to simplify the first of the remaining integrals we let y = a2 + b2 x and find
that

1

∞

G(d1 , b1 /b2 , a2 ) if b2 > 0,
b
2
φ(a2 + b2 x)Φ(a1 + b1 x + c/b1 ) dx =
1

0
 − G (d1 , b1 /b2 , a2 ) if b2 < 0,
b2
where d1 = a1 − a2 b1 /b2 + c/b1 . Defining d2 = a2 − a1 b2 /b1 + c/b2 and evaluating
the second integral in a similar manner we get
F (a1 , a2 , b1, b2 , c) = Φ(a1 )Φ(a2 )
c
ca1 /b1 c2 /2b21
+ e
I12 −Φ(a2 )Φ a1 +
− I2 G d1 ,
e
b1
c
ca2 /b2 c2 /2b22
I12 −Φ(a1 )Φ a2 +
− I1 G d2 ,
e
+ e
b2
b1
, a2 + (1 − I2 )G d1 ,
b2
b2
, a1 + (1 − I1 )G d2 ,
b1
b1
, a2
b2
b2
,
, a1
b1
where I12 = I(b1 > 0, b2 > 0), I1 = I(b1 > 0), I2 = I(b2 > 0) are ordinary indicator
functions and where
c
a
b
G (a, b, c) =
Φ(a + bx)φ(x) dx = BvN √
, c; − √
1 + b2
1 + b2
−∞
is derived in a similar manner to G(a, b, c) and
c
1
a
a
H(a, b, c) =
Φ c 1 + b2 + √
φ(x)φ(a+bx) dx = √
φ √
1 + b2
1 + b2
1 + b2
−∞
follows directly from equation 110 of Owen (1980).
In the evaluation of J(a1 , a2 , b) we need only consider the case b < 0. Making
the substitution y = bx and integrating (A 3) by parts with dv=1 we obtain
0
0
1
yφ(a1 + y)Φ(a2 + y) dy +
yφ(a2 + y)Φ(a1 + y) dy .
J(a1 , a2 , b) =
b
−∞
−∞
Integrating by parts with u = Φ(a2 + y) and dv = yφ(a1 + y) for the first integral,
and similarly for the second integral, gives
1
J(a1 , a2 , b) = −
φ(a1 )Φ(a2 ) + φ(a2 )Φ(a1 ) + (a1 + a2 )Φ(a1 )Φ(a2 ) − H(a1 −a2 , 1, a2)
b
− H(a2 −a1 , 1, a1) − a1 G (a1 −a2 , 1, a2) − a2 G (a2 −a1 , 1, a1) .
Rainfall modelling
15
References
Austin, P. M. and Houze, R. A. (1972) Analysis of structure of precipitation patterns
in New England. J. Appl. Meteorol. 11, 926–935.
Chandler, R. (1997) A spectral method for estimating parameters in rainfall models.
Bernoulli 3(3), 301–322.
Cowpertwait, P. S. P. (1994) A generalised point process model for rainfall. Proc.
R. Soc. Lond. A. 447, 23–37.
Cox, D. R. and Isham, V. (1980) Point Processes. London: Chapman and Hall.
Kakou, A. (1997) Point process based models for rainfall. Ph.D. thesis, University
of London. Unpublished.
Moore, R., Jones, D., Cox, D. and Isham, V. (2000) Design of the HYREX raingauge
network. Hydrol. Earth. Syst. Sc. 4(4), 523–530.
Northrop, P. (1998) A clustered spatial-temporal model of rainfall. Proc. R. Soc.
Lond. A 454, 1875–1888.
Onof, C. and Wheater, H. S. (1993) Modelling of British rainfall using a random
parameter Bartlett-Lewis rectangular pulse model. J. Hydrology 149, 67–95.
Owen, D. B. (1980) A table of normal integrals. Communications in Statistics
B9(4), 389–419.
Petterssen, S. (1956) Weather Analysis and Forecasting. Volume 2. New York:
McGraw Hill.
Rodriguez-Iturbe, I., Cox, D. R. and Isham, V. (1987) Some models for rainfall
based on stochastic point processes. Proc. R Soc. Lond. A 410, 269–288.
Rodriguez-Iturbe, I., Cox, D. R. and Isham, V. (1988) A point process model for
rainfall : further developments. Proc. R Soc. Lond. A 417, 283–298.
Wheater, H. S. (2002) Progress in and prospects for fluvial flood modelling. Phil.
Trans. R. Soc. Lond. A 360(1796), 1409–1431.
Wheater, H. S., Isham, V. S., Cox, D. R., Chandler, R. E., Kakou, A., Northrop,
P. J., Oh, L., Onof, C. and Rodriguez-Iturbe, I. (2000) Spatial-temporal rainfall
fields: modelling and statistical aspects. Hydrol. Earth. Syst. Sc. 4(4), 581–601.
Willems (2001) A spatial rainfall generator for small spatial scales. J. Hydrol. 252,
126–144.
Wood, S. J., Jones, D. A. and Moore, R. J. (2000) Accuracy of rainfall measurement
for scales of hydrological interest. Hydrol. Earth. Syst. Sc. 4(4), 531–543.
Download