esses c Pro

advertisement
Stat 330 (Spring 2015): slide set 18
2
5. Exponential race: P (min(S, T ) > t) = P (S > t, T > t) = e−(λ+μ)t if
T, S independent. What about P (min(T1, · · · , Tn) > t)?
(this is key for Poisson process later)
P (T > t + s|T > t) = P (T > s)
4. Lack of memory property for Exponential:
3. pdf of Exponential distribution: fT (t) = λe−λt for t ≥ 0, and λ is the
rate, 1/time. What is E(T ), and Var(T )? What is E(X) and Var(X))
2. Poisson: P (X = k) = e−λλx/x! where X is the number of observations
of rare event during certain time period (or space).
1. Exponential: P (T ≤ t) = 1 − e−λt for all t ≥ 0 where T is waiting time
for rare event to happen (once).
Review: What is Exponential distribution? and Poisson distribution?
Poisson Process
Last update: February 16, 2015
Stat 330 (Spring 2015)
Slide set 18
Stat 330 (Spring 2015): slide set 18
Poisson process
3
Jargon: X(t) is a “counting process” with independent Poisson increments.
X(t2) − X(t1) is independent from X(t4) − X(t3)
3. non-overlapping intervals are independent for any 0 ≤ t1 < t2 ≤ t3 < t4
X(t2) − X(t1) ∼ P oλ(t2−t1)
2. distribution depends only on length of interval for any 0 ≤ t1 < t2:
1. for t > 0, X(t) takes values in {0, 1, 2, 3, . . .}.
Definition: A stochastic process X(t) is called homogenous Poisson process
with rate λ, if
Stat 330 (Spring 2015): slide set 18
1
The example about ’hits on a webpage’ is a typical example of stochastic
process, and it has a special name: Poisson Process.
3. Values of X(t) are called states, the set of all possible values for X(t) is
called the state space.
2. Modeling usually requires somehow specifying the joint distribution
(X(t1), · · · , X(tk )) or P (X1 ∈ A1, · · · , Xk ∈ Ak )
1. Stochastic process is a mathematical model of reality.
Some remarks:
Definition: A stochastic process is a set of random variables indexed by
some indices, particularly time t, and is usually denoted by X(t).
Review: What is a Random variable?
Stochastic Processes
Stat 330 (Spring 2015): slide set 18
6
The time until the k th hit Ok is therefore given as the sum of inter-arrival
times Ok = I1 + . . . + Ik .
• What is a nonhomogeneous process?
7
• We mention homogeneous Poisson process; What is meant by
homogeneous?
• Why Poisson so important?!
Note: This theorem is very important! - it links the Poisson, Exponential,
and Erlang distributions tightly together! Some thoughts:
The time until the kth hit Ok is an Erlangk,λ distributed variable, ⇐⇒
X(t) is a Poisson process with rate λ.
and the inter-arrival time between successive hits:
Ij = Oj − Oj−1 for j = 1, 2, . . .
Corollary:
O0 = 0, Oj = time of the j thoccurrence = the first t for which X(t) ≥ j
X(t) is a Poisson process with rate λ iff the inter-arrival times I1, I2, . . .
are i.i.d. Expλ.
For a given Poisson process X(t) we define occurrences
Stat 330 (Spring 2015): slide set 18
Stat 330 (Spring 2015): slide set 18
Equivalence theorem:
5
Equivalence theorem
X(t) = X(t) − X(0) ∼ P oλ(t−0)
4. The distribution of X(t) is Poisson with rate λt, since:
3. With the same argument, X(0) = 0 - ALWAYS!
2. Similarly, X(t2) − X(t1) is the number of occurrences in the interval
(t1, t2].
Based on the last example:
Example (Cont’d)
Stat 330 (Spring 2015): slide set 18
1. X(t) can be thought of as the number of occurrences until time t.
Remarks
Example (cont’d)
4
♦ For example, we find that X(t) = 3 for t ∈ [5, 8] minutes; i.e., only 3
hits upto any time within 5 to 8 minutes.
♥ Here arrival times are generated from Exp(2). X(t) counts numbers of
hits until time t min.
♣ A counter of the number of hits on our webpage is an example for a
Poisson process with rate λ = 2/min.
Example
Stat 330 (Spring 2015): slide set 18
+ 10 · e
−10
+ 10 /2e
2
−10
= 0.0028.
= 0.1889.
8
Stat 330 (Spring 2015): slide set 18
)=e
−5/3
P (X > 650)
Then:
Z≤
650 − 600
√
600
N (0, 1).
≈
10
= 1 − 0.9798 = 0.0202. Webpage table, p.789 or Baron p. 386
≈ 1 − Φ(2.05)
approx
X−600
√
∼
600
= 1 − P (X ≤ 650) = 1 − P
Then X ∼ N (600, 600) → Z :=
approx
Recall that a Poisson distribution with large rate λ can be approximated
by a normal distribution with mean μ = λ and variance σ 2 = λ.
5. The number of hits in the first hour is Poisson with mean 600.
You would like to know the probability of more than 650 hits. Exact
calculation isn’t really feasible. So approximate this probability and justify
your approximation.
P (Y ≥ 10) = 1 − P (Y ≤ 10) = 1 − (1 − e
−10·1/6
Let Y be the time until the first hit - then Y has an Exponential
distribution with parameter λ = 10 per minute or λ = 1/6 per second.
2. Evaluate the probability that the time till the first hit exceeds 10 seconds.
(You may also check the Poisson cdf table).
P (X ≤ 2) = P o10(2) = e
−10
Let X be the number of hits in the first minute, then X is a Poisson
variable with λ = 10:
1. Evaluate the probability of 2 or less hits in the first minute.
Hits on a website: Hits on a popular Web page occur according to a Poisson
Process with a rate of 10 hits/min. One begins observation at exactly noon.
Example
Stat 330 (Spring 2015): slide set 18
k
4
=
= 0.4 minutes
λ 10
k
4
=
= 0.04minutes2.
λ2 100
where X ∼ P oi(λ · t)
9
11
♠ This tells us a way to simulate a Poisson process with rate λ on the
interval (0, T ).
♣ In other word, given that there were k arrivals, the set of arrival times is
the same as the locations of k darts thrown at random on the interval [0, t].
Theorem: Let X(t) be a Poisson process. Given that X(T ) = k, the
conditional distribution of the time of the k occurrences O1, . . . , Ok is the
same as the distribution of k ordered independent standard uniform variables
U(1), U(2), . . . , U(k).
Poisson process possesses an interesting property that is consistent with
thinking of it as ”random occurrences” in time t, which leads to the
conditioning theorem
Poisson Process: Conditioning
Stat 330 (Spring 2015): slide set 18
= P o4(3) = 0.433 Website table,p.786 or Baron p.384
= P (X ≤ 3) where X ∼ P oi(10 · 0.4)
P (T > 0.4) = P (X < 4)
4. Evaluate the probability that the time till the 4th hit exceeds 24 seconds.
Need P (T > 24/60) where T ∼ Erlang(4, 10) and T is in minutes; so
we’ll use the Gamma-Poisson formula:
V ar[T ] =
E[T ] =
Let T be the time till the 4th hit. Then T has an Erlang distribution with
stage parameter k = 4 and λ = 10 per minute.
3. Evaluate the mean and the variance of the time till the 4th hit.
Stat 330 (Spring 2015): slide set 18
second, generate w many standard uniform values u1, . . . , uw
define oi = T ·u(i), where u(i) is the ith smallest value among u1, . . . , uw .
•
•
12
♣ There, we have departures as well and, related to that, the time each
surfer stays - which we will call service time (from the perspective of the
web server).
♦ So far, we are looking only at arrivals of events. Besides that, we could,
for example, look at the number of surfers that are on our web site at the
same time.
♥ The above theorem tells us, that, if we pick k values at random from
an interval (0, t) and order them, we can assume that the distance between
two successive values has an exponential distribution with rate λ = k/t.
first, draw a Poisson value w from P oλT . ( This tells us, how many
uniform values Ui we need to simulate )
•
Simulating a Poisson Process
Download