Quickest Detection and Its Applications

advertisement
Quickest Detection and Its Allications
Zhu Han
Department of Electrical and Computer Engineering
University of Houston, Houston, TX, USA
Outline

Introduction
– Basics
– Markov stopping time

Quickest Detection
– Sequential detection
– Bayesian detection
– CUSUM test

Applications
–
–
–
–
Cognitive radio network
Multiuser detection for memory
Medical applications
Smart grid

Conclusions
Classic Hypothesis Test

Probability Space (Ω, F, P)
– Ω is a set, a sample space
– F is a event
– P is the probability measure assign to the event

Detection: “Spot the Money”
Hypothesis Testing

Let the signal be y(t), model be h(t)
Hypothesis testing:
H0: y(t) = n(t)
(no signal)
H1: y(t) = h(t) + n(t) (signal)

The optimal decision is given by the Likelihood ratio test
(Nieman-Pearson Theorem), g is a threshold.
Select H1 if L(y) = log(P(y|H1)/P(y|H0)) > g;
otherwise select H0.
Signal detection paradigm
Receiver operating characteristic (ROC) curve
Tradeoff between false alarm and detection probability
Basics of Quickest Detection


A technique to detect distribution changes of a sequence of
observations as quick as possible with the constraint of
false alarm or detection probability.
Classification
1.
2.
3.

Sequential detection: determine asap between two known
distributions, starting from time 0.
Bayesian detection: at random time (known distribution),
distribution changes between two known distribution.
CUSUM test: at random time (unknown distribution),
distribution changes to known/unknown distribution.
Applications
1.
2.
3.
4.
Cognitive Radio: Primary user reappear
Multiuser Detection: Memory
Network Monitoring:
Medical Device: Fall or not
Markov Stopping Time

For Markov process: memoriless property

likelihood of a given future state, at any given moment,
depends only on its present state, and not on any past states

Random variable YT: a reward that can be claimed at time T
Optimal stopping time that maximizes the reward

S is finite or infinite.
For finite time S case




backward induction
dynamic programming for Markov Case
For infinite time S case

Define

Stopping time
Outline

Introduction
– Basics
– Markov stopping time

Quickest Detection
– Sequential detection
– Bayesian detection
– CUSUM test

Applications
–
–
–
–
Cognitive radio network
Multiuser detection for memory
Medical applications
Smart grid

Conclusions
Sequential Detection

How to reach a decision between two hypotheses after minimal
average trails?

A real sample sequence, {Zk;K=1,2…} that obey one of two
hypotheses:

Stop the observation as soon as the decision is made
Trade off between probability of error and decision time. More
accurate, more decision time. Quicker decision, less accurate


A sequential decision rule (s.d.r.) as the pair (T,δ), in which T
declares the time to stop sampling and then δ takes the value 0
or 1 declaring which one of H1 , H0
Performance indices of interest

Average cost of errors
– False Alarm
– Missing Probability
– Average cost of errors,  is the probability of event
– c0 and c1 are constants to balance the tradeoff
The cost of sampling
 s.d.r. to solve the optimization problem

Equivalent Rule
Optimal Detection Rule

We can rewrite the problem

Optimal stopping time

Optimal cost
S() and thresholds

An illustration of s(π)

The thresholds are found from s(π)
– One is for false alarm
– The other is for missing prob.
Sequential probability ratio test
Sequential probability ratio test (SPRT) with boundaries
A and B : (SPRT(A, B))
– It exhibit minimal expected stopping time among all
s.d.r.’s having given error probability.
– The stopping time T is equivalently be written as

Example

At the 1st exit of ∧k from (A,B), decides H1 if the exit is to
the right of this interval and H0 if the exit is to the left.
Outline

Introduction
– Basics
– Markov stopping time

Quickest Detection
– Sequential detection
– Bayesian detection
– CUSUM test

Applications
–
–
–
–
Cognitive radio network
Multiuser detection for memory
Medical applications
Smart grid

Conclusions
Baysesian quickest detection





The distribution changes with unknown time (but known
distribution for the changing time). The objective of
observer is to detect such a random change, if one occurs,
as quickly as possible.
The difference from the sequential detection
The design of quickest detection procedures involves the
optimization of a tradeoff between two types of
performance indices: detection delay vs. false alarm.
For example, network from WIFI to Bluetooth
Approaches



Shiryaev’s problem for Bayesian quickest detection
Bojdecki’s quickest detection problem and other constraints
Ritov’s quickest detection problem: Game theory approach
Shiryaev’s problem
for Bayesian quickest detection

Random sequence, {Zk ; k=1,2,…} suppose there is a
change point, t, such that given {Z1 , Z2…, Zt-1} with
marginal distribution Q0 , and {Zt , Zt+1…, ZT} with
marginal distribution Q1

Two performance indices
– The expected detection delay:
– The false alarm probability:

The determination of optimal stopping time, T,
– It was a first posted by Shiryaev. It considers
– C>0, is a constant controlling the balance between 2
indices.
Geometric distribution assumption

To find the optimal stopping time, it need to assume a
specific prior distribution for the change pint, t,
– π and ρ are the constant lying in the interval (0,1)
– π, probability that a change already occurred when
the sequence observation start.
– ρ, the conditional probability that the sequence will
transition to the post-change state at any time, given
that it has not done so prior to that time
Optimal Solution
Example


How to find optimal threshold
Detection vs. time example
Other penalty functions

The penalty parameters act like an optimal constraints (i.e.
penalize combination of false alarms and detection delay)
but the solutions ideally converge to the solution or the
original one.
1. an example is a delay penalty of polynomial type (T-t)p for fixed p>0
2. The exponential penalty. (replace P(T<t) with P(T<t-ε) for fixed ε >0)
3. A alterative delay penalty
Bojdecki’s problem

A different approach to detecting the change point t within
Bayesian framework by maximizing the probability of
selected the right estimator for t based on the observation.

B is an approx. measurable set and XT depends the observed
Zk . If T* is existed, will be called optimal.

Let if maximizing the probability of stopping within m
units of the change point t.

Omit other details
A game theoretic formulation


An alternative approach: Ritov’s gametheoretic quickest detection problem
A game consists two player.
– Player#1: “the statistician” is attempting to quickly
detect a random change point as in the preceding
section
– Player#2: “nature” is attempting to choose the
distribution of the change point and foil the
Player#1.
– Given the probability of the change point
Is allowed to be a function of the past observation
{Z1~Zk-1}, which is selected by “nature”.
Outline

Introduction
– Basics
– Markov stopping time

Quickest Detection
– Sequential detection
– Bayesian detection
– CUSUM test

Applications
–
–
–
–
Cognitive radio network
Multiuser detection for memory
Medical applications
Smart grid

Conclusions
Non-Bayesian quickest detection

Previously, Shiryaev’s problem for Bayesian quickest
detection assumed the change point t, which is a random
variable with given, prior distribution.
– How to solve if the system has no pre-existing
statistical model for occurrence of event, like in
surveillance or inspection system?

Lorden’s problem for non-Bayesian quickest detection
– Problem definition
– Page’s CUSUM test
– Performance of Page’s test

Asymptotic results
– Lorden’s approach

The false-alarm constraints
Lorden’s problem

The detection delay is penalized by its worst case
value :
– Where d(T) is the worst case delay, and dt (T) is the
average delay under Pt
Constraint and Problem Formulation

The rate of false alarms can be quantified by the
mean time between false alarms

The design criterion is then given by:
– is positive, finite constant, and is the stopping time for
minimizing the worst-case delay within lower-bound
constraint in the mean time between the false alarms.
Cusum test (Page, 1966)
Likelihood of composite hypothesis Hn against H 0 :
Hv: sequence has
density f0 before v, and f1 after
max 0£ k£ n (Sn - Sk ) = Sn - min 0£ k£ n Sk ,
where
H0: sequence is
stochastically homogeneous
k
f (x )
S0 = 0; Sk = å log 1 j
f 0 (x j )
j=1
Stopping rule :
N = min{n ³ 1: gn = Sn - min 0£ k£ n Sk ³ b}
gn
for some threshold b
gn can be written in recurrent form
g0 = 0;gn = max(0,gn-1 + log
f1 (x n )
)
f 0 (x n )
This test minimizes the worst-average
detection delay (in an asymptotic sense)
b
Stopping time N
Outline

Introduction
– Basics
– Markov stopping time

Quickest Detection
– Sequential detection
– Bayesian detection
– CUSUM test

Applications
–
–
–
–
Cognitive radio network
Multiuser detection for memory
Medical applications
Smart grid

Conclusions
Example: Cognitive Radio
Lane reserved
for military
Licensed
Spectrum
Or
Primary
Users
Public Traffic
Lane congested!
Unlicensed
Spectrum
Or
Secondary Users
Treated as
Harmful
Interference
Spectrum Sensing

Secondary users must sense the spectrum to
– Detect the presence of the primary user for reducing interference on
primary user
– Detect spectrum holes to be used for transmission

Spectrum sensing is to make a decision between two
hypotheses
– The primary user is present, hypothesis H0
– The primary user is absent, hypothesis H1

Quickest detection for spectrum sensing
– A distribution change in frequency domain is detected in
observations to quit from or join into the licensed frequency band
– There exist unknown parameters after the primary radio emerges
Collaborative Spectrum Sensing

Collaborative spectrum sensing
3- Fusion Center makes
final decision: PU present
or not
Secondary User
Common Secondary
Fusion Center
Primary User
(Licensed user)
2- the SUs send their
Local Sensing bits to a
common fusion center
Secondary User
Secondary User
Secondary User
1- the SUs perform Local
Sensing of PU signal
Collaborative Quickest Spectrum Sensing

The collaborative quickest spectrum sensing without
communication coordination
– An node made own broadcast decision
– The random time-slot selection
– The limited time slots for the secondary users to exchange
information

The key issue is to determine
whether to broadcast based on the
current observation and the local
population of secondary user.
– A threshold broadcast scheme
is proposed
Medical Applications

Patient falling



CUSUM test




Quickest detection to detect as soon as possible to prevent or
report
False alarm limitation
No prior information
How to train the threshold
Need real data
Computation



Bluetooth between sensor and google phone Android
Computation in Android using JAVA
Communication through 3G or WIFI for reporting
Outline

Introduction
– Basics
– Markov stopping time

Quickest Detection
– Sequential detection
– Bayesian detection
– CUSUM test

Applications
–
–
–
–
Cognitive radio network
Multiuser detection for memory
Medical applications
Smart grid

Conclusions
Power System State Estimation Model

Transmitted active power from bus i to bus j
– High reactance over resistance ratio
– Linear approximation for small variance
– State vector
, measure noise e with covariance Ʃe
– Actual power flow measurement for m active power-flow branches
– Define the Jacobian matrix
– We have the linear approximation
– H is known to the power system but not known to the attackers
State Estimation (SE)
•
•
•
z=Hx+e, for n power lines and m measurement, m<n
H: Jacobean Matrix (n×n)
x: State variable (n×1)
z: Measurements (m×1), m<n
e: noise vector (n×1)
Goal of system is to estimate x from z
SE is a key function in building real-time models of
electricity networks in Energy Management Centers (EMC)
Real-time models of the network can be used by
Independent System Operator (ISO) to make optimal
decisions with respect to technical constraints (such as
transmission line congestion, voltage and transient stability)
Bad Data Injection and Detection

Inject Bad data c: z=Hx+c+e

Bad data detection
– Residual vector
– Without attacker
where
– Bad data detection (with threshold )
without attacker:
with attacker:

otherwise
Stealth (unobservable) attack
– Hypothesis test would fail in detecting the attacker, since the
control center believes that the true state is x + x.
QD System Model

Assuming Bayesian framework:
– the state variables are random with

The binary hypothesis test:

The distribution of measurement z under binary hyp: (differ
only in mean)

We want a detector
– False alarm and detection probabilities
Detection Model - NonBayesian

Requiring a non-Bayesian approach due to unknown
prior probability, attacker statistic model

The unknown parameter exists in the post-change
distribution and may changes over the detection
process.
– You do not know how attacker attacks.

Minimizing the worst-case effect via detection delay:
Detection
delay

Detection
time
Actual time of
active attack
We want to detect the intruder as soon as possible
while maintaining PD.
Multi-thread CUSUM Algorithm

CUSUM Statistic:
How about the
unknown?
where Likelihood ratio term of m measurements:

By recursion, CUSUM Statistic St at time t:

Average run length (ARL) for declaring the attack:
Declare the attacker is existing!
Otherwise, continuous to the process.
Linear Solver for the Unknown

Rao test – asymptotically equivalent model of GLRT:

The linear unknown solver for m measurements:
– Omitting the necessity of [J-1]
– Simplifying Quadratic form

solo-parameter envir.
the unknown > 0
Recursive CUSUM Statistic w/ linear unknown
parameter
The unknown
is nosolve:
long
involved
Simulation: Adaptive CUSUM algorithm

2 different detection tests: FAR: 1% and 0.1%

Active attack starts at time 6

Detection of attack at time 7 and 8, for different FARs
Conclusion



Different from the other detection techniques that minimize
error, quickest detection minimizes the decision time.
Trade off between decision time and error probability (false
alarm and error probabilities)
Depending on the different scenarios




Sequential detection
Bayesian detection
Non-Bayesian detection
Applications




Wireless network
Medical applications
Smart grid
Other applications?
Questions?
Download