Optimal Distributed State Estimation with Communication

advertisement
Optimal Distributed State Estimation with Communication
Cost: A Majorization Theory Approach
Gabriel M. Lipsa and Nuno C. Martins
Majorization Theory:
xk+1 = axk + wk
Problem Formulation:
Estimator
Pre-processor
2 2 )
wk ∼ N 0, σW
(NσW
x0 = 0
vk = Pk {xt }kt=0
k
x̂k = Ek {{vt }t=0
Pk : Rk+1 → R {∅}
Ek : (R {∅}
{∅})k+1 → R
The symmetric nonincreasing rearrangement of the function f is given by:
σ
f (x) =
rk =
0, vk = ∅
∅
1, vk =
Proof of Theorem 2:
∞
0
I(z∈Rn :f (z)>ρ)σ (x)dρ
Let {τk }Tk=1 be a sequence of positive real numbers. We define the policy
P o as it follows:
• if vt = ∅ for all t ∈ {{1, . . . , k − 1},
} let vk = ∅ if xk ∈ [−τ
− k , τk ], otherwise
let vk = xk ;
• after a perfect sample was sent, adopt the optimal threshold policy for the
remaining time.
T
2
2
Jβ α, σW
, C, P, E = lim
β k E (xk − x̂k ) + Crk
T →∞
2
Jβ α, σW
,C
=
k=0
2
min Jβ α, σW
, C, P, E
P,E
f
Optimal solution:
f g if:
k
E(α),k {vt }t=0 = x̂k =
P(τ ),k
k
{ t }t=0
{x
=
αx̂k−1 , vk = ∅
, k≥1
vk ,
ow
∅
∅,
xk
fK (x) =
−τk < xk − x̂k < τk
−
,k ≥ 1
ow
Theorem 1: Let the following parameters be given: the variance of the
2
process noise σW
, the system’s dynamic constant α, the communication cost C,
and the discount factor β. There exist a positive real constant ν, such that P(τ )
∞
2
and E(α) are an optimal solution to min Jβ α, σW , C, P, E , where τ = {τk }k=0
P ,E
is a sequence of positive real numbers such that τk = ν for all positive integers k.
JT,β
2
α, σW
, C, P, E
2
JT,β α, σW
,C
=
=
T
k=0
k
2
β E (xk − x̂k ) + Crk
2
min JT,β α, σW
, C, P, E
P,E
Theorem 2: Let the following parameters parameters be given: the variance
2
, the system’s dynamic constant α, the communication
of the process noise σW
cost C, the discount factor β and the time horizon T . There exists a sequence
T
of positive real numbers τ = {τk }k=0 , such that P(τ ) and E(α) are an optimal
2
solution to min JT,β α, σW , C, P, E .
P,E
Remark: The optimal cost JT,β
condition x0 .
Theorem 2 can be proved using an induction argument. Choose an arbitrary
pre-processor policy P, this policy will define ω k , γk|k−1 and γk for all k ∈
{1, . . . , T }
2
α, σW
,C
does not depend on the initial
fσ
σ
||x||≤ρ
f (x)dx ≥
K
f (x)
,x
f (x)dx
||x||≤ρ
g σ (x)dx, for all ρ ≥ 0
The cost obtained by adopting the policy P can be lower bounded as follows:
JT,β
∈K
0, x ∈
/K
A nonnegative function f : Rn → R+ is symmetric and nonincreasing if
it can be expressed via a nonincreasing function g : R+ → R+ such that
f (x) = g(||x||).
Lemma: Let f, g : R → R be two probability distribution functions, such
that f is symmetric and nonincreasing and f g. Let κ be a real number in
interval κ ∈ (0, 1). Let K = [−τ, τ ] be the symmetric interval,
such that
the
τ
f (x)dx = 1 − κ. For any function λ : R → [0, 1] satisfying R g(x)λ(x)dx =
−τ
1 − κ, the following holds:
fK g·λ
1−κ
f ∗bg∗b
Lemma: Let f be a neat and even probability distribution function on the
real line. Let µ, be a probability
distribution function on the real line, such that
µ ≺ f . If we define x̂µ = R xµ(x)dx, then the following holds:
2
x
f (x)dx ≤
R
2
α, σW
, C, P
P, E
T
2
(P)
P ≥ k=1 Eω k (xk − x̂k ) γk
2
+ C + JT −k,β α, σW , C (1 − γk|k−1 )γk−1
The cost obtained by adopting the policy P o is given by:
JT,β
2
o
(P ) = k=1 Eωok (xk − x̂k ) γk
2
+ C + JT −k,β α, σW , C (1 − γk|k−1 )γk−1
2
α, σW
, C, P o , E
o
T
It follows then that:
2
JT,β α, σW
, C, P, E
Proof of Theorem 1:
(P) ≥
2
JT,β α, σW
, C, P o , E
o
(P )
Let the time horizon T go to infinity and the results follow.
Lemma [Hajek]: Let f and g be two probability distribution functions
on Rn , with f is symmetric and nonincreasing and f g. For a symmetric
nonincreasing probability distribution function b the following holds:
Preliminary notations:
Remark:For a given
pre-processor policy P, the estimtor which minimizes
2
, C is the conditional expectation of the state xk given the
the cost JT,β α, σW
entire past, i.e. the estimator which computes x̂k = E [xk |vt , 1 ≤ t ≤ k].
o
The policy P o will define ω ok , γk|k−1
and γko for all k ∈ {1, . . . , T }. Choose
o
the thresholds τk such that γk|k−1 = γk|k−1
for all k ∈ {1, . . . , T }. It can be
shown that ω ok ω k for all k ∈ {1, . . . , T }, and moreover ω ok is symmetric and
nonincreasing.
2
(x
−
x̂
)
µ(x)dx
µ
R
ω k = f (xk |vt = ∅, 1 ≤ t ≤ k, x0 )
γk = P (vt = ∅, 1 ≤ t ≤ k, x0 )
γk|k−1 = P (vk = ∅|vt = ∅, 1 ≤ t ≤ k − 1, x0 )
Conclusions:
We have shown that for the given problem, the decision to send or not to send
a sample of the underlying stochastic process depends only on the estimation
error and that there exists an optimal threshold policy.
References:
Bruce Hajek, Kevin Mitzel and Sichao Yang, “Paging and Registration in
Cellular Networks: Jointly Optimal Policies and an Iterative Algorithm,” IEEE
Transactions on Information Theory, Feb 2008
O. C. Imer and T. Basar. “Optimal estimation with limited measurements,”
in Proc. IEEE CDC/ECC 2005, Seville, Spain submitted to IEEE Transactions
on Automatic Control
Albert W. Marshall, Ingram Olkin, Barry Arnold, ”Inequalities: Theory of
Majorization and Its Applications (Springer Series in Statistics),” Academic
Press, New York
Download