Symbolic dynamics of Markov chains

advertisement
Symbolic dynamics of Markov
chains
P S Thiagarajan
School of Computing
National University of Singapore
Joint work with:
Manindra Agrawal, S Akshay, Blaise Genest
Acknowledgements
• Samarjit and Javier
• IAS
• TÜV SÜD Foundation
Agenda
• Study complex dynamical systems
– verification of temporal logic specification
• High dimensional
• Continuous time; continuous value domains.
– Differential equations
– Hybrid automata
• Networks of such dynamical systems
Strategy
• Approximate the dynamics.
– Discretize the time and value domains
• Today’s first talk will have this flavor.
– Sample the dynamics and do a statistical analysis.
• Today’s second talk
• To handle large networks: Deterministic synchronizations
• Tomorrow’s talk
Probabilistic dynamics
• Illustrate these ideas in the setting of probabilistic
dynamical systems.
– Yield useful approximations in the presence intractability and
lack of knowledge.
– Scalability can be achieved through sampling (simulations)
based statistical analysis techniques.
• Statistical model checking
• Key application: (Networks) of biochemical networks.
Study I: Symbolic dynamics of Markov
chains
•
•
•
•
Well-known probabilistic dynamical systems.
Rich theory
Widely applied.
Our focus (finite state, discrete time):
– Symbolic dynamics
• Discretize the probability value space [0, 1].
Symbolic dynamics of Markov chains
• Analyze the symbolic dynamics via:
– classical (linear) temporal logic specifications.
• Probabilities are sneaked in through the atomic propositions.
– Model checking methods
• Details in:
– LICS’2012
– Journal paper under review.
Two views of Markov chains
• View I
– a finite state probabilistic transition system.
• View II
– a linear transform of probability distributions over the states.
The transition system view
1
3
1
2
1
1
2/5
3/5
4
The transition system view
0
1
1
3
1
2
3/5
1
1
0
2/5
4
0
The transition system view
3
1
3
1
1
1
2
1
4
3/5
2/5
1
2/5
4
2/5
3/5
1
2
3
3
4
4
2/5
3/5
The transition system view
• Outcomes: Infinite paths
• Basic cylinder: The set of infinite paths that have a
common finite prefix.
• Path space: -algebra generated by the basic cylinders.
• Probabilities are assigned to basic cylinders in a natural
way.
• Extends canonically to a probability measure over the
path space.
The transition system view
3
1
3
1
1
1
2
4
3/5
2/5
1
2/5
1
4
1
Pr(B) = 1  2/5  1  1
= 2/5
2
1
1
B – The set of all paths that
have the prefix 3 4 1 3 4
3/5
3
3
1
1
4
B
4
Measurable properties
• The state 3 will be visited infinitely often.
PCTL, CTL, ….
Baier, C. and Katoen, J.-P. Principles of
Model Checking
VIEW II
• The Markov chain transforms (linearly) a given
distribution over its nodes to a new one.
• The graph of the chain is represented as a stochastic
matrix
– (all row sums are 1)
• The Perron-Frobinius theory (for non-negative matrices)
applies.
VIEW II
• Leads to natural sub-classes:
– Irreducible and aperiodic,
– periodic, …
• transient states, recurrent states….
• Stationary distributions….
VIEW II
1
3
1
2
1
1
2/5
3/5
4
The second view
0
1
3
1
2
1
1
1
0
2/5
3/5
4
0
Trajectory of distributions
• Markov chain as (linear)transformer of probability
distributions
3
1
1
1
2 0
3/5
1
1
0
2/5
4
0
Transition matrix M
Trajectory of distributions
0
0
3
1
1 1
1
2
3/5
1
2/5
0
4
Transition matrix M
• Each initial distribution generates a trajectory of distributions:
Trajectory of distributions
0
0
3
1
1 1
1
2
3/5
1
2/5
0
4
Transition matrix M
• Each initial distribution generates a trajectory of distributions:
Trajectory of distributions
• Dynamics:
– The set of trajectories generated from an initial set of
distributions
– This set can be infinite
• We can ask whether all (none) of the trajectories satisfy
some desired dynamical property
• What we can ask in this setting is incomparable with
View I specifiable properties
Trajectory of distributions
• Eventually the probability of being in 1 is always greater
than being in 3.
• There is a future time at which 90% of the probability
mass lies on {1, 4}.
Symbolic dynamics
• A trajectory:
– an infinite sequence of probability distributions (over a finite set
of nodes).
– The alphabet of probability distributions -over a finite set of
nodes- is (uncountably) infinite.
• Hence a trajectory is an infinite sequence over an infinite
alphabet,
• But exactly tracking probability distributions may be
neither necessary nor possible.
– Bio-chemical networks
– Sensor networks…
Symbolic dynamics
• We propose to reason about sequences of distributions
in a symbolic way.
– Using a finite alphabet of discretized distributions.
Symbolic dynamics
• A classical tradition dynamical systems
theory.
– Jacques Hadamard(1898), Morse and
Hedlund (1921), ….
– Shannon, Smale…
– Significant applications in data storage
and transmission (via coding theory),
linear algebra.
Symbolic dynamics
• We know how to do this for:
– Timed automata
– In some restricted settings for hybrid automata.
• For finite state Markov chains this has been done under
very restricted setting and unnatural restrictions (Chada
et.al 2011, Korthikanti et.all 2010.)
• We shall instead give up on bisimulation-based
equivalence classes:
• Instead, fix a discretization of [0, 1]
– With no restrictions handle all finite state Markov chains .
Symbolic dynamics
c
d
F2(x)
F(x)
F3(x)
e
a
b
x
Finite alphabet!
Symbolic dynamics
• Each block is a letter.
• Each trajectory is now recorded as a
sequence of letters taken from a finite
alphabet.
• If each block bisimualtion equivalence
class it is called a Markov partition!
• Study the system dynamics in terms of
these sequences.
– Sophic shift sequences
– Shift sequences of finite type.
Symbolic dynamics
• We know how to get “Markov partitions”) do this for:
– Timed automata
– For hybrid automata, in a few settings.
• For finite state Markov chains this has been done under
very restricted setting and unnatural restrictions (Chada
et.al 2011, Korthikanti et.all 2010.)
• We do not look for bisimulations of finite index.:
• Instead, we fix a discretization of [0, 1]
– Handle all finite state Markov chains ; no restrictions.
Symbolic dynamics of Markov chains
• Discretization
– We partition [0,1] into a finite set of intervals,
a
b
c
Symbolic dynamics of Markov chains
a
b
c
• Each (probability) value is mapped to
(identified with) the interval it falls in.
Symbolic dynamics of Markov chains

Symbolic dynamics of Markov chains
•
Each distribution 𝜇 maps to D, a unique n-tuple of intervals.
• 𝜇 ∈D means Γ(𝜇) = D

= D
Symbolic dynamics of Markov chains
• Each distribution 𝜇 maps to D, a unique n-tuple of intervals.
• Several distributions can map to the same D,
• in fact Γ-1(D) can be infinite

= D

Symbolic dynamics of Markov chains
• Discretization
– We partition [0,1] into a finite set of intervals,
• Γ-1(D) can be empty
?

= D
Symbolic dynamics of Markov chains
• Discretized distribution
– A tuple of intervals D is discretized distribution ( 𝒟- distribution
for short ) if Γ-1(D) ≠ ø
ø

= D

= D ×
Symbolic dynamics of Markov chains
• The set of discretized distributions 𝒟 is finite
Symbolic dynamics of Markov chains
• A Trajectory of M starting from a distribution 𝜇 :
• Induces a symbolic trajectory
– a word over 𝒟ω
• ξ 𝜇 = (Γ(𝜇) Γ(𝜇1) Γ(𝜇2) ….)
Symbolic dynamics of Markov chains
3
0.45
1
2
0.12
1
1
1
0.35
2/5
3/5
4
0.08
.

.
Symbolic dynamics of Markov chains
• Fix a set of initial distributions IN.
• We use a 𝒟-distribution Din to specify IN. That is IN = Din
This can be an infinite set .
– IN =
=
,
………..
• We can fix the set of initial distributions in many other
ways.
Symbolic dynamics of Markov chains
• The symbolic dynamics of (M , Din ) is:
• We wish to reason about this -language.
Symbolic dynamics of Markov chains
• The discretization need not be uniform
• Can be different for each dimension (node).
• {[0, 1]} can be used to mask out “don’t care” nodes.
– Dimension-reduction.
A temporal logic to reason about the
symbolic dynamics
• I, a discretization of [0, 1] :
– < i , d > is an atomic proposition
– node i interval d
– “In the current distribution 𝜇, 𝜇(i) falls in the interval d ”.
– In the current discretized distribution D, D(i) = d
• Probabilistic linear-time temporal logic LTL I
The verification problem
• Given,
–
–
–
–
a Markov chain M,
a discretization I,
an initial set of distributions represented as Din ,
and a specification φ as an LTL I formula,
• Determine if M, Din ⊨ φ. In other words,
– (0) ⊨  for every  in LM
– Does every symbolic trajectory of M satisfy φ, i.e.,
– is it the case LM ⊆ L φ?
Example formulas
• Whenever probability of node i is “high”, the probability of
node j is “low” :
• The 𝒟 -distribution (d1, d2, . . . , dn) appears infinitely often:
Example formulas
• Extending with FO theory of reals, we can express much more:
– e.g., Infinitely often, the probability of node i is at least twice the sum
of probabilities of all other nodes.
• Logics based on path spaces (PCTL etc.) and logics based
sequences of probability distributions are incomparable
Examples
• We have modeled a simple version of the Google
pageranking algorithms.
• We have also modeled a small pharmacokinetics system
for drug delivery.
The verification problem
• Given, M, I, Din , φ, determine if M, Din ⊨ φ , i.e., LM ⊆ L φ
– If LM is ω-regular and effectively computable then we can use standard
model checking techniques.
– But LM is not always ω-regular !
The verification problem
LM is not always ω-regular.
𝜖-approximations
• Fix a small 𝜖 > 0 (fraction of length of an interval of I )
.
𝜇
.
𝜇’
𝜖-approximations
• Fix a small 𝜖 >0 (fraction of length of an interval of I )
.
𝜇
D
𝜖
.
𝜇
.
𝜇’
𝜖-approximations
• Fix a small 𝜖 >0(fraction of length of an interval of I )
.
𝜇
D
𝜖
.
𝜇
.
𝜇’
• The 𝜖-nbhd of a concrete distribution 𝜇 is the set of all 𝒟distributions which are atmost 𝜖-away from it.
𝜖-approximations
• The 𝜖-nbhd of a concrete distribution 𝜇 is the set of all 𝒟distributions which are atmost 𝜖-away from it.
– Nbr()
• 𝓕 is an -neighborhood iff there exists  such that
Nbr() = 𝓕
𝜖-approximations
Suppose D, D’ belong to a final class. Then there exist 
in D and ’ in D’ such that (, ’)  2𝜖
• Suppose D, D’ belong to a final class. Then for every 
in D and ’ in D’ , (, ’)  2𝜖 +  where  depends
only on the discretization.
•
𝓕
D

DD
2
’ D’
DD
A basic property of the symbolic dynamics
• There exists a computable constant K𝜖 and a computable
ordered collection of 𝜖-nbhds { 𝓕0, 𝓕1, . . . , 𝓕𝜃} called
final classes such that:
– After K𝜖 steps, ξ 𝜇 will forever cycle through the ordered final
classes. In other words, ξ 𝜇(k) ∈ 𝓕k mod 𝜃 for k > K𝜖
A basic property of the symbolic dynamics
• There exists a computable constant K𝜖 and an ordered
collection of 𝜖-nbhds { 𝓕0, 𝓕1, . . . , 𝓕𝜃} called final
classes such that:
– After K𝜖 steps, ξ 𝜇 will forever cycle through the ordered final
classes. In other words, ξ 𝜇(k) ∈ 𝓕k mod 𝜃 for k > K𝜖
𝜇
M
𝜇1
M
𝜇2
K𝜖 steps
𝓕0
.
𝜖
𝓕1
M
𝓕𝜃-1
M
M
𝓕2
A basic property of the symbolic dynamics
• There exists a computable constant K𝜖 and an ordered
collection of 𝜖-nbhds { 𝓕0, 𝓕1, . . . , 𝓕𝜃} called final
classes such that:
– After K𝜖 steps, ξ 𝜇 will forever cycle through the ordered final
classes. In other words, ξ 𝜇(k) ∈ 𝓕k mod 𝜃 for k > K𝜖
𝜇
M
𝜇1
M
𝜇2
K𝜖 steps
𝓕0
.
M
𝓕1
M
𝓕𝜃-1
M
M
𝓕2
𝜖-approximations of trajectories
• A symbolic trajectory ξ’ is an 𝜖- approximation of ξ 𝜇 (the
symbolic trajectory gen. by 𝜇) iff:
– ξ 𝜇(k)= ξ’ 𝜇(k) for all 0≤ k ≤ K𝜖 ,
– For all k> K𝜖 , ξ’ (k) and ξ (k) belong to the same final class.
Two approximate verification problems
• Determine if
– M, Din ⊨𝜖 φ , i.e.,
for every 𝜇 ∈ Din , there exists an 𝜖-approximation of ξ 𝜇 in L φ.
– M, Din ⊨𝜖 φ , i.e.,
for every 𝜇 ∈ Din , every 𝜖-approximation of ξ 𝜇 is in L φ.
Two approximate verification problems
• Proposition
– M, Din ⊨𝜖 φ implies LM ⊆ L φ
– M, Din ⊭𝜖 φ implies LM ⊈ L φ
Two approximate verification problems
• If M, Din ⊨𝜖 φ and M, Din ⊭𝜖 φ then M 𝜖- approximately
meets φ.
• If we want to do better we can set 𝜖’ = 𝜖/2 and iterate
(incremental overhead).
The main result
• Theorem:
Checking M, Din ⊨𝜖 φ and M, Din ⊨𝜖 φ are both
effectively solvable problems.
The main result
• The proof first assumes a single initial distributions and
proceeds to establish the theorem for:
– Irreducible aperiodic chains
– Irreducible periodic chains.
– General chains
• This is then extended to a set of initial distributions Din.
• The main step is the transient + steady state
characterization of the symbolic dynamics.
The main result
The main result
Additional results
• Convex hull of a finite set of distributions can also be
handled.
• The atomic propositions can be constraints expressed in
the first order theory of reals.
• Restrictions on the eigenvalues can yield -regular
behaviors (egs.: distinct and real)
Summary
• Discretizing the values space [0, 1] leads to an interesting
and useful symbolic dynamics of finite state Markov
chains.
• Extensions:
–
–
–
–
Optimizations
Implementations
Case studies
Interval Markov chains
Download