A Modal View on Abstract Learning and Reasoning Henry Soldano

Proceedings of the Ninth Symposium on Abstraction, Reformulation and Approximation
A Modal View on Abstract Learning and Reasoning
Henry Soldano
L.I.P.N, Université Paris-Nord, UMR-CNRS 7030
99 Avenue J.-B. Clément, 93430, Villetaneuse, France
sentence. The starting point has to be found in Formal Concept Analysis (Ganter and Wille 1999) and Galois lattice
theory (Caspard and Monjardet 2003). In such approaches,
both L and P(W ) are lattices, and the relation between
L and P(W ) is materialized as an extension-intension lattice. This lattice is the structure of the definable elements of
P(W )(Ben-David and Ben-Eliyahu-Zohary 2000), i.e. the
subsets of W that each are the extension of some sentence
of L. In such a lattice each definable set e represents the
equivalence class of all the sentences whose extension is e,
and a node, also called a concept, is made of a pair (e, t)
where t is the most specific sentence of the class, denoted
as the intension of the concept. Note that from a Machine
Learning perspective, we only have access to a learning set
of instances W , and so the definable sets, with respect to W
are arguably the only subsets of interest: searching hypotheses in L comes down to searching the space of definable sets.
In (Pernelle et al. 2002) particular self-maps, denoted as
projections, are used to reduce L or P(W ) in such a way
that the relation between the language and the extensional
space is still materialized as a lattice, denoted as a projected
extension/intension lattice. In other words, projections ensure that we have a coarser, still structure preserving, view
of concepts representatives of the universe we deal with, or
at least on the data we have on this universe. More recently,
projections on a lattice have been shown to be in one to one
correspondence with parts of the lattice that are closed under
the least upper bound operator. These structures are denoted
as abstractions and so to each abstraction A is associated the
corresponding projection pA . As a consequence projected
extension/intension lattices are also denoted as abstract extension/intension lattices (Soldano and Ventos 2011).
To summarize, the set of the definable subsets of P(W ),
i.e the extensional space with respect to the language L,
has, under some conditions, a complete lattice structure. Abstractions of either L or P(W ), preserve the lattice structure
while reducing the size of the structure, i.e. while simplifying our representations.
We investigate here more specifically abstractions of
P(W), denoted as extensional abstractions i.e. parts of
P(W ) that are closed under set theoretic union, and simply
call them abstractions of W in what follows.. This leads us
to a class of modal logics whose semantics relies on extensional abstractions, and we will denote them modal logics
Abstract
We present here a view on abstraction originating from the relation between formulas in a partially ordered language L and
their extension on a set of instances W . In Formal Concept
Analysis, this relation is materialized as a lattice G. Particular self-maps on either L or the powerset P(W ) are known
to ensure structure-preserving reductions of the lattice G and
have been shown to be in one to one correspondence with
abstractions, defined as subsets of either L or P(W ) closed
under union. We investigate specifically extensional abstractions (subsets of P(W )). Such an abstraction, a special case
of domain abstraction, comes down to a change in granularity: extensions are now considered as union of abstract instances, i.e. union of predefined subsets of instances.
The main contribution of the paper is the investigation of
the class of (non normal) monotonic modal logics whose semantics relies on such abstractions, and that we call abstract
modal logics.
1
Introduction
Abstraction has been extensively investigated in Artificial
Intelligence, and was demonstrated as useful in particular in
Problem Solving (see for instance (Yang et al. 2008)). There
were various attempts to formalize abstraction and characterize desirable properties with respect to induction(Zucker
2003). An early statement about abstraction in Machine
Learning was that abstraction should be order-preserving
with respect to the partially ordered language in which hypotheses are searched for. One of the purpose of the program
INTHELEX (Esposito et al. 2004) is to induce such semantic abstractions. The model KRA, proposed by J-D Zucker
and L. Saitta (Zucker 2003; Saitta and Zucker 2001) starts
from a grounded representation of a perceived world, using
various representation layers, from a universe of configurations to the highest level language. However, the question of
what classes of abstractions are to be investigated for learning and reasoning is far of being exhausted.
We present here a view on abstraction that originates from
the relation between a sentence t of a partially ordered language L and the extension of the sentence t on a set of instances W , i.e. the subset of W whose elements satisfy the
c 2011, Association for the Advancement of Artificial
Copyright Intelligence (www.aaai.org). All rights reserved.
99
where 2A t means ”Abstractly t“ with respect to the abstraction A. Then, so starting from the semantics, we show that
the corresponding modal logics are particular cases of classical modal logics (Chellas 1980) and relate properties of the
abstractions A to axioms and inference rules of such modal
logics, further denoted as abstract modal logics. This also
leads to propose an abstract worlds semantics, relying on
the abstraction A, of abstract modal logics as a particular
and intuitive case of neighborhood semantics.
Furthermore, consider the set A of all the abstractions of
W . We will see that A is a lattice with respect to the following abstraction order: if an abstraction A is a subset of
an abstraction A, then A is more abstract than A. As we
will see whenever p →A q is valid and the abstraction A
is more abstract than A, then p →A q is also valid. This
ordering of abstractions leads to introduce multimodal logics of abstraction, in which a modal connector is associated
to each abstraction Ai . In such a multimodal logic we can
reason with different abstractions.
of abstraction. The main purpose of this paper is to motivate, introduce and investigate these logics that turn to be in
general non normal monotonic modal logics.
Defining an extensional abstraction consists in restricting
the extensional space to a part A of P(W ). This means that
we will then no more consider instances as unitary elements
of the extensional space, but rather particular subsets of W ,
denoted as abstract instances. The extension of a given sentence t is then transformed in an abstract extension and is
written as extA (t) = pA ◦ ext(t) that turns to be the union
of all the abstract instances whose elements all satisfy t.
As an example consider W = {w1 , w2 , w3 , w4 },
and let A be obtained by closing under union the part
{{w1 , w2 }, {w1 , w3 }}. As a result, {w1 , w2 , w3 } belongs to
A but {w2 , w3 } does not. Now, consider the smallest elements of A that contain a given instance w. We call these
elements the minimal abstractions of w and denote them as
abstract instances. So such an abstraction corresponds to a
change in the granularity of the instance space. In the previous example, {w1 , w2 } is the unique minimal abstraction
of w2 , {w1 , w3 } is the unique minimal abstraction of w3 ,
{w1 , w2 }, {w1 , w3 } are the two minimal abstractions of w1 ,
and w4 has no abstraction in A.
Suppose furthermore that ext(t) = {w1 , w2 , w3 , w4 },
then the abstract instances included in ext(t) are {w1 , w2 }
and {w1 , w3 } and therefore extA (t) = {w1 , w2 , w3 }.
Clearly the abstract extension of a given term is always included in its original extension. In other words, we reduce
the extension of the term by excluding any instance that has
no minimal abstraction included in the original extension. In
the current example, w4 is such an instance. The intuition
here is a change in granularity: the new objects we deal with
are the minimal abstractions of the original instances.
Consequently we define an abstract implication as valid
on an abstraction A whenever the abstract extension of the
antecedent is included in the abstract extension of the consequent. Going back to our example, suppose ext(p) =
{w1 , w2 , w4 } and ext(q) = {w1 , w2 , w3 }, then the implication p → q is not valid on W . Consider now the
corresponding abstract implication p →A q. As we have
extA (p) = {w1 , w2 } and extA (q) = {w1 , w2 , w3 }, p →A q
is valid. As a matter of fact, this turns to be a general property of abstractions: they preserve validity of implications.
In other words we have that whenever p → q is valid, then
p →A q is also valid.
An interesting case is the Alpha abstraction that corresponds to Alpha Lattices (Ventos and Soldano 2005). Such
an abstraction is obtained by considering an a priori categorization of the instances together with a parameter α
that corresponds to some degree of influence of the categorization. As we will see in section 4, this allows for instance, when considering animal species divided into mammals, insects and birds, to obtain valid abstract implications
as apt to fly →A oviparous, when the corresponding logical implication apt to fly → oviparous is falsified by few
mammals as bats.
We then define modal logics of abstraction by stating that
abstract implications are in fact implications between abstract sentences, i.e. rewriting t1 →A t2 as 2A t1 → 2A t2 ,
2
Preliminaries
In a recent paper, extensional and intensional abstractions
have been introduced as reductions of either the language
L or the extensional space P(W ) that preserve the original latticial structure of the relation between the language
and the instance universe (Soldano and Ventos 2011). We
present in this section the necessary definitions and various
results about abstractions, as intended here. We introduce
the sufficient conditions to obtain a Galois connection relating L and P(W ) and discuss the corresponding extension/intension lattice G. The question of a structure preserving way to reduce G is first answered using self-maps on
either P(W ) or L, and then in showing the one to one correspondence between these self-maps and particular subsets
of the domain, further denoted as abstractions.
Extension-intension lattices
We call here instances the elements of W but later denote
them as worlds in the section discussing modal logics of abstraction. We also call intensional representations the terms
of L and extensional representations the elements of the
powerset P(W ). In this presentation we suppose that we
know whenever an instance w satisfies the term t, and we
define accordingly its extension:
Definition 1 extW (t) = {w ∈ W | w satisfies t}.
In what follows we consider W as a fixed set of instances
and we simply write ext(t) the extension in W .
The language L is partially ordered by a general-tospecific relation. We write t1 t2 whenever t1 is less specific than t2 , or equivalently t1 is more general than t2 . Conversely, there is also a semantic order relation representing
the inclusion order of extensions. We further consider that
t1 t2 ⇒ extW (t1 ) ⊆ extW (t2 ) is always true but that
the converse is false. This simply means that some representable instances do not belong to the universe we modelise. For instance, as all birds are blue in our universe, the
100
Let M be a lattice and p a projection of M , then p(M )
is also a lattice with join operator ∨ and meet operator ∧
defined as, for any pair m1 , m2 ∈ p(M ), m1 ∧ m2 =
p(m1 ∧M m2 ) and m1 ∨ m2 = m1 ∨M m2 .
When considering respectively M = L and M = P(W )
we obtain respectively intensional and extensional projections and both lead to projected extension-intension lattices
(Pernelle et al. 2002).
Recently, abstractions, as defined hereunder, have been
proposed as an equivalent, and more intuitive, view of projections (Soldano and Ventos 2011).
terms blue bird and bird have the same extension in our universe of worlds. As a consequence there is an equivalence
relation on terms related to the extensional order:
Definition 2 t1 ≡ t2 iff ext(t1 ) = ext(t2 )
The following proposition relates L and P(W ) by a Galois connection. The proposition 1 directly follows from, for
instance, theorem 2 in (Diday and Emilion 2003).
Proposition 1 Let L be a finite language, a partial order
on L denoted as specificity, W be a finite set of instances
and ext : L → P(W ) be a mapping such that t1 t2 ⇒
ext(t1 ) ⊇ ext(t2 ). We consider the following conditions:
Definition 4 An abstraction of a lattice M is a subset1 of M ,
closed under ∨M .
• (condition 2.1) For any instance w, there is a unique most
specific term, denoted as the instance description d(w),
among all terms t such that w ∈ ext(t)
• (condition 2.2) L has a greatest element and is a lower
semi lattice, i.e. each pair of terms t1 , t2 of L has a
unique greatest lower bound t1 ∧L t2 in L , also called
the least general generalisation (for short lgg) of t1 and
t2 .
Building abstractions therefore simply means to choose any
subset of M and close it by ∨M . We note hereunder that
abstractions are in a one to one correspondence with projections.
Proposition 2 Let A be an abstraction of a lattice M , then
pA defined as pA (x) = c∈A,c≤x c , is a projection of M .
Let p be a projection then A = p(M ) is an abstraction of
M , and p is the projection pA associated to A.
Whenever conditions 2.1 and 2.2 are satisfied, (L, ) is a
lattice, i.e. two terms t1 and t2 also have a least upper
bound denoted t1 ∨L t2 , and the pair (int, ext) where
d(w)
int(e) =
The equivalence between interior systems p(M ) and subsets
A of M closed under union is a known property of interior
operators (Erné et al. 1993) i.e projections. As abstractions
are closed under ∨M , we have that ∨M = ∨A . From now on
we simply write ∨ when no confusion is possible. An important point is that we only need the ∨-irreducible elements of
A. These are defined as the elements of A that cannot be
obtained by applying ∨M to other elements.
w∈eL
is a Galois connection.
This means that whenever the previous conditions are satisfied, i) an equivalence class of L with respect to W corresponds to a given subset e of W and is represented by a
unique most specific term t such that e = ext(t), ii) the set
G of the pairs (e, t) is structured as a lattice, denoted as an
extension-intension lattice,
Finally G is considered as the structure of W as perceived
through L and can be also represented by the set TW of all
the implications p → q which are valid on W , i.e. such that
ext(p) ⊆ ext(q) is true.
Proposition 3 Let A be an abstraction of M , and AI be
the set of ∨-irreducible elements of A, the projection pA is
obtained by only considering elements of AI :
pA (x) =
c
c∈AI ,c≤x
3
Ordered abstractions
We discuss here both intensional and extensional abstractions, and define an order on abstractions.
Projections and Abstractions
Now, consider the following problem; how to reduce P(w),
L, or both, in such a way that the extension-intension lattice is smaller that G ? An answer to this question is given
in (Pernelle et al. 2002) through the use of intensional and
extensional projections whose images are also lattices:
Intensional abstractions An intensional abstraction is
simply obtained by selecting part of the language L and closing it by the join operator ∨L . Therefore, given a pair of
terms (t1 , t2 ) belonging to an abstraction A of the language
L, the minimal specialization in L of t1 and t2 , namely
t1 ∨L t2 , also belongs to A.
For instance, consider the lattice L whose elements are intervals [a, b] such that a, b ∈ {1, 2, 3, 4}. Here applying the
minimal specialization operator we obtain [1, 3] ∨L [2, 4] =
[2, 3]. The ∨L -irreducibles are then [1, 3], [1, 2], [1], [2, 4],[3, 4], [4]. Consider first the abstraction A whose elements
Definition 3 (Projection) p is a projection of a lattice
(M, ≤) iff for each pair (x, y) of elements of M , we have
the following properties:
• If x ≤ y then p(x) ≤ p(y) (monotonicity)
• p(x) ≤ x (minimality)
• p(x) ≤ p(p(x)) (semi-idempotence)
1
Note that ⊥M , the smallest element of M , belongs to all abstractions.
101
• The least upper bound A1 ∨A A2 is obtained by closing
A1 ∪ A2 under ∨M .
• The greatest lower bound A1 ∧A A2 simply is A1 ∩ A2 .
are the intervals containing 3. A is closed by ∨L as intersecting two intervals containing 3 results in an interval also
containing 3.
As a counterexample, consider now L = L −
{[1], [2], [3], [4]}. L is obtained by simply deleting the most
specific terms and clearly still is a lattice. However the minimal specialization in L of [2, 3] and [3, 4] is now ∅ and so
clearly L is not closed by ∨L , and therefore is not an abstraction.
Note that such a restricted view of language reduction
has nice properties. For instance, the possibility to check
whether an instance w satisfies a term through a simple
subsumption test is preserved when reducing the language
through intensional abstractions:
The ordering of extensional abstractions will further be
the basis for the definition of multimodal logics of abstraction.
4
Extensional abstractions
An extensional abstraction is obtained by considering part
of the powerset of W , and closing it by the union operator
∪. This means that we do not consider any more instances
but rather sets of instances, called abstract instances.
Proposition 4 Let L and extW satisfy condition 2.1 and
2.2, and A = pA (L) be an intensional abstraction, then for
all h in pA (L), {x} ⊆ ext(h ) iff h pA (d(x))
Definition 6 Let A be an extensional abstraction A and pA
the associated projection on P(W ), then let t be a term of
L, extA (t) = pA ◦ ext(t) is called the abstract extension of
t.
Let AI be the set of ∪-irreducibles elements of A, from
Proposition 3 we also have
u
extA (t) =
Extensional abstractions They abstract instances rather
than abstracting the language of terms. Note that when considering extensional abstractions, ∨ is the set theoretic union
∪, and that the order relation is the set theoretic inclusion.
u∈AI ,u⊆ext(t)
Example 1 As an example consider W = {1, 2, 3, 4},
and the abstraction A obtained by closing under union
the part {{1, 2}, {1, 3}} of P(W ), so adding {1, 2, 3}and
∅ to build A. The set of ∪-irreducible elements of A is
AI = {{1, 2}, {1, 3}}. From Proposition 3, pA ({1, 2, 3}) =
{1, 2} ∪ {1, 3} = {1, 2, 3} as both elements of AI are included in {1, 2, 3}, and pA ({2, 3}) = ∅ because neither
{1, 2} nor {1, 3} is included in {2, 3}.
Hereunder we define abstract instances and minimal abstractions of a given instance.
Definition 7 Let A be an extensional abstraction and AI
the set of the ∪-irreducible elements of A.
• An element of AI is called an abstract instance.
• We denote as minimal abstractions of w the minimal
elements of A that contains w. More precisely, the
set Am (w) of minimal abstractions of w is defined as
Am (w) = {u ∈ AI | w ∈ u and if w ∈ u ⊂
u, then u ∈ A}.
In section 4 we will investigate that particular kind of abstraction.
The lattice of abstractions
There is a partial order on projections of a lattice M (Pernelle et al. 2002):
An interesting point is that AI is the set of all the minimal
abstractions of instances in W :
Definition 5 Let M be a lattice and p1 et p2 two projections
on M , we will state that p2 ≤ p1 , i.e. p2 is less concrete,
and so more abstract than p1 , iff there is some projection p
defined on p1 (M ) such that for all c in M , p2 (c) = p◦p1 (c).
Proposition 6
AI =
Am (w)
{w∈W }
So A2 is more abstract than A1 iff A2 is an abstraction
of A1 . This is also equivalent to say that any ∨-irréductible
element i2 of A2 may be written as a join of ∨-irreducible
elements of A1 , i.e. i2 = i11 ∨ · · · ∨ in1
In the particular case where A is closed under intersection,
each element has at most one minimal abstraction. In the
example hereunder we consider instances that may belong
to more than one minimal abstract instance.
Example 2 (Extensional abstractions)
Let M = P({1, 2, 3, 4}), AI1 = {{1, 2}, {1, 3}} and
AI2 = {{1, 2}, {1, 2, 3}}, A2 is more abstract than A1 as
{1, 2} ∈ AI1 and {1, 2, 3} = {1, 2} ∪ {1, 3}.
Example 3 Let W ={plastic chair, wood table, wooden
box} and AI = {{plastic chair, wood table}, {wood table,
wooden box}}. Here the objects are abstracted as furnitures
or wooden objects. The wood table has here two minimal
abstractions, first as a furniture, second as a wooden object.
When closing AI under intersection we add {wood table}
as an abstract instance, and all objects have now a unique
minimal abstraction in the resulting abstraction.
The set A of abstractions on M is a lattice.
Proposition 5 (Lattice of abstractions) Let M be a lattice, the set A of the abstractions of M is a lattice. Let then
A1 = p1 (M ) and A2 = p2 (M ) be elements of A:
102
5
Definition 8 (Abstract implication) Let t1 et t2 be terms of
L: whenever extA (t1 ) ⊆ extA (t2 ) we say that the abstract
implication t1 →A t2 is valid on A.
We will formalize the nature of abstract implications by interpreting them as classical implications relating modalized
terms. So we rewrite t1 →A t2 as 2A t1 → 2A t2 , and
consider that from an observed set of worlds W , in some
model M , we extract a set of modal valid implications that
we translate as modal axioms M 2A t1 → 2A t2 for further
reasoning. Hereunder we discuss the corresponding modal
logics of abstraction, denoted as abstract modal logics.
Now as t1 → t2 means that ext(t1 ) ⊆ ext(t2 ), we
have that whenever some abstract instance u is included in
ext(t1 ), we also have that u is included in ext(t2 ). This also
means, by definition, that extA (t1 ) is included in extA (t2 ).
Proposition 7
• Let A be an abstraction, then t1 → t2 ⇒ t1 →A t2
• Let A be an abstraction more abstract than A, then
t1 →A t2 ⇒ t1 →A t2
Neighborhood semantics
A modal logic, in its simplest form, is a propositional logic
to which is added at least one unary modal connector 2,
generally referred to as a necessity operator. They differ following the axiom schemata and inference rules in which
appear the modal connectors. Axiom schemata are written
as φ where φ is a modal formula, Axioms are theorems,
and inference rules states that whenever we have some antecedent theorem, we can infer a consequent theorem. Classical modal logics are the modal logics in which formulas
are given truth values through neighborhood semantics, also
known as minimal models semantics (Chellas 1980). This
wide class of modal logics includes in particular normal
modal logics relying on Kripke possible world semantics.
We will define hereunder abstract modal logics as particular
classical modal logics, usually non normal, and discuss their
properties. We hereunder recall neighborhood semantics.
The modal language Lmod is obtained by adding the
modal connector 2 to a propositional language L. A model
M = (W, N , ext) is defined through i) a set of worlds W ,
ii) a neighboring function N relating each world w to a set
of subsets of W , iii) a valuation function ext, relating each
atomic proposition P to the set of worlds in which P is true,
and we write for such a world w that M, w |= P . The valuation function is then straightforwardly extended to L (for
instance, for any pair of formulas φ and ψ, we have that
M, w |= φ ∧ ψ iff M, w |= φ and M, w |= ψ) and is then
extended to Lmod , in the following way :
The partial order of abstractions, as defined above, means
that A is more abstract than A whenever any element of
AI may be written as the union of elements of AI . This
means in particular that considering a categorization of animal species on mammals, birds and insects is more abstract
than also considering that mammals have subclasses made
of terrestrial and non terrestrial mammals.
Alpha abstractions
An interesting case is Alpha abstractions. Starting from a
subset C whose union closure is an initial abstraction A,
more concrete abstractions Aα are built, where α represents some degree of influence of the initial categorization
C. More precisely Aα is obtain by deriving from each initial
category C the set C α of frequent enough parts of C 2 , joining the C α ’s of all the categories, and closing under union
the resulting set C α . We have then that Aα is more abstract
than Aα iff α ≥ α . In a previous work the corresponding
Alpha Galois lattices were defined through projections and
experimented in order to extract abstract implications (Ventos and Soldano 2005). As an example, consider the categorization of species in ”mammals“, ”insects“ and ”birds“
and a value α = 0.1. The implication ”Animals that fly are
oviparous” is not valid on W because of the bat that belongs to extW (flies) but not to extW (oviparous). However
consider the corresponding 0.1-abstract implication. The bat
belongs to the category ” mammals“ and the proportion of
mammals that fly is less than 0.1, and so there is no frequent enough part of the category ”mammals“ included in
extW (flies), and as a consequence the bat does not belong
to ext0.1 (flies). Therefore, the abstract implication is not falsified. Note that flying birds and flying insects do belong to
ext0.1 (flies) and also to ext0.1 (oviparous), and as a consequence do not falsify the abstract implication.
Now, what is the use of abstract implications as t1 →A
t2 ? The basic idea, is that when considering some particular
instance w, whether w is in extA (t1 ), i.e. t1 is abstractly
true, we may deduce that t2 is also abstractly true. So such an
implication is useful as far as there are instances in which the
premise t1 is abstractly true. When abstraction increases, the
following effects occur: i) There are more valid implications,
ii) as extA (t1 ) decreases with abstraction (second property
of projections) each implication tends to be less useful.
2
Modal logics of abstraction
∀φ ∈ Lmod , M, w |= 2φ iff ext(φ) ∈ N (w)
(1)
We have then that M, w |= φ holds for all models M and
worlds w, iff φ is a theorem of the corresponding classical
modal logic.
A mapping m from P(W ) to P(W ) is associated to the
neighborhood function N and defined as m(E) = {w ∈
W | E ∈ N (w)}. As a consequence we have
ext(2φ) = {w ∈ W | ext(φ) ∈ N (w)} = m ◦ ext(φ)
and as a consequence Equation 1 rewrites as:
M, w |= 2φ iff w ∈ m ◦ ext(φ)
(2)
These mappings m are in one-to-one correspondence with
the neighborhood functions N as we have E ∈ N (w) iff
w ∈ m(E). Properties of classical modal logics, in terms of
axioms and valid inference rules, follow then from properties of the mapping m.
Let c ⊆ C, then c ∈ C α iff |c| ≥ α ∗ |C|
103
Abstract modal logics
Proposition 8
Now recall that we defined abstract extension extA as p◦ext
where p is a projection. We will so naturally define abstract
modal logics as classical modal logics in which the mapping
m is a projection. To characterize abstract modal logics we
have so to translate the properties of projections as axioms
and inference rules. We have the following correspondence:
1. X ⊆ Y ⇒ p(X) ⊆ p(Y ) corresponds to:
• 2True (axiom N ) iff the abstraction A is complete,
i.e. every world w has at least one minimal abstraction
in AI . The smallest logic for complete abstractions is then
EMT4N .
• (2P ∧ 2Q) → 2(P ∧ Q) (axiom C) iff A is closed
under ∩, i.e every world w has at most one minimal abstraction in AI . The smallest regular abstract logic is then
EMCT4 .
• Normal abstract modal logics, i.e. abstract modal logics
with N and C, are at least EMCNT4 , known as S4 . S4
obeys to the Kripke possible worlds semantics relying on
a reflexive and transitive accessibility relation R (Chellas
1980).
• Normal abstract logic with ¬2¬P → 2¬2¬P , (axiom
5) is EMCNT5 , known as S5 in which R is an equivalence relation and AI is a partition of W .
From (P → Q) infer (2P → 2Q)
the inference rule (RM ) that characterizes monotonic
modal logics (Hansen and Kupke 2004)
2. p(X) ⊆ X corresponds to the axiom schema T :
2P → P
3. p(X) ⊆ p ◦ p(X) corresponds to the axiom schema 4 :
2P → 22P
The smallest3 monotonic modal logic EM is the smallest
classical logic E equipped with RM as an inference rule.
To define the class of abstract modal logics we have to add
axioms T and 4, and so the smallest abstract modal logic is
EMT4 .
Note that the smallest normal abstract logic S4 is obtained
when each world have exactly one minimal abstraction. See
Example 3 how closing by intersection the set of minimal
abstractions changes the nature of the underlying modal
logic, at the price that additional abstract instances are imposed. However is it practically useful to consider nonnormal abstract logics? Or, in other words, is it useful to consider abstractions that are not closed under intersection ? As
an example, alpha abstractions discussed at the end of section 4 are not closed under intersection. Consider the alpha
abstraction Aα on animal species previously described and
two different minimal abstractions c1 and c2 in the category
mammal. Each contains α ∗ |mammal | elements, but clearly
(c1 ∩c2 ) has strictly less than α∗|mammal | elements4 and so
does not belong to Aα . When closing Aα under intersection
we obtain as minimal abstractions all the single species, and
so there is no more abstraction. More generally, as soon has
we want to express a notion of granularity in which granules
overlap, as in example 3, the underlying abstract modal logic
is non-normal.
Definition 9 The class of abstract modal logic is defined as
the class of monotonic modal logics with T and 4 as axioms.
The abstract worlds semantics
Considering the abstraction A associated to a projection pA ,
we have then that:
ext(2φ) = {w ∈ W | ∃c ∈ Am (w), s.t c ⊆ ext(φ)}
and Equation 1 and 2 both rewrites also as:
M, w |= 2φ iff ∃c ∈ Am (w), s.t c ⊆ ext(φ)
(3)
and we say that φ is abstractly true in the world w.
This new semantics, simply obtained by replacing the
neighborood function N by the abstraction A, and equation
1 by equation 3, is denoted as the abstract worlds semantics,
and (W, A, ext) is denoted as an abstract model. To summarize, the abstract worlds semantics states that φ is abstractly
true in w iff there is some minimal abstraction c of w whose
worlds all satisfy φ.
Note that, any element φ of Lmod can then be stated as
either true or false. For instance, consider a model M whose
worlds represents animal species, and let A0.1 be the alpha
abstraction described at the end of Section 4. Consider then
the formula φ = viviparous ∧ 2 flies. This sentence is intended to be true for all animals that both are viviparous
and abstractly fly. Let w represent the bat. We have then
that both viviparous and flies are true in w, and therefore
the formula viviparous ∧ flies is also true in w. However,
as noticed before, 2flies is false in w as few mammals (i.e.
the category to which w belongs) do fly. As a consequence
φ = viviparous ∧ 2flies is also false in w.
Now we relate the properties of the abstraction A to properties of modal logics (omitting proofs). We summarize the
derived classification in table 1.
3
Abstract reasoning
Modeling We consider that the worlds (or instances) are
objects of a universe from which we intend to extract knowledge using some modeling or learning process. We observe
then a finite set of objects W , considering some a priori abstraction A. We have also a set of atomic propositions (properties) whose truth values are observed on each object, and
therefore their extensions on W are known.
Suppose for instance that we have observed that on the
learning set W the implication φ1 → φ2 , where φ1 and φ2
are formulas of Lmod , holds. We write then M φ1 → φ2
to state that this implication is supposed to be true in all the
worlds of some intended model M . This means that we consider now that the implication holds for all objects of the
universe and that for any object w of the universe, we can
always state for any property P whether one of its minimal
4
Except when α = 0 or when α = 1 and that the categorization
is a partition of the universe W .
i.e. with the minimal set of axiom schemata and inference rules
104
• p2 (X) ⊆ p1 (X)
• p2 (X) ⊆ p2 ◦ p1 (X)
From these properties it follows that in our multimodal logic
RM , T and 4 are generalized whenever A2 is more abstract
than A1 in:
• From 2A1 φ → 2A1 ψ infer 2A2 φ → 2A2 ψ (RM’)
• 2A2 φ → 2A1 φ (T’)
• 2A2 φ → 2A2 2A1 φ (4’)
Furthermore as A is a sublattice the greatest lower bound
∧ and least upper bound ∨ are as defined in section 3 and
so for any pair of abstractions (A, A ) in A, A ∧ A is more
abstract than both A and A . As a result, multiple abstract
reasoning is possible.
As an example, consider two abstractions A and A and
suppose that we have both 2A φ → 2A ψ and 2A ψ →
2A λ. Then, from 2A φ → 2A ψ we infer by (RM )
2A∧A φ → 2A∧A ψ. In the same way from 2A ψ →
2A λ, we infer 2A∧A ψ → 2A∧A λ. From these we deduce 2A∧A φ → 2A∧A λ.
This means that we may compose two abstract implications associated to two distinct and possibly incomparable
abstractions, and obtain an abstract implication.
This may be useful for instance when modeling an agent
that have multiple points of view, each corresponding to an
abstraction, on the universe. Another interesting case is the
multi-agent one, where each agent has its own point of view
on the universe. In the later case, we may interpret the previous multimodal example as a way for two agents to find
a sufficiently high level of abstraction to perform a kind of
cooperative reasoning.
Table 1: Properties of the set of abstraction A versus classical modal logic classification
A is a partial covering of W
closed by ∪
A is complete
|Am (w)| ≥ 1
A is closed under intersection
|Am (w)| ≤ 1
A is closed under intersection
and complete
|Am (w)| = 1
AI is a partition
EM T 4 :
monotonic + T 4
EM N T 4 :
monotonic +T 4N
EM CT 4 :
regular +T 4
EM CN T 4 = S4 :
normal+T 4
(R reflexive and transitive)
EM CN T 5 = S5 :
normal+T 5
(R equivalence relation))
abstraction have this property. For instance in the previous
examples on alpha abstraction about animal species, we suppose that we will always be able to categorize an animal as a
mammal, a bird or an insect, and that we can always state for
each property P the proportion of animals in each category
that possess the property. Of course, these proportions are
estimated on the objects of W during the modeling process.
From a practical point of view, abstract implications as
defined in Section 4 are of the form 2φ1 → 2φ2 where
φ1 and φ2 are conjunction of properties, and are straighforwardly extracted from abstract concept lattices.
Simple abstract reasoning We consider here reasoning
in a modal logic as drawing consequences from a set Γ of
antecedents, and from a set of axioms and inference rules.
Inference is performed as in propositional logic, as far as
no modal inference rules are used, and the modal inference
rules, as RM , are used only on axioms. For sake of clarity
we write then the modus ponens rule MP as: from Γ φ and
Γ φ → ψ infer Γ ψ, and the modal rule RM as : from
φ → ψ infer 2φ → 2ψ.
Now let us suppose that we intend to use M φ1 → φ2 .
In our reasoning process such statements act as axioms and
we omit the subscript M .
Now consider 2φ1 as an antecedent. We have then the
following reasoning:
From φ1 → φ2 and RM we infer 2φ1 → 2φ2 ,
then from Γ 2φ1 and 2φ1 → 2φ2 we infer Γ 2φ2 ,
and so to summarize, from abstractly φ1 we have deduced
abstractly φ2 .
6
Related work and conclusion
In Imielisnski(Imielinski 1987) domain abstraction was proposed to query databases. Domain abstractions, that corresponds to our extensional extensions, are also used in reinforcement learning: in order to find good policies with a reasonable amount of exploration it is necessary to reduce the
state space. Loscalzo, Lin and Wright (Loscalzo and Wright
2010; Lin and Wright 2010) investigate in this context tile
coding as an abstraction method: the space state is dynamically discretized into variable size tiles. Each tile covers
range of values for each feature in the state space.
Regarding extensional abstractions, the formal work closest to ours concerns rough sets logics. Rough sets basically
relies on an indiscernability relation, first considered as an
equivalence relation, and further relaxed to any binary relation in generalized rough sets theory (see (Yao and Lin
1996) for a review relating rough sets to normal modal
logics). More recently generalization of rough sets using
coverings, corresponding to our complete abstractions, has
been investigated from an algebraic perspective (Cattaneo
and Ciucci 2009; Davvaz and Mahdavipour 2008). A recent
overview of rough set semantics in a modal logic perspective
goes beyond our abstractions by considering any neighborhood system (Liau 2000).
Our main purpose here, though formal, was to investigate
which restriction in abstracting intensional or extensional
A multimodal abstract logic
We consider here a sublattice A = {A1 , . . . } of abstractions
and we define a multimodal logics using the set of modalities
{2A | A ∈ A}. Now, recall that the partial order states that
A2 is more abstract than A1 whenever there exists a projection p such that A2 = p(A1 ). As a consequence, given any
subset of worlds X, we have that p2 (X) = p ◦ p1 (X). As p
is a projection, the following properties hold:
• p1 (X) ⊆ p1 (Y ) ⇒ p2 (X) ⊆ p2 (Y )
105
spaces were of interest in a learning and reasoning perspective, and to propose practical guidelines to incorporate abstraction, in the sense given here, to learning and reasoning
methods in AI. We have here introduced, starting from semantics, modal logics of abstraction, by considering minimal abstractions of worlds. and shown how a multimodal
logic allows to formalize reasoning at various levels of abstractions. An interesting result is that these logics are non
normal when some world may have various minimal abstractions. Furthermore we have defined a new semantics for
these abstract modal logics, i.e. for classical modal logics
with at least EMT4. Because it simply relies on a covering of
the universe, this semantics is more intuitive than the neighborhood semantics of classical modal logics. In the normal
case, i.e. S4, the abstract models introduced here are topological models as proposed by J. C. C. Mc Kinsey and A.
Tarski in the early 40’s (McKinsey and Tarski 1944) (see for
instance (Parikh, Moss, and Steinsvold 2007) for recent developments). Interestingly, a recent line of work started by
Á. Czászár (Császár 2002) introduces a notion of generalized topology exactly defined as our notion of abstraction.
From a practical point of view, we have tools to learn abstract implications and association rules from instances (Soldano et al. 2010), and we are currently investigating supervised abstract concept learning.
Hansen, H. H., and Kupke, C. 2004. A coalgebraic perspective on monotone modal logic. Electronic Notes in Theoretical Computer Science 106:121–143.
Imielinski, T. 1987. Domain abstraction and limited reasoning. In IJCAI, 997–1003.
Liau, C.-J. 2000. An overview of rough set semantics for
modal and quantifier logics. Int. J. Uncertain. Fuzziness
Knowl.-Based Syst. 8(1):93–118.
Lin, S., and Wright, R. 2010. Evolutionary tile coding:
An automated state abstraction algorithm for reinforcement
learning. In AAAI Workshops.
Loscalzo, S., and Wright, R. 2010. Automatic methods for
continuous state space abstraction. In AAAI Workshops.
McKinsey, J. C. C., and Tarski, A. 1944. The algebra of
topology. Annals of Mathematics 45(1):141–191.
Parikh, R.; Moss, L. S.; and Steinsvold, C. 2007. Topology
and Epistemic Logic. Springer Verlag. chapter in M.Aiello,
I.Pratt-Hartmann and J. F. A. K.Van Benthem (eds), Handbook of Spatial Logics, 299–341.
Pernelle, N.; Rousset, M.-C.; Soldano, H.; and Ventos, V.
2002. Zoom: a nested Galois lattices-based system for conceptual clustering. J. of Experimental and Theoretical Artificial Intelligence 2/3(14):157–187.
Saitta, L., and Zucker, J.-D. 2001. A model of abstraction in
visual perception. Applied Artificial Intelligence 15(8):761–
776.
Soldano, H., and Ventos, V. 2011. Abstract Concept Lattices. In Valtchev, P., and Jäschke, R., eds., International
Conference on Formal Concept Analysis (ICFCA), volume
6628 of LNAI, 235–250. Springer, Heidelberg.
Soldano, H.; Ventos, V.; Champesme, M.; and Forge, D.
2010. Incremental construction of alpha lattices and association rules. In Setchi, R.; Jordanov, I.; Howlett, R.; and
Jain, L., eds., Knowledge-Based and Intelligent Information
and Engineering Systems, volume 6277 of Lecture Notes in
Computer Science. Springer Berlin / Heidelberg. 351–360.
Ventos, V., and Soldano, H. 2005. Alpha galois lattices:
An overview. Proceedings of ICFCA’05, Lecture Notes on
Computer Science 3403:298–313.
Yang, F.; Culberson, J.; Holte, R.; Zahavi, U.; and Felner, A.
2008. A general theory of additive state space abstractions.
,JAIR 32:631–662.
Yao, Y. Y., and Lin, T. Y. 1996. Generalization of rough sets
using modal logics. Intelligent Automation and Soft Computing, an International Journal 2:103–120.
Zucker, J.-D. 2003. A grounded theory of abstraction in
artificial intelligence. Philos Trans R Soc Lond B Biol Sci
358(1435):1293–309. 0962-8436 Journal Article.
References
Ben-David, S., and Ben-Eliyahu-Zohary, R. 2000. A modal
logic for subjective default reasoning. Artif. Intell. 116(12):217–236.
Caspard, N., and Monjardet, B. 2003. The lattices of closure
systems, closure operators, and implicational systems on a
finite set: a survey. Discrete Appl. Math. 127(2):241–269.
Cattaneo, G., and Ciucci, D. 2009. Lattices with interior
and closure operators and abstract approximation spaces. T.
Rough Sets 10:67–116.
Chellas, B. F. 1980. Modal Logic: an introduction. Cambridge University Press.
Császár, Á. 2002. Generalized topology, generalized continuity. Acta Mathematica Hungarica 96(4):351–357.
Davvaz, B., and Mahdavipour, M. 2008. Rough approximations in a general approximation space and their fundamental properties. International Journal of General Systems
37(3):373 – 386.
Diday, E., and Emilion, R. 2003. Maximal and stochastic
galois lattices. Discrete Appl. Math. 127(2):271–284.
Erné, M.; Koslowski, J.; Melton, A.; and Strecker, G. E.
1993. A Primer on Galois Connections. Annals of the New
York Academy of Sciences 704(Papers on General Topology
and Applications):103–125.
Esposito, F.; Ferilli, S.; Fanizzi, N.; Basile, T. M. A.; and
Mauro, N. D. 2004. Incremental learning and concept drift
in inthelex. Intell. Data Anal. 8(3):213–237.
Ganter, B., and Wille, R. 1999. Formal Concept Analysis:
Mathematical Foundations. Springer Verlag.
106