NON-MONOTONIC REASONING USING DEMPSTER’S

advertisement
From: AAAI-84 Proceedings. Copyright ©1984, AAAI (www.aaai.org). All rights reserved.
NON-MONOTONIC
REASONING
Matthew
Department
USING
St,anford
lahelletl
that
so as to reflect
represent
California
are thought
esi$ting
to combine
to he a natural
deduction,
the arcs
of semantic
detail.
of acceptable
con-
can be used
eflectively
developed
also seems
one in which
to describe
about
higher
and d is the extent
.98 = .02 above).
d, etc.,
so that
Rich
nets
[Rich,
[Quillian,
the deplh
sent.
SEMANTIC
birds
19681 with
of convict,ion
The non-monotonic
represented
factors”
rule
“birds
flyers
and
respectively.
semantic
c + d 5
indicating
fly” would
Since
1, with
complete
thus be
sponds
fib
flyers
knowledge
birds
the certainty
%
factor
birds 11~. Monotonic
flyers,
of .95 indicates
rules have certainty
ostriches
is also written
7
that
factors
95%
[Shafer,
but
of
(2)
ezncfly
that
between
conditional
95%
90%
3
thought
It seems
of all birds
and
98%
to
so c + d = 0 corre-
that
to no knowledge
unreasonable
do.
better
Instead
p(flyer(z)lbird(z))
to believe
sition to be confirmed
to which
of having
the
y (2)
to evaluate
x 5
of an x being
the minimum
z.
probability
reasons.
From
Thus
(3)
z,
the fact
that
of an x being
a z is at
a z is at least
the value of the arc x s
ud) and we have, for example,
z
birds
(1 0)
z
that
-3
(.9
the
a y is a, it follows
of an x not being
Tweety
flyers
.02)
conclusion
5
(.Q
a given propo-
evidence
jsa
and
y
gives rise to the non-monotonic
range as a pair (c n),
by the available
this rcpresenta-
to believe
= .95, we take
we believe
using
probability
Tweety
probabilities
of not as specific
E [.9, .98].
c is the extent
reasoning
an d want
is (ac
fly-much
We will write such a probability
where
simple
least UC. The probability
flyers.
are better
probability
p(flyer(z)jbird(z))
As c + d = 1 corresponds
minimum
that
as
19761 has argued
above
as ranges.
that
have
interval
we have
isa
ad for similar
ostriches
values,
if the probability
[0, l] and therefore
x (iZ)
of 1, as
non-flyers,
by Rich
suppose
(1)
in
Shafer
c _< d; we will always
only
of a probability,
to the interval
To perform
tion,
such as those
we need
equality
at all.
but as
which
3
flyers
(0 1)
[c, C1]is in fact a single point.
they repre-
not as
birds
where
is [c, Z] in general.
levels of
labelling
“certainty
(J for
interval
.02)
NETS
held in the properties
the probability
2
(.9
reasoning”.
19831 h as suggested
it is dis-
(1) and (2) now become
ostriches
I
to which
We will also write
to in the last paragraph
The beliefs
probabil-
methods
(1 -
c, d for 1 -
referred
they
If these
framework
such as “reasoning
1 -
nets be
in the propertics
of as ranges
The
example),
confirmed
in greater
statistical
them.
94305
above
confidence
is investigated
Gclcnccs
ities,
suggestion
Science
University
ABSTRACT
Rich’s
RULE
L. Ginsberg
of Computer
Stanford,
DEMPSTER’S
(-9 in the
126
flyers.
.02)
(4
II
The
DEMPSTER’S
difficulties
ing applications
conclusions.
with
RULE
this scheme
arise
when
differ-
If we have
(?;I
ostriches
with
result
we need is some
way
e.
admit
reasoning.
this
[Dcmpster
situation
exceptions;
conflicting
1968
or Shafer
in depth,
and
what
(0
a.
problem
(or other)
plication
attractive
inferences
and associativity
be able to overcome
(a
in which
are drawn
b) + (0
0) = (a
b).
any attempt
to apply
is critical,
example,
generate
since
another.
0)
that
an arc
The point
E
elephants
(0 1)
rule.
(a
ranges
(a
0) and
the corrcspondin
probabilities
0) =
(a + c 0)
(c
g arcs;
combine
each
in this
UC
is undefined.
has
ditficult
of determining
Such
been
proven
a conflict
If we denote
1) or (1
us to easily retract
lier inference
rule
of
a combination
both
valid
and
in the database.
without
tion
by -,
range
(0
from
cd - bee - ud2
(7)
of an ear-
conclusions
drawn using
AND
METARULES
approach
to non-monotonic
by McCarthy’s
formulation
A labnormall(x)
--+ flies(x)
deduc-
[McCarthy,
19841:
bird(x)
0)
The
from
We might,
*TX).
the conclusion
influencing
RULES
is implied
we will
ostrich(x)
--+ abnormall(x)
ostrich(x)
A labnormal2(x)
effect
of these rules
is an ostrich
invalidate
-+ Tflies(x).
is to have the fact
not
the conclusion
for
can fly, but
Ihe rule which
the
formulation
we want to deactivate
%
(.9
but the rule corresponding
we draw
that
Tweety
that
Tweety
led to that conclusion.
Tweety
(should
the inverse
0),
means.
%
flyers.
(0 1)
here is that such an inference
0) + (c
(5) again.
a non-monotonic
This enables
the ap-
that
z
flyers
(0 0)
it) will have no effect on our eventual
c.
is simply
reasoning--that
d) # (0
pair
Tweety
the
invertible.
III
non-
at all and will result
the arc Tweety
that
we have, for (c
The commu-
The probability
an inapplicable
rule can
(4)
(a b)- (c d) = ( F$--e!;!;dd
by
non-monotonic
of (6) guarantees
to no knowledge
for
result
is no danger
rule to obtain
and as such represents
this difficulty.
corresponds
There
0);
such as (5) will bc invalidated;
to apply
1) + (1
prop-
In many
associative.
of one rule may invalidate
tativity
b.
and
the order
systems,
(1
This
is in
If we denote
has the following
It is commutative
=
1).
of a non-monotonic
the two results
A more efficient
monotonic
d)
(0
certainty.
f. + is (nearly)
other
formulation
d) =
conclusions
by corn bining the
(a b) + (c d) the inference obtained
two inferences
(a 6) and (c d), Dempster’s
rule gives us
This
0) + (c
a non-monotonic
conclusion
it is legitimate
invalid
19763 has dis-
our
case of his investigations.
a special
a logical
of non-monotonic
indicates
such as (4) and (5).
Dempster
(1
no application
of combining
when
of non-monotonic
to combine
l),
1) -t (c
inference.
(4).
rules by their very nature
(0
This allows us to avoid the most computationally
2
flyers,
(0 1)
is typical
This situation
(0
0),
applying
an eventual
%+ flyers,
(0 1)
d) #
(1
that
aspect
in contradiction
fact
(c
outweigh
when
Tweety
cussed
d) f
ever
we obtain
Default
For
(c
implies
of the rule used in (3) lead to different
Tweety
d.
In our
not the arc
flyers
.02)
to
conclusions.
0).
The
indicate
case,
the
no
birds
probability
disbelief
*%
(.9
flyers
(8)
.02)
in
itself.
(independent)
describe
in the usila.1 fashion.
127
In order
to see how
to do this,
the rule (8) in greater
detail.
we need
first t.o
We will think
of a rule as a triple
a is a list of antecedents,
(cz
probability
interval.
antecedents
are satisfied,
The
then the consequent
probability
range
example
p.
An
the best clarification:
intention
list consists
is (isa x flyers),
Thus
ensuing
Returning
to a subset.”
provide
(((isa
then z can
multiple
with confidence
at the same
antecedents,
if the value
(.9
a product
case,
(rule
(rule
should
((isa
Now suppose
we read an article
(11~) and (llb)
Rules
the metarule
article
1))
(12)
is now believed
repeats
cannot
fly. The second,
itself.
If the rule has been
accurate;
rule ensures
that
of the
rule
we will reach
we apply
(9)
PW
ostriches
the reversability
the conclusions
The
of
tially
we have been
ostriches
will remain
the same
t o an ostrich.
considering,
conclusion
(12),
But
the cer-
consider
or not
are true.
in the National
(.9
Enquirer
If I read something
probablity
such
applied
in the Nat,ionnl
and I will believe
.05)
for example,
range
interval
(.92
.07).
as (lob)
that
ensures
when
(llb)
Enquirer,
(.5
.4)
to be true,
appearance.
(a
maximum
extent
RULES
we have described
is probably
range
examples
in applying
weight
given
such as
as a way to assign
b), we should
poten-
we have
of a metarule
Thus
a proba-
a rule with
its conclusion
any other probabilities,
to which
since b is the
the rule may be applicable.
both
(11~)
not
Firstly,
it enables
without
obtaining
being
rules
do need
will
of this idea will require
a list of rules
rule assumes
which
have been
There
reapplying
information.
independence
combined,
multiple
us to mainused
are thrne advantages
us to avoid
new
either
or actito this.
a single
Since
rule
Dcmpster’s
of the probability
estimates
used of a single rule need to be
avoided.
to be true with
Here we really
that
FOR
the
to a rule itself.
v&ted by other rules.
(114
are true.
the story
Now rule (11~)
is believed
the following
W)
can be applied
if we later read
interpretation
Implementation
Articles
rule
best
updating
tain
articles
.4).
Times.
by the Enquirer
of the methods
by 6 before
example:
Newspaper
article
well beyond
The
probability
fly guarantees
whether
power
thus far.
WC will be saved
do not
the
The
(.5
happens
PROBABILITIES
extends
bility
that
is what
to some extent
IV
the rule (9)
so.
In the example
that
applied,
if the rule has not been applied,
the work of doing
tainty
deactivates
and
(11~).
1)).
the rule that
however,
with (116) activating
deactivating
in the New York
is applied
(12’)
Enquirer.
to be true with confidence
important
corroborated
.02))(0
are activated,
.05))
1)).
in the National
and therefore
.4)))
2) (.9
(0
(( (isa x ostriches))
simply
z) (.5
(accurate
.02a).
case, we have the rules
( isa 2 flyers) (.9
(accurate
x newspaper-article))
is (CL b), the
will be (.9a
alone
x birds))
newspaper-article)
((isa x Enquirer-article))
Equally
Dempster’s
(12)
1)).
we have
(( (isa Enquirer-article
.02).
level as the
of (isa x birds)
to (isa x flyers)
to the ostrich
first of these
this as
x y)
As a special
the same article
The
write
a
can
(9)
.OZ)).
(isa x flyers)(O
((isa
WC could
apply
rule which
(rule (a (isa z x) b) c d))
(( (isa x ostriches))
(rule
“Never
with
of the single arc (isa x birds).
is activated
increment
be the metarule,
x birds))
The antecedent
be used).
holds
would
(rule (u (isa z y) 6) c e)(O
The consequent
(for
bc applied
if all of the
will probably
(isa 5 flyers){.9
antecedent
is that
still
rule to a set when there is a corresponding
as
(((isa
rule itself
Better
p) where
and p is a
The rule “if z is a bird,
fly” will be represented
The
c
c is a consequent,
Secondly,
a
be
can be.
128
this approach
activate
a rule.
Returning
pcrlmps
all wc should
enables
to
us to
purtiully
our newspaper
say is that the rule (11~)
de-
example,
is not as
‘l‘he
(rule ((isa
5 IlcwsI)aI)er-n~t,iclc))
(accurate
x)( .9 .05))
(0
of (1 Ib)----note
If we use this rule instead
d i~~nppcars -- an article
believed
to be true with
Bote
also that
of Dempster’s
fore
in tllc
(1 la)
we store
probability
in order
will have no difficulty
invocation
(12’)
will
reversing
we
and
Izave used
the inference
that
(1 la),
advantage
For a rule which
the attention
has probability
range
attention
((isa
( isa x flyers)
a rule will itself need
(.9
If we are maintaining
they
a list of rules
are expected
rule such as (14) can be used to ensure,
system,
that
the inference
fly will be drawn
generally,
an element
to that
(((isa
Mike
stimulating
group,”
think
were the only
(1
(14)
to
any given
about
one in which
them
probabilistic
reasoning
can be
This framework
to describe
general
examining.
would
like to thank
Gencsereth
John McCarthy,
and Sally Greenhalgh
Ben
for many
discussions.
bird
PI
early.
about
McCarthy,
considering
properties
which
are
Quillian,
( isa x z) a) (-5
flyers,
so that
flyers
reproduce
As it stands,
0)).
%
birds
had
(14))
with
y =
the result
and flyers will be somewhat
of applying
weaker.
Sot.
of Bay&an
B, 30 (1968))
“Applications
Infer-
205--247
of Circumscription
Sense Knowledge,”
PI
“Semantic
Processing,
Press,
1968, pp.
Rich,
E. “Default
ing.”
In Proc.
PI Shafer,
x y))
Stat.
Common
M.R.
formation
William
(15)
J.
Formalizing
MIT
“n7hen
“A Generalization
A.P.
J. Roy.
to
to appear
(1984)
a
into the metarule
0)’ this would
and z = flyers.
PI
and
x y)
(rule ((isa
to birds
them
can be focussed.
and useful
ence.”
for any forward-
(isa 2 y))
(15)
the rules
fly,
to be useful,
that
we can translate,
of a group,
value
and
by including
assigning
of the type WC have been
The author
Grosof,
.02))( .5 0)).
a high level of activation
which
with
More
by allowing
both
Reasoning
to be a promising
PI Dempster,
x birds))
can probably
by
easily,
REFERENCES
the levels
birds
b),
the rule is likely
x birds))
chaining
and
own.
have
be of any use).
truth
of
(u
on the fact that birds
(rule
can be obtainctl
and attention
knowledge
of this
To focus
If birds
seems
rules
of their
discussed,
them.
n-ill1
described
counterparts.
thcmsclves to be treat t:d as arcs,
other
ire dealing
be
their monotonic
power
in sewhich
ACKNOWLEDGKMENTS
unexplored)
to be nscflrl.
(((isa
can
to arcs
of’ the pi-oblcnls
after a sub-
to which
unique
we
with
Further
rah:;es
sonic
11~: c~;countcrcrl
infcrcnccs
of (13).
of u as the extent
(Such
otjherwisc
weights
(13) be-
of conlidcnrc
sccn~s to solve
mesh neatly
within
invertability
we can think
we might
be
(.54 .03).
range
is that it may allow us to focus
the system.
(13)
now
this result--provided
that
A final (but currently
approach
that
that we need not apply
to obtain
would
assignnicnt
nets
Non-monotonic
4.
Enqllircr
the commutativity
rule mean
the information
sequent
Natioin~l
mantic.
Reasoning
Inc.,
G. A Mathematical
Princeton
in Semantic
M. Minsky.
In-
Cambridge:
216-270.
AAAI-83.
Kaufmann,
Princeton:
Memory,”
ed.
as Likelihood
Los
Altos,
1983, pp.
Theory
University
ReasonCalifornia:
348-351
of Evidence.
Press,
1976
Download