Discourse Understanding with
Discourse Representation Theory
and Belief Augmented Frames
Colin Tan,
Department of Computer Science,
School of Computing,
National University of Singapore.
Belief Augmented Frames
Motivation
• Frames
– Flexible, intuitive way of representing
knowledge.
– Frames represent an entity or a concept
– Frames consist of slots with values
• Represents relations between current frame and
other frames.
– Slots have events attached to them
• Can invoke procedures (“daemons”) whenever a
slot’s value is changed, removed etc.
Belief Augmented Frames
Motivation
• In the original definition of frames, slotvalue pairs are “definite”.
• One improvement is to introduce
uncertainties into these relations.
Belief Augmented Frames
Motivation
• Statistical representations are not always ideal:
– If we are p% certain that a fact is true, this doesn’t
mean that we are 1-p% certain that it is false.
• Various uncertainty reasoning methods introduced
to address this:
– Dempster-Schafer Theory
– Transferrable Belief Model
– Probabilistic Argumentation Systems
Belief Augmented Frames
Motivation
• By combining uncertainty measures with
frames:
– Uncertainties in slot-value pair assignments
provide frames with greater expressiveness and
reasoning ability.
– Frames offer a neat intuitive structure for
reasoning uncertain relations.
Modeling Uncertainty in
Belief Augmented Frames
• Uncertainty not only on slot-value pair
assignments, but also on the existence of the
concept/object represented by the frame.
• The belief mass Tf is called the
“Supporting Mass”, and it is the degree of
support that the fact f is true.
• Likewise Ff is the Refuting Mass, and it is
the degree of support that the fact f is false.
Modeling Uncertainty in
Belief Augmented Frames
• In general:
• 0 Tf , Ff 1
• Tf is fully independent of Ff
• Tf + Ff 1
• The Degree of Inclination Dif is defined as:
– Dif = Tf - Ff
• The Plausibility plf is defined as:
– Plf = 1 - Ff
Combining Belief Masses
• Fuzzy-logic style min-max functions are
used to combine belief masses from
different facts.
• Given two facts P and Q:
– Conjunctions
• TPQ = min(TP, TQ)
• FPQ = max(FP, FQ)
Combining Belief Masses
• Given two facts P and Q:
– Disjunctions
• TPQ = max(TP, TQ)
• FP Q = min(FP, FQ)
– Negation
• TP = FP
• FP = TP
Discourse Representation
Structures
• Discourse Representation Theory provides
the techniques and structures for resolving
important discourse processing issues like
anaphoric and ellipses references.
• The main structure in DRT is the Discourse
Representation Structure, or DRS.
Example
• An example DRS representing “Pedro owns
a donkey” is shown below:
– [u1: u2: pedro(u1)
donkey(u2)
owns(u1, u2)]
• The symbols u1 and u2 are known as
referent markers.
Embedded DRSs
• Embedded DRSs are used to model more
complex relations:
– Conditionals: If Pedro owns a donkey he will
beat it.
[u1: u2: [ pedro(u1)
donkey(u2)
owns(u1, u2)] ===> [u3=u1
u4=u2
beats(u3, u4)]]
Embedded DRSs
• Some, Few, Most, All etc are similarly
modeled. E.g.
– Some men who own donkeys love them.
[u1: u2: [ men(u1)
donkey(u2)
own(u1, u2)] =some=> [u3=u1
u4=u2
love(u3, u4)]]
From DRS to BAF
• Conversion from DRS to BAF is trivial:
– All nouns and objects are inserted as new frames in the
BAF:
• New frames for Pedro(u1) and Donkey(u2) are created.
– All relations between nouns and objects in the DRS are
modeled slot-value pairs in the BAF. E.g.
• beats(u1, u2)
• u1 is resolved to Pedro, u2 is resolved to donkey, a slot beats is
created in Pedro and the frame for donkey is assigned to it.
From DRS to BAF
• Uncertainties for slot-value assignments:
– For simple relations (e.g. Pedro owns a
donkey):
• Towns(pedro, donkey) =
• Fowns(pedro, donkey) = 1-
– Here is our degree of belief in the reliability
of the source that told us that Pedro owns a
donkey.
From DRS to BAF
• Alternatively, if person C says that Pedro
doesn’t own a donkey, then:
• Towns(pedro, donkey) =
• Fowns(pedro, donkey) =
– This example illustrates the expressive power
of making T and F separate and fully
independent.
From DRS to BAF
• “Fuzzy” relations like some, most, etc. can
be represented in BAF by using fuzzy-logic
style membership functions.
– E.g. Some boys beat their donkeys
• Let S be the set of boys who beat their donkeys.
– Tbeat(boys, donkeys) = f(|S|)
– Fbeat(boys, donkeys) = 1 - Tbeat(boys, donkeys)
– Here f(.) is a monotonically increasing function
definied in the range [0, 1], similar to the
fuzzy-logic S function.
From DRS to BAF
• Other “fuzzy” notions can also be similarly
expressed:
– All boys beat their donkeys
• Let S be the set of boys who beat their donkeys, and
let D be the set of boys who own donkeys. Then:
• Tbeat(boys, donkeys) = (|S| ==|D|)
• Fbeat(boys, donkeys) = 1 - Tbeat(boys, donkeys)
– Here is a proximity function, defined on [0,
1], that increases as |S| approaches |D|.
Applications
• Several applications of BAFs are currently being
developed:
– Q & A system
• Digests newswire articles and answers questions.
• Most direct application of the topics in today’s talk.
– Text Classification System
• Uses abstraction feature of BAFs to learn the features of
document classes.
• Use these features to classify unseen documents.
• Results are better than Naïve Bayes.
Difficulties
• Semantics of slots is ill-defined
– There is no fixed way to use the slots to
represent relations between frames.
– This complicates the modeling of real-world
English sentences. E.g.
• The black cat stole the purse.
– Should this be modeled as stole(subject: cat,
object: purse), cat(action:stole, object:purse),
purse(action:stolenby, subject:cat)?
Difficulties
• Many ways to derive a-priori Supporting
and Refuting masses.
– Some ways might be better than others.
• Separation of Supporting and Refuting
masses introduces additional problems that
can make modeling awkward and counterintuitive:
– E.g. plsf < Tf , when Tf , Ff > 0.5
Difficulties
• The range for the sum of Tf ,andFf falls in
[0, 2] instead of [0, 1]. This is again
counter-intuitive.
Future Work
• More work to be done in incorporating
linguistic hedges like very and somewhat.
• A model for measuring the quality of the
knowledge in the BAF knowledge base
should be developed.