FrameNet Meets the Semantic Web

advertisement
FrameNet Meets the Semantic Web
Srini Narayanan
Charles Fillmore
Collin Baker
Miriam Petruck
Outline of Presentation
•
•
•
•
•
Semantic Frames and the FrameNet Project
Status of FrameNet Data and Software
Details on the FrameNet process
Comparison to other ontologies/resources
Afternoon session: Going through the
annotation process demo.
The FrameNet Project
• Phase I (NSF, 1997-2000)
– ICSI, U-Colorado
– Conceptual basis, used existing tools, and perl
• Phase II (NSF, 2000-2003)
– ICSI, U-Colorado, SRI, SDSU
– Scaling up, uses SQL database and Java-based
in house tools. Pilot applications developed.
The FrameNet Project
C Fillmore PI (ICSI)
Co-PI’s:
S Narayanan (ICSI, SRI)
D Jurafsky (U Colorado)
J M Gawron (San Diego State U)
Staff:
C Baker Project Manager
B Cronin Programmer
C Wooters Database Designer
Applications
An important goal of our work is to present
information about the words in a form that will
prove usable in various NLP applications:
1. Question Answering (Berkeley, Colorado)
2. Semantic Extraction (Berkeley, SRI, Colorado)
3. Machine Translation (San Diego State)
Frames and Understanding
• Hypothesis: People understand things by
performing mental operations on what they
already know. Such knowledge is
describable in terms of information packets
called frames.
FrameNet in the Larger Context
• The long-term goal is to reason about the world in
a way that humans understand and agree with.
• Such a system requires a knowledge
representation that includes the level of frames.
• FrameNet can provide such knowledge for a
number of domains.
• FrameNet representations complement ontologies
and lexicons.
The core work of FrameNet
1.
2.
3.
4.
5.
6.
characterize frames
find words that fit the frames
develop descriptive terminology
extract sample sentences
annotate selected examples
derive "valence" descriptions
Lexicon Building
• We study words,
• describe the frames or conceptual structures
which underlie them,
• examine sentences that contain them
(from a vast corpus of written English),
• and record the ways in which information
from the associated frames are expressed in
these sentences.
The Core Data
The basic data on which FrameNet
descriptions are based take the form of a
collection of annotated sentences, each
coded for the combinatorial properties of
one word in it. The annotation is done
manually, but several steps are computerassisted.
The Process
• Sentences containing a given word are extracted
from the corpus and made available for
annotation.
• Student annotators select the phrases that identify
particular semantic roles in the sentences, and tag
them with the name of these roles.
• Automatic processes then provide grammatical
information about the tagged phrases.
SAMPLE ANNOTATIONS
Types of Words / Frames
o
o
o
o
o
o
o
events
artifacts, built objects
natural kinds, parts and aggregates
terrain features
institutions, belief systems, practices
space, time, location, motion
etc.
Event Frames
Event frames have temporal structure, and
generally have constraints on what precedes
them, what happens during them, and what
state the world is in once the event has been
completed.
Sample Event Frame:
Commercial Transaction
Initial state:
Vendor has Goods, wants Money
Customer wants Goods, has Money
Transition:
Vendor transmits Goods to Customer
Customer transmits Money to Vendor
Final state:
Vendor has Money
Customer has Goods
Sample Event Frame:
Commercial Transaction
Initial state:
Vendor has Goods, wants Money
Customer wants Goods, has Money
Transition:
Vendor transmits Goods to Customer
Customer transmits Money to Vendor
Final state:
Vendor has Money
Customer has Goods
(It’s a bit more complicated than that.)
Partial Wordlist for Commercial
Transactions
Verbs:
pay, spend, cost, buy, sell, charge
Nouns:
cost, price, payment
Adjectives: expensive, cheap
Meaning and Syntax
 The various verbs that evoke this frame
introduce the elements of the frame in
different ways.
 The identities of the buyer, seller, goods and
money
 Information expressed in sentences
containing these verbs occurs in different
places in the sentence depending on the
verb.
She bought some carrots from the greengrocer for a dollar.
Customer
Vendor
from
BUY
Goods
for
Money
She paid a dollar to the greengrocer for some carrots.
Customer
to
Vendor
PAY
Goods
for
Money
She paid the greengrocer a dollar for the carrots.
Customer
Vendor
PAY
Goods
for
Money
She spent a dollar on the carrots.
Customer
Vendor
SPEND
Goods
on
Money
The greengrocer sold some carrots to her for a dollar.
Customer
to
Vendor
SELL
Goods
for
Money
The greengrocer sold her some carrots for a dollar.
Customer
Vendor
SELL
Goods
for
Money
The greengrocer charged a dollar for a bunch of carrots.
Customer
Vendor
CHARGE
Goods for
Money
The greengrocer charged her a dollar for the carrots.
Customer
Vendor
CHARGE
Goods
for
Money
A bunch of carrots costs a dollar.
Customer
Vendor
COST
Goods
Money
A bunch of carrots cost her a dollar.
Customer
Vendor
COST
Goods
Money
It costs a dollar to ride the bus.
Customer
IT
Vendor
COST
Goods
to do X
Money
It cost me a dollar to ride the bus.
Customer
IT
Vendor
COST
Goods
to do X
Money
FrameNet Product
• For every target word,
• describe the frames or conceptual structures
which underlie them,
• and annotate example sentences that cover
the ways in which information from the
associated frames are expressed in these
sentences.
FN work: characterizing frames
• One of the things we do is characterize such
information packets - beginning with
informal descriptions.
• We can begin with Revenge.
The Revenge frame
The Revenge frame involves a situation in
which
a) A has done something to harm B and
b) B takes action to harm A in turn
c) B's action is carried out independently of
any legal or other institutional setting
FN work: finding words in frame
• We look for words in the language that
bring to mind the individual frames.
• We say that the words evoke the frames.
Vocabulary for Revenge
• Nouns: revenge, vengeance, reprisal,
retaliation
• Verbs: avenge, retaliate, revenge, get back
(at), get even (with), pay back
• Adjectives: vengeful, vindictive
FN work: choosing FE names
• We develop a descriptive vocabulary for the
components of each frame, called frame
elements (FEs).
• We use FE names in labeling the
constituents of sentences exhibiting the
frame.
FEs for Revenge
• Frame Definition: Because of some injury
to something or someone important to an
avenger, the avenger inflicts a punishment
on the offender. The offender is the person
responsible for the injury. The
injured_party may or may not be the same
individual as the avenger.
• FE List: avenger, offender, injury,
injured_party, punishment.
FN work: collecting examples
• We extract from our corpus examples of
sentences showing the uses of each word in
the frame.
Obviously we need to conduct a more regimented search,
grouping examples with related structures.
Examples of simple use are swamped by the idiomatic phrase
"with a vengeance".
FN work: annotating examples
• We select sentences exhibiting common
collocations and showing all major
syntactic contexts.
• Using the names assigned to FEs in the
frame, we label the constituents of
sentences that express these FEs.
FN work: summarizing results
• Automatic processes summarize the results,
linking FEs with information about their
grammatical realization.
• The output is presented in the form of
various reports in the public website, in
XML format in the data release.
I avenged my brother.
I avenged his death.
Querying the data: meaning to form
Through various viewers built on the FN
database we can, for example, ask how
particular FEs get expressed in sentences
evoking a given frame.
By what syntactic means is offender
realized?
• Sometimes as direct object:
we'll pay you back for that
• Sometimes with the preposition on
they'll take vengeance on you
• Sometimes with against
we'll retaliate against them
• Sometimes with with
she got even with me
• Sometimes with at
they got back at you
By what syntactic means is offender
realized?
• Sometimes as direct object:
we'll pay you back for that
• Sometimes with the preposition on
they'll take vengeance on you
• Sometimes with against
we'll retaliate against them
It's these word-by-word
• Sometimes with with
she got even with me specializations in
FE-marking that make
• Sometimes with at
automatic FE recognition
they got back at you
difficult.
Querying the data: form to meaning
Or, going from the grammar to the meaning,
we can choose particular grammatical
contexts and ask which FEs get expressed in
them.
What FE is expressed by the object of
avenge?
• Sometimes it's the injured_party
I've got to avenge my brother
• .Sometimes it's the injury
My life goal is to avenge my brother's
murder.
Evaluation
• Lexical coverage. We want to get all of the
important words associated with each
frame.
• Combinatorics. We want to get all of the
syntactic patterns in which each word
functions to express the frame.
Evaluation
• We do not ourselves collect frequency data.
That will wait until methods of automatic
tagging get perfected.
• In any case, the results will differ according
to the type of corpus - financial news,
children's literature, technical manuals, etc.
What do we end up with?
• Frames
• Lexical entries
• Annotations
Sample from frames list
Creating, Crime_scenario, Criminal_investigation,
Criminal_process, Cure. Custom, Damaging,
Dead_or_alive, Death, Deciding,
Deny_permission, Departing, Desirability,
Desiring, Destroying, Detaining, Differentiation,
Difficulty, Dimension, Direction, Dispersal,
Documents, Domain, Duplication, Duration,
Eclipse, Education_teaching,Emanating, Emitting,
Emotion_active, Emotion_directed, Emotion_heat,
Employing, Employment, Emptying, Encoding,
Endangering, Entering_of_plea, Entity, Escaping,
Evading. Evaluation, Evidence, Excreting,
Sample from lexical unit list
• * augmentation.N
(Expansion)
• * augur.V (Omen)
• * August.N (Calendric_unit)
• * aunt.N (Kinship)
• * auntie.N (Kinship)
• * austere.A (Frugality)
• * austerity.N (Frugality)
• * author.V (Text_creation)
• * authoritarian.A (Strictness)
• * authorization.N
(Documents)
• * autobahn.N (Roadways)
• * autobiography.N (Text)
• * automobile.N (Vehicle)
• * autumn.N (Calendric_unit)
• * avalanche.N (Quantity)
• * avenge.V (Revenge)
• * avenger.N (Revenge)
• * avenue.N (Roadways)
• * aver.V (Statement)
Added Value: frame relatedness
• We have ways of linking frames to each
other, through relations of
– inheritance
– subframe
– "using"
• We would like to explore how our frame
relationships can be mapped onto
ontological relations.
Frame-to-frame relations
• Revenge inherits Punishment/Reward
• Revenge uses the Hostile_encounter frame
• (see existing tentative frame hierarchy)
Added Value: semantic types
• We also have the means of adding semantic types
to words, frames and frame elements.
• Some of these:
– negative vs. positive
(disaster vs. bonanza),
– punctual vs. stative
(arrive vs. reside),
– artifact vs. natural kind
(building vs. tree).
Added Value: semantic types
• For the kinds of nouns that occupy particular FE
slots in given frames, we should be able to use the
WordNet noun taxonomies.
• This is done in some related work
Added Value: support verbs
• In the case of the event nouns, we keep track of
which verbs can combine with which nouns to
signal occurrences of the frame evoked by the
noun.
–
–
–
–
–
take a bath (bathe)
have an argument (argue)
wreak vengeance,
take revenge,
exact retribution.
Can annotation be automated?
Gildea, D & D Jurafsky, 2000, Automatic
labeling of semantic roles, Association for
Computational Linguistics, Hong Kong.
Mohit & Narayanan, 2003, Semantic
Extraction using Wide-coverage lexical
resources, HLT-NAACL 2003.
The Database
The information collected from the data
(and a certain amount of information
inserted manually by the lexicographers) is
stored in a MySQL database.
Current Status
• Current: 7700 Lexical Units
–
–
–
–
FN1: 1600 Lexical units
FN2: 4400 Lexical Units
Created (not yet annotated): 1280 LU
Other : in process, problems, etc.
Current Status
• 500 Frames
• 7700 Lexical Units
• 130,000 Annotated sentences
Data Distribution
Distributed as XML files with accompanying DTDs
Separate files and DTDs for
– Frame and FE data
– –Annotation data
– Frame relation data
• Easy to parse with standard XML tools.
– Approximately 100 research groups have been
authorized to download release 1.0 of the FN data
(Oct., 2002).
• Next release scheduled for August, 2003
FrameNet Software Distribution
• All software is pure Java, and can be run on any platform
for which a JVM is available
• Has been successfully run on Solaris, Linux, Mac OS X,
and Windows 9x/2000 with very minor modifications
• Server and clients currently being used in Barcelona for
annotation in Spanish FN.
• We will streamline the installation process if demand
warrants
• We plan to publicly release the full software suite in
August, 2003.
Multi-Lingual FrameNets
• Spanish FrameNet
– Prof. Carlos Subirats, U A Barcelona
– Parallel to English FrameNet, using same frames
• German FrameNet
– Prof. Manfred Pinkal, U Saarlandes
– Complete annotation of existing parsed corpus,
– using English frames where possible
• Japanese FrameNet
– Prof. Kyoko Ohara, Keio U
– Collecting own corpus, building search tools
Some Comparisons
Is FN an ontology?
• Not exactly, but some users use FN frames
as an ontology of event types.
Is FN a thesaurus?
Yes, because it groups words into meaning
categories, by way of shared membership in
frames.
How is FN different from WN?
FN does not explicitly display semantic
relations between words of the sort found in
WordNet. (synonymy, antonymy, hyponymy,
meronymy, etc.)
Furthermore, FN includes many opposing
pairs (hot, cold; tall, short) in the same
frame.
Are FN annotations a treebank?
• FrameNet accumulates annotations, but FN
annotations are mainly sentences in which
only one word is analyzed thoroughly.
• Unlike existing treebanks, e.g., U Penn's
PropBank, FN has a richer semantics.
Comparison with Dictionaries
American Heritage Dictionary
• avenge v.
1. To inflict a
punishment or penalty
in return for; revenge
2. To take vengeance
on behalf of
• revenge v.
1. To inflict
punishment in return
for (injury or insult)
2. To seek or take
vengeance for (oneself
or another person);
avenge
American Heritage Dictionary
• avenge v.
1. To inflict a
punishment or penalty
in return for; revenge
2. To take vengeance
on behalf of
• revenge v.
1. To inflict
punishment in return
for (injury or insult)
2. To seek or take
vengeance for (oneself
or another person);
avengeprepositionally;
The FEs of the direct objects are expressed
"in return for" marks the injury; "for" or "on behalf of" marks
the injured_party.
American Heritage Dictionary
• avenge v.
1. To inflict a
punishment or penalty
in return for [ ];
revenge
2. To take vengeance
on behalf of [ ]
• revenge v.
1. To inflict
punishment in return
for (injury or insult)
2. To seek or take
vengeance for (oneself
or another person);
avenge
revenge definer added qualifications
on the missing
argument, avenge definer didn't.
American Heritage Dictionary
• avenge v.
1. To inflict a
punishment or penalty
in return for; revenge
2. To take vengeance
on behalf of
• revenge v.
1. To inflict
punishment in return
for (injury or insult)
2. To seek or take
vengeance for (oneself
or another person);
avenge
avenge definer claims avenge and
revenge are
synonym in sense 1; the revenge definer claims avenge
and revenge are synonyms in sense 2.
American Heritage Dictionary
• avenge v.
1. To inflict a
punishment or penalty
in return for; revenge
2. To take vengeance
on behalf of
• revenge v.
1. To inflict
punishment in return
for (injury or insult)
2. To seek or take
vengeance for (oneself
or another person);
avenge
revenge definer included "seek vengeance", not supported
by FN examples.
American Heritage Dictionary
• avenge v.
1. To inflict a
punishment or penalty
in return for; revenge
2. To take vengeance
on behalf of
• revenge v.
1. To inflict
punishment in return
for (injury or insult)
2. To seek or take
vengeance for (oneself
or another person);
avenge
Both definers include "take vengeance" in their definitions, as
if that's more transparent than the simple verb.
Comparison with WordNet
We make fewer distinctions.
1. revenge, avenge, retaliate -- (take revenge for a
perceived wrong; "He wants to avenge the murder
of his brother")
2. retaliate, strike back -- (make a counterattack and
return like for like, esp. evil for evil; "The Empire
strikes back"; "The Giants struck back and won
the opener"; "The Israeli army retaliated for the
Hamas bombing")
We make fewer distinctions.
1. revenge, avenge, retaliate -- (take revenge for a
perceived wrong; "He wants to avenge the murder
of his brother")
2. retaliate, strike back -- (make a counterattack and
return like for like, esp. evil for evil; "The Empire
strikes back"; "The Giants struck back and won
the to
opener";
"The
army
retaliated two
for the
Hard
figure out
whatIsraeli
motivates
distinguishing
senses;
personal
institutional?
Hamasvs.bombing")
We make fewer distinctions.
1. revenge, avenge, retaliate -- (take revenge for a
perceived wrong; "He wants to avenge the murder of
his brother")
2. retaliate, strike back -- (make a counterattack and
return like for like, esp. evil for evil; "The Empire
strikes back"; "The Giants struck back and won the
Like opener
FrameNet,
theseIsraeli
entriesarmy
include
Definitions
and Hamas
Examples.
"; "The
retaliated
for the
FrameNet
limits
bombing
") its examples to attested sentences from a Corpus.
FN has more detailed syntax.
revenge, avenge, retaliate -- (take revenge for a
perceived wrong; "He wants to avenge the murder of
his brother")
*> Somebody ----s something
retaliate, strike back -- (make a counterattack and return
like for like, esp. evil for evil; "The Israeli army
retaliated for the Hamas bombing")
*> Somebody ----s
The WN
sentence templates
*> Somebody
----s PPare impoverished structurally
and do not indicate the semantic roles. In fact, retaliate is
wrongly described as taking a simple object.
FN has more detailed syntax.
revenge, avenge, retaliate -- (take revenge for a
perceived wrong; "He wants to avenge the murder of
his brother")
*> Somebody ----s something
retaliate, strike back -- (make a counterattack and return
like for like, esp. evil for evil; "The Israeli army
retaliated for the Hamas bombing")
*> Somebody ----s
The*>
identity
of the P----s
in PP
Somebody
PPis important: strike back at
marks the offender, as does retaliate against; retaliate
for marks the injury.
FN has more detailed syntax.
revenge, avenge, retaliate -- (take revenge for a
perceived wrong; "He wants to avenge the murder of
his brother")
*> Somebody ----s something
retaliate, strike back -- (make a counterattack and return
like for like, esp. evil for evil; "The Israeli army
retaliated for the Hamas bombing")
*> Somebody ----s
Where WordNet merely shows that the words in the
*>second
Somebody
synset can----s
occurPP
intransitively, FN would say
something about the anaphoric nature of the omitted
offender.
Comparison with ontologies
Switching frames
• Revenge is a simple frame, but neither
SUMO nor OpenCYC seem to have any
conceptual link to it.
• A particular family of frames that we have
concentrated on are those that make up the
steps and institutions of Criminal_process.
Complex Frames
• With Criminal_process we have, for
example,
– sub-frame relations (one frame is a component
of a larger more abstract frame) and
– temporal relations (one process precedes
another)
Inferencing
• These are the frames with which we are
trying to set up inferencing rules for texts
about crime reports. (Details in the
presentation later.)
In SUMO
• SUMO (Adam Pease) deals with only the upper
ontology, and moves toward our frame along this
path, stopping at legal action.
– entity
– process
– intentional process
–
social interaction
–
contest
–
legal action
In OpenCYC: ArrestingSomeone
ArrestingSomeone: "A specialization of
Social Occurrence and CapturingAnimal. In
each instances of ArrestingSomeone a law
enforcement officer arrests another person,
who is then taken into custody. See the
related constant #$HeldCaptive."
Trial
comment : [[Def]] "The subcollection of
#$LegalConflict events whose instances are
heard and decided by a court and are
officiated by a #$Judge."
requiredActorSlots : [[Mon]] plaintiffs
[[Mon]] defendants
Legal activities
comment : [[Def]] "The collection of all events
performed with the purpose of enforcing laws, that
are performed by people officially charged with
this this duty. Includes most activities of law
enforcement officials (such as police) including
detection of crime, identification of offenders, and
arrests."
LawEnforcementOfficer
comment : [[Def]] "An instance of
PersonTypeByOccupation, and a specialization of
PersonWithOccupation. Each instance of
LawEnforcementOfficer is a person whose job is to detect,
stop, and/or punish people engaged in illegal activities. The
collection LawEnforcementOfficer includes members of
local, state, and special police (e.g., transit police) forces, as
well as federal agents (e.g., members of border patrols,
national security agents). Consequently, a given instance of
Law EnforcementOfficer typically also belongs to one of
the following collections: #$StateEmployee,
#$LocalGovernment Employee, or
NationalGovernmentEmployee (see Public
SectorEmployee)."
FrameNet for Applications
• Semantic Web (http://www.semanticweb.org)
– FN database in DAML+OIL
(http://www.ai.sri.com/~narayana/frame-desc.daml)
• Semantic Extraction using FrameNet
• Frame Simulation and Inference
– Translation from frame structure to a simulation based
inference tool (KarmaSIM)
• (COLING 2002)
Talk Outline
• FrameNet
• A DAML + OIL Representation of
FrameNet
• An Example: Encoding the Criminal
Process Frame
• Web Applications of FrameNet.
• Summary and Future Work
Semantic Web
• The World Wide Web (WWW) contains a large
and expanding information base.
• HTML is accessible to humans but does not
formally describe data in a machine interpretable
form.
• XML remedies this by allowing for the use of tags
to describe data (ex. disambiguating crawl)
• Ontologies are useful to describe objects and their
inter-relationships.
• DAML+OIL (http://www.daml.org) is an markup
language based on XML and RDF that is
grounded in description logic and is designed to
allow for ontology development, transfer, and use
on the web.
FrameNet Entities and Relations
• Frames
– Background
– Lexical
• Frame Elements (Roles)
• Binding Constraints
– Identify
• ISA(x:Frame, y:Frame)
• SubframeOf (x:Frame, y:Frame)
• Subframe Ordering
– precedes
• Annotation
A DAML+OIL Frame Class
<daml:Class rdf:ID="Frame">
<rdfs:comment> The most general class </rdfs:comment>
<daml:unionOf rdf:parseType="daml:collection">
<daml:Class rdf:about="#BackgroundFrame"/>
<daml:Class rdf:about="#LexicalFrame"/>
</daml:unionOf>
</daml:Class>
<daml:ObjectProperty rdf:ID="Name">
<rdfs:domain rdf:resource="#Frame"/>
<rdfs:range rdf:resource="&rdf-schema;#Literal"/>
</daml:ObjectProperty>
DAML+OIL Frame Element
<daml:ObjectProperty rdf:ID= "role">
<rdfs:domain rdf:resource="#Frame"/>
<rdfs:range rdf:resource="&daml;#Thing"/>
</daml:ObjectProperty>
</daml:ObjectProperty>
<daml:ObjectProperty rdf:ID="frameElement">
<daml:samePropertyAs rdf:resource="#role"/>
</daml:ObjectProperty>
<daml:ObjectProperty rdf:ID="FE">
<daml:samePropertyAs rdf:resource="#role"/>
</daml:ObjectProperty>
FE Binding Relation
<daml:ObjectProperty rdf:ID="bindingRelation">
<rdf:comment> See http://www.daml.org/services </rdf:comment>
<rdfs:domain rdf:resource="#Role"/>
<rdfs:range rdf:resource="#Role"/>
</daml:ObjectProperty>
<daml:ObjectProperty rdf:ID="identify">
<rdfs:subPropertyOf rdf:resource="#bindingRelation"/>
<rdfs:domain rdf:resource="#Role"/>
<daml-s:sameValuesAs rdf:resource="#rdfs:range"/>
</daml:ObjectProperty>
Subframes and Ordering
<daml:ObjectProperty rdf:ID="subFrameOf">
<rdfs:domain rdf:resource="#Frame"/>
<rdfs:range rdf:resource="#Frame"/>
</daml:ObjectProperty>
<daml:ObjectProperty rdf:ID="precedes">
<rdfs:domain rdf:resource="#Frame"/>
<rdfs:range rdf:resource="#Frame"/>
</daml:ObjectProperty>
Talk Outline
• FrameNet
• A DAML + OIL Representation of
FrameNet
• An Example: Encoding the Criminal
Process Frame
• Applications of FrameNet.
• Summary and Future Work
The Criminal Process Frame
Frame Element
Description
Court
Defendant
The court where the
process takes place
The charged individual
Judge
The presiding Judge
Prosecution
FE indentifies the
attorneys’ prosecuting the
defendant
Attorneys’ defending the
defendant
Defense
The Criminal Process Frame in
DAML+OIL
<daml:Class rdf:ID="CriminalProcess">
<daml:subClassOf rdf:resource="#BackgroundFrame"/>
</daml:Class>
<daml:Class rdf:ID="CP">
<daml:sameClassAs rdf:resource="#CriminalProcess"/>
</daml:Class>
DAML+OIL Representation of the
Criminal Process Frame Elements
<daml:ObjectProperty rdf:ID="court">
<daml:subPropertyOf rdf:resource="#FE"/>
<daml:domain rdf:resource="#CriminalProcess"/>
<daml:range rdf:resource="&CYC;#Court-Judicial"/>
</daml:ObjectProperty>
<daml:ObjectProperty rdf:ID="defense">
<daml:subPropertyOf rdf:resource="#FE"/>
<daml:domain rdf:resource="#CriminalProcess"/>
<daml:range rdf:resource="&SRI-IE;#Lawyer"/>
</daml:ObjectProperty>
FE Binding Constraints
<daml:ObjectProperty rdf:ID="prosecutionConstraint">
<daml:subPropertyOf rdf:resource="#identify"/>
<daml:domain rdf:resource="#CP.prosecution"/>
<daml-s:sameValuesAs rdf:resource="#Trial.prosecution"/>
</daml:ObjectProperty>
• The idenfication contraints can be between
• Frames and Subframe FE’s.
• Between Subframe FE’s
• DAML does not support the dot notation for paths.
Criminal Process Subframes
<daml:Class rdf:ID="Arrest">
<rdfs:comment> A subframe </rdfs:comment>
<rdfs:subClassOf rdf:resource="#LexicalFrame"/>
</daml:Class>
<daml:Class rdf:ID="Arraignment">
<rdfs:comment> A subframe </rdfs:comment>
<rdfs:subClassOf rdf:resource="#LexicalFrame"/>
</daml:Class>
<daml:ObjectProperty rdf:ID="arraignSubFrame">
<rdfs:subPropertyOf rdf:resource="#subFrameOf"/>
<rdfs:domain rdf:resource="#CP"/>
<rdfs:range rdf:resource="#Arraignment"/>
</daml:ObjectProperty>
Specifying Subframe Ordering
<daml:Class rdf:about="#Arrest">
<daml:subClassOf>
<daml:Restriction>
<daml:onProperty rdf:resource="#precedes"/>
<daml:hasClass rdf:resource="#Arraignment"/>
</daml:Restriction>
</daml:subClassOf>
</daml:Class>
DAML+OIL CP Annotations
<fn:Annotation>
<tpos> "36352897" </tpos>
<frame rdf:about ="&fn;Arrest">
<time> In July last year </time>
<authorities> a German border guard </authorities>
<target> apprehended </target>
<suspect>
two Irishmen with Kalashnikov assault rifles.
</suspect>
</frame>
</fn:Annotation>
Current Status of DAML Encoding
• All FrameNet 1 data is available in DAML+OIL
– annotations
– frame descriptions.
• The translator has also been updated to handle the
more complex semantic relations (both frame and
frame element based) in FrameNet 2.
• We plan to release both the XML and the
DAML+OIL versions of all FrameNet 2 releases.
Talk Outline
• FrameNet
• A DAML + OIL Representation of
FrameNet
• An Example: Encoding the Criminal
Process Frame
• Applications of FrameNet.
• Summary and Future Work
FrameNet for Applications
• Semantic Web (http://www.semanticweb.org)
– FN database in DAML+OIL
(http://www.ai.sri.com/~narayana/frame-desc.daml)
• Semantic Extraction using FrameNet
• Or can FrameNet be automated
• Frame Simulation and Inference
– Translation from frame structure to a simulation based
inference tool (KarmaSIM)
• (COLING 2002)
Semantic Extraction
• Behrang Mohit and Srini Narayanan
– HLT-NAACL 2003.
Enhancing IE Techniques
• IE techniques currently use no inference (mostly!)
– Robert Pickett was charged with felony possession of a
handgun and sentenced to 5 years in a federal prison.
• Says Pickett was arrested
• Frame-based inferences can be useful for a variety
of applications including individual/topic tracking,
bridging inferences/co-reference resolution.
• FrameNet subframe structure and bindings can be
exploited for this purpose.
A Simulation Semantics for
Inference
• Frame Structure and bindings specify parameters
for a simulation/enactment of the event
• Based on previous work (IJCAI 99, AAAI 99, CogSci 2000,
COLING 2002, WWW 2002)
– using an “X-schema” based representation, we simulate
the temporal and inferential structure of the FrameElement and Frame/Subframe relations from FrameNet.
– Direct translation from both the mySQL FN database
and the DAML+OIL representation
Reasoning about Events for NL
applications (QA, NLU)
• Reasoning about dynamics
– Complex event structure
• Multiple stages, interruptions, resources, framing
– Evolving events
• Conditional events, presuppositions.
– Nested temporal and aspectual references
• Past, future event references
– Metaphoric references
• Use of motion domain to describe complex events.
• Reasoning with Uncertainty
– Combining Evidence from Multiple, unreliable sources
– Non-monotonic inference
• Retracting previous assertions
• Conditioning on partial evidence
Previous work
• Models of event structure that are able to deal with the
temporal and aspectual structure of events
• Models frame-based and metaphoric inference about event
structure.
• Based on an active semantics of events and a factorized
graphical model of complex states.
– Models event stages, embedding, multi-level perspectives and
coordination.
– Event model based on a Stochastic Petri Net representation with
extensions allowing hierarchical decomposition.
– State is represented as a Temporal Bayes Net (T(D)BN).
– The Event-State representation requires branching time bayes nets
with synchronization or Coordinated Bayes Nets (CBN)
States
• Factorized Representation of State uses
Dynamic Belief Nets (DBN’s)
– Probabilistic Semantics
– Structured Representation
States and Domain Knowledge
• Factorized
Representation
using Dynamic
Belief Nets
(DBN’s)
– Probabilistic
Semantics
– Structured
Representation
Active Event Representations
• Actions and events are coded in active representations
called x-schemas which are extensions to Stochastic Petri
nets.
• x-schemas are fine-grained and can be used for monitoring
and control as well as for inference.
• Badler’s (U Penn) group uses same idea for commanding
simulated robots (Jack). Nils Nilsson (SU) uses a similar
idea for robot planning called Teleo-Reactive programs.
• Semantic basis for DAML-S, process descriptions of the
Semantic Web
Compositional Primitives
process
atomic
process
inputs
(conditional) outputs
preconditions
(conditional) effects
composite
process
composedBy
control
constructs
sequence
If-then-else
fork
while
...
Sequence: P1;P2
start
Ready
finish
Atomic
Process
P1
Done(P1)
Atomic
Process
P2
Done(P1;P2)
Fork: P1|| P2
Done(P1 || P2)
start
finish
Ready(P1)
Atomic
Process
P1
Ready(P2)
Atomic
Process
P2
Concurrent-Sync
start
finish
Ready(P1)
Atomic
Process
P1
Done(P1)
Ready(P2)
Atomic
Process
P2
Done(P2)
Implementation
DAML-S translation to the modeling environment KarmaSIM [Narayanan, 97]
(http://www.icsi.berkeley.edu/~snarayan)
Basic Program:
Input: DAML-S description of Frame relations
Output: Network Description of Frames in KarmaSIM
Procedure:
• Recursively construct a sub-network for each control construct.
Bottom out at atomic frame.
• Construct a net for each atomic frame
• Return network
A Precise Notion of Contingency
Relations
Activation:
Executing one schema causes the enabling, start or continued execution
of another schema. Concurrent and sequential activation.
Inhibition:
Inhibitory links prevent execution of the inhibited x-schema by
activating an inhibitory arc. The model distinguishes between concurrent
and sequential inhibition, mutual inhibition and aperiodicity.
Modification:
The modifying x-schema results in control transition of the modified
xschema. The execution of the modifying x-schema could result in the
interruption, termination, resumption of the modified x-schema.
Results of Model
• Captures fine grained distinctions needed for
interpretation
– Frame-based Inferences (COLING02)
– Aspectual Inferences (Cogsci98, CogSci01, IJCAI 99,
CL03)
– Metaphoric Inferences (AAAI99)
– Biological Evidence (CogSci03, BL03)
• Sufficient Inductive bias for verb learning
(Bailey97, CogSci99), construction learning (Chang03,
to Appear)
• Model for DAML-S (ISWC02, WWW02, Computer
Networks 03)
Distributed OPErational
(DOPE) Semantics
Maps Situation Calculus action axiomatization to CBN Formalism
[Narayanan 99, NM2002, NM2003]
Features of CBN representation
 Can deal with quantitative information & resources
 Natural representation of stochastic actions (selection and effects)
 Variety of well established analysis and simulation techniques including
mappings to other logics of change.
 Natural representation of change, concurrency, and synchronization
 Execution semantics
Problems with T(D)BN
• Scaling up to relational structures
• Supports linear (sequence) but not
branching (concurrency, coordination)
dynamics
Structured
Probabilistic Inference
Probabilistic inference for Events
– Filtering
• P(X_t | o_1…t,X_1…t)
• Update the state based on the observation sequence and state set
– MAP Estimation
• Argmaxh1…hnP(X_t | o_1…t, X_1…t)
• Return the best assignment of values to the hypothesis variables given
the observation and states
– Smoothing
• P(X_t-k | o_1…t, X_1…t)
• modify assumptions about previous states, given observation
sequence and state set
– Projection/Prediction/Reachability
• P(X_t+k | o_1..t, X_1..t)
Open-Source FrameNet
• Use the idea of open source Linux development
– Frame hackers around the world
– Distributed vanguard and peer review process
– Pilot projects in large social networks (ICSI BCIS
project)
• Develop software and infrastructure
–
–
–
–
Frame Creation and Modification
Annotation structures
Common API for semantic resources.
Specialized domain FrameNets
Summary
 The FrameNet Project is making good progress
toward our goal of producing a lexicon for a
significant number of English words with uniquely
detailed information about their argument
structure and the semantics associated with it.
 We have an automatic translation from FrameNet
to computational representations that
 Are able to translate FN annotations and frame
structure for use by Semantic Web researchers and use
ontologies on the web for semantic typing of FE’s.
 Translates Frame representations to a simulation
semantics that can perform frame-based inference and
may provide a scalable semantics for NL systems.
Ongoing Work: Question Answering
• As part of the AQUAINT program (UCB, ICSI,
Stanford), we are tasked with
– coming up with a uniform formalism to encode frames,
schemas and metaphors (ScaNaLU 2002)
– Designing inference algorithms to reason with semantic
schemas.
– Others (UCB, Stanford) are tasked with trying to
identify semantic relations from text.
– One possible interchange language choice is DAMLS/OWL-S
• Hypothesis: Simulation based inference over
semantic relations is useful for question
answering.
http://www.icsi.berkeley.edu/NTL
http://www.icsi.berkeley.edu/framenet
http://www.icsi.berkeley.edu/NTL
http://www.icsi.berkeley.edu/framenet
Download