G22

advertisement
G22.2590 - Natural Language Processing - Spring 2001
Lecture 11 Outline
Prof. Grishman
April 4, 2001
Resolving Anaphoric Reference, cont’d (Allen Ch. 14)
Pronominal coreference
General strategy for resolving pronouns:
- generate discourse entities for prior sentences in a discourse
- apply constraints to select best candidate antecedent for pronoun
Computing discourse entities for noun phrases
- proper names and definite NPs (“Fred met the president of NYU.”)
- indefinite NPs (“Fred ate an apple.”)
- indefinite NPs in scope of universal quantifiers (“Every boy ate an apple.”)
Constraints:
recency (prefer most recent antecedent; exclude antecedents beyond a certain
distance)
focus / centering
prefer current focus of discourse
prefer argument of main clause of prior sentence; prefer subject to object
breadth-first search provides good approximation (Hobbs)
number (for it vs. they)
note complication for indefinite NPs inside universal quantifiers
animate (for he/she vs. it)
gender (for he vs. she)
requires determination of gender for antecedents
reflexive / non-reflexive pronouns
selectional constraints
does antecedent satisfy selectional constraints of context of anaphor?
Other forms of anaphora
Definite descriptions (Allen 14.4)
May refer to entities from earlier discourse contexts (not just local context), or even
to uniquely determined entities in the setting of the discourse.
In addition, we may make use of background knowledge (for example, about partwhole relations) to make definite references which introduce objects related to
objects in the discourse history. For example, “I bought a new frying pan. The
lid of the pan was made of glass.” Or “The winner of yesterday’s 5-mile race got a
new car.” The definite reference requires (or implies) that the object is unique.
In many cases, the reference to the prior object is implicit (Allen, p. 443): ““I bought
a new frying pan. The lid was made of glass.” Or “I saw an exciting 5-mile race
yesterday. The winner got a new car.” In such cases, if we can identify the
semantic class of the implicit element (null anaphor), we can use conventional
(pronominal) anaphora resolution methods to retrieve the referent.
‘One’ (surface) anaphora (Allen 14.7)
“I bought a red car. Mary bought a green one.” is an example of a quite different
anaphor phenomenon. Here we refer not to a specific entity evoked by the prior
discourse, but to (parts of) a description of such an entity. It is therefore sometimes
called “description anaphora.” While the process of identifying the antecedent is
generally similar, the way in which the antecedent is used is quite different: one
takes just those parts of the antecedent which do not conflict with the modifiers in the
anaphor (we do not use the discourse entities). This may be done at the surface level
(rather than at the logical form level).
Ellipsis (Allen 14.6)
Ellipsis involves input which is not a full clause / sentence, but can be ‘filled out’
with reference to prior sentences. Sometimes this is indicated by a bare auxiliary:
“John passed the course. Mary did too.” Sometimes it takes the form of a bare noun
phrase, particularly in dialog: “Would you like some chocolate crunch? No. Apple
walnut?”
Resolving ellipsis involves identifying the antecedent constituent, and then
determining which parts are to be replaced by the current phrase. Sometimes this can
be done syntactically; sometimes semantic constraints are essential, particularly for
analyzing bare NPs. Some ellipsis mechanism is essential for analyzing dialog.
Anaphora resolution mechanism in Jet (separate handout)
In the example patterns, we use the anaphora mechanism to recover the company
name for appoint events.
In contrast to the earlier pattern sets, we generate an explicit event annotation for each
appointment event; we fill the company slot with an empty noun group with head
‘company’, which will be resolved to the actual company name (usually) by reference
resolution.
Assignment #10 (the last): due April 11th:
.
Exercises #2, 7, and 8 from Chapter 14.
Download