Anaphora Resolution Sobha Lalitha Devi AU-KBC Research Centre MIT Campus of Anna University Chennai-44 sobha@au-kbc.org Contents Introduction to Anaphora and Anaphora Resolution Types of Anaphora Process of Anaphora Resolution Tools Applications References 05/24/2011 Summer School, IIIT HYderabad 2 What is Cohesion COHESION is the internal continuity or network of points of continuity within a text. Text is not just a string of sentences. It is not simply a large grammatical unit “something of the same kind as a sentence, but differing from it in size- a sort of super-sentence· A semantic unit” Halliday & Hassan 05/24/2011 Summer School, IIIT HYderabad 3 Cohesive Relationships Cohesive relationships between words and sentences have certain definable qualities that allow us to recognize the super sentence Nature of cohesive relation Type of cohesion Relatedness of form Substitution and ellipsis; Lexical collocation Relatedness of reference Reference, Lexical reiteration Semantic connection Conjunction 05/24/2011 Summer School, IIIT HYderabad 4 Relatedness of Form Substitution: "Nice teapots! I'll take one.“ Ellipsis: "Turn on. Tune in. Drop out." ['you' is elided] Collocation: "John went to the bank. He wanted to swim in the river." ['river' disambiguates 'bank'] 05/24/2011 Summer School, IIIT HYderabad 5 Relatedness of Reference Exophora: (extra linguistic feature: deitic markers , this that) “what is this?” Anaphora: "I used to have the key. But I lost it.” Cataphora: "It is your turn, John” Reiteration: "He speaks only to the Huxleys; the Huxleys speak only to the Darwins; and the Darwins speak only to God.“ 05/24/2011 Summer School, IIIT HYderabad 6 Semantic Connection Conjunction: "You tell me that you've got ev'rything you want, and your bird can sing, but you don't get me, you don't get me! You say you've seen seven wonders, and your bird is green, but you can't see me, you can't see me! When your prized possessions, start to tear you down, then look in my direction, I'll be round, I'll be round." [Beatles -- Lee Campbell] 05/24/2011 Summer School, IIIT HYderabad 7 Comparison Halliday & Hassan also classify comparison as a form of cohesion "She's more fun than a barrel of monkeys!" "He's as tall as a six foot four inch tree." 05/24/2011 Summer School, IIIT HYderabad 8 CONTEXT DEPENDENCE The interpretation of most expressions depends on the context in which they are used Developing methods for interpreting context dependent expressions useful in many applications We focus here on dependence of nominal expressions on context introduced LINGUISTICALLY, for which we will use the term ANAPHORA 05/24/2011 Summer School, IIIT HYderabad 9 Introduction What is Anaphora Antecedent Anaphora Resolution 1. Sabeer Bhatia arrived at Los Angeles International Airport at 6 p.m. on September 23, 1998. His flight from Bangalore had taken 22hrs and he was starving. [RD, NOV 2000] 05/24/2011 Summer School, IIIT HYderabad 10 Etymology of Anaphora ANA- Back, Upstream, Back upstream Phora- Act of Carrying Anaphora - Act of Carrying Back 05/24/2011 Summer School, IIIT HYderabad 11 What is Anaphora Anaphora, in discourse, is a device for making an abbreviated reference (containing fewer bits of disambiguating information, rather than being lexically or phonetically shorter) to some entity (or entities) in the expectation that the receiver of the discourse will be able to disabbreviate the reference and, thereby, determine the identity of the entity. (Hirst 1981) 05/24/2011 Summer School, IIIT HYderabad 12 Cataphora When “anphora” precedes the antecedent Because she was going to the departmental store, Mary was asked to pick up the vegetables. 05/24/2011 Summer School, IIIT HYderabad 13 Relevance from the Linguistics point of view Binding Theory is one of the major results of the principles and parameters approach developed in Chomsky (1981) and is one of the mainstays of generative linguistics. The Binding Theory deals with the relations between nominal expressions and possible antecedents. It attempts to provide a structural account of the complementarity of distribution between pronouns, reflexives and R-expressions. 05/24/2011 Summer School, IIIT HYderabad 14 Dichotomy Between Linguistic and NLP The Binding Theory (and its various formulations) deals only with intra-sentential anaphora, A very small subset of the anaphoric phenomenon that practical NLP systems are interested in resolving. A much larger set of anaphoric phenomenon is the resolution of pronouns inter-sententially. This problem is dealt with by Discourse Representation Theory and more specifically by Centering Theory (Grosz et al., 1995).. 05/24/2011 Summer School, IIIT HYderabad 15 Types of Anaphors The Prime Minister is yet to arrive and he is expected at the central hall at any time. [The Times of India, Feb 2001] This book is about Anaphora Resolution. The book is designed to help beginners in the field and its author hopes that it will be useful. VP Anaphor John screamed, as did Mary . 05/24/2011 Summer School, IIIT HYderabad 16 Pronominal anaphora Vajpayee hits back forcefully when he told the opposition today “sometimes we fall prey to the media and sometimes you do. [Indian Express 2001] Possessive Priyanka eats only chicken sandwiches before going to take any exam; nothing else goes down her gullet that day.[Indian Express, 13 March 2001] 05/24/2011 Summer School, IIIT HYderabad 17 Reflexive Pronoun Finally ,Danian heaved himself up and lay on a waiting stretcher. Demonstrative Pronoun John had lots of packing to do before he shifted his house. This was something he never liked…. Relative Pronoun Stumper Sameer Dige, who made his test debut, failed to show fast reflexives when it mattered. 05/24/2011 Summer School, IIIT HYderabad 18 Non Anaphoric Usage of Pronouns. Pleonastic It Cognative a. It is believed that….. b. It appears that….. Modal Adjectives c. It is dangerous…… d. It is important….. Temporal e. It is five o’clock f. It is winter Weather verbs g. It is raining f. It is snowing Distance h. How far it is to Chennai? 05/24/2011 Summer School, IIIT HYderabad 19 Non-anaphoric uses of pronouns He that plants thorns must never expect to gather roses. He who dares wins. Deictic He seems remarkably bright for a child of his age. 05/24/2011 Summer School, IIIT HYderabad 20 Noun Phrase Anaphora Definite descriptions and Proper names Roy Kaene has warned Manchester United he may snub their pay deal. United’s skipper is even hinting that unless the future Old Trafford Package meets his demands, he could quit the club in June 2000. Irishman Keane, 27, still has 17 months to run on his current 23,000 pound a week contract and wants to commit himself to United for life. Alex Ferguson’s No 1 player confirmed: If it’s not the contract I want, I won’t sign”. 05/24/2011 Summer School, IIIT HYderabad 21 Coreference Computational Linguists from many different countries attended the tutorial. The participants found it hard to cope with the speed of the presentation, nevertheless they manages to take extensive notes. 05/24/2011 Summer School, IIIT HYderabad 22 Coreference Sophia Loren says she will always be grateful to Bono. The actress revealed that the U2 singer helped her calm down when she became scared by a thunderstorm while travelling by a plane. She=> Sophia Loren The actress=> Sophia Loren The U2 Singer=> Bono Her=>Sophia Loren She=>Sophia Loren 05/24/2011 Summer School, IIIT HYderabad 23 Coreference chain Sophia Loren says she will always be grateful to Bono. The actress revealed that the U2 singer helped her calm down when she became scared by a thunderstorm while travelling by a plane. Coreference chains {Sophia Loren, she, the actress, her, she} {Bono, the U2 singer} 05/24/2011 Summer School, IIIT HYderabad 24 Chains of object mentions in text Toni Johnson pulls a tape measure across the front of what was once a stately Victorian home. A deep trench now runs along its north wall, exposed when the house lurched two feet off its foundation during last week's earthquake. Once inside, she spends nearly four hours measuring and diagramming each room in the 80-year-old house, gathering enough information to estimate what it would cost to rebuild it. While she works inside, a tenant returns with several friends to collect furniture and clothing. One of the friends sweeps broken dishes and shattered glass from a countertop and starts to pack what can be salvaged from the kitchen. (WSJ section of Penn Treebank corpus) 05/24/2011 Summer School, IIIT HYderabad 25 What is Anaphora Resolution The Process of finding the antecedent for an Anaphor is Anaphora resolution 05/24/2011 Anaphor-The reference that point to the previous item. Antecedent-The entity to which the anaphor refers Summer School, IIIT HYderabad 26 RESEARCH ON ANAPHORA RESOLUTION: A QUICK SUMMARY 1970-1995 Primarily theoretical Emphasis: commonsense knowledge, salience Exception: Hobbs 1977, Shalom Lappin 1995-2005 First annotated corpora to be used to develop, evaluate and compare systems (MUC, Geand Charniak, ACE) First robust systems Heuristic-based: Mitkov ML: Vieira & Poesio 1998, 2000; Soon et al 2001, Ng and Cardie2002 Emphasis: surface features Exceptions: Poesio & Vieira, Harabagiu, Markert 2005-present More sophisticated ML techniques (global models, kernels) Richer features –especiallySummer semantic information 05/24/2011 School, IIIT HYderabad First tools 27 Application of Anaphora Resolution Tasks that require determining the coherence of (segments of) text Segmentation Post-hoc coherence check in summarization (Steinberger et al, 2007) Tasks that require identifying the most important information ina text Sentence selection in summarization (Steinberger et al 2005, 2007) Indexing Information extraction: recognize which expressions refer to objects in the domain Relation extraction from biomedical text (Sanchez-Grailletand Poesio, 2006, 2007) Multimodal interfaces: recognize which objects in the visual scene are being referred to 05/24/2011 Summer School, IIIT HYderabad 28 Different Approaches In Anaphora Resolution Rule Based Statistical Based Machine Learning Based 05/24/2011 Summer School, IIIT HYderabad 29 Rule Based Hobbs system 05/24/2011 Summer School, IIIT HYderabad 30 Hard Constraints on Coreference Number agreement Person and case Gender Agreement Syntactic Agreement Selectional Restrictions 05/24/2011 Summer School, IIIT HYderabad 31 Number agreement Singular Plural Unspecifie d She,her, he, him, his, it We, us, they, them you John and Mary loaned Sue a cup of coffee. Little did they know the magnitude of her addiction. 05/24/2011 Summer School, IIIT HYderabad 32 Person and Case Agreement First Secon Third d Nominative I,we you he,she,they Accusative me,us you Him,her,the m Genitive my,ou your r His,her, their 05/24/2011 Summer School, IIIT HYderabad 33 Gender Agreement *John has a coffee machine. She loves it. 05/24/2011 Summer School, IIIT HYderabad 34 Syntactic Agreement Reflexives (himself, herself…) have strong constraints on what syntactic positions they can appear in John bought himself a cup of coffee. *John bought him a cup of coffee. 05/24/2011 Summer School, IIIT HYderabad 35 Selectional Constraints Jim bought a coffee from the store. He drank it quickly. 05/24/2011 Summer School, IIIT HYderabad 36 Also : Preferences Recency Grammatical Role Repeated Mention Parallelism Verb Semantics 05/24/2011 Based on Salience Summer School, IIIT HYderabad 37 Recency John had a pop-tart. Bill had a jelly donut. Mary wanted it. Recent Entities are more salient 05/24/2011 Summer School, IIIT HYderabad 38 Grammatical Role “Sue bought a cup of coffee and a donut from Jane. She met John as she left.” Entities in subject position are more salient 05/24/2011 Summer School, IIIT HYderabad 39 Repeated Mention John went to the store to buy coffee. He loves coffee. He drinks 5 cups a day. At the store, Bill sold him a cup. He was delighed. Entities mentioned more frequently are more salient 05/24/2011 Summer School, IIIT HYderabad 40 Parallelism John bought coffee from Jim in the morning. Sue bought coffee from him in the evening. Even with preferences to the contrary (grammatical role) the syntactic parallelism strongly prefers [him = Jim] 05/24/2011 Summer School, IIIT HYderabad 41 Verb Semantics John telephoned Bill. He was jonesing for coffee. John criticized Bill. He was jonesing for coffee. Perhaps salience of different elements in the sentence changes with respect to the verb used. 05/24/2011 Summer School, IIIT HYderabad 42 Algorithms --- How to integrate these preferences? Constraints are easy to use : reject all hypothesis which violate the hard constraints (if you can accurately detect the constraints!) Preferences more difficult – how can one integrate these different preferences? 05/24/2011 Summer School, IIIT HYderabad 43 Hobbs Tree Search Algorithm Given parse trees, search them in a specific order to find the most likely referent 05/24/2011 Summer School, IIIT HYderabad 44 Hobbs in Detail 1. Begin at NP 2. Go up tree to first NP or S. Call this X, and the path p. 3. Traverse all branches below X to the left of p. Propose as antecedent any NP that has a NP or S between it and X 4. If X is the highest S in the sentence, traverse the parse trees of the previous sentences in the order of recency. Traverse left-to-right, breadth first. When a NP is encountered, propose as antecedent. If not the highest node, go to step 5. 05/24/2011 Summer School, IIIT HYderabad 45 Hobbs cont. 5. 6. 7. 8. From node X, go up the tree to the first NP or S. Call it X, and the path p. If X is an NP and the path to X did not pass through the nominal that X dominates, propose X as antecedent Traverse all branches below X to the right of the path, in a left-to-right, breadth first manner. Propose any NP encountered as the antecdent If X is an S node, traverse all brnaches of X to the right of the path but do not go below any NP or S encountered. Propose any NP as the antecedent. 05/24/2011 Summer School, IIIT HYderabad 46 Lappin and Leass (1994) Anaphora Resolution Algorithm The Lappin and Leass(1994) anaphora resolution algorithm uses salience weight in determining the antecedent to the pronominals. It requires as input a fully parsed sentence structure and uses hierarchy in identifying the subject, object etc. This algorithm uses syntactic criteria to rule out noun phrases that cannot possibly corefer with it. The antecedent is then chosen according to a ranking based on salience weights. 05/24/2011 Summer School, IIIT HYderabad 47 The salience Factors and Weights A pronoun P is non-coreferential with a (non-reflexive or nonreciprocal) noun phrase N if any of the following conditions hold: P and N have incompatible agreement features. P is in the argument domain of N. P is in the adjunct domain of N. P is an argument of a head H, N is not a pronoun, and N is contained in H. P is in the NP domain of N. P is a determiner of a noun Q, and N is contained in Q. 05/24/2011 Summer School, IIIT HYderabad 48 Examples Condition 1: The woman said that he is funny. Condition 2: She likes her. John seems to want to see him. Condition 3: She sat near her. Condition 4: He believes that the man is amusing. This is the man he said John wrote about. Condition 5: John’s portrait Summer of him is interesting. 05/24/2011 School, IIIT HYderabad 49 Salience Factors and Weights Salience factor types with initial weights Factor type Initial weight Sentence recency 100 Subject emphasis 80 Existential emphasis 70 Accusative emphasis 50 Indirect object and oblique complement emphasis 40 Head noun emphasis 80 Non-adverbial emphasis 50 05/24/2011 Summer School, IIIT HYderabad 50 Kennedy 1996 The linguistic analysis for anaphora resolution includes The output of a part of speech tagger, Augmented with syntactic function annotations for each input token; Using LINGSOFT 05/24/2011 Summer School, IIIT HYderabad 51 A set of patterns are used for identifying The NP Chunking with position of the NP in the text: Nominal Sequencing in two subordinate syntactic environments: a. in an adverbial adjunct b. in an NP (i.e. containment in a prepositional or clausal complement of a noun, or containment in a relative clause) Expletive “it”: 05/24/2011 Summer School, IIIT HYderabad 52 Anaphora Resolution Uses Lappin and Lease algorithm SENT-S: CNTX-S: SUBJ-S: EXST-S: POSS-S: ACC-S: DAT-S: OBLQ-S: HEAD-S: ARG-S: 05/24/2011 100 iff in the current sentence 50 iff in the current context 80 iff GFUN = subject 70 iff in an existential construction 65 iff GFUN = possessive 50 iff GFUN = direct object 40 iff GFUN = indirect object 30 iff the complement of a preposition 80 iff EMBED = NIL 50 iff ADJUNCT = NIL Summer School, IIIT HYderabad 53 Mitkov 1997 No Parsing of the Input Sentence Boosting indicators First Noun Phrases: A score of +1 is assigned to the first NP in a sentence. Indicating Verbs: A score of +1 is assigned to those NPs immediately following a verb which is a member of a predefined set (including verbs such as discuss, present, illustrate, identify, summarise, examine, describe, define, show, check, develop, review, 05/24/2011 Summer School, IIIT HYderabad 54 MARS Cont…. 05/24/2011 Lexical Reiteration: A score of +2 is assigned to those NPs repeated twice or more in the paragraph in which the pronoun appears, a score of +1 is assigned to those NPs repeated once in that paragraph. Section Heading Preference: A score of +1 is assigned to those NPs that also occur in the heading of the section in which the pronoun appears. Summer School, IIIT HYderabad 55 Boosting indicators contd.. Collocation Match: A score of +2 is assigned to those NPs that have an identical collocation pattern to the pronoun. Immediate Reference: A score of +2 is assigned to those NPs appearing in constructions of the form “… (You) V1 NP … con (you) V2 it (con (you) V3 it)”, where con Є {and/or/before/after…}. Sequential Instructions: A score of +2 is applied to NPs in the NP1 position of constructions of the form: “To V1 NP1 V2 NP2. (Sentence). To V3 it, V4 NP4“ the noun phrase NP1 is the likely antecedent of the anaphor it (NP1 is assigned a score of 2). Term Preference: A score of +1 is applied to those NPs identified as representing terms in the genre of the text. 05/24/2011 Summer School, IIIT HYderabad 56 Impeding indicators Indefiniteness: Indefinite NPs are assigned a score of -1. Prepositional Noun Phrases: NPs appearing in prepositional phrases are assigned a score of -1. 05/24/2011 Summer School, IIIT HYderabad 57 Indian Language Types of anaphors Dravidian type with gender marked pronouns Aryan Language Type without gender marking in the pronouns 05/24/2011 avan, aval, atu (example) Telugu does not have all other Dravidian languages have. “us” as in Hindi (example) FROM CHAPTER Summer School, IIIT HYderabad 58 <ANT 1,2,3> uutti </ANT> oru alakiya malai nakaram. Ooty(N) one(Q) beautiful(ADJ) hill(N) town(N). (Ooty is a beautiful hill station.) <ANA 1> ithu </ANA> malaikalin raani. This hill(N)+GEN queen(N). (This is Queen of hills.) <ANA 2> anku </ANA> puungkaa, pataku cavaari, malai rayil untu. There park(N), boating(VBN) hill train(N) present(V+PRESENT+3S) (There are park, boating, hill train ) <ANA 3> athu </ANA> oru cirantha currulaath thalam. it(PN) one(Q) best(ADJ) tourist(N) place(N). (It is a best tourist place.) 05/24/2011 Summer School, IIIT HYderabad 59 “Vasisth” a Rule Based Anaphora Resolution System 1. mo:han(i) avanRe(i) kuttiye mohan he-poss child-acc (Mohan saw his child.) 2. mo:han(i) avanRe(i) kuttiye kantu mohan he-poss kantu. see-pst ennu kRisnan paRannu. child-acc see-pst compl krishnan say-pst (Krishnan said that Mohan saw his child.) 3. *mo:han(i) avane(i) aticcu. mohan he-acc beat-pst (Mohan beat him.) 4. mo:han avane(i) aticcu ennu kRisnan(i) paRannu. mohan he-acc beat-pst compl krishnan say-pst (Krishnan said that Mohan beat him.) 05/24/2011 Summer School, IIIT HYderabad 60 The Algorithm for Intra-sentential Anaphora A pronoun P is coreferential with an NP iff the following conditions hold: a. P and NP have compatible P, N, G features. b. P does not precede NP. c. If P is possessive, then NP is the subject of the clause which contains P. d. If P is non-possessive, then NP is the subject of the immediate clause which does not contain P. 05/24/2011 Summer School, IIIT HYderabad 61 Vasisth is a multilingual Anaphora Resolution system Rule based With minimum Parsing Exploit the Morphology of Indian Languages 05/24/2011 Summer School, IIIT HYderabad 62 “VASISTH” Using Salience Measure for Indian Languages No In-depth Parsing Exploit the Rich Morphology of the Language The analysis depends on the salience weight of the candidate (NP) for the antecedent-hood of an anaphor from a list of probable candidates. 05/24/2011 Summer School, IIIT HYderabad 63 The salience weight assignment a) The current sentence gets a score of 50 and it reduces by 10 for each preceding sentence till it reaches the fifth sentence. The system considers five sentences for identifying the antecedent. b) The current clause gets a score of 75 if the pronoun present in the clause is a possessive pronoun and if it is a nonpossessive pronoun it gets zero score. c) The immediate clause gets the score 70 in the case of Possessive pronoun and gets a score of 75 for nonpossessive pronouns. d) For non-immediate clause, the possessive pronoun gets a score of 30 and non-possessive pronoun gets a score of 65. 05/24/2011 Summer School, IIIT HYderabad 64 e)The analysis showed that the subject could be the most probable antecedent for the pronoun. The case markings the subject of a sentence could take are nominative and dative. A Nominative, a Dative and a Possessive NP with a nominative/Dative head could become a subject of a sentence. 05/24/2011 Summer School, IIIT HYderabad 65 f) The direct object of a sentence could be identified by the case markings and all the case markings other than the subject are considered for object. The next most probable NP for antecedent-hood is the direct object and hence it gets a score of 40. g) The third NP in a clause, which is not identified as the subject or object, is considered as the indirect object and gets a low score of 30. 05/24/2011 Summer School, IIIT HYderabad 66 Salience factor weights for Indian Languages Salience Factors Current sentence Weights 50- Reduced by 10 for preceding sentences upto 5th sentence Possessive Current clause Immediate clause Non-immediate clause 75 70 30 Non-Possessive Current clause Immediate clause Non-immediate clause Possessive and Non-Possessive N.Nom N.Poss N.Dat 05/24/2011 N.Acc, Loc, Instr… N.others(3rd NP) 0 75 65 80 50 50 Summer School, IIIT HYderabad 40 30 67 How it works The salience weight to an NP is assigned in the following way Identify the Pronoun Consider Four sentences above the sentence containing the Pronoun Consider all the NPs preceding the Pronoun ( This is the general rule) 05/24/2011 Summer School, IIIT HYderabad 68 Here we take some NPs which follow the the Pronoun since Tamil All Indian languages are relatively free word Order Assign Salience Weights. The NP which gets the maximum salience weight and agrees in png with the anaphor is considered as the antecedent to the anaphor 05/24/2011 Summer School, IIIT HYderabad 69 Machine Learning CONLL Task and its results 05/24/2011 Summer School, IIIT HYderabad 70 Introduction Goal of the task Automatically identify coreference chains in a document The coreference chains can include Names Nominal mentions Pronouns verbs that are coreferenced with a noun phrases. The data used is the Ontonotes English documents from Ontonotes This consists of five different types of genres 05/24/2011 Newswire (NW), Broadcast News (BN), Telephonic Conversation (TC), Broadcast conversation (BC), Web blogs (WB) Summer School, IIIT HYderabad 71 Introduction Coreferencing Coreferents are classified into two types Two referential entities when they exist in the real world Coreference analysis determines whether or not two entities refer to the same entity Pronominal referents Non-pronominal referents Pronominal referents are pronouns, which refer to other nouns in the text Non-pronominal referents are names, nominal mentions and other noun phrases 05/24/2011 Summer School, IIIT HYderabad 72 A Sample Text Eagle Clothes Inc. , which is operating under Chapter 11 of the Federal Bankruptcy Code , said it reached an agreement with its creditors . Under the accord , Albert Roth , chairman and chief executive officer , and Arthur Chase , Sam Beigel , and Louis Polsky will resign as officers and directors of the menswear retailer . Mr. Roth , who has been on leave from his posts , will be succeeded by Geoffrie D. Lurie of GDL Management Inc. , which is Eagle 's crisis manager . Mr. Lurie is currently co-chief executive. 05/24/2011 Summer School, IIIT HYderabad 73 Coreference Tagged Text <Coref_chain Id="1">Eagle Clothes Inc.</Coref_chainId="1"> , which is operating under Chapter 11 of the federal Bankruptcy Code , said <Coref_chain Id="1">it</Coref_chain Id="1"> reached an agreement with <Coref_chain Id="1">its</Coref_chain Id="1"> creditors . Under the accord , <Coref_chain Id="2">Albert Roth , chairman and chief executive officer</Coref_chain Id=“2"> , and Arthur Chase , Sam Beigel , and Louis Polsky will resign as officers and directors of the <Coref_chain Id="1">menswear retailer</Coref_chain Id="1"> . <Coref_chain Id="2">Mr. Roth , who has been on leave from his posts ,</Coref_chain Id="2"> will be succeeded by <Coref_chain Id="3">Geoffrie D. Lurie of GDL Management Inc. , which is <Coref_chain Id="1">Eagle 's</Coref_chain Id="1"> crisis manager </Coref_chain Id="3">. <Coref_chain Id="3">Mr. Lurie </Coref_chain Id="3"> is currently cochief executive . 05/24/2011 Summer School, IIIT HYderabad 74 Our Approach Two different approaches are used for the two types of coreferents Mainly two modules Pronominal resolution module Non-pronominal resolution module Pronominal resolution refers to identification of a Noun phrase (NP) that is referred by a pronominal Non-Pronominal resolution refers to identification of a NP referring to other NP. 05/24/2011 Summer School, IIIT HYderabad 75 Pronominal Resolution Module Pronominal Resolution is done using refined salience measure. All pronouns do not refer back to an entity For example in the sentence “It will rain today” The pronoun “It” does not refer to any entity Such an instance of “It” is called as pleonastic “it” Before we do pronominal resolution, we need to identify such non-anaphoric pronouns 05/24/2011 Summer School, IIIT HYderabad 76 Pronominal Resolution Module (Contd…) The first step in the pronominal resolution module is identification of non-anaphoric pronouns and filter the non-anaphoric pronouns from text The identification of non-anaphoric pronouns is done using CRFs, a machine learning approach 05/24/2011 Summer School, IIIT HYderabad 77 Pronominal Resolution Module (Contd…) Using CRFs we build a language model The features used for the training are Word Part-of-speech (POS) tag Window of five words Non-anaphoric pronoun filtering has been observed to improve the results 05/24/2011 Summer School, IIIT HYderabad 78 Pronominal Resolution Module (Contd…) The second step - core task of the pronoun resolution module the identification of antecedent for a pronoun Use a refined salience measure based approach WordNet and NE tag information are used to match the category of pronoun and the antecedent 05/24/2011 Summer School, IIIT HYderabad 79 Pronominal Resolution Module (Contd…) The table below shows the refined salience factors and the weights assigned for them Salience Factors Weights Current Sentence (sentence in which pronoun occurs) 100 For the preceding sentences up to four sentences from the current sentence Reduce sentence score by 10 Current Clause (clause in which pronoun occurs) 100 – for possessive pronoun 50 – for non-possessive pronouns Immediate Clause (clause preceding or following the current clause) 50 – for possessive pronoun 100 – for non-possessive pronouns Non-immediate Clause (neither the current or immediate clause) 50 Possessive NP 65 Existential NP 70 Subject 80 Direct Object 50 Indirect Object 40 Compliment of PP 05/24/2011 Summer School, IIIT HYderabad 30 80 Non-Pronominal Resolution Module Use CRFs, a machine learning method Preparing the Training data for CRFs learning All positive pairs of NPs, and negative pairs of NPs are taken for training Positive pairs are the NPs and the anaphor All NPs between an anaphor and antecedent are the negative NPs Do not consider the NPs containing the pronouns, We consider the NP on the left side as antecedent NP and NP on the right side as anaphor NP 05/24/2011 Summer School, IIIT HYderabad 81 Features – Non-pronominal Resolution Module The features used for training are Distance feature Definite NP If the antecedent NP is a definite NP, has value 1 else 0 Demonstrative NP the possible values are 0,1,2,…. Calculated based on number of sentences between NPs If the antecedent NP is demonstrative, has value 1 else 0 String match 05/24/2011 the possible values are between 0 and 1 calculated as ratio of the number of words matched between the NPs and the total number of words of the anaphor NP. Summer School, IIIT HYderabad 82 Features – Non-pronominal Resolution Module (Contd…) Number Agreement use the gender data file (Bergsma and Lin, 2006) provided by CoNLL Also use POS information Gender agreement use the gender data file (Bergsma and Lin, 2006) provided by CoNLL Alias feature the possible values are 0 or 1. this is obtained using three methods Comparing the head of the NPs, if both are same then scored as 1 If both the NPs start with NNP or NNPS POS tags, and if they are same then scored as 1 Looks for Acronym match, if one is an acronym of other it is scored as 1 Proper NP If both NPs are proper NPs then value of 1 else 0 NE Tag Info If NE tag present then value of 1 else 0 05/24/2011 Summer School, IIIT HYderabad 83 Non-Pronominal Resolution Module (Contd…) The semantic class information (noun category) obtained from the WordNet is used for the filtering purpose. 05/24/2011 The pairs which do not have semantic feature match are filtered out. Summer School, IIIT HYderabad 84 Complete Coreference Chain Building The Coreferring pairs obtained from pronominal resolution system and Non-pronominal system are merged to generate the complete coreference chains. The merging is done as follows: A member of a coreference pair is compared with all the members of the coreference pairs identified if it occurs in anyone of the pair, then the two pairs are grouped this process is done for all the members of the identified pairs and the members in each group are aligned based on their position in the document to form the chain We show the sample outputs for building chain in the next few slides 05/24/2011 Summer School, IIIT HYderabad 85 Evaluation The data used for training and testing is the English portion of the Ontonotes The training and development data consisted of 1876 files from different genres viz., Newswire, Broadcast news, Broadcast conversation, Web blogs, magzine articles The metrics used for evaluating the complete system are 05/24/2011 MUC B-Cubed CEAFE Summer School, IIIT HYderabad 86 Evaluation We have performed evaluation of the Pronominal resolution module separately using the development data We perform the evaluation of non-anaphor detection engine In the evaluation of pronominal resolution module we studied how the non-anaphoric pronoun detection improves 05/24/2011 Summer School, IIIT HYderabad 87 Results – Pronominal Resolution Module Evaluation of Non-anaphoric pronoun detection component Evaluation of Pronominal resolution module Actual (gold standar d) System identifie d Correctl y Accur acy (%) System type Total Anaph oric Prono uns System identifie d pronoun s System correctly Resolved Pronouns Prec (%) Anaphoric Pronouns 939 908 96.6 Without nonanaphoric pronoun detection 939 1099 693 63.1 Nonanaphoric pronouns 160 81 50.6 939 987 693 70.2 Total 1099 989 89.9 With nonanaphoric pronoun detection Type pronoun of 05/24/2011 Summer School, IIIT HYderabad 88 Results– Complete System Metric Mention Detection Coreference Resolution Recall Precision F1 Recall Precision F1 MUC 68.1 61.5 64.6 52.1 49.9 50.9 B-CUBED 68.1 61.5 64.6 66.6 67.6 67.1 CEAFE 68.1 61.5 64.6 42.8 44.9 43.8 05/24/2011 Summer School, IIIT HYderabad 89 Results Discussion On analysis of the output we found mainly three types of errors. They are Newly invented chains 05/24/2011 The system identifies new chains that are not found in the gold standard annotation. This reduces the precision of the system. This is because of the string match as one of the features. Summer School, IIIT HYderabad 90 Results Discussion (Contd…) Only head nouns in the chain system while selecting pair for identifying coreference, the pair has only the head noun instead of the full phrase. In the phrase “the letters sent in recent days”, the system identifies “the letters” instead of the whole phrase. This affects both the precision and recall of the system. 05/24/2011 Summer School, IIIT HYderabad 91 Conclusion Presented a coreference resolution system which combines the pronominal resolution using refined salience based approach with non-pronominal resolution using CRFs, machine learning approach. Non-anaphoric pronouns identification improves the precision. In non-pronominal resolution algorithm, the string match feature is an effective feature. But, this feature is found to introduce errors. We need to add additional contextual and semantic feature to reduce above said errors. The results on the development set are encouraging. 05/24/2011 Summer School, IIIT HYderabad 92 Problems in Anaphora Resolution Ambiguous Sentences We gave the bananas to the monkeys because they were hungry. We gave the bananas to the monkeys because they were ripe. We gave the bananas to the monkeys because they were here. Complement anaphora 05/24/2011 (1) Only a few of the children ate their ice-cream. They ate the strawberry flavour first. (2) Only a few of the children ate their ice-cream. They threw it around the room instead Summer School, IIIT HYderabad 93 The Prime Minister of New Zealand visited us yesterday. The visit was the first time she had come to New York since 1998. If the second sentence is quoted by itself, it is necessary to resolve the anaphor: The visit was the first time the Prime Minister of New Zealand had come to New York since 1998. Although of course, as The Prime Minister of New Zealand is an office of state and she would seem to refer to the person currently occupying that office, it could quite easily be that the Prime Minister of New Zealand had visited New York since 1998 and before the present day, whilst the present incumbent she had not. 05/24/2011 Summer School, IIIT HYderabad 94 Corpus Creation for ML Method Annotation Guide lines 05/24/2011 PALINKA MUC GATE Give examples Summer School, IIIT HYderabad 95 Tools GATE Java-RAP (pronouns) GUITAR (Poesio & Kabadjov, 2004; Kabadjov, 2007) BART (Versleyet al, 2008) 05/24/2011 Summer School, IIIT HYderabad 96 TASKS and Conferences CoNLL shared task Conference Discourse Anaphora and Anaphora Resolution Colloquium: DAARC 05/24/2011 Summer School, IIIT HYderabad 97 Where it is required? Machine Translation Information Extraction Summarization And in……….almost all NLU applications 05/24/2011 Summer School, IIIT HYderabad 98 References Massimo Poesio Slides: “Anaphora resolution for Practical task” Ruslan Mitkov: “MARS a Knowledge Poor anaphora resolution system” MORE FROM OTHER SOURSES 05/24/2011 Summer School, IIIT HYderabad 99 Thank You