ppt

Centering theory and its direct applications Lecture 2 Some definitions Discourse = coherent sequence of utterances  Several sentences following one another do not make a readable text  Defining specific computable measures of coherence is the goal of this seminar  Centering theory ingredients  Deals with local coherence What happens to the flow from sentence to sentence  Does not deal with global structuring of the text (paragraphs/segments)   Defines coherence as an estimate of the processing load required to “understand” the text Processing load  Upon hearing a sentence a person Cognitive effort to interpret the expressions in the utterance  Integrates the meaning of the utterance with that of the previous sentence  Creates some expectations on what might come next  Example (3) John met his friend Mary today. He was surprised to see her. He thought she is still in Italy.  Form of referring expressions (1) (2)    Anaphora needs to be resolved “Create” a discourse entity at first mention with full noun phrase Creating expectations Creating and meeting expectations (1) a. John went to his favorite music store to buy a piano. b. He had frequented the store for many years. c. He was excited that he could finally buy a piano. d. He arrived just as the store was closing for the day. (2) a. John went to his favorite music store to buy a piano. b. It was a store John had frequented for many years. c. He was excited that he could finally buy a piano. d. It was closing just as John arrived. Interpreting pronouns a. b. c. d. e. Terry really goofs sometimes. Yesterday was a beautiful day and he was excited about trying out his new sailboat. He wanted Tony to join him on a sailing expedition. He called him at 6am. He was sick and furious at being woken up so early. Basic center definitions  Centers of an utterance Set of entities serving to link that utterance to the other utterances in the discourse segment that contains it  Not words or phrases themselves  Semantic interpretations of noun phraes  Types of centers    Forward looking centers  An ordered set of entities  What could we expect to hear about next  Ordered by salience as determined by grammatical function  Subject > Indirect object > Object > Others John gave the textbook to Mary.  Cf = {John, Mary, textbook} Preferred center Cp  The highest ranked forward looking center  High expectation that the next utterance in the segment will be about Cp Backward looking center  Single backward looking center, Cb (U)  For each utterance other than the segment-initial one The backward looking center of utterance Un+1 connects with one of the forward looking centers of Un  Cb (U+1) is the most highly ranked element from Cf (Un) that is also realized in U+1  Centering transitions ordering Cb(Un+1)=Cb(Un) Cb(Un+1) != OR Cb(Un) Cb(Un)=[?] Cb(Un+1) = Cp (Un+1) continue Cb(Un+1) != Cp (Un+1) retain smooth-shift rough-shift Centering constraints  There is precisely one backwardlooking center Cb(Un)  Cb(Un+1) is the highest-ranked element of Cf(Un) that is realized in Un+1 Centering rules  If some element of Cf(Un) is realized as a pronoun in Un+1 then so is Cb(Un+1)  Transitions not equal  continue > retain > smooth-shift > rough-shift Centering analysis  Terry really goofs sometimes.  Cf={Terry}, Cb=?, undef  Yesterday was a beautiful day and he was excited about trying out his new sailboat.  Cf={Terry,sailboat}, Cb=Terry, continue  He wanted Tony to join him in a sailing expedition.  Cf={Terry, Tony, expedition}, Cb=Terry, continue  He called him at 6am.  Cf={Terry,Tony}, Cb=Terry, continue  He called him at 6am.  Cf={Terry,Tony}, Cb=Terry, continue  Tony was sick and furious at being woken up so early.  Cf={Tony}, Cb=Tony, smooth shift  He told Terry to get lost and hung up.  Cf={Tony,Terry}, Cb=Tony, continue  Of course, Terry hadn’t intended to upset Tony.  Cf={Terry,Tony}, Cb = Tony, retain Rough shifts in evaluation of writing skills  One of the graders of student essays in standardized tests is an automatic program  ETS researchers have developed a number of applications that use natural language processing technologies to evaluate and score the writing abilities of test takers:  The CriterionSM Online Essay Evaluation Service automatically evaluates essay responses using e-rater and the Critique writing analysis tools.  E-rater® gives holistic scores for essays.  CritiqueTM provides real-time feedback about grammar, usage, mechanics and style, and organization and development.  C-raterTM offers automated analysis of conceptual information in short-answer, free responses. E-rater features  Syntactic variety   Clear transitions     Represented by features that quantify the occurrence of clause types Cue phrases in certain syntactic constructions Existence of main and supporting points Appropriateness of the vocabulary content of the essay What about local coherence? Ranking forward looking centers Subject >  Indirect object >  Object >  Others >  Quantified indefinite subjects (people, everyone) >  Arbitrary plural pronominals  Essay score model Human score available  E-rater prediction available  Percentage of rough-shifts in each essay: analysis done manually   Negative correlation between the human score and the percentage of rough-shifts Karamanis’07  Why are we reading this paper?  Gives quite complete list of references on later work on centering • Centering variants  Reminds that entity coherence is not the only factor in text flow • We’ll be discussing rhetorical structure theory during the next class  Applications---can some aspects of the work be done differently/improved upon? Information ordering task  Given a set of sentences/clauses, what is the best presentation?   Criteria for deciding which of two orderings is better   Take a newspaper article and jumble the sentences---the result will be much more difficult to read than the original Centering would definitely be applicable Summarization, question answering, generation  Linear multi-factor regression   Adding rough shifts significantly improves the model of the score   Approximate the human score as a linear function of the e-rater prediction and the percentage of rough-shifts 0.5 improvement on 1—6 scale How easy/difficult would it be to fully automate the rough-shift variable Centering variations  Continuity (NOCB=lack of continuity)   Coherence    Cb(Un) = Cb(Un+1) Salience   Cf(Un) and Cf(Un+1) share at least one element The Cb(U) = Cp(U) Coherence is more important than salience Cheapness (fulfilled expectations)  Cb (Un+1) = Cp(Un) GNOME corpus  20 descriptions of museum artifacts    Split into finite unites (clauses) Semi-automatic centering annotation Item 144 is a torc. Its present arrangement, twisted into three rings, may be a modern alteration; it should probably be a single ring, worn around the neck. The terminals are in the form of goats’ heads. Rhetorical coherence   Each text can be seen as a hierarchical tree structure Different spans are related by some rhetorical relation      Elaboration (adding more information) Contrast Sequence Purpose Summary etc Local rhetorical coherence  Applies only locally rather than on the text as a whole  Signaled by cue phrases  Contrast: but, however, on the other hand  Continuation: and, then, later  Consequence: because, in order to, so  These local rhetorical relations structure the text  When missing, entity coherence determines the flow  8 out of the 20 texts do not have any explicitly marked rhetorical relations Joint centering and local rhetorical coherence  In clauses directly marked for a rhetorical relation  Merge the Cf lists of the two clauses  Apply centering transitions on the resulting Cf list rather than the original  GNOME-RR contains 1.58 fewer CF lists compared to the original average number (8.35) Metrics of coherence M.NOCB (no continuity)  M.CHEAP (expectations not met)  M.KP sum of the violations of continuity, cheapness, coherence and salience  M. BFP seeks to maximize transitions according to Rule 2  Experimental methodology  Gold-standard ordering    The original order of the text (object description, news article) Assume that other orderings are inferior Classification error rate  Percentage orderings that score better than the gold-standard + 0.5*percentage of the orderings that score the same Results  NOCB gives best results  Significantly better than the other metrics  M.BFP is the second best metric  Adding the local rhetorical relations hurts performance---is this surprising? Reminders  Select topics you would like to present    Should schedule next week now The second time you present one of the goals will be to relate the papers with previous topics we have covered Start thinking about the topic of your literature overview    About 15 papers 5/6 pages Due Nov 12

ppt

Related documents

Products

Support

ppt

Related documents

Add this document to collection(s)

Add this document to saved

Suggest us how to improve StudyLib