INTRODUCTION TO ARTIFICIAL INTELLIGENCE Massimo Poesio LECTURE 6 The Cognitive Perspective: Generative Lexicon Sense Enumeration Lexicon (SEL) • WordNet and similar resources are examples of SENSE ENUMERATION LEXICA • Direct approach to handle polysemy is to allow the lexicon to have multiple listing of words, each annotated with a separate meaning or lexical sense. • Widely accepted in both computational and theoretical linguistics. 2 Sense Enumeration Lexicon (SEL) • Example of Contrastive Senses bank1 CAT= count-noun GENUS= financial-institution bank2 CAT= count-noun GENUS= shore 3 Verbal polysemy in sense enumeration lexica Syntactic polysemy deals with polivalency (I), object deletion (II) and the general properties of argument expression (III). (I) a. Mary began to read the novel. b. Mary began reading the novel. c. Mary began the novel. (II) a. Mary ate (her meal) quickly. b. Mary devoured *(her meal) quickly. (III) a. John carved a doll (out of the wood). b. John carved the wood (into a doll). Nominal polysemy (1)a. Mary doesn’t believe the book. b. John sold his books to Mary. (2)a. Eno the cat is sitting on yesterday’s newspaper. b. Yesterday’s newspaper really got me upset. (3)a. Mary is in Harvard Square looking for the Bach sonatas. b. We won’t get to the concert until after the Bach sonatas. (4)a. I have my lunch in the backpack. b. Your lunch was no longer today than it was yesterday. (5)a. The phone rang during my appointment. b. My next appointment is John. Nominal polysemy in sense enumeration lexica • Newspaper Newspaper1 CAT= count-noun GENUS= artefact Newspaper2 CAT= count-noun GENUS= information 6 GENERATIVE LEXICON THEORY • (Pustejovsky, 1991, 1995) • Claim: the concepts associated with a word in a context are GENERATED by a process starting from lexical entries structured into QUALIA STRUCTUREs and involving GENERATIVE DEVICES such as TYPE COERCION and COCOMPOSITION Generative lexicon theory: lexical entries A lexical entry in the generative lexicon consists of the following elements at least: • Argument Structure True Arguments Default Arguments Shadow Arguments True Adjuncts • Event Structure • Qualia Structure Formal Constitutive Telic Agentive 8 QUALIA ROLES Word meaning is structured on the basis of four generative factors, called qualia roles, that capture how humans understand objects and relations in the world and provide the minimal explanation for the linguistic behavior of lexical items. FORMAL: the basic category that distinguishes an object within a larger domain CONSTITUTIVE: the relation between an object and its constituent parts TELIC: the object’s purpose and function AGENTIVE: factors involved in the object’s origin or ‘coming to being’ Qualia structures and argument polysemy Qualia Structure for novel novel Qualia const = narrative formal = book telic = reading agent = writing 10 A generative device: type coercion • Type Coercion a lexical item or phrase is coerced to a semantic interpretation by a governing item in the phrase, without changing its syntactic type Mary began to read the novel Mary began reading the novel Mary began the novel • • Function Application with Coercion different complement type of the verb different interpretations of the verb that arise for the different complements 11 Other generative devices • Selective Binding a lexical item or a phrase operates specifically on the substructure of a phrase, without changing the overall type in the composition a good knife: a knife that cuts well • Co-composition multiple elements within a phrase behave as functors, generating new nonlexicalized senses for the words in composition John baked the potato John baked the cake 12 NOMINAL POLYSEMY A dot object is a deeper structure relating the apparently contradictory senses of the word. For each sense pair there is a relation that ‘connects’ the senses in a well-defined way. The dot object is characterized as: - a Cartesian type product of n types (the product τ1 x τ2, of types τ1 and τ2, each denoting sets, is the ordered pair <t1 , t2>, where t1 ε τ1 , t2 ε τ2) - with some additional constraints: there exists a relation R between the elements of τ1 and τ2 , namely, R(t1 , t2). This relation must be seen as part of the definition of the semantics for the dot object. Type combinations included in the broad range of complex types encountered in natural language: a. b. c. d. e. phys_objinfo : e.g., book, record eventevent : e.g., construction, examination eventquestion : e.g., exam eventfood : e.g., lunch, dinner eventhuman : e.g., appointment For each of these type products, there is a unique relation, Ri, that structures the types. For example, nouns such as book or record, are structured by a containment relation R (container-like concepts). This containment relation -hold(x,y)- must be encoded directly into the semantics of the concept as the FORMAL quale value. The lexical structure for newspaper as a dot object is represented as follows: newspaper ARGSTR = ARG1 = y:information ARG2 = x:phys_obj QUALIA = informationphys_obj FORM = hold(x,y) TELIC = read(e,w,xy) AGENT = write(e,v,xy) This translates to the following logical expression: λx yev[newspaper (x:physobj y :info) hold(x,y) λwλe [read (e,w,xy) [write(e;v,x y)]] REFERENCES • Pustejovsky, J. (1995). The generative Lexicon. Cambridge, MA: MIT Press.