Lecture 12 - ufal wiki

advertisement
Boris Iomdin
Russian Language Institute,
Russian Academy of Sciences
iomdin@ruslang.ru
Lecture 12. Plan
 Classifications of the lexicon
 Objects
 Predicates
 Fundamental classification
 Main classes
 Subclasses
 Semantic annotation of corpora
 A case study: memory in Russian
Classifying the lexicon
 Objects
 names of animals, birds, fish, fruits, vegetables,
stones, mountains, stars, planets, etc.
 differential explications
 a taxonomical classification
 Predicates
 lexical units that have at least one semantic
valency
 exhaustive explications
 a fundamental classification
Generic and specific features
Both parts of the inventory are further
subdivided into two subgroups:
 generic semantic features (genus
proximum).
Nouns: ‘animal’, ‘vegetable’, ‘state’, ‘action’, …
 specific semantic features (differentia
specifica). Adjectives: ‘domestic’, ‘wild’,
‘natural’, ‘physical’, ‘mental’, …
Fundamental classification
 Developed by Juri Apresjan and differs from
comparable systems of J. Maslov, Z. Vendler,
T. Bulygina, E. Paducheva, Ch. Fillmore and others
 A non-rigid multi-level hierarchy with multiple
"horizontal" and "vertical" intersections of classes
 Unlike that of object descriptors, the list of
predicate descriptors forms a closed set
 The classes of the fundamental classification can
be used in all sorts of rules: morphological,
derivational, syntactic, semantic, pragmatic, …
Main classes
 Action
 Localization
 Activity
 State
 Occupation
 Property
 Behaviour
 Ability
 Impact
 Parameter
 Process
 Existence
 Manifestation
 Relation
 Event
 Interpretation
 Spatial position
Action
 An action is a verb whose explication, if reduced to
the lowest level, includes the semantic primitive ‘to do’,
and the situation takes place within one round of
observation
 Examples: attack, walk, go, chop
 The main and the most numerous class
 Most actants
 Richest co-occurrence
 Highest derivational potential
 Most polysemous
Activity
 An activity is a verb that denotes an aggregate of
different actions united by a single purpose, and the
situation takes place within several rounds of
observation
 Examples: educate, trade, fight (for power)
Behaviour
 A behaviour is a verb that denotes an aggregate of
physical actions viewed from the angle of how the
subject shows himself towards other people or towards
the current situation, with a (generally negative)
assessment
 Examples: frolic, mince, fool around, brawl
Occupation
 An occupation is a verb that denotes an action the
main purpose of which is the action itself
 Examples: read, play, rest, skate
 Action and occupation are close enough, the main
difference being the interpretation of the purpose
Occupation:
 xodit’ po gorodu ‘to be walking in the city’, plavat’ v
bassejne ‘to be swimming in the pool’
Action:
 xodit’ kupat’sja ‘to go for a swim’, plavat’ za
kuvshinkami ‘to swim for water lilies’
Impact
 An impact is a verb whose subject is not a human
being but an inanimate power that can change the
world (a space object, a natural phenomenon, a certain
circumstance, etc.)
 This power could be the cause of
 an action (A rustle in the bushes made him stop)
 a process (The sun heats the air)
 a situation (This declaration harms us)
 a state (Meeting with de Saussure worried him)
Process
 A process is a verb whose subject is an object or a
phenomenon which could change all by itself, without
anyone wishing it
 Examples: grow, decrease, boil, recover
 Actions and processes are close enough, the main
difference being the existence / absence of a purpose
Spatial position
 A spatial position is a verb whose main idea is a
constant contact of an object X (or its part Z) and a
place Y
 Examples: be sitting, be lying, be standing
Manifestation
 A manifestation is a property, a state or a process that
is being perceived by an observer
 Examples: shine, ring, stink, sting
Subclasses
 Inceptives
 arise: ‘beginning’ + ‘existence’
 learn: ‘beginning’ + ‘state’
 Finitives
 die out: ‘ceasing’ + ‘existence’
 forget: ‘ceasing’ + ‘state’
 Causatives
 feed: ‘causation’ + ‘action’
 cultivate: ‘causation’ + ‘process’
 Liquidatives
 awaken: ‘elimination’ + ‘process’
Subclasses of actions
 Physical: break
 Physiological: urinate
 Mental: think
 Volitional: decide
 Emotional: anger
 Speech: request
 Social: marry
…
Subclasses of impacts
 Physical: heat
 Mental: convince
 Volitional: force
 Emotional: surprise
…
Subclasses of states
 Physical: see
 Physiological: be ill
 Mental: know
 Volitional: wish
 Emotional: fear
 Material: need
 Social: be married
…
(no speech states!)
Subclasses of parameters
 Numerical: height, length, distance, width, depth, area,
capacity, volume, age, time, weight, …
 Qualitative: appearance, form, colour, sound, smell, …
 Social, non-hierarchical: nationality, religion,
occupation, income, …
 Social, hierarchical: rank, title, status, course, …
Polysemy: different lexemes
may go into different classes
 vxodit’ v komnatu ‘to enter a room’: action
 vxodit’ v trans ‘to fall into a trance’: process
 vxodit’ v komissiju ‘to be part of a committee’: state
 vxodit’ v vedro ‘to go in a bucket’: parameter (capacity)
 sobaka viljaet xvostom ‘the dog wiggles its tail’: action
 pricep viljaet ‘the trailer side-tracks’: process
 doroga viljaet ‘the road twists’: property
Order of meanings
 Direct meanings
 Metaphorical/metonymical meanings
 Lexically or syntactically bound meanings
 Grammatical meanings
Gradual semantic erosion of verbal lexemes
Semantic tagging of corpora
 Semantic tagging is based on the inventory of semantic
features (descriptors) and a dictionary, with a set of tags
assigned to each lexeme and its argument slots
 The set of descriptors assigned to words is designed in such
a way as to construct a linguistically relevant classification
for the whole vocabulary
 This classification serves for discovering laws according to
which the elements of various lexical and semantic classes
interact in the texts
 A part of the Russian National Corpus has been
syntactically and semantically tagged by Apresjan et al. Cf.
Proposition Bank from the Penn English Tree Bank etc.
Russian memory lexicon
• Mental state: pomnit’ ‘remember’, …
• Inceptives:
▫ Controlled: vspomnit’, pripomnit’ ‘recall’, …
▫ Uncontrolled: vspomnit’sja ‘come to memory’, …
• Liquidatives:
▫ Controlled: vybrosit’ iz golovy ‘think away’, …
▫ Uncontrolled: zabyt’ ‘forget’, upustit’ ‘miss’, …
• Causation:
▫ Internal: zapomnit’ ‘memorize’, vyuchit’ ‘learn’, …
▫ External: napomnit’ ‘remind’, suvenir ‘souvenir’, …
•
•
•
•
Abilities: pamjatlivyj ‘having a good memory’, …
Result: vospominanie ‘reminiscence’, …
Properties of objects: pamjatnyj ‘memorable’, …
Responsible organ(s): pamjat’ ‘memory’, soznanie ‘mind’
Memory and other mental
domains: main differences
 What types of information can be received
 How it is received
 How it is processed
 How it is stored
 How it is retrieved
Receiving information
 uznat’ ‘to learn’
 ponjat’ ‘to understand’
 Rational understanding: osoznat’ ‘to realize’,
soobrazit’ ‘figure out’, etc.
 Irrational understanding: ugadat’ ‘to guess’,
osenit’ ‘to dawn upon’, etc.
Types of information
 znat’ adres <imja, nomer telefona>
‘to know the address <name, phone number’
 pomnit’ adres <imja, nomer telefona>
‘to remember the address <name, phone number’
 *ponimat’ adres <imja, nomer telefona>
‘to understand the address <name, phone
number’
Types of information
 pomnit’ vcherashnij vecher
 ‘to remember yesterday’s evening’
 *znat’ vcherashnij vecher
‘to know yesterday’s evening’
 ?ponimat’ vcherashnij vecher
‘to understand yesterday’s evening’
experiential vs. informational memory
Storing information
 Controlled: vyuchit’ ‘learn’, zauchit’ ‘learn by heart’,
vyzubrit’ ‘swot up’, zatverdit’ ‘drill’, …
 Uncontrolled: zapomnit’sja ‘stay /stick into one’s
memory’, zapast’ v pamjat’ ‘be imprinted in one’s
memory’
 Ja dolgo zapominal ix adres ‘I spent some time
memorizing their address’ VS.
 Ja nazval otel’, adres kotorogo sluchajno zapomnil
‘I mentioned an hotel whose address I chanced to
remember’
Information storages
 golova ‘head’
 um ‘mind, intellect’
 soznanie ‘mind, conscience’
 pamjat’ ‘memory’
Um
processes current information:
 reshat’ v ume zadachi ‘mentally solve
problems ’
 igrat’ v ume v shaxmaty’ ‘play blindfold
chess’
 ??xranit’ v ume ‘store in one’s mind/intellect’
Soznanie
 holds freshly perceived information, is
responsible for the perception of the current
situation and for instant processing (the
human RAM)
 is rather small: aeto ne umeshchaetsja v
soznanii ‘the mind cannot accommodate it’
 soznanie zapolneno odnoj mysl’ju <odnim
voprosom> ‘the mind is full with the only
thought <the only topic>’
English equivalents of soznanie
 conscience
 consciousness
 sense
 mind
Three fundamental “faculties” of mind are
memory, imagination, and reason (Francis
Bacon, Novum Organum) –
Tri sfery soznanija: pamjat’, voobrazhenie i
razum
Pamjat’
 ‘the ability of a person A1 to store and recall
in his/her mind (v soznanii) images of
earlier perceived objects, experienced
feelings, and other earlier received
information of type A2, or an invisible organ
inside A1’s head which implements this
ability’
Pamjat’ as an ability
 xoroshaja pamjat’ na lica <na daty>
‘good memory for faces <dates>’
 zritel’naja <dvigatel’naja> pamjat’
‘visual <motor> memory’
 uxudshenie <poterja> pamjati
‘memory impairment <loss of memory>’
Pamjat’ as an organ
 otlozhit’sja v pamjati (lit. ‘be deposited in one’s
memory’) ‘stay in one’s memory’
 vyzyvat’ v pamjati (lit. ‘call force in one’s
memory’) ‘recall, remind’
 osvezhit’ v pamjati (‘refresh in one’s memory’)
 derzhat’ <xranit’> v pamjati (‘store in one’s
memory’)
 vyvetrit’sja iz pamjati (lit. ‘be weathered out of
one’s memory’) ‘be effaced from one’s memory’
Syncretic usage
Most of the cases:
 xoroshaja <blestjashchaja, cepkaja> pamjat’
‘good <brilliant, tenacious> memory’
 ploxaja <dyrjavaja, korotkaja> pamjat’ ‘bad
<sieve-like, short> memory’ (= kurinaja pamjat’
lit. ‘chicken’s memory’, devich’ja pamjat’
lit. ‘girl’s memory’)
 razvivat’ <uprazhnjat’> pamjat’ ‘to develop
to exercise> one’s memory’
Ability/organ disjunction
Quite regular in Russian:
 um ‘mind/intellect’
 sovest’ ‘conscience/Gewissen’
 voobrazhenie ‘imagination’
 dusha ‘soul’
 serdce ‘heart’
…
Polysemy of pamjat’
 1.1 ‘ability or organ’: good memory
 1.2 ‘memory of a device’: computer memory
 1.3 ‘memory of a substance’: memory effect
 2.1 ‘reminiscence’: memory of a tragedy
 2.2 ‘image of a late person’: dedicate to memory
 3 ‘a reminding object’: serviz kak pamjat’ o dede ‘dinner
set as a memory of the grandfather’
 4 obsolete ‘conscience’: lezhat’ bez pamjati ‘lay
unconscious’
Polysemy of zabyt’
 1 ‘cease to remember’: Ja zabyl tvoj telefon
‘I forgot your phone number’
 2 ‘leave elsewhere’: Ja zabyl zontik doma
‘I forgot my umbrella at home’
 3.1 ‘lose contacts’: Ne zabyvaj staryx druzej
‘Do not forget you old friends’
 3.2 ‘abandon’: On sovsem zabyl skripku
‘He entirely gave up playing violin’
Semantic invariant for zabyt’
 ‘shift from the center towards periphery’
 1: information present in the subject’s memory was not
there in his mind when needed, or it shifted to some
inaccessible domain
 2: information about the need to take or transfer an
object was not there in the subject’s mind when
needed
 3: a person (4: an occupation) shifted from the
subject’s center of interest
A model of forgetting
 Pamjat’ stores things that can be recalled
(vspomnit’) in the mind when needed, and the
forgotten information goes elsewhere in our head
 *Ja xranju ego imja v pamjati, no ne mogu
vspomnit’ ‘I keep his name in my memory but I
cannot remember it’
 vertet’sja v golove ‘run through one’s head’
 Ja vse zabyla… u menja pereputalos’ v golove
‘I forgot everything… it got mixed up in my head’
A model of memory
 pamjat’ ‘memory’: the ability to store and recall
information and the invisible organ in the head,
close to soznanie ‘conscience’ and um ‘mind’
 zapomnit’: intentionally or occasionally perceving
information, put it into pamjat’, where it can be
retrieved and put into soznanie
 pomnit’: zapomnit’ and be able to recall
 zabyt’: zapomnit’ , but later be unable to recall
 vspomnit’: retrieve from pamjat’ and recall
Next lecture
 Case studies. Lexicographic treatment of
artifacts. Everyday life vocabulary.
Download