Models of
Discourse Analysis
Carolyn Penstein Rosé
Language Technologies Institute/
Human-Computer Interaction Institute
What are the computational
implications of the debate
between DA and CA?
Chicken and Egg…
Main issue for this week:
Exploring sequencing and
linking between speech acts in
* Where do the ordering constraints come from? Is it the language? Or is it what is behind the language
(e.g., intentions, task structure)? If the latter, how do we computationalize that?
Reminder from last time RE
Constraint from Ordering
Inform is the most common class
With bigrams, if we look for conditional
probabilities above 25%
Next most frequent is Assess (18.5%)
The only case where the most likely
next class is not Inform is ElicitAssessment, which is followed by
Assessment 36% of the time
It is followed by Inform 33% of the time
It only occurs about 1% of the time
Trigrams might be better, but this
makes ordering information look pretty
Interesting Observation!
Responses can address either illocutions or perlocutions
Perlocutions are much less constrained
Accounts for some of the difficulty in imposing ordering
Argues in favor for thinking about conversation as
organized around intentions and tasks rather than
linguistic categories
Wednesday’s readings will argue just the opposite!!
Are illocutions just the wrong categories??
Discourse Analysis vs Conversation Analysis
(according to Levinson)
 Rules, formulas, more
typical of linguistics and
 Categories,
contingencies, grammars
 Use of a small but
strategic amount of data
 Accused of “premature”
theory construction
 Martin & Rose, Levinson
More rigorously empirical
and inductive
Focus on what is found in
data, not on what is
expected to be found or
would sound odd
Hesitant to make
generalizations/ Accused
of being atheoretical
Questions about whether
the rules “work” on real
* Is it a question about the nature of language (is there a fundamental segmentation
difference between utterances and acts?), or is it a question about research
methodology? Are these linked?
The nature
of what we
are modeling
What we can
know about it
and how certain
we can be
How we
learn what
we know
Rules, like speech
anthrooplogy style
