Collaborative Annotation of the AMI Meeting Corpus Jean Carletta University of Edinburgh

advertisement
Collaborative Annotation of the
www.amiproject.org
AMI Meeting Corpus
Jean Carletta
University of Edinburgh
www.amiproject.org
AMI Partners
Carletta
20 June 2007
2
www.amiproject.org
NXT Major Development
Sites
Carletta
20 June 2007
3
www.amiproject.org
AMI's aim
• aim: to develop technologies for
browsing meetings and to assist people
during meetings
• interdisciplinary: signal processing,
language engineering, theoretical
linguistics, human-computer interfaces,
organizational psychology, ...
Carletta
20 June 2007
4
Why annotation?
www.amiproject.org
• For basic scientific understanding - e.g.,
• How do people choose a next speaker?
• What is the relationship between speech
and gesture during deixis?
• For machine learning
• Hand-code e.g. statement vs. question
• Identify features for each like word
sequences and prosody
• Use the data to fit a statistical classifier that
codes new data automatically
Carletta
20 June 2007
5
www.amiproject.org
Carletta
20 June 2007
6
www.amiproject.org
Carletta
20 June 2007
7
www.amiproject.org
AMI Meeting Rooms
4 close- and 2 wide-view cameras, 4 head-set
and 8 array microphones, presentation screen
capture, whiteboard capture, pen devices, plus
extra site-dependent devices
TNO
Edinburgh
Carletta
IDIAP
20 June 2007
8
www.amiproject.org
IS1004d, 3:07 - 4:11
Carletta
20 June 2007
9
www.amiproject.org
Corpus Overview
• 100 hrs of well-recorded meetings
• orthographically transcribed with
word timings by forced alignment
• ASR output
• heavily annotated by hand for
communicative behaviours
• Creative Commons Share-Alike
licensing, with demo DVD
Carletta
20 June 2007
10
www.amiproject.org
Hand Annotations
• transcription with word-level timings
from forced alignment (100%)
• timestamping against signal (10-30%)
• head gestures; hand gestures for
addressing and interactions with objects;
location in room; gaze; emotion?
• discourse structure (70%)
• dialogue acts (some w/ addressing), named
entities, topic segments, linked extractive
and abstractive summaries
Carletta
20 June 2007
11
www.amiproject.org
Costs in person-hrs/hr
transcription
topic segments + abstractive summaries
dialogue acts w/ some relations
addressing
extractive summaries linked to abstract
named entities
hand gestures (rough timings)
head gestures (rough timings)
head gestures (precision timings)
movement around room
Carletta
20 June 2007
30
6-10
20
12
1
2-5
6
6
20
4
12
www.amiproject.org
Core Problems
• How do we represent all of these
kinds of annotation on the same
base data, including both structural
relationships and timing?
• How do we allow for multiple
(human and machine) annotations
of the same property, so that we
can compare them?
Carletta
20 June 2007
13
www.amiproject.org
Carletta
20 June 2007
14
www.amiproject.org
Carletta
20 June 2007
15
NITE XML Toolkit
www.amiproject.org
•
•
Mature toolkit for handling annotations with
temporal ordering and full structural relations
Data storage format designed to support
distributed corpus development
• Libraries for data handling, query, and writing
graphical user interfaces
• End user annotation tools for common tasks
• Command line utilities for analysis, feature
extraction
•
Open source
Carletta
20 June 2007
16
www.amiproject.org
NXT corpus design
• data model is multi-rooted tree with arbitrary
graph structure over the top
• each node has one set of children, multiple parents
• annotations often naturally map to a tree
• corpus design to decide where trees intersect
• NXT can represent arbitrary graphs but the
more the data has this character, the less
useful the query language is
Carletta
20 June 2007
17
Stand-off XML
www.amiproject.org
extract from Bdb001.A.speech-quality.xml
<speechquality nite:id="Bdb001.emphasis.16" type="emphasis">
<nite:child href="Bdb001.A.words.xml#id(Bdb001.w.1,342)..id(Bdb001.w.1,344)" />
</speechquality>
extract from Bdb001.A.words.xml
<w nite:id="Bdb001.w.1,342" starttime="356.39" endtime="" c="W">time</w>
<w nite:id="Bdb001.w.1,343" starttime="" endtime="" c="HYPH">-</w>
<w nite:id="Bdb001.w.1,344" starttime="" endtime="356.59" c="W">line</w>
Carletta
20 June 2007
18
www.amiproject.org
Metadata file
Like set of DTDs for the XML files
plus:
• connections between the files
• list of "observations" (coded
dialogues/group discussions/texts)
• catalog for finding signals and data
on disk
Carletta
20 June 2007
19
www.amiproject.org
Simple example query
($w word)($r reference):
($w@POS = “NN”) && ($r ^ $w)
Return list of 2-tuples of words and
referring expressions where the word’s
part of speech is NN and the word is in
the referring expression.
Carletta
20 June 2007
20
General features of the
language
www.amiproject.org
• Match variable by no type, single type, or
disjunctive type
• Attribute and content tests for existence,
ordering, equality, match to regexp
• The usual boolean combinators
• Quantifiers forall and exists
• Filtering by passing results to another query
to create a result tree (not list)
Carletta
20 June 2007
21
www.amiproject.org
Uses for queries
•
•
•
•
•
Carletta
Exploring the data in a browser
Basic frequency counts
Verifying data quality
Indexing complexes for further use
Finding things for screen rendering
in GUI
20 June 2007
22
www.amiproject.org
Only configuration
needed to:
• search/index data in NXT format
• display data in a standardized
(ugly) way
• Set up annotation tools for some
common tasks
• dialogue act
• named entity
• time-stamped labelling
Carletta
20 June 2007
23
www.amiproject.org
• [named entity demo]
Carletta
20 June 2007
24
www.amiproject.org
Programming
tailored interfaces
• development time is 1.5 days - 2
weeks depending on
• how clear the spec is
• complexity of the interface and
whether our "transcription view"
middleware fits
• familiarity with Swing
Carletta
20 June 2007
25
www.amiproject.org
Named entity coder
Carletta
20 June 2007
26
www.amiproject.org
Carletta
20 June 2007
27
www.amiproject.org
Carletta
20 June 2007
28
www.amiproject.org
Carletta
20 June 2007
29
www.amiproject.org
Carletta
20 June 2007
30
www.amiproject.org
Carletta
20 June 2007
31
www.amiproject.org
Carletta
20 June 2007
32
www.amiproject.org
Carletta
20 June 2007
33
www.amiproject.org
Carletta
20 June 2007
34
www.amiproject.org
Summary
• NXT provides infrastructure for
collaborative annotation that
•
•
•
•
Is distributed
Provides structural relationships
Provides timing w.r.t signals
Works for large-scale projects
• NXT’s best current demonstration
is in the AMI Meeting Corpus
Carletta
20 June 2007
35
Download