Progress in the FAME Project - Computer Vision for Human

advertisement
The FAME Project
• Acronym: Facilitating Agent for Multicultural Exchange
• Partners:
Universität Karlsruhe,
UJF Grenoble,
UPC Bacelona,
ATLAS Barcelona
• Project volume:
• Duration:
• More info:
INPG Grenoble,
ITC-irst Trento,
SONY Europe, Stuttgart
5.5 M Euro
40 months, started October 2001
http://www.fame-project.org
Seminar „Multimodale Räume“
Uni Karlsruhe, 14.5.2003
1
The FAME Projekt
Facilitating Agent for Multicultural Exchange
• Volume:
• Duration: 40 months (since October 2001)
• Financial Volume:  5,5 Mio. €
• currently approx. 30 scientists
www.fame-project.org
• Partners:
Uni Karlsruhe
UPC Barcelona
, INPG Grenoble
, ITC-irst Trento
Stuttgart,
, UJF Grenoble,
,
Barcelona
Seminar „Multimodale Räume“
Uni Karlsruhe, 14.5.2003
2
Project Goals
• long term vision: facilitate communication between humans
• reduce the workload on the users of technical equipment
• observe humans and their activities in an intelligent room
and serve as a context-aware information butler
• FAME project goal:
provide and integrate core technologies (video and speech
perception, augmented reality, translation, information retrieval)
to show feasibility of the concept
• demonstrate system at fair
• scenario 1 (lecture scenario):
one person is giving a talk or lecture or presentation
• scenario 2 (meeting scenario):
several people are discussing / working on a common task
Seminar „Multimodale Räume“
Uni Karlsruhe, 14.5.2003
3
The FAME Showcases
scenario 1 (presentation)
• use A/V equipment
• intelligent cameraman
• presentation tracking
• summarisation + archiving
• translation, crosslingual IR
scenario 2 (meeting)
• augmented reality
• video-based activity tracking
• topic spotting
• information butler
• service: planning of fair visit
Seminar „Multimodale Räume“
Uni Karlsruhe, 14.5.2003
4
The FAME Demonstrator
(at Barcelona Fair „Forum of Cultures“ 2004)
FAME outside view
meeting inside
people mention topics
Seminar „Multimodale Räume“
Uni Karlsruhe, 14.5.2003
reception by FAME-guy
room gives information
about spotted topics
5
The FAME Demonstrator
(at Barcelona Fair „Forum of Cultures“ 2004)
at the phicon wall
gestures
multimodal input on table
interactive visit planning
output also on the wall
Seminar „Multimodale Räume“
Uni Karlsruhe, 14.5.2003
the projection table
borrow a camera for
photographs of the visit
6
The FAME Demonstrator
(at Barcelona Fair „Forum of Cultures“ 2004)
Back from the visit in the
FAME room: dowload ...
record testimony
... and look at photos
select, print, save photos
using phicon interaction
intelligent cameraman,
presentation tracker
take home photos and
information about FAME
Seminar „Multimodale Räume“
Uni Karlsruhe, 14.5.2003
7
Important Components
• multimodal environment
• context-aware intelligent camera-man
automatically track people and their activities
• augmented reality environment
move physical icons (phicons) on table/wall,
and interact with projection on table/wall
• spontaneous speech recognition (with distant microphones)
• translation and crosslingual information retrieval
in European-English, Catalan, and Spanish
• dialog and context model
Seminar „Multimodale Räume“
Uni Karlsruhe, 14.5.2003
8
Multimodal Environment at UKA
Smartboard as
Projection Wall
Microphon-Array
(Speaker Lokalization)
Livingroomg
Audio Signals
IR-Remote Control
X-10
Illumination
Loudspeakers
Microphones
Several Beamers
4 Cameras
TV/Video
Seminar „Multimodale Räume“
Uni Karlsruhe, 14.5.2003
9
Augmented Reality Table
• project virtual reality on real table
• move around physical icons (multiple users)
• interact with projection
• select, move, rotate, resize, delete, change color
• write on table, pass notes to others, point to items
Seminar „Multimodale Räume“
Uni Karlsruhe, 14.5.2003
10
Intelligent Camera Man
• follow speaker while
talking and moving around
• detect interaction from audience
• zoom on area of interest
e.g. when pointing somewhere or showing something
Seminar „Multimodale Räume“
Uni Karlsruhe, 14.5.2003
11
Lecture
Supporter
• track lecture or presentation
• operate FAME room equipment
by speech commands
• automatically switch slides during presentation
• automatically create transcript of lecture
• create summary, translate to other languages
• record and store all lectures in searchable database
• retrieve and browse through previously recorded lectures
Seminar „Multimodale Räume“
Uni Karlsruhe, 14.5.2003
12
estimate
72 classes
P(Cn|wn-1,wn-2)
· P(wn|Cn)
add to classes
Adaptation Overview
most frequent
40k words
least frequent
20k words
20% fewer
errors
60k vocabulary
HUB-4 corpus
trigrammmodel
P(wn|wn-1,wn-2)
tf-idf
wichtige
wichtige
wichtige
important
Wörter
Wörter
Wörter
words
presentationslides
±2 contexts
100
Links
100
Links
100
links
scores
top n
CLASS 32
CLASS 14
CLASS 57
CLASS 6
CLASS 70
perplexity
TO THE RECOGNITION OF THE
CONTINUOUS SPEECH RECOGNITION IN NOISY
AND PATTERN RECOGNITION NOT IN
SECURING DUE RECOGNITION AND RESPECT
Seminar „Multimodale Räume“
Uni Karlsruhe, 14.5.2003
13
Welcome in Barcelona
in Summer 2004
Seminar „Multimodale Räume“
Uni Karlsruhe, 14.5.2003
14
Download