Targeted Help - Computational Linguistics and Phonetics

advertisement
TARGETED HELP FOR SPOKEN
DIALOGUE SYSTEM
SREEDHAR ELLISETTY
TARGETED HELP FOR SPOKEN DIALOGUE SYSTEM :
INTELLIGENT FEEDBACK IMPROVES NAIVE USERS’ PERFORMANCE
MAIN ISSUES
» What is TARGETED HELP?
» How it provides Feedback?
» How Target help Module Works?
» How SLM Recognizer makes system Robust?
» Experimental Evidence
» How some ITSs providing Feedback?
INTRODUCTION - 1
» Targeted Help: Gives immediate Intelligent Feedback to Naïve
users’ about their Out-of-Coverage spoken Utterances.
» Goal: Producing Intelligent Feedback to the Naive users’ and
guiding users Towards In-Coverage Utterances.
» Key words
- Expert Users
- Naive Users
- In-Coverage Utterance
- Out-of-Coverage Utterance
- Grammar based Recognizer
- SLM Recognizer
INTRODUCTION - 2
» Expert users know the limitations and capabilities of the system
» Grammar-based recognizers tuned to a domain can handle well,
the in-coverage Utterances.
» Naive users don’t know the Grammar of the System and they
produces Out-of-Coverage Utterances.
» Grammar based Recognizers rejects and produces messages like
“Sorry, I didn't Understand”
» Statistical Language Model (SLM) Recognizer handles Out-of-Coverage
Utterance.
INTRODUCTION - 3
» SLM Recognizer produces a recognition hypothesis, which is
used by the Targeted Help agent to give the users feedback
» Feedback message consists of Diagnostics of the user Utterance
and In-Coverage examples.
» In-Coverage example uses In-coverage words in the users’
Utterances and the Same Dialogue move
» Encourages users’ to align the users Utterance with the Language
model of the System
» Makes users expert in quick time, improves performance,
Reduces time.
Grammar – Based Recognizer
» Tuned to a domain performs well for all In-Coverage utterances.
» Example: Grammar of FAIRY TALE READER
» Grammar can be written quickly
Drawbacks:
- Puts constraints to the user on the usage of the system
- Cant’ handle Out-of-Coverage utterance
- Takes lot of time for simple sentences.
» Defining a Grammar is to predict future user utterance as good as possible
by observing the real users in the desired application
» Problem in Defining appropriate Grammar
- It should cover as much as possible
- It shouldn’t contain unnecessary words, which leads to recognition errors
> It should satisfy these issues: Precise , Coverage.
SLM RECOGNIZER
» Also called Secondary, Fall-back, Category-based statistical Language model
Recognizer.
» Handles all the Out-of-Coverage Utterances well
» Produces a recognition hypothesis, which is used by the targeted help agent to
Give Feedback
» Problems with SLM :
- Needs to be trained to perform efficiently
- Collecting a large enough corpus to create an SLM is time consuming
and expensive
- A separate parser must be implemented to extract semantic content.
Advantages
- Don’t put constraints to the users’
System Description
» Targeted Help was developed and tested as part of the WITAS dialogue
system
» WITAS: A command and control and mixed-initiative dialogue system for
interacting with a simulated robotic helicopter or UAV
» Interaction in WITAS involves joint-activities between an autonomous
system and human operator .
» Interactions with such a system are not scriptable in advance ,
and rely on mixed task and dialogue initiatives in conversation.
» So, the system is a good test-bed for Targeted Help
THE TARGETED HELP MODULE
» Is a separate component that can be added to an existing dialogue system
with minimal changes .
» Module design makes it quite portable
» Goal is to handle Utterance that cant’ be processed by the Main Dialogue system
» Aligns the users’ input with the coverage of the system as much as possible
PARTS IN THE MODULE:
- The secondary Recognizer
- The Targeted Help Activator
- The Targeted Help Agent
ARCHITECTURE OF DIALOGUE SYSTEM WITH TARGETED HELP MODULE
Parser
Dialogue Manager
MAIN DIALOGUE SYSTEM
Primary speech
recognizer
Speech Synthesizer
Speech in
Speech
out
Secondary speech
recognizer
Targeted help
activator
Targeted help agent
TARGETED HELP MODULE
Normal response path
Targeted help response path
Targeted help response path (if secondary SR result parses)
THE TARGETED HELP ACTIVATOR
» The Activator’s behavior is as follows for the four possible combinations of
Recognizer outcomes -
1. Both Recognizers get a recognition hypothesis
2. Main recognizer gets a recognition hypothesis and secondary recognizer
rejects
3. Main recognizer rejects, secondary recognizer gets a recognition hypothesis
and it can be Parsed (rare)
4. Main recognizer rejects, secondary recognizer gets a recognition hypothesis
and it cant’ be Parsed
5. Both recognizers reject.
BOTH RECOGNISERS GET A RECOGNITION HYPOTHESIS
Parser
Dialogue Manager
MAIN DIALOGUE SYSTEM
Primary speech
recognizer
Speech Synthesizer
Speech in
Speech
out
Secondary speech
recognizer
Targeted help
activator
Targeted help agent
TARGETED HELP MODULE
Normal response path
Targeted help response path
Targeted help response path (if secondary SR result parses)
MAIN RECOGNISER ACCEPTS, SECONDARY RECOGNISER REJECTS
Parser
Dialogue Manager
MAIN DIALOGUE SYSTEM
Primary speech
recognizer
Speech Synthesizer
Speech in
Speech
out
Secondary speech
recognizer
Targeted help
activator
Targeted help agent
TARGETED HELP MODULE
Normal response path
Targeted help response path
Targeted help response path (if secondary SR result parses)
MAIN RECOGNISER REJECTS, SECONDARY RECOGNISER HYPOTHESIS
CAN BE PARSED
Parser
Dialogue Manager
MAIN DIALOGUE SYSTEM
Primary speech
recognizer
Speech Synthesizer
Speech in
Speech
out
Secondary speech
recognizer
Targeted help
activator
Targeted help agent
TARGETED HELP MODULE
Normal response path
Targeted help response path
Targeted help response path (if secondary SR result parses)
MAIN RECOGNISER REJECTS, SECONDARY RECOGNISER
HYPOTHESIS CANNOT BE PARSED
Parser
Dialogue Manager
MAIN DIALOGUE SYSTEM
Primary speech
recognizer
Speech Synthesizer
Speech in
Speech
out
Secondary speech
recognizer
Targeted help
activator
Targeted help agent
TARGETED HELP MODULE
Normal response path
Targeted help response path
Targeted help response path (if secondary SR result parses)
FEEDBACK MESSAGE
» Target Help Agent constructs a message based on SLM recognizer hypothesis
» Spots the In-Coverage words in the SLM recognizer hypothesis and Dialoguemove type.
» Feedback Message consists:
•What the System Heard:
- A Report of the Backup SLM recognition hypothesis
•What the Problem was
- Problem with the users’ Utterance
•What you might say instead
- A Similar In-Coverage example.
PROBLEM DIAGNOSTIC
» Diagnostics are of three major types which accounts for major failed utterances
1. End pointing Errors:
Eg.
User says
: Search for the red car
System heard
: For the red car
Category
: Push-to-talk button too late
2. Unknown Vocabulary :
Eg: Fly over to the Hospital.
“The system doesn't understand the word over” vocabulary which is not the
Main recognizer”.
3. Sub categorization Mistakes:
Saying “Zoom in on the Red car” but system knows only intransitive “Zoom in”.
Zooming in on a particular object is not part of the functionality of the system
In-Coverage Example
• Helps Naive users learn the coverage of the system.
• Takes in-coverage words produced by the user .
• Helps the user to imitate in-coverage pattern .
• Leads to linguistic alignment between the user and the
system.
• It always matches the utterance dialogue-move type.
• Eg: Wh-Question, Yes/No-Question, Command,etc.
• Users are likely to produce atleast few in-coverage words.
• Eg :
User
: Fly over to the Hospital.
In-Coverage Example : Fly to the Hospital.
Constructing In-Coverage Example
•
In what sense they should be similar?
•
It takes into account information from a wide variety of sources,
including discourse history, user model and pedagogical strategy.
•
Constraints on not only what words should be included in the
example, but also what syntactic structures, semantic
representations, and word order.
•
Definition of generation task is: Given a grammar G, and a set of
target words T , find a word string W=W1,…,Wn, W ε L(G) such that
T {W1,…….,Wn} .
•
Algorithms for generating In-Coverage Example:
1. Iterative-Deepening (ID) Algorithm.
2. A* Algorithm
3. A combined A*-ID Algorithm.
4. Anytime Algorithm.
Example session with targeted help
version of the Home Control System
User
System
User
System
: Hello?
: Hello, How can I help ?
: Um, Can I have some lights on ?
: Sorry, I didn’t quite catch that. To turn a device on or off, you
could try something like “turn on the kitchen light”.
User
: Turn on the kitchen light .
System : The kitchen light is now on.
User
: What other sort of things are there ?
System : I didn’t quite catch that . To find out what devices there are you
could try something like “ what devices are there” , or “What
devices are there in the lounge”.
User
: What devices are there in the lounge ?
System : There are three things in the lounge; the computer, the VCR and
the television.
DESIGN OF EXPERIMENTS
• To assess the effectiveness of the targeted help of this system.
• Compared the performance of 2 groups of 20 users , One that
received help and one that didn’t.
• Provided minimal written instruction on how to use the system.
• Task is to direct a helicopter within a city to complete various tasks.
• Given task is ended when one of the following is met.
- The task was accurately completed.
- The user thinks he completed the task, though it wasn’t.
- The user gave up.
• The system produced feedback to group in help group.
• They concluded from the experiment that , users achieved
-> HIHER TASK COMPLETION .
-> REDUCED TIME.
EXPERIMENTAL RESULTS
•
Found clear evidence that targeted help improves performance in
this environment, by showing statistical analysis of the experiment.
•
Users who received help were less likely to give up than those who didn’t
received help.
- During first task 11% Vs 45%
•
Users who received help took less time to complete tasks than those who
didn’t , 290.4s Vs 440.6s.
•
Users who didn’t receive help took 67.0% longer to complete than those
who received.
•
Finally, 89% users in the help group and 40% of the users in the No Help
group accurately completed the task.
•
Conclusion: It is possible to construct effective Targeted help messages even
from fairly low quality secondary recognizers.
•
Such an approach can improve the speed of the training for Naive users and
may result in lasting improvements in the quality of their understanding.
My Suggestions
• It is always better to provide instructions to users on how to use the
system and some in-coverage examples.
• If possible system should give mock session to the user and evaluate
the users level of expertise.
• This leads to better interaction between the user and the system.
• Giving the system a way to adjust its help strategy to the level of
expertise of the user is to be accomplished.
• We should use both recognizers simultaneously and select the one
which gives higher confidence value.
• Adding multimodal Input/Output and Animated Agents along with
this approach in ITSs is very useful.
feedback of geometry tutor
• The cognitive tutor component determines what feedback to give to
the student , based on the classification of the explanation.
• NLU component uses a knowledge-based approach to recognize
sentences as correct or partially correct explanations.
• It uses a Statistical Text Classifier (STC) when the knowledge-based
method fails which acts as a Backup. This is based on the Naive
Bayes classification method.
• System falls back on the STC to determine whether the student’s
explanation is focusing on the right geometry rule.
• The tutor say “You appear to be focusing on the right geometry rule.
However , the tutor doesn’t understand your explanation, Could you
please state it in a different way”.
• Thus, the STC enables the tutor to provide more informative
feedback in response to unexpected input.
feedback of geometry tutor: Example
• Explanation of the Triangle sum theorem
Student : they are 180.
Tutor
EQUAL-180
: Could you be more specific? Who are “they”? And what tells
you that they are 180.
Student : the angles are 180.
Tutor
ANGLES-180
: You are heading in the right direction, but you need to state
this rule more precisely. First where does the 180 come
from? Are all angles 180?
Student : the angles in a triangle are 180.
ANGLES-OF TRIANGLE-180
Tutor
: You are awfully close . But is each angle measure equal to
180? Precisely what is 180?
Student
: the sum of angles in a triangle are 180.
Tutor
: Correct.
TRIANGLE-SUM
SUMMARY
• Seen experimental evidence that providing Naive users an
immediate help message improves their performance and achieves,
- HIHER TASK COMPLETION .
- REDUCED TIME
• Adding SLM Recognizer to the Dialogue system makes system
robust.
• Targeted help makes Naïve users expertise in the usage of the
system in a quick time.
• In-Coverage example leads to greater linguistic alignment between
the user and the system.
• This approach is easy to add to an existing system with little
modifications.
• Seen how this approach is useful in some ITSs .
references
1.
2.
3.
4.
5.
6.
Generating Canonical examples using candidate words. Dowding,
G.Aist , B.A. Hockey, and E.O. Bratt. 2002.
G.Gorell, I.Lewin, and M.Rayner. 2002. Adding intelligent help to
mixed-initiative spoken dialogue systems.
B.A. Hockey, G.Aist, J.Dowding, and J.Hieronymus. 2002a.
Targeted help and dialogue about plans.
B.A. Hockey, G.Aist, J.Dowding, J.Hieronymus, and O. Lemon .
2002b. Targeted Help: Embedded training and methods for
evaluation.
B.A. Hockey, G.Aist, J.Dowding, J.Hieronymus, O. Lemon, E.
Campana, L. Hiatt, A. Gruenstein. Targeted Help for spoken
Dialogue Systems: Intelligent feedback improves Naïve users
performance.
V. Aleven, O. Popescu, and K.R. Koedinger. Towards Tutorial
Dialog to support Self Explanation : Adding Natural Language
Understanding to a Cognitive Tutor.
THANK YOU !
ANY QUESTIONS?
Download