Attention_awareness_version3

advertisement
Attention, Awareness, and the
Computational Theory of Surprise
Research Qualifying Exam
August 30th, 2006
Outline







Introduction
Background
Case Study
Problem Definition
Approach
Results & Work to Date
Future Work
2
Introduction: Intelligent Machines
Office
Defense
Exploration
Home
3
Introduction: Autonomy

Aspects of Autonomy
•
Sensing
•
•
Processing
•
•
Ability to sense the environment
Ability to make decisions about the
sensed environment
Mobility
•
Ability to move about the sensed
environment
4
Introduction

Where are we now?
•
Sensors
•
•
•
•
cheaper
more reliable
more accurate
Data Association
•
•
engineered solutions for specific problems in a given environment
few solutions for unforeseen problems in potentially changing
environments
5
Introduction

QUESTION: What principles from biological system can we borrow to handle
unforeseen problems in dynamic settings?
6
Background: Attention

Definition


1a : the act or state of attending especially through applying the mind to an
object of sense or thought
1b : a condition of readiness for such attention involving especially a selective
narrowing or focusing of consciousness and receptivity – (Merriam-Webster
Dictionary)
7
Background: Theories of Attention

Feature Integration Theory of Attention
•
Treisman (1980)
• “Features are registered early, automatically, and in parallel across the visual field,
while objects are identified separately and only at a later stage, which requires focused
attention.”
•

tested human subjects, measuring time response of visual attention to
cues on screens
Premotor Attention Theory
•
Rizzolatti (1987)
•
The idea of attention is directly linked with the same circuitry in
humans used in the generation of movements or planned movements
of all types.
8
Background: Theories of Attention
More from Joel …
9
Background: Saliency

Definition


3b: standing out conspicuously : PROMINENT especially : of notable
significance– (Merriam-Webster Dictionary)
“[a measure of] how different a given location is from its surround in
color, orientation, motion, depth, etc.” – (Koch & Ullman, 1985)
10
Background: Attention & Saliency

Koch & Ullman (1985)


first proposed the idea of a
“saliency map” drawing from
research in the neurobiology
field
define the “winner take all”
network approach and the
“inhibition of return”
11
Background: Attention & Saliency

Itti & Koch (2000)
•
Show the concept of a
“saliency map” works to shift
machine attention to most
“salient” area of visual scenes
as compared to human test
subjects
•
•
Apply the FeatureIntegration Theory of
Attention in building “feature
maps” in parallel
Attention is distributed in
decreasing order of saliency
12
Background: Attention & Saliency

Frintrop (2000)
•
show the concept of a
saliency map is not limited to
visual sensors but can also be
applied to range sensors
•
•
Used 3D laser range sensor
to extract a “range” image
and an “intensity” image
Further extracts orientation
and intensity feature maps
from each dimension and
fuses them to form a
saliency map
13
Background: Attention & Saliency

Questions:
•
•
Can we formulate these concepts of attention and awareness into
a mathematical framework?
Can we make some connection between attention and control
theory?
14
Case Study

Question:
Can these concepts of attention
and awareness be incorporated
into autonomous robots to
sense changes in a known
environment in the context of
mapping?
15
Case Study: SLAM

Dynamic Environments
•
Simultanoues localization and mapping (SLAM) in dynamic
environments has been the focus of recent research:
•
Fox et al. (1999)
•
•
•
Approach is to filter moving objects out and only apply SLAM to the static
environment map
Entropy Filter: very closely related to Baldi’s definition of surprise but uses
it only to remove data contributing to positive changes in entropy
Distance Filter: filters those sensor measurements with probability larger
than some threshold of being shorter than expected
16
Case Study: SLAM

Dynamic Environments
•
Wang et al. (2002-2003)
•
•
•
Filters the current map into a stationary object map (SO-map) and a
moving object map (MO-map) assuming that:
•
Measurements can be divided into stationary and moving
•
Measurements of moving objects and their pose carry no
information and can be filtered out
Detection of moving objects done by observing discrepancies between
scans
Derives a Bayesian framework for the SLAM with detection-and-tracking of
moving objects by building on Fox’s work
17
Case Study: SLAM

Static Environments
•
Original problem first posed by (Smith, Self, & Cheeseman, 1990),
to which the solution has been shown to exist:
•
Particle-Filter based approach (Thrun, et al., 1998)
•
•
Kalman-Filter based approach (Dissanayake, et al., 2001)
•
•
Presents a probabilistic framework for the SLAM problem without
assumptions of probability distributions being Gaussian; uses random
samples, weighted appropriately, to represent the desired posterior
density functions
Applies discrete Kalman Filter techniques to estimate landmark locations
and robot pose; shows that all landmark locations become fully correlated
and will converge to a lower bound covariance
Multi-robot Kalman-Filter based approach (Roumeliotis, 2002)
•
Shows that the centralized Kalman Filter estimator can be written in
decentralized form, allowing processing on distributed host machines
18
Case Study: SLAM

Problems with previous approaches
•
Dynamic environment SLAM
•
•
•

assumption that discrepancies in data are due to changes in the
environment
Fox et al. filter dynamic data out and focus only static areas of the
map; inherent assumption that dynamic data is uninformative
Wang et al. assumes that all measurements can be separated into
either a stationary object measurement or a moving object
measurement
Question
•
Is there a better framework for detecting dynamic changes in the
environment?
19
Case Study: Surprise

Pierre Baldi (2002)
•
Definition:
“... a complimentary way of measuring information carried by the data is to measure the
distance between the prior and the posterior. To distinguish it from Shannon's
communication information, we call this notion of information the surprise information or
'surprise'”
20
Case Study: Surprise

Surprise
•
•
Idea of “surprise” to be a measure of the difference
between what is expected of the data and what is actually
said by the data
An alternative to Shannon’s definition of “information”
21
Case Study: Surprise

Itti & Baldi (2005)
•
… more to add here… I haven’t read these papers yet
22
Surprise Example
time = {0,…,tk }
time = tk+1
23
Surprise Example
•
Obvious Question:
•
•
P(D) ? P(D|M) ?
Start by using a line based approach to
approximate the world and make the
assumption that the associated sensor noise
is Gaussian, i.e. :
24
Surprise Example
•
If we treat P(D|M) as the probability of the expected data given our
understanding of the model of the world from t = 0, … ,tk , then
P(D|M) becomes:
P(D|M)
25
Surprise Example
•
If we treat P(D) as the probability of the most recent data
measurement at time tk+1, then P(D) becomes:
P(D)
26
Surprise Example
•
Using Baldi’s equation, surprise yields the following result with the
most “surprising” part of the environment corresponding with what
was expected:
S(D,M)
27
Surprise

Properties
•
•
S(D,M) > 0 : new features in the environment previously not
accounted for in the model
S(D,M) < 0 : modeled features of the environment changed or
possibly no longer existing
28
Problem Definition
•
•
Can we formulate the concepts
of attention and awareness into
a mathematical framework
using Baldi’s definition of
surprise?
•
(e.g. a “surprise-saliency map” )
Can we extend Baldi’s definition
of surprise in such a way to
govern the controls/actions
taken by intelligent,
autonomous robots?
•
(e.g. feedback-control using the
“surprise-saliency map” )
29
Approach: Short Term

Dynamic Mapping
•
•
•
Apply Baldi’s definition of surprise to the problem of robot
localization and mapping in dynamic indoor and outdoor
environments
Develop a general probabilistic approach to calculating “surprise”
without assuming a known form of probability density functions
Formulate results into a “surprise-saliency” map where concepts
of attention and awareness taken from neurobiology can be
applied (e.g. inhibition of return, winner-take-all, top-down
approach, bottom-up approach, etc…)
30
Approach: Long Term
31
Work to Date: Testbed

Setup
•
•
•
•
Currently we have 4 fully functional ER1
Robots (Evolution Robotics), each equipped
with laser range finders and indoor-GPS units
The interface platform used between
hardware and client-codes is Player v1.6.5.
The robot simulator we use is a
complimentary interface to Player, known as
Stage v2.0.0
Peer-to-peer communication is made possible
over a wireless network via the
communication architecture known as
“Spread”
32
Work to Date: Testbed

Localization Methods
Wheel Odometry
Indoor GPS
Scanmatching
33
Work to Date: Testbed

Graphical User Interface
34
Future Work
35
Download