Theories of Vision: a swift overview 16-721: Learning-based Methods in Vision

advertisement
Theories of Vision: a swift overview
Most slides from Steve Palmer
16-721: Learning-based Methods in Vision
A. Efros, CMU, Spring 2007
Tale of Martians with an old PC
Categorization of Theories
“Why do things looks as they do?”
-- Kurt Koffka (1935)
Nativism vs. Empiricism
• “Because we were born (evolved) to see them that way” vs.
• “Because we have learned to see them that way”
Atomisn vs. Holism
• “because of the way each pixel appears” vs.
• “because of the way the entire scene appears”
Organism vs. Environment
• “Because we are the way we are” vs.
• “Because the world is the way it is”
Structuralism
THEORY
NATIVISM
vs.
EMPIRICISM
ATOMISM
vs.
HOLISM
ORGANISM
vs.
ENVIRONMENT
PRINCIPAL
ANALOGY
METHOD
Structuralism
Gestaltism
Ecological
Optics
Constructivism
© Stephen E. Palmer, 2002
Structuralism
Structuralism:
Perception results from the association
of basic sensory atoms in memory via
repeated, prior joint occurrences.
Derived from philosophy of
British Empiricists (e.g., Locke,
Berkeley, Hume, and Mills).
Proposed by Wilhelm Wundt,
the father of modern Psychology.
© Stephen E. Palmer, 2002
Structuralism
Sensory Atoms
Redness
at (x1,y1)
Yellowness
at (x2,y2)
Greenness
at (x3,y3)
Retinal mosaic
© Stephen E. Palmer, 2002
Structuralism
Perceptual Complexes
Retinal mosaic
© Stephen E. Palmer, 2002
Structuralism
Perceptual Complexes
Red apple
at (x0,y0)
Retinal mosaic
© Stephen E. Palmer, 2002
Structuralism
Chemical Analogy
Perceptions are made of basic sensory experiences
just as molecules are made of basic atoms.
© Stephen E. Palmer, 2002
Structuralism
THEORY
NATIVISM
vs.
EMPIRICISM
ATOMISM
vs.
HOLISM
ORGANISM
vs.
ENVIRONMENT
PRINCIPAL
ANALOGY
METHOD
Structuralism
Empiricism
Atomism
Organism
Chemistry
Trained
Introspection
Gestaltism
Ecological
Optics
Constructivism
© Stephen E. Palmer, 2002
Gestaltism
Gestaltism:
Perception results from the interaction
between the intrinsic structure of the stimulus
and the intrinsic structure of the brain.
Max
Wertheimer
Wolfgang
Köhler
Kurt
Koffka
© Stephen E. Palmer, 2002
Gestaltism
Principles of Gestalt Theory
Holism:
The whole is different from the sum of its parts.
Prägnanz:
The percept will be as “good” as the prevailing
conditions allow, I.e. simplest explanation
Nativism:
Not a total rejection of learning, but rejection of
Its primacy.
Gestaltism
Holism: The whole is different from the sum of its parts.
Emergent properties:
Features of a configuration
that are not features of
its components, e.g.:
• length
• orientation
• curvature
• closure
• connectedness
© Stephen E. Palmer, 2002
Gestaltism
Prägnanz: the percept will be as
“good” as the prevailing conditions allow
What is this?
?
square &
circle?
?
square &
pacman?
etc….
?
squigit &
pacman?
© Stephen E. Palmer, 2002
Gestaltism
Field Theoretic Analogy
Field of
a single
positive
charge
Field of
a pair of
positive
charges
Field of a
positive &
a negative
charge
Repulsion
Attraction
© Stephen E. Palmer, 2002
Gestaltism
THEORY
NATIVISM
vs.
EMPIRICISM
ATOMISM
vs.
HOLISM
Structuralism
Empiricism
Atomism
Gestaltism
Nativism
Holism
ORGANISM
vs.
ENVIRONMENT
PRINCIPAL
ANALOGY
METHOD
Organism
Chemistry
Trained
Introspection
Organism
EM Fields
Naive
Introspection
Ecological
Optics
Constructivism
© Stephen E. Palmer, 2002
Ecological Optics
Ecological Optics: Perception is the direct
apprehension of the visible environment by
extracting invariants in the dynamic ambient
optic array and the affordances of objects.
James J. Gibson: Ask not
what’s inside your head, but
what your head’s inside of!
(Ecological optics is a theory
of stimulus structure.)
© Stephen E. Palmer, 2002
Ecological Optics
Direct Perception: Perception is not mediated
by processes like unconscious thinking
Perception of the 3-D world is NOT ambiguous
for an actively exploring organism.
© Stephen E. Palmer, 2002
Ecological Optics
Texture Gradients are gradual changes in
the size & shape of texture elements in an
image when a plane recedes in depth.
Tile
floor
Wheat
field
Geometric
surface
© Stephen E. Palmer, 2002
Ecological Optics
Motion Gradients are gradual changes in
the speed and direction of elements in optic
flow within the dynamic ambient optic array.
Plane landing
Bird flying
© Stephen E. Palmer, 2002
Ecological Optics
THEORY
NATIVISM
vs.
EMPIRICISM
ATOMISM
vs.
HOLISM
Structuralism
Empiricism
Atomism
Gestaltism
Nativism
Ecological
Optics
Nativism
ORGANISM
vs.
ENVIRONMENT
PRINCIPAL
ANALOGY
METHOD
Organism
Chemistry
Trained
Introspection
Holism
Organism
Physical
Field Theory
Naive
Introspection
Holism
Environment
Mechanical
Resonance
Ecological
Analysis
Constructivism
© Stephen E. Palmer, 2002
Constructivism
Constructivism: Perception is the result of
unconscious inferences about the scene most
likely to have caused the retinal image or event.
Hermann von Helmholtz
originated the idea of
unconscious inference and
the likelihood principle.
© Stephen E. Palmer, 2002
Constructivism
Unconscious Inference: the process of recovering
environmental information by logically combining
retinal information with heuristic assumptions.
Tilted room illusion:
If you assume that the
walls and floor of the
room are vertical and
horizontal, then you
must be tilted —and you
feel that way!
© Stephen E. Palmer, 2002
Constructivism
Tilted room illusion
© Stephen E. Palmer, 2002
Constructivism
Prägnanz vs. Likelihood
What governs what we see: goodness or probability?
What is this?
?
square &
circle?
?
square &
pacman?
etc….
?
squigit &
pacman?
© Stephen E. Palmer, 2002
Constructivism
THEORY
NATIVISM
vs.
EMPIRICISM
ATOMISM
vs.
HOLISM
Structuralism
Empiricism
Atomism
Gestaltism
Nativism
(both)
ORGANISM
vs.
ENVIRONMENT
PRINCIPAL
ANALOGY
METHOD
Organism
Chemistry
Trained
Introspection
Holism
Organism
Physical
Field Theory
Naive
Introspection
(both)
(both)
Logical
Inference
Likelihood
Analysis
Ecological
Optics
Constructivism
© Stephen E. Palmer, 2002
Conclusions
So which theory is correct?
Probably none of them!
Or maybe all of them, to some degree!
It’s good to keep them in mind, when
designing your algorithms!
© Stephen E. Palmer, 2002
Four Stages of Visual Perception
Sound
Audition
LTM
Odor (etc.)
Light
Light
ImageBased
Processing
STM
Motor
Vision
SurfaceBased
Processing
ObjectBased
Processing
Movement
CategoryBased
Processing
Ceramic
cup on
a table
David Marr, 1982
© Stephen E. Palmer, 2002
Four Stages of Visual Perception
The Retinal Image
An Image
(blowup)
Receptor Output
© Stephen E. Palmer, 2002
Four Stages of Visual Perception
Retinal
Image
Image-based
Representation
Imagebased
processes
Edges
Lines
Blobs
etc.
An Image
(Line Drawing)
Primal Sketch
(Marr) © Stephen E. Palmer, 2002
Four Stages of Visual Perception
Image-based
Representation
Surface-based
Representation
Surfacebased
processes
Stereo
Shading
Motion
etc.
Primal Sketch
2.5-D Sketch
© Stephen E. Palmer, 2002
Koenderink’s trick
Four Stages of Visual Perception
Object-based
Representation
Surface-based
Representation
Objectbased
processes
Grouping
Parsing
Completion
etc.
2.5-D Sketch
Volumetric Sketch
© Stephen E. Palmer, 2002
Geons
(Biederman '87)
Four Stages of Visual Perception
Category-based
Representation
Object-based
Representation
Categorybased
processes
Category: cup
Color: light-gray
PatternRecognition
Size: 6”
Location: table
Spatialdescription
Volumetric Sketch
Basic-level Category
© Stephen E. Palmer, 2002
We likely throw away a lot
line drawings are universal
Physics
is not
However, things are not so
simple…
Problems with feed-forward model of
processing…
two-tone images
“attached
shadow”
contour
hair (not
shadow!)
“cast shadow”
contour
inferred
external
contours
Cavanagh's argument
A. Finding 3D structure in two-tone images requires
distinguishing cast shadows, attached shadows, and areas of
low reflectivity
B. The images do not contain this information a priori
D. Therefore, our recognition process must involve matching
viewpoint-specific 2D templates
Feedforward vs. feedback models
Marr's model (circa 1980)
object
recognition by
matching 3D
models
Object
3D model
Feedback Model
basic
recognition
with 2D
primitives
2½D sketch
primal sketch
stimulus
memory
3D shape
2D shape
reconstruction of
shape from image
features
stimulus
feedback
Download