Physiology of Vision: a swift overview 16-721: Advanced Machine Perception

advertisement

Physiology of Vision: a swift overview

Some figures from Steve Palmer

16-721: Advanced Machine Perception

A. Efros, CMU, Spring 2006

Class Introductions

• Name:

• Research area / project / advisor

• What you want to learn in this class?

• When I am not working, I ______________

• Favorite fruit:

Class mailing list…

Image Formation

Film

Digital Camera

The Eye

Monocular Visual Field: 160 deg (w) X 135 deg (h)

Binocular Visual Field: 200 deg (w) X 135 deg (h)

What do we see?

3D world 2D image

Point of observation

Figures © Stephen E. Palmer, 2002

What do we see?

3D world 2D image

Point of observation

Painted backdrop

The Plenoptic Function

Figure by Leonard McMillan

Q: What is the set of all things that we can ever see?

A: The Plenoptic Function (Adelson & Bergen)

Let’s start with a stationary person and try to parameterize everything that he can see…

Grayscale snapshot

P( q,f

) is intensity of light

• Seen from a single view point

• At a single time

• Averaged over the wavelengths of the visible spectrum

(can also do P(x,y), but spherical coordinate are nicer)

Color snapshot

P( q,f,l

) is intensity of light

• Seen from a single view point

• At a single time

• As a function of wavelength

Spherical Panorama

See also: 2003 New Years Eve http://www.panoramas.dk/fullscreen3/f1.html

All light rays through a point form a ponorama

Totally captured in a 2D array -P( q,f

)

Where is the geometry???

A movie

P( q,f,l

,t) is intensity of light

• Seen from a single view point

• Over time

• As a function of wavelength

Space-time images

t y x

Holographic movie

P( q,f,l

,t,V

X

,V

Y

,V

Z

) is intensity of light

• Seen from ANY viewpoint

• Over time

• As a function of wavelength

The Plenoptic Function

P( q,f,l

,t,V

X

,V

Y

,V

Z

)

• Can reconstruct every possible view, at every moment, from every position, at every wavelength

• Contains every photograph, every movie, everything that anyone has ever seen! it completely captures our visual reality! Not bad for a function…

The Eye is a camera

The human eye is a camera!

• Iris colored annulus with radial muscles

• Pupil the hole (aperture) whose size is controlled by the iris

• What’s the “film”?

– photoreceptor cells (rods and cones) in the retina

The Retina

Cross-section of eye Cross section of retina

Pigmented epithelium

Ganglion axons

Ganglion cell layer

Bipolar cell layer

Receptor layer

Retina up-close

Light

Two types of light-sensitive receptors

C on es cone-shaped less sensitive operate in high light color vision

Rods rod-shaped highly sensitive operate at night gray-scale vision

© Stephen E. Palmer, 2002

Rod / Cone sensitivity

The famous sockmatching problem…

Distribution of Rods and Cones

Fovea

Blind

Spot

150,000

100,000

Rods

Rods

50,000

Cones Cones

0

80 60 40 20 0 20 40 60 80

Visual Angle (degrees from fovea)

Night Sky: why are there more stars off-center?

© Stephen E. Palmer, 2002

Electromagnetic Spectrum

Human Luminance Sensitivity Function http://www.yorku.ca/eye/photopik.htm

Visible Light

Why do we see light of these wavelengths?

…because that’s where the

Sun radiates EM energy

© Stephen E. Palmer, 2002

The Physics of Light

Any patch of light can be completely described physically by its spectrum: the number of photons

(per time unit) at each wavelength 400 - 700 nm.

# Photons

(per ms.)

400 500 600 700

Wavelength (nm.)

© Stephen E. Palmer, 2002

The Physics of Light

Some examples of the spectra of light sources

A. Ruby Laser B. Gallium Phosphide Crystal

400 500 600 700

Wavelength (nm.)

C. Tungsten Lightbulb

400 500 600 700

Wavelength (nm.)

D. Normal Daylight

400 500 600 700 400 500 600 700

© Stephen E. Palmer, 2002

The Physics of Light

Some examples of the reflectance spectra of surfaces

Red

Yellow Blue Purple

400 700 400 700 400 700 400 700

Wavelength (nm)

© Stephen E. Palmer, 2002

The Psychophysical Correspondence

There is no simple functional description for the perceived color of all lights under all viewing conditions, but …...

A helpful constraint:

Consider only physical spectra with normal distributions mean

# Photons

400 area variance

500 600

Wavelength (nm.)

700

© Stephen E. Palmer, 2002

The Psychophysical Correspondence

Mean Hue blue green yellow

Wavelength

© Stephen E. Palmer, 2002

The Psychophysical Correspondence

Variance Saturation hi.

high med.

medium low low

Wavelength

© Stephen E. Palmer, 2002

The Psychophysical Correspondence

Area Brightness

B. Area Lightness bright dark

Wavelength

© Stephen E. Palmer, 2002

Physiology of Color Vision

Three kinds of cones:

440 530 560 nm.

100

S M L

50

400 450 500 550 600 650

WAVELENGTH (nm.)

• Why are M and L cones so close?

© Stephen E. Palmer, 2002

Retinal Processing

© Stephen E. Palmer, 2002

Single Cell Recording

Microelectrode

Amplifier

Electrical response

(action potentials) mV

Time

© Stephen E. Palmer, 2002

Single Cell Recording

© Stephen E. Palmer, 2002

Retinal Receptive Fields

Receptive field structure in ganglion cells :

On-center Off-surround

Response

Time

Stimulus condition Electrical response

© Stephen E. Palmer, 2002

Retinal Receptive Fields

Receptive field structure in ganglion cells :

On-center Off-surround

Response

Time

Stimulus condition Electrical response

© Stephen E. Palmer, 2002

Retinal Receptive Fields

Receptive field structure in ganglion cells :

On-center Off-surround

Response

Time

Stimulus condition Electrical response

© Stephen E. Palmer, 2002

Retinal Receptive Fields

Receptive field structure in ganglion cells :

On-center Off-surround

Response

Time

Stimulus condition Electrical response

© Stephen E. Palmer, 2002

Retinal Receptive Fields

Receptive field structure in ganglion cells :

On-center Off-surround

Response

Time

Stimulus condition Electrical response

© Stephen E. Palmer, 2002

Retinal Receptive Fields

Receptive field structure in ganglion cells :

On-center Off-surround

Response

Time

Stimulus condition Electrical response

© Stephen E. Palmer, 2002

Retinal Receptive Fields

RF of On-center Off-surround cells

Neural Response

Center

Receptive Field

Firing

Rate

Response Profile on-center

Surround off-surround

Horizontal Position

On Off

© Stephen E. Palmer, 2002

Retinal Receptive Fields

RF of Off-center On-surround cells

Neural Response Receptive Field Response Profile

Firing

Rate on-surround

On Off off-center

Horizontal Position

© Stephen E. Palmer, 2002

Retinal Receptive Fields

Retinal Receptive Fields

Receptive field structure in bipolar cells

Light

© Stephen E. Palmer, 2002

Retinal Receptive Fields

Receptive field structure in bipolar cells

LIGHT

Receptors

Direct excitatory

component (D)

Indirect inhibitory component (I)

Horizontal

Cells

Direct Path

Bipolar Cell

Indirect Path

A. WIRING DIAGRAM

D + I

B. RECEPTIVE FIELD PROFILES

© Stephen E. Palmer, 2002

Visual Cortex

Cortical Area V1 aka:

Primary visual cortex

Striate cortex

Brodman’s area 17

LGN

Thalamus

Parietal visual cortex

Dorsal

Stream Striate cortex

(V1)

Eye Optic nerve Temporal visual cortex

Ventral

Extrastriate cortex

Stream

© Stephen E. Palmer, 2002

Cortical Receptive Fields

Single-cell recording from visual cortex

David Hubel & Thorston Wiesel

© Stephen E. Palmer, 2002

Cortical Receptive Fields

Single-cell recording from visual cortex

Time

© Stephen E. Palmer, 2002

Cortical Receptive Fields

Three classes of cells in V1

Simple cells

Complex cells

Hypercomplex cells

© Stephen E. Palmer, 2002

Cortical Receptive Fields

Simple Cells: “Line Detectors”

B. Dark Line Detector

Firing

Rate

Horizontal Position

© Stephen E. Palmer, 2002

Cortical Receptive Fields

Simple Cells: “Edge Detectors”

C. Dark-to-light Edge Detector D. Light-to-dark Edge Detector

Firing

Rate

Horizontal Position

Firing

Rate

Horizontal Position

© Stephen E. Palmer, 2002

Cortical Receptive Fields

Constructing a line detector

Retina LGN

Receptive Fields

Center-

Surround

Cells

© Stephen E. Palmer, 2002

Cortical Receptive Fields

Complex Cells

STIMULUS

0

0 o

NEURAL RESPONSE

Time

© Stephen E. Palmer, 2002

Cortical Receptive Fields

Complex Cells

STIMULUS

60 o

NEURAL RESPONSE

Time

© Stephen E. Palmer, 2002

Cortical Receptive Fields

Complex Cells

STIMULUS

90 o

NEURAL RESPONSE

Time

© Stephen E. Palmer, 2002

Cortical Receptive Fields

Complex Cells

STIMULUS

120 o

NEURAL RESPONSE

Time

© Stephen E. Palmer, 2002

Cortical Receptive Fields

Constructing a Complex Cell

Retina Cortical Area V1

Receptive Fields Simple Cells

© Stephen E. Palmer, 2002

Cortical Receptive Fields

Hypercomplex Cells

© Stephen E. Palmer, 2002

Cortical Receptive Fields

Hypercomplex Cells

© Stephen E. Palmer, 2002

Cortical Receptive Fields

Hypercomplex Cells

© Stephen E. Palmer, 2002

Cortical Receptive Fields

Hypercomplex Cells

“End-stopped” Cells

© Stephen E. Palmer, 2002

Cortical Receptive Fields

“End-stopped” Simple Cells

© Stephen E. Palmer, 2002

Cortical Receptive Fields

Constructing a Hypercomplex Cell

RETINA CORTICAL AREA V1

Receptive Fields Complex Cell End-stopped Cell

© Stephen E. Palmer, 2002

Mapping from Retina to V1

Why edges?

So, why “edge-like” structures in the Plenoptic Function?

Because our world is structured!

Problem: Dynamic Range

1

The real world is

High dynamic range

1500

25,000

400,000

2,000,000,000

Is Camera a photometer?

Image pixel ( 312 , 284 ) = 42

42 photos?

Long Exposure

Real world

10 -6

Picture

10 -6

High dynamic range 10 6

10 6

0 to 255

Short Exposure

Real world

10 -6

Picture

10 -6

High dynamic range 10 6

10 6

0 to 255

Varying Exposure

What does the eye sees?

The eye has a huge dynamic range

Do we see a true radiance map?

Eye is not a photometer!

"Every light is a shade, compared to the higher lights, till you come to the sun; and every shade is a light, compared to the deeper shades, till you come to the night."

— John Ruskin, 1879

Cornsweet Illusion

Sine wave

Campbell-Robson contrast sensitivity curve

Metamers

Eye is sensitive to changes

(more on this later…)

Download