L.

advertisement
UNDERSTANDING SCENES WITH SHADOWS
VISION FLASH 21
David L. Wal t z
Massachusetts Institute of Technology
Artificial Intel 1 igence Laboratory
Vision Group
November 1971
ABSTRACT
The basic problem of this research i s t o find methods which will enable a
program to construct a three dimensional interpretation from the line
drawing of a scene, where the scene may have shadows and various degeneracies. These methods differ from those used in earlier related programs
in that they use region information extensively, and include formal isms
for eye and lighting position. The eventual result of t h i s research will
be a program which should be able t o successfully t r e a t scenes with far
fewer restrictions than present programs wi 11 to1 erate.
Work reported herein was conducted a t the Artificial Intelligence Laboratory, a Massachusetts Institute of Te,chnology research program supported
by the Advanced Research Projects Agency of the Department of Defense,
and was monitored by the Office of NavaJ Research under Contract Number
N00014-70-A-0362-0002.
Understanding scenes w i t h shadows;
Introduction. 2
l NTRODUCT l ON
I n t h e body o f t h i s paper,
I deal p r i m a r i l y w i t h t h e
c o n c r e t e r e s u l t s I have o b t a i n e d so f a r ,
but I t h i n k that the
s i g n i f i c a n c e o f t h e r e s u l t s may not be immediately apparent.
Therefore I be1 l e v e t h a t i t w i 11 be u s e f u l t o say something about
how I approached t h i s research,
s o r t t h e y are,
about why t h e r e s u l t s a r e of t h e
and about what relevance t h i s work has t o human
p e r c e p t ion,
Why does It seem so easy f o r us t o apprehend r a p i d l y t h e
t h r e e - d l m e n s l o n a l n a t u r e o f scenes,
even when t h e y a r e presented
t o us I n t h e f o r m o f b a d l y exposed and b a d l y focussed photographs
of b a d l y 1 I t and u n f a m l l t a r o b j e c t s ? While i t i s p o s s i b l e t o
demonstrate a l a r g e number o f optical
illuslons,
I think that the
u s u a l r e l l a b i l l t y o f my p e r c e p t i o n s I s a f a r more s t r i k t n g
phenomenon, Because we can p e r f o r m as r e 1 1 a b l y as we can,
I began
w 1 t h t h e b a s i c assumption t h a t scenes p r o v i d e many redundant
pieces of l n f o r m a t l o n ~ What k i n d s o f i n f o r m a t i o n and r u l e s c o u l d
s e r v e t o e.xpl a i n t h e observed performance?
Some cues and ru1,es had a1 ready been demonstrated b y
Huffman,
Clowes and others.
T h e i r work has shown f i r s t t h a t t h e
s e t s o f edge types and j u n c t i o n types which can occur i n a scene
made up o f o b j e c t s w l th p l a n a r surfaces and t r i hedral j u n c t i o n s
; J n d e r s t a n d l n g scenes w i t h shadows;
i n t r o d u c t ion, 3
can be e a s i l y enu~neratcfl, and t i i e n t h a t we cat1 o n l y i n t e r r ~ r c t s
l i n e as a p a r t i c u l a r t y p e i f we can l a b e l t h e e n t i r e scene f r o m
t h e s e t o f a l l o w a b l e j u n c t i o n s i n a manner c o n s i s t e n t w i t h t h i s
1 i n e assignment.
C l e a r l y t h i s approach cannot p r o v i d e a
s u f f i c i e n t explanation o f v i s u a l perception,
s i n c e i n g e n e r a l any
scene can be l a b e l e d i n s e v e r a l d i f f e r e n t ways,
and t h e r e i s no
o b v i o u s way t o s e l e c t t h e i n t e r p r e t a t i o n we would c a l l c o r r e c t .
i d e a was t o e x t e n d t h e s e t o f a1 l o w a b l e j u n c t i o n s t o
My f i r s t
i n c l u d e shadow j u n c t i o n s ,
and I f o u n d t h a t I n s i m p l e scenes w i t h
shadows we can sometilnes p a r s e u n i q u e l y because shadows i n some
sense p r o v i d e a d d i t i o n a l i n f o r m a t i o n . However i n more complex
(
scenes,
t h e l a r g e r s e t o f l a b e l s m e r e l y compounded t h e problem o f
m u l t i p l e i n t e r p r e t a t i o n s . T h i s l e d me t o t h i n k more about t h e
n a t u r e o f o u r p e r c e p t i o n o f scenes w i t h shadows.
Why does i t seem easy t o i n t e r p r e t scenes r e g a r d l e s s o f
1i g h t i n g ,
shadows,
and i n p a r t i c u l a r ,
how a r e we a b l e t o " f a c t o r o u t "
even i n cases where t h e y a r e t h e most p r o m i n e n t f e a t u r e s
i n terms o f c o n t r a s t ? My s e a r c h f o r r u l e s began w i t h t h e
o b s e r v a t i o n t h a t i f a scene i s 1 i t w i t h a s i n g l e 1 i g h t source we
can p a r t i t i o n t h e s e t o f j u n c t i o n s i n t o t h o s e which can o c c u r
w i t h r e s p e c t t o a p a r t i c u l a r p l a n e and t h o s e w h i c h cannot o c c u r .
T h i s i n t r o d u c e d a new p r o b l e m because we must know t h e
o r i e n t a t i o n o f a s u r f a c e and l i g h t source w i t h r e s p e c t t o t h e
v l e w e r b e f o r e we can say f o r c e r t a i n whether o r n o t p a r t i c u l a r
k.
Understandl ng scenes w f t h shadows;
i n t roduct I o n . 4
j u n c t i o n s can occur.. Nevertheless I had found an i n t e r e s t i n g k i n d
o f i n t e r r e l a t f o n between these v a r i a b l e s i n t h a t i f we know
o r i e n t a t i o n s we can p a r t i t i o n j u n c t i o n s and i f we know the
j u n c t i o n l a b e l s we can t e l l something about surface,
l i g h t and
eye placement. T h i s suggested t h a t I c o u l d use t h i s type o f f a c t
b o t h t o check f o r l a b e l l n g v a l i d i t y ( a l l l a b e l i n g s must be
c o n s i s t e n t w l t h a s i n g l e ' i n t e r p r e t a t i o n of l i g h t and eye
positions) b u t even more important i t provided a p o t e n t i a l a c t i v e
t o o l i f I c o u l d o b t a l n p a r t i a l f n f o r m a t f o n i n a scene. Perhaps
i t p o i n t e d o u t the importance of f l n d t n g
most i m p o r t a n t o f a l l ,
surface orientations;
j u s t as a l i n e l a b e l c o n s t r a i n s t h e
p o s s i b l e l a b e l i n g o f the j u n c t l o n s a t both ends o f t h e l i n e ,
so a
reglon o r i e n t a t i o n l a b e l could constrain the possible junctions
and 1 i n e s whl ch bound it.
As soon as I began t h i n k i n g i n terms o f regions I came up
w l t h some powerful r e s u l t s . The f i r s t r e g i o n I considered was the
i l l u m i n a t e d p o r t i o n o f t h e s u r f a c e (assumed t o be a plane) which
supports a scene.
Along the boundary of t h i s r e g i o n and the
scene,
t h e o n l y 1 lne assignments which are p o s s i b l e are shadow
edges,
concave edges (when t h e scene o b j e c t s r e s t on the surface)
and o b s c u r i n g edges. Therefore we can (1) p a r t i t i o n the set o f
p o s s f b l e j u n c t i o n s i n t o those which can occur along the boundary
and those which cannot,
( 2 ) we can say how these j u n c t i o n s must
be o r i e n t e d i f t h e y occur on t h e boundary (no convex edges can
\
Understand1ng scenes w i t h s.ha?.ow,s; In t r o d u c t ion. 5
occur a l o n g t h e boundary f o r example),
and ( 3 ) we can f u r t h e r
p a r t i t i o n these j u n c t i o n s according t o l i g h t i n g d i r e c t i o n ,
since
they a1 1 appear i n r e l a t i o n t o t h e same surface.
Once I enbumerated these j u n c t i o n s another f a c t emerged:
c e r t a i n j u n c t i o n s on t h e scene/background boundary rnust always be
p a r t i a l l y l a b e l e d i n a unique manner.
What t h i s means I s t h a t
c e r t a i n l o c a l f e a t u r e s always have a unique meaning I n a
p a r t i c u l a r context.
I have s i n c e been a b l e t o f i n d a number o f
s i m i l a r examples which occur i n o t h e r contexts. These
o b s e r v a t i o n s have a1 lowed me t o f i n d r u l e s which enable us t o
p a r t i a l 1 y l a b e l v i r t u a l l y any scene;
g i v e n a , p a r t i a l l a b e l ing,
can o f t e n f i n d addl t i o n a l " f o r c e d choice" labels,
t o determine l l g h t l n g position,.
we
use a1 1 t h e s e
and.use t h i s i n - t u r n t o .do
further forced labeling.
As I show In t h e paper, we can a l s o f o r m a l i z e l a b e l s f o r
r e g i o n o r i e n t a t i o n and I 1 1umina.t i o n and l o o k .for scene cues t o
l a b e l these j u s t as we do l i n e s . Some o f t h e cues i n c l u d e l i n e
slopes,
j u n c t i o n types,
and placement I n t h e scene (e.g.
scene/background boundary o r i n t h e . i n t e r i o r ) .
on t h e
I have a l s o
developed some o f t h e r u l e s which r e l a t e 1 i.ne l a b e l in^, j u n c t i o n
labeling,
and r e g i o n 1abel.lng t o each o t h e r and t o themselves as
w e l l as t o l i g h t i n g and eye p o s i t i o n s .
To i l l u s t r a t e t h e i n h e r e n t redundancy o f t h e scene features,
t h e types o f i n f o r m a t l o n I have 1tste.d a r e sufficient t o s o l v e a
Understanding scene's w i t h shadows;
number o f shadow scenes,
introduction. 6
y e t I have n o t even mentioned one o f t h e
most obvious f e a t u r e s o f shadows- t h e i r darkness. A t one time I
had f e l t t h a t t h i s was perhaps t h e most i m p o r t a n t shadow cue,
and
t h a t i t was a l s o p o s s i b l e t h a t we were a b l e t o d e t e c t a
p a r t i c u l a r k i n d o f m i c r o s t r u c t u r e i n shadow edges which we c o u l d
use t o distinguish them frorn o t h e r edges. C l e a r l y darkness and
edge t y p e a r e n o t enough t o alone i d e n t i f y shadows,
n o t o f t e n confuse p a i n t e d areas w i t h shadows.
s i n c e we do
I n order t o
r e c o g n i z e shadows we need t o know about t h e cons i s t e n t
r e l a t i o n s h i p s which shadows have t o l i g h t i n g and t o t h e o b j e c t s
whl ch l'causel'
Finally,
t h e shadows.
a l t h o u g h I d i d n o t s e t o u t t o s o l v e t h e problem o f
degeneracies i n scene$ I have been a b l e t o make some progress
toward hand1 i n g them. Roughly,
degeneracies a r e p o i n t s a t whi ch
separated v e r t i c e s and l i n e s appear t o be p a r t o f the same
vertex,
so t h a t t h e r e s u l t a n t v e r t e x i s n o t i n t h e a1 lowable s e t
o f junctions,
o r where 1 i g h t i n g i s such t h a t shadow 1 ines are
p r o j e c t e d on j u n c t i o n s o r o t h e r w i s e l i n e up w i t h edges.
c o u l d move t h e 1 i g h t source and eye s l i g h t l y ,
I f we
these degeneracies
would disappear (though we m i g h t form new ones). However I have
been a b l e t o f i n d some methods o f decomposing o r o t h e r w i s e
u n d e r s t a n d i n g such j u n c t i o n s w i t h o u t resorting t o maklng physlcal
changes,
l a r g e l y as a byproduct o f t h e use o f a l a r g e r number c f
scene cues and b e t t e r general r u l e s .
Understanding scenes w i t h shadows;
Rather than conslder any more d e t a l l s ,
introduction. 7
I would now l i k e t o
r e t u r n t o t h e general problem, What e x . a c t l y does t h i s research
add t o our understandlng o f t h e general probletns o'f v i s i o n
programs and hurnan v i s i o n ?
F i r s t I have shown t h a t we can. f o r m a l i z e and make good use
o f scene i n f o r m a t i o n o t h e r than l i n e s and j u n c t i o n s .
Next,
a l t h o u g h I have c h o s e n . t o use o n l y t h a t i n f o r m a t i o n
which I know how t o e x t r a c t f rorn scenes,
I have provided a f a r
more general f rarnework than . e x i s t e d . before.
For example,
a1 ready
having a f o r m a l i s m f o r r e g i o n o r i e n t . a t i o n a n d . l i g h t l n g
i n f o r m a t i o n a l l o w s us t o add c o l o r s o r textur.es o f r e g i o n s as
properties.
I have a1 so shown t h a t when we a s s i g n values t o scene
v a r i a b l e s we can put e x p l l c i t condi.tions on o t h e r scene
v a r i a b l e s . T h i s leads t o a g r e a t e r understanding o f t h e a c t i v e
n a t u r e o f v i s u a l processing, which we can express i n the form o f
theorems i n a program, T h i s approach has several advantages over
p r e v i o u s l a b e l i n g procedures.
(1) We can check a t each s t e p t o
see whether i t i s ' reasonable t o proceed i n a p a r t i c u l a r manner,
and do n o t need t o d e f e r c o n f i r i n a t i o n o r c o n t r a d i c t i o n u n t i l
l a t e r i n t h e o p e r a t i o n o f t h e program.
conditions which must be s a t i s f i e d ,
p a r s i n g s f o r each scene;
( 2 ) We have more
so t h a t we should get fewer
i n o r d e r t o o b t a i n these,
we a l s o do n o t
have t o s t a r t w i t h a completely u n l a b e l e d 1 i n e drawing f o r every
Understanding scenes w i t h shadows;
Introduction. 8
p a r s i n g , ( 3 ) We can handle degeneracies more e a s i l y s i n c e we do
n o t have. t o depend on t h e s i n g l e o b s e r v a t i o n t h a t a l l t h e l i n e
l a b e l s must match.
I n a1 1 t h e s e senses,
I have made a s t a r t toward
u n d e r s t a n d i n g - v i s i o n i n a way which w i l l a l l o w me t o w r i t e a
program u s i n g procedures as. i n Winograd's program r a t h e r than
b l i n d t r e e search techniques. L i k e Wlnograd I have t r l e d t o make
t h e meaning of v a r i o u s scene f e a t u r e s c l e a r I n terms o f t h e
consequences o f i n t e r p r e t i n g them i n each possi b l e manner.
F l n a l l y I be1 I e v e t h a t my research w i l l p r o v i d e a more s o l i d
bas1 s f o r u n d e r s t a n d i n g va.rious phenomena i n human v i st on, Whll e
I would c e r t a i n l y n o t contend t h a t t h e processes i n human v i s i o n
a r e 1 ike t h e .ones I descr ibe,
we shout-d a t l e a s t 'be a b l e t o g a i n
a b e t t e r I d e a o.f t h e necessary comp1exit.y o f a t h e o r y t o e x p l a i n
human .vi s.lon,
and some perspect l v e on what k.lnds o f l n f o r r n a t l o n
such a t h e o r y must c o n t a i n .
Understandl ng Scenes W i t h Shadows. Page 2
I. Ground r u l e s
I w i l l assume t h a t I am g i v e n o n l y t h e f o l l o w i n g i n f o r m a t i o n
which comprises a l i n e drawing o f a scene:
1. Lines.
Lines a r e j o i n e d c o l l e c t i o n s o f f e a t u r e p o i n t s
d e f i n e d t o be on t h e r e t i n a , where a f e a t u r e p o i n t I s a r e t i n a l
p o i n t a t which the spacial d e r i v a t i v e o f l i g h t i n t e n s i t y i s
I w i l l assume t h a t a l i n e
g r e a t e r than a c e r t a i n threshold.
drawlno has o n l y s t r a ' i g h t l i n e s ,
and t h a t we are g i v e n no
lnformat i o n about 'which 1 i.nes are 1 1 k e l y t o be cracks,
shadows,
o r edges by t h e 1 i n e - f l n d i n g programs.
2. .Junctions. P o i n t s on t h e r e t i n a where two o r more l i n e s
intersect.
3.
I n t e n s i t i e s . R e l a t i v e l i g h t i n c i d e n t on any p o i n t on the
r e t lna.
4. Regions, Areas o f t h e r e t l n a bounded by l i n e s , The set o f
a l l regions I n t h e l i n e .drawing f i l l s t h e r e t i n a .
T h i s i n f o r m a t i o n can be coded i n t o t h e f o l l o w i n g form which
I w i l l assume I s d i r e c t l y a c c e s s i b l e t o the program.
1. For each j u n c t i o n ,
a l l t h e j u n c t i o n s t h a t a r e connected
t o i t by s i n g l e l i n e segments.
2.
For each junction, a type,
such as e l 1,
I w i 11 have more t o say about these l a t e r .
fork,
arrow,
etc.
Understan.dlng Scenes W i t h Shadows. Page 3
3.
For each j u n c t l o n ,
a l l the r e g i o n s surround.ing i t .
4.
For each j u n c t i o n ,
i t s coordinates.
5.
For each region,
6.
For t h e background region,
i t s Intensity.
a s p e c i a l name t o d i s t i . n g u i s h
i t from a l l o t h e r r e g i o n s I n t h e l l n e drawing.
The o b j e c t o f my work i s t o a r r i v e a t a s i n g l e parsing,
or
assignment o f l i n e l a b e l s t o l i n e segments i n t h e l i n e drawing.
There are seven l l n e l a b e l s : concave,
convex,
crack,
I w i l l a l l o w the
o r i e n t a t i o n s each f o r obscure and shadow.
scene (which corresponds t o t h e 1 i n e drawfng)
f r o m a s i n g l e l i g h t source,
t o have shadows
coincidences o f l l n e segments and
j u n c t i o n s f r o m t h e p o l n t o f view of t h e eye,
have faces w i t h a r b i t r a r y r e f l e c t i v i t y .
e s s e n t i a l l y ambiguous;
and o b j e c t s whlch
Some scenes a r e
b y t h i s I mean t h a t humans can a r r i v e a t
more than one i n t e r p r e t a t i o n f o r such a scene.
goal
p l u s two
w i l l be t o f i n d these i n t e r p r e t a t i o n s .
I n these cases
my
These a r e
c o n s i d e r a b l y more ambi t ious goal s than have been a t t e r l p t e d
before,
b u t I b e l i e v e t h a t t h e r e i s s u f f i c i e n t i n f o r v a t i o n i n the
r e p r e s e n t a t i o n s t h a t I have d e s c r i b e d t o come c l o s e t o t h e s e
goal s.
I
Understandlng Scenes Wlth Shadows.
Page 4
I n t h e s e c t l o n s which f o l l o w I w l l l f i r s t discuss t h e
programs which have been suggested t o s o l v e slrnl l a r problems,
t h e i r l i m l t a t l o n s and then new methods f o r overcoming t h e s e
1 iml tations.
Understanding Scenes W f t h Shadows. Page 5
I I. Huffman,
Clowes,
and Dowson's Programs
The programs w r i t t e n by o r suggested by these men a1 1 share
t h e f o l l o w i n g assumptlons which a r e b u i l t i n t o the methods:
1. Every j u n c t i o n which can appear i n a l i n e drawing under
p a r t i c u l a r assumptions (such as o n l y t r i h e d r a l j u n c t i o n s ) i s
listed;
each j u n c t i o n which occurs I n a l i n e drawfng Is assumed
t o be i n t h i s
1is.t.
2. The o n l y c o n d l t l o n s f o r 1;be.llng
a l i n e are a
' t h a t the
j u n c t t o n s which r e s u l t be from the a1 l d w a b l e ' s e t and (b) t h a t a1 1
j u n c t i o n s i n the 1 i n e draw'ing can be laba'led ' I n a manner which i s
c o n s i s t e n t w i t h t h i s l a b e l f o r t h e 1.ine.
3.. Thus these programs r e q u i r e o n l y t'hat 1 l n e l a b e l s agree
and p u t no e x p l i c i t c o n d i t i o n s a t a l l on regions.
I n general each program o f t h i s type w i l l produce several
parsings., and indeed,
as long as a l l j u n c t i o n s i n t h e l i n e
drawing appear i n the s e t o f allowab'le j u n c t i o n s and as l o n g as
t h e r e a r e no degeneracies,
t h e p a r s i n g we would c a l l llcorrectlt
w i l l be among those produced.
However, t h i s s t i l l leaves us w i t h
t h e problem o f f l n d l n g the llcorrectll
parsing
from among those
U.nderstanding Scenes W i t h Shadows. Page 6
produced.
Even more s e r i o u s l y ,
these programs
produce a c o r r e c t
p a r s i n g o n l y i f t h e needed j u n c t i o n s a r e i n t h e a l l o w a h l e . s e t .
There i s no way t o handle degenerate j u n c t i o n s except t o l i s t
them i n t h e a l l o w a b l e s e t ,
I n general d o i n g t h i s w l l l lead t o
many more p a r s i n g s from which t o s e l e c t t h e c o r r e c t one,
t h e r e w i l l be more p o s s i b i l i t i e s f o r e a c h j u n c t i o n type.
FORK j u n c t i o n s i n s t e a d o f 10,)
since
(i.e.
20
An obvious method one might
suggest f o r t h i s problem would then be t o somehow o r d e r t h e
j u n c t i o n types so t h a t t h e ones most 1 i k e l y t o be encountered
would be t r i e d f i r s t .
Then we c o u l d suggest as the c o r r e c t
p a r s i n g Che one which had t h e n no st 1 ikely." s e t of c o n s i s t e n t
junctions.
The problem w i t h t h i s approach i s t h a t t h e r e l a t i v e
l i k e l i h o o d o f v a r i o u s j u n c t i o n s depends on l i g h t i n g and eye
position,
on p o s i t i o n and o r . i e n t a t i o n w i t h i n t h e l i n e drawing,
and o t h e r f a c t o r s ;
i n many i n s t a n c e s a j u n c t i o n l a b e l i n g which I s
e x t r e m e l y l i k e l y I n one c o n t e x t w i l l be impossfble i n anoth-er
context,
I n t h e methods I suggest,
I w i l l t r y a t each p o i n t t o make
t h e c o n t e x t o f each l i n e segment, ,junction,
drawing as e x p l i c l t as possfble,
region,
and l i n e
and l a b e l 1 l n e segments and
r e g i o n s o n t h e b a s i s of t h i s c o n t e x t i n f o r m a t i o n . By c o n t e x t I
mean a l l t h e p a r t i a l l n f o r r n a t i o n e x t r a c t e d up t o t h e p a r t i c u l a r
p o i n t i n t h e program. As w f 11 be shown,
p a r t l a 1 l a b e l fngs o f
Understanding Scenes W i t h Shadows. Page 7
l i n e s o r regions,
eye p o s i t i o n ,
l i g h t position,
t h e types of o b j e c t s i n t h e scene,
knowledge about
o r i e n t a t i o n o f l i n e s ahd
j u n c t i o n types adjacent t o a j u n c t i o n can a l l f u n c t i o n as c o n t e x t
i n f o r m a t i o n . The b u l k o f t h i s paper i s devoted t o showing ways i n
which c o n t e x t a f f e c t s t h e i n t e r p r e t a t i o n of l o c a l f e a t u r e s i n a
1 i n e drawing,
and t o showing ways t o o b t a i n c o n t e x t inforrnat i o n
from l o c a l f e a t u r e s .
I f t h e c o n t e x t I s u n c e r t a i n ( a s a t t h e s t a r t o f t-he
program) I w i l 1 , d e m o n s t r a t e t h a t we can make i n i t i a l hypotheses
and then work on t h e b a s i s o f these.
L a t e r I w i l l show how t o
generate hypotheses as we proceed through t h e program. The
r e s u l t s of t h i s approach a r e methods which a l l o w a l a r g e p o r t i o n
of t h e l a b e l ing t o be done w i t h 1 it t l e o r no back-up,
greater
i n s i g h t i n t o t h e i n t e r a c t i o n s o f d i f f e r e n t l e v e l s o f information,
and programs w h i c h ( I hope t o convince you) w i l l be a b l e t o
s u c c e s s f u l l y f t n d t h e " c o r r e c t parsings'l f o r scenes
under t h e
limitations I have l i s t e d ,
Bot h Huffman and Clowes t r e a t a t some l e n g t h an a l t e r n a t e
scene representatton,
c a l l e d t h e dual-graph o f a scene.
I n dual
space each r e g l o n maps i n t o a p o i n t i n such a way t h a t p a r a l l e l
surfaces map i n t o t h e same p o i n t . For a treatment o f t h i s
representation,
r e f e r t o t h e i r papers. T h e i r f e e l i n g seems t o be
r
Understanding Scenes With Shadows. Page 8
t h a t i t would be u s e f u l t o develop t h e dual r e p r e s e n t a t i o n and
t h e normal r e p r e s e n t a t i o n of. a scene i n para1 l e l ,
whereas ' , I f e e l
t h a t i t i s more n a t u r a l t o c a s t dual-space ideas and r e s u l t s i n t o
a form t h a t can be u.sed i n the normal r e p r e s e n t a t i o n .
However I
am s t i 1 1 open on t h e matt-er and n o t prepared t o defend my view.
F i g u r e 1 1 i s t s a1 1 t h e p o s s i b l e t r i h e d r a l ,
non-degenerate
j u n c t i o n s c l a s s i f i e d by j u n c t i o n t y p e and w l t h l n each t y p e by
l i n e l a b e l s . These j u n c t i o n s were o b t a i n e d b y methods dlscusscd
i n Dowson and Waltz.
Understanding Scenes With Shadows. Page 1 3
I I I. General plan f o r a progratn
Basically,
t h e methods I w i l l d e s c r i b e depend on t h e
f o l l o w i n g observat ions:
1. We can generate a complete s e t o f non-degenerate j u n c t i o n
l a b e l ings under p a r t ic u l a r a-ssumptions.
2. For each j u n c t l o n type,
we can a l s o s p e c i f y a f i n i t e s e t
of r e g i o n l a b e l i n g s which g i v e values f o r o r i e n t a t i o n and
illurninatlon,
3.
one and o n l y one o f which must always be t r u e .
J u s t as two j u n c t i o n s j o i n e d by a l i n e segment must be
l a b e l e d so t h a t t h e l i n e connecting them has a s i n g l e value,
so
must each j u n c t i o n around a r e g i o n be l a b e l e d so t h a t t h e r e g i o n
has a s i n g l e o r l e n t a t i o n and i l l u m i n a t i o n value.
4. Whenever we assign a r e g i o n a p a r t i c u q a r value we a l s o
p l a c e e x p l i c i t c o n s t r a i n t s on the p o s s i b l e p o s i t i o n o f the eye
and l i g h t source,
5. A l l reglons and j u n c t i o n s w i t h i n a scene must agree w i t h
a s i n g l e value f o r l i g h t source p o s i t i o n and a s i n g l e value f o r
eye p o s i t i o n .
6.
Junctions which a r e degenerate because of l i g h t source
~ o s i t i o n l n g( u s u a l l y when t h e l i g h t source l i e s I n t h e plane o f a
surface) can on1 y occur under
very r e s t r i c t i v e c o n d i t ions on t h e
regions surrounding t h e j u n c t l o n and on t h e l i g h t and eye;
Under s t and ing Scenes W 1 t h Shadows. Page 1 5
t h e r e f o r e these j u n c t i o n s can be i n c l u d e d i n t h e complete s e t
w i t h o u t danger o f causing l a r g e numbers o f s p u r i o u s parsings.
F i g u r e 2 shows t h i s s e t o f j u n c t i o n s .
7. We can i d e n t i f y o t h e r degenerate j u n c t f ons as those which
e i t h e r ( a ) do n o t have t h e form o f a known j u n c t i o n o r ( b ) cannot
be i n t e r p r e t e d as members o f the s e t o f a l l o w a b l e j u n c t l o n s i n a
manner c o n s i s t e n t w i t h t h e r e s t o f t h e scene.
8. While i t I s i m p r a c t i c a l t o l i s t a l l these o t h e r types o f
degenerate j u n c t i o n s which can occur,
as l o n g as these j u n c t i o n s
comprise o n l y a small percentage o f t h e t o t a l number o f j u n c t i o n s
i n a scene we w i l l g e n e r a l l y be a b l e t o o b t a i n a t l e a s t a p a r t i a l
l a b e l i n g imposed by j u n c t i o n s and r e g i o n s surrounding them,
On
t h e b a s i s of p a r t i a l l a b e l i n g and knowledge about t h e ways t h a t
degeneracies can occur, we can suggest ways o f s e p a r a t i n g t h e
t r u e j u n c t i o n components.
9. We can use l i n e o r i e n t a t i o n s t o suggest r e g i o n and
j u n c t i o n l a b e l s , Given t h r e e unknown v a r i a b l e s ,
name1 y t h e r e g l o n
typ-es on b o t h s i d e s o f a l i n e segment and t h e l i n e segment l a b e l ,
p l u s one known,
the 1 l n e o r i e n t a t i o n ,
we can t e l l v e r y 1 It . t l e .
However Sf any of t h e t h r e e v a r i a b l e s have known values,
be a b l e t o s o l v e f o r t h e o t h e r two,
we may
and i f two o f t h e t h r e e a r e
known we u s u a l l y w i l l be a b l e t o deduce t h e t h i r d .
I n e i t h e r case
we can a t l e a s t l i m i t t h e p o s s i b i l i t i e s .
10. Region i n t e n s i t y i s n o t always a v e r y good i n d i c a t o r o f
Understanding Scenes With Shadows, Page 1 6
r e g i o n type,
s i n c e o b j e c t s can have a r b i t r a r y r e f l e c t i v i t y . We
can however say t h a t whenever we l a b e l a l i n e as a shadow 1 ine,
t h e r e g i o n on t h e s i d e we have l a b e l e d as a shadow must be darker
t h a n t h e o t h e r s i d e of t h e l i n e . S i m i l a r l y ,
I f two regions
separated b y a 1 tne a r e approxtrnatel y equal i n Intensl t y ,
It i s
u n l i k e l y t h a t t h e l i n e s e p a r a t i n g them i s a shadow l i n e .
In
a d d i t i o n we can say t h a t t h e d a r k e s t r e g i o n s are l i k e l y t o be
shadow regions,
whereas t h e b r i g h t e s t ones a r e extremely u n l i k e l y
t o be shadow regions.
Understanding Scenes With Shadows. Page 1 7
IV.
I V.A.
Region n o t a t i o n
. J u n c t i o n segment n o t a t i o n
As w i l l become obvious l a t e r ,
we w i l l have use f o r a
n o t a t i o n which a l l o w s us t o name r e g i o n s around a j u n c t i o n .
convent i o n s shown i n F i g u r e 3,
By
we can code a r e g i o n i n t h e form
o f a c i r c u l a r l i s t o f sequenti.al j u n c t i o n s ,
l i s t e d by c o n v e n t i o n
i n a c l o c k w i s e d i r e c t i o n when viewed froin i n s i d e t h e r e g i o n .
1V.B.
Illumination
Definition:
Causing j u n c t i o n s a r e ones i n which (1) one o f t h e
l i n e s o f t h e j u n c t i o n i s a shadow l i n e and ( 2 ) t h i s shadow l i n e
i s t h e p r o j e c t i o n o f one o f t h e o t h e r l i n e s i n t h e f u n c t i o n .
A r e g i o n may be one o f t h e f o l l o w i n g typesr
I = illuminated
S = shadowed;
i f ' t y p e S t h e n i t must be e i t h e r
SAP a p r o j e c t e d shadow r e g i o n w i t h one o r more "causing
j u n c t i o n s " an i t s boundary.
SB= a p r o j e c t e d shadow r e g i o n w i t h no I1causfng j u n c t i o n s " cil
i t s boundary whicli cause i t . ( A t y p e S!3 r e g i o n can be bounded b y
causing j u n c t i o n s which obscure i t . See f i g u r e 4 A f o r t h e
d'ist inction.
Understand1 ng Scene-s W i t h Shadows, Page 2 1
SAB = a p r o j e c t e d shadow r e g i o n formed b y an o v e r l a p p i n g of
a t y p e . S A and a t y p e SB shadow r e g i o n .
a r e g i o n which i s shadowed because i t i s o r i e n t e d away
SC
f r o m t h e l i g h t source.
The background r e g i o n i s d e f i n e d as t h a t p o r t i o n o f t h e
s u p p o r t s u r f a c e which i s i 1 l u m i n a t e d and which shares a boundary
w i t h t h e edge o f t h e r e t i n a . The s u p p o r t s u r f a c e I s t h e u n i o n o f
t h e background reglon,
any I11 umlnated r e g i o n s o f t h e s u p p d r t
s u r f a c e n o t connected t o t h e background reg.ion, and shadows
p r o j e c t e d o n t o t h e s u p p o r t plane.
Th'e s'upport 'surface can thus
have r e g i o n s which a r e l a b e l e d i n any manner except as t y p e S C o r
t y p e SAB.
1V.C.
Some f a c t s about r e g i o n s and' i l l ' u m i n a t i o n
Definition:
A shadow causi,ng edge i s one which c a s t s a
shadow.
I w i l l appeal t o your knowledge o f t h e w o r l d t o s t a t e t h e
f o l l o w i n g f a c t s w i t h o u t p r o o f o r w i t h mln.lma1 p r o o f .
1. A shadow c a u s i n g edge must always be c6nvex.
2.
Every convex edge must;always be l a b e l e d as "+" o r
obscure.
3.
Every'
11+"
o r o b s c u r i n g Ii n e segment i s convex.
Understanding Scenes .With Shadows. Page 2 2
4.
Every shadow c a u s i n g edge has a t y p e I r e g i o n on t h e s i d e
o r i e n t e d toward t h e 1 l g h t source and a t y p e S C regFon on t h e
o t h e r side.
Proof: Because an edge I l l u m i n a t e d on b o t h s t d e s cannot
cause a shadow,
nor can an edge shadowed on b o t h s i d e s cause one.
I f t h e shadowed s i d e I s any o t h e r t y p e than t y p e SC,
t h e n the
shadow i s due t o t h e p r o j e c t i o n o f a n o t h e r o b j e c t where t h e
p r o j e c t e d shadow c o i n c i d e s e x a c t l y w i t h t h e edge i t s e l f ;
b u t then
i f t h e o b j e c t c a u s l n g t h e shadow were removed, b o t h s i d e s o f t h e
edge would b e illuminated, so t h e edge cannot c a s t a shadow.
Definition:
A r e g i o n bounded by two edges o f a j u n c t i o n
which obscure i t has no necessary r e l a t i o n t o e i t h e r of i t s
a d j o i n i n g regions;
I w i l l c a l l such a r e g i o n an obscured r e g i o n
w i t h r e s p e c t t o such a j u n c t i o n .
5.(a)
, then
I f a shadow causing edge i s l a b e l e d
It+',,
o f t h e edge i s t y p e I and t h e o t h e r i s t y p e SC.
c a u s i n g edge i s l a b e l e d as an o b s c u r i n g edge,
a shadow c a u s l n g j u n c t i o n (1.e.
one s i d e
I f t h e shadow
and t h e j u n c t i o n i s
t h e shadow I s v i s i b l e ) then ( b )
one s i d e o f t h e edge I s l a b e l e d t y p e S A and t h e o t h e r i s t y p e I,
o r e l s e ( c ) one s i d e I s l a b e l e d t y p e S C and t h e o t h e r i s l a b e l e d
I, except when t h e r e g i o n a d j a c e n t t o t h e t y p e S C r e g i o n I s a
r e g i o n obscured w i t h r e p e c t t o t h e shadow causlng j u n c t i o n .
Proof:
(a) i s t r i v i a l .
I n ( b ) and ( c ) I f we can see t h e edge
t h e n we can see e i t h e r t h e t y p e I r e g i o n ( b ) o r t h e t y p e S C
Understanding Scenes W i t h Shadows. Page 2 3
region (c).
I f we can see t h e t y p e I r e g i o n and we can see t h e
t y p e S A shadow region,
then these two r e g i o n s must a d j o i n one
another a t t h e j u n c t i o n - what c o u l d be between thern? I f we can
see t h e t y p e SC region,
and t h e j u n c t i o n i t s e l f ,
then the
a d j a c e n t r e g i o n must be I o r e l s e t h e shadow causing edge c o u l d
n o t c a s t a shadow a t t h e j u n c t i o n ,
except i n t h e case where t h e
a d j a c e n t r e g i o n i s an obscured region,
w i t h respect t o t h e shadow
causing j u n c t ion.
6,
A t y p e S C r e g i o n can have no shadow edges bounding i t .
7,
I n non-degenerate cases a r e g i o n X separated from a
r e g i o n Y by a c r a c k o r concave edge i s t y p e I i f f r e g i o n Y i s
t y p e I; o t h e r w i s e b o t h a r e t y p e S .
Proof: The o n l y way t h i s could n o t be t r u e i s i f a shadow
edge c o i n c i d e d e x a c t l y w i t h t h e concave o r c r a c k edge.
i f we can i d e n t i f y a j u n c t i o n as a shadow causing
8.
junction,
then we can u n i q u e l y l a b e l a l l o f I t s r e g i o n s f o r
I 1 l u m i n a t t o n except those which a r e obscured w i t h respect t o t h e
junction,
i f any,
Proof: Consider t h e s e t o f a l l shadow c a u s i n g j u n c t i o n s
( f i g u r e 4B).
We can always by d e f i n i t i o n l a b e l the Pegions
bounded b y t h e shadow l i n e as types SA on one s i d e and I on t h e
other.
edge,
I f t h e SA r e g i o n I s bounded on t h e o t h e r s i d e by a concave
l a b e l t h e n e x t r e g i o n S.C, .ofherwise l a b e l i t t y p e I (us1 ng
f a c t 5 above),
u n l e s s t h a t r e g i o n i s obscured w i t h respect t o t h e
Understanding Scenes W l t h Shadows. Page 25
junction.
Using f a c t 7,
we can then l a b e l a l l t h e o t h e r regions.
Exact r e l a t i o n s between eye,
1V.D.
light,
and scene;
v i s i b i l i t y and i l l u m i n a t i o n of surfaces
As a f i r s t approximation,
consider the l i g h t source and eye
t o be i n f i n i t e l y d i s t a n t from t h e scene,
supported by a p l a n e w i t h no holes,
t h e scene t o be
and t h e 1 l g h t source t o be a t
an e l e v a t i o n g r e a t enough .sat h a t a1 1 shadaws c a s t i n the, scene
a r e e n t i r e l y surrounded by i l l u m i n a t e d backgound on t h e r e t i n a l
p r o j e c t i o n of t h e scene.
Def i n e a c o o r d i n a t e systern C w i t h respect t o t h e eye.
a r i g h t hand c o o r d i n a t e system,
scene,
C is
o r j g i n i n t h e center o f the
o r i e n t e d w i t h I t s x and z axes on t h e s u p p o r t p l a n e and y
a x l s normal t o t h e support plane.
The z a x i s coincides w i t h t h e
p r o j e c t i o n of a l i n e from t h e o r i g i n t o t h e eye o n t o t h e support
plane,
o r i n o t h e r words t h e eye 4 s i n t h e y-z p l a n e I n t h e
quadrant between t h e +z and + y axes. The eye e l e v a t i o n ,
& , is
defined as t h e a n g l e between t h e +z a x i s and t h e l i n e from t h e
o r i g i n t o t h e eye. Thus
(PE
i s between 0 and 90 degrees.
Understanding Scenes With Shadows, Page 2 6
The normal t o any surface I n t h e scene can be defined I n
#S
t h i s c o o r d i n a t e system by two angles,
and e d m
9%
i s t h e a n g l e between t h e normal t o a s u r f a c e and t h e p r o j e c t i o n
$
of t h e normal o n t o t h e support plane.
i s t h e angle
between t h e p r o j e c t i o n of t h e normal o n t o t h e support p l a n e and
t h e z axis,
measured i n a counterclockwise d i r e c t i o n when viewed
from the +y axis.
Using these d e f i n i t i o n s ,
the v e c t o r from t h e o r i g i n t o the
eye i n C i s
*
VE=
C
+
COS
9'
The normal v e c t o r t o a s u r f a c e f s
3
vS
=
sin
escos +s
A
x
+ sin
cb, .(: +
cos
$ COs #+s 2.
The c o n d i t i o n which must, then. be s a t i s f i e d I n o r d e r t h a t a
s u r f a c e be v l s l b l e I s
i
Understanding Scenes W i t h Shadows. Page 2 7
we can def i n e a c o o r d i n a t e systern C',
Simi larly,
1 i g h t source l i e s on t h e yl-z'
p l a n e a t an e l e v a t f o n
where t h e
+L
The s u r f a c e o r i e n t a t i o n s a r e g i v e n i n t h l s c o o r d i n a t e system b y
$2
0;.
and
The c o n d i t i o n s f o r v i s i b i l i t y i n C a r e t h e n
analogous t o t h e c o n d i t i o n s f o r i l l u m i n a t i o n i n t h i s c o o r d i n a t e
Therefore l e t
system C ' .
E'
= sin
$:
sin
4'
I
+
cos
cos
4: '6
cos
then
i f E1
>
0,
the surface*i s illuminated
i f E' = 0, t h e n t h e 1 i g h t source 1 i e s i n t h e surface plane,
1.e.
if El
<
t h e s u r f a c e has g l a n c i n g ~ I l u r n I n a t i o n
0,
t h e n t h e s u r f a c e I s shadowed,
and i t s edges c a s t
shadows,
We can t h e n d e f i n e a t r a n s f o r m a t i , o n from C '
i s r e l a t e d t o C b y t h e a n g l e %L.
a x i s o f C t o the z axis o f C',
t o . C,
where C '
e E k l s t h e a n g l e from t h e
r
o r ~ e q u i ~ v a l ' ~ n t l 'tyh, e a n g l e between
t h e p r o j e c t i o n s o f t h e e y e - o r i g i n and l i g h t - o r i g i n l i n e s o n t o t h e
support surface.
I n C,
t h e .Il g h t source normal i s
Understanding Scenes W i t h Shadows. Page 2 8
-3
VL
A
= cos b ' s i n
eL
fx
A
A
+
s.in +L
Y + COS +'COS
Q'Z
so t h e c o n d i t i o n f o r s u r f a c e - i l l u m i . n a t i o n i s
b
OR ( c o s #tL
3
s i n egLsin & c o s
4s
+ sin
$s
sin$L
I n Appendix 1, I show haw we can f i n d a l l these angles under
c e r t a i n assumptions.
1V.E.
Quantized r e g i o n o r i e n t a t i o n
Rather t h a n use t-he f u l l
treatment. developed i n t h e previous
s e c t i o n and App.endix 1, I w - i l l
angles .defined t h e r e .
o f t e n be d i f f i c u l t
I n t r o d u c e a quarvtization o f t h e
One reason f o r doing so i s t h a t i t w i 11
t o f i n d - t h e eye e l e v a t i o n i n cases where t h e r e
a r e o b j e c t s w i t h o u t r i g h t angles,
and another i s t h a t t h e r e i s no
n e c e s s i t y t o b e t h i s prec1s.e f o r t h e t i m e being.
It is
I n t e r e s t i . n g t h a t even a v e r y "rough q u e n t i t a t i o n prov1.des a
u n i f o r m and' e f f e c t i v e .method f o r e l i m i n a t i n g p a r s i n g s whlkh a r e
p h y s i c a l l y tmpossi ble,
r e q u i r e d t o match.
b u t a1 lowabl e when on.1y 1 i n e 1abel.s a r e
Understanding Scenes With Shadows. Page 29
Two r e g i o n s o r i e n t a t i o n s a r e s p e c i a l .
horizontal,
denoted by H,
One o f these Is
which means p a r a l l e l t o t h e support
The o t h e r i s 7 w h i c h i s used t o l a b e l any r e g i o n which
surface.
I s p a r t of t h e v i s i b l e support surface,
A l l o t h e r o r i e n t a t i o n s are q u a n t i z e . l o c ' c o r l i n z t o t h e i r
value,
i.e.
es
according t o t h e a n g l e between t h e p r o j e c t i o n o f
t h e i r normals o n t o t h e support s u r f a c e plane and t h e scene z
axis.
The q u a n t i z a t i o n s are:
F = front,
es : , w i t h i n about
1 0 degrees o f t h e z axis;
the
exact values w i l l v a r y depending on t h e d l s t a n c e o f the eye from
t h e scene,
b u t t h e working d e f i n i t i o n i s t h a t such a surface can
have r i g h t angle v e r t i c e s a t b o t h l e f t and r i g h t extremes such
t h a t f o r each of t h e v e r t i c e s o n l y one j u n c t i o n segment o u t o f a
possible three i s v i s i b l e .
Another way o f p u t t i n g t h l s i s t h a t a
r e g i o n l a b e l e d F can have edges l a b e l e d i'convex"
a t I t s r i g h t and
l e f t extrema o n l y I f t h e o b j e c t does n o t have r e c t a n g u l a r
vertices.
R = rlght,
BR
OS
= back r l g h t ,
B L = back l e f t ,
L = left,
es
between t h e l i m i t of F and 9 0 degrees.
es
ec
between 9 0 and 1 8 0 degrees.
between 1 8 0 and 2 7 0 degrees.
between 2 1 0 degrees and t h e 1 l m l t o f F.
Understanding Scenes With Shadows. Page 3 0
To t h i s w i l l be appended a s u f f i x t o I n d i c a t e the v a l u e o f
, the
a n g l e between the normal t o a surface and t h e
p r o j e c t i o n o f the normal o n t o the s u p p o r t plane.
These s u f f i x e s
are:.
+s
Vsvertical,
D = down.,
=Odegrees.
<
+s
VU = v . e r t i c a l / u p ,
O degrees.
+S
HU = h o r I z o n t a l / u p ,
between 0 and 4 5 degree,^.
between 45 and 9 0 degrees ( 9 0
degrees = hor.1z o n t a l 1.
N o t i c e t h a t I n a scene made up o n l y of o b j e c t s which have
r e c t a n g u l a r corners,
bnother,
LV,
FV,
where none of t h e o b j e c t s l1leansf1on
t h e o n l y p o s s i b l e v f s t b l e s u r f a c e o r l e n t a t l o n s a r e B,
and RV,
r e g a r d l e s s o f eye posi t i o n .
I n addl t i o n ,
s u r f a c e e i t h e r belongs t o t h e s e t o f i l l u m i n a t i o n types
SB,
SAB
o r i s t y p e SC,
l i g h t / e y e angle,
1V.F.
a
I, SA,
and t h i s i s a f u n c t i o n o n l y o f r e l a t i v e
Independent o f l i g h t e l e v a t i o n .
Light posl t i o n -quantization
The r e l a t i v e l i g h t l e y e angle,
a c c o r d i n g t o T a b l e 1.
H,
BL , w i l l
be q u a n t i z e d
There a r e t w e l v e p o s s i b l e values,
which
can be visualized by c o n s i d e r i n g a vantage p o l n t on t h e scene y
Understanding Scenes With Shadows. Page 3 2
axis.
From t h i s p o s i t i o n ,
l o o k i n g down on t h e scene,
F i s a 30
degree wide segment centered on t h e z a x i s ( 6 o ' c l o c k ) .
then centered around 5 o'clock,
FR i s
B around. 1 2 otc:1ock, e t c .
The l i g h t e l e v a t i o n i s quantized i n t o t h r e e values,
a c c o r d i n g t o which angle i s nearest t h e a c t u a l value:
L = low,
3 0 degrees.
M = medium,
H = high,
45 degrees,
6 0 degrees,
The eye e l e v a t l o n angle i s quantized I n f o u r values,
-
three
(
o f which a r e exact1 y t h e same as f o r t h e 1 l g h t source elevatlon,
the
a d d i t i o n a l one being VL
l e s s than 1 5 degrees.
Given t h i s q u a n t f z a t i o n and the methods o f Appendix 1 we can
then generate v a l u e s f o r each combi nat.lon o f 1 i g h t source
posl t i o n ,
s u r f a c e o r i e n t a t ion,
and eye p o s i t i o n which l n d l c a t e
how a g i v e n s u r f a c e 1s i l l u m i n a t e d and whether o r n o t I t i s
v l s l ble.
Understand1ng Scenes W i t h Shadows. Page 35
V.
The background/seene boundary
While I w i l l t a l k e x t e n s i v e l y about t h e scene/background
boundary,
i would l i k e t o p o i n t o u t t h a t a l l o f t h e techniques
developed t o handle t h i s boundary w i l l a l s o be appl i c a b l e t o any
i s o l a t e d s u b p i c t u r e i n t h e scene,
o r f o r most p o r t i o n s of t h e
scene around whlch we can draw a p a t h k h l c h c o n s i s t s o n l y of the
j u n c t i o n s which c h a r a c t e r i z e t h e scene/background boundary,
s p e c i a l treatment o f TEE j u n c t i o n s .
Since I have n o t worked o u t
I w i l l not
t h e d e t a i l s o f i s o l a t i n g such p o r t i o n s o f t h e scene,
make any o v e r o p t i t n i s t l c claims,.
plus
b u t I fntend t o f o l l o w up on t h i s
approach.
You w i l l remember t h a t e a r l i e r I d e f l n e d t h e background
r e g i o n t o be t h a t p o r t i o n o f t h e v l - s i b l e -support s u r f a c e which I s
illuminated.
Every 11-ne whlch bounds t h i s r e g i o n can have o n l y
one o f t h r e e labels,
support s u r f ace),
namely obscure ( a 1 i n e segment o b s c u r i n g t h e
conc'ave ( o r minus),
and shadow,
shadow l i n e can be o r i e n t e d i n o n l y one d i r e c t i o n .
where t h e
We can then
l o o k a t t h e complete l i s t o f p o s s i b l e j u n c t i o n s and p i c k o u t
those whlch have segments bounded by any combination of these
three labels.
F i g u r e 5 A shows t h i s s e t o f j u n c t i o n s ,
where B / I
I n a j u n c t i o n segment i n d i c a t e s t h a t t h i s segment can be p a r t o f
t h e background region.
Understanding Scenes W i t h Shadows. Page 3 6
We can now make t h e r a t h e r remarkable o b s e r v a t i o n t h a t
whenever a j u n c t i o n on t h e scenelbackground boundary has one o f
i t s segments of t y p e T I , T2, S 1 , 52,
M,
o r F as p a r t o f t h e
background region, we can p a r t i a l 1 y l a b e l t h a t j u n c t i o n d i r e c t 1 y.
Frequently,
h a v i n g done t h i s ,
o t h e r j u n c t i o n labelings;
1.e.
t h e r e tnay be f o r c e d choices f o r
i f we check a l l t h e j u n c t i o n s
which a r e p a r t i a l l y labeled, we may f i n d t h a t t h e r e i s o n l y one
p o s s i b i l i t y f o r l a b e l i n g some j u n c t i o n s ,
given that the p a r t i a l
l a b e l i n g must be preserved,
I n any event,
m e r e l y by knowing t h a t one o f t h e segments o f
a j u n c t i o n i s p a r t ~f t h e background region, we can use t h i s
r e s t r i c t e d s e t o f j u n c t i o n s from which t o choose a l a b e l lng.
F i g u r e 5B shows t h e r e s u l t s o f a p p l y i n g the r e s t r i c t e d s e t of
p o s s l b i l i t i e s t o a l l n e drawing ( t a k e n from a photograph).
F i g u r e 5 C shows t h e r e s u l t s o f a p p l y i n g f o r c e d r u l e s t o t h e
p a r t i a l l y l a b e l e d l i n e drawing o f F i g u r e 5B.
The steps I n v o l v e d
i n forced l a b e l i n g are l i s t e d i n order o f t h e i r application
beneath F i g u r e 5 C .
Understanding Scenes With Shadows. Page 3 9
VI.
V1.A.
Scenes w i t h r e c t a n g u l a r v e r t i c e s
Theory
As I mentioned e a r l i e r ,
t h e cases where a scene c o n t a i n s
o n l y r e c t a n g u l a r v e r t i c e s i s a p a r t i c u l a r l y easy one.
In this
s e c t i o n i w i l l show how we can cornplete t h e l a b e l i n g o f t h e l i n e
drawing shown i n F i g u r e s 5B and 5 C assuming r e c t a n g u l a r v e r t i c e s
o n l y . F i r s t r e c a l l t h a t t h e o n l y s u r f a c e o r i e n t a t i o n s which a r e
p o s s i b l e i n t h e r e c t a n g u l a r v e r t i c e s case are B, t i 8 RVO FV,
LV.
I n F i g u r e 6,
1 show a c o m p i l a t i o n o f a l l t h e p o s s i b l e ways
i n which each r e g i o n can be l a b e l e d f o r o r i e n t a t i o n ,
-,
p a r t i c u l a r l i n e labeling f o r a junction.
t h e j u n c t i o n types,
given a
I have n o t r i s t e d a l l
b u t have conf l n e d myself t o those which are
needed t o complete t h i s example: e l l ,
psi,
and
and kay j u n c t i o n s .
arrow,
fork,
tee,
peak,
In p r a c t i c e we need not have t h i s e n t i r e
t a b l e i n lneinory since' we can use theorems t o generate p o s s i b l e
r e g i o n l a b e l i n g s as needed.
1 w i l l g i v e rules f o r generating t h i s
t a b l e i n t h e s e c t i o n on l i n e o r i e n t a t i o n s .
ELL
IPbel'kq
J
e
Allowable wr
m'*&%tiorrs
to
LL
Note: thmykO'Lt t k j s
A w e I htzeomXted
"u" f;.m FV, RV4L.V.
4
AUowdb
labslkq
SUP&
~n'wbtiows$w
Po P l
PP P3
*
KAY
PSI
L&Gau
.fs..
A - b a b l ~su
0 & t & t i ~
$1
s t Sf'
s2'
Understanding Scenes W i t h Shadows. Page 45
I n t h e r e c t a n g u l a r v e r t e x case t h e r e i s a p a r t i c u l a r l y
simple. r e l a t i o n s h i p between o r i e n t a t i o n and p o s s i b l e 1 i g h t i n g .
Table 2 shows t h i s r e l a t i o n s h i p .
and F i g u r e 6,
Using t h e r e s u l t s o f t h i s t a b l e
we can now a s s i g n l i g h t i n g possibilities t o each o f
t h e p o s s i b l e j u n c t i o n s p r e v i o u s l y c l a s s i f l e d by l i n e l a b e l s and
region o r i e n t a t i o n ,
Rather than show t h e complete set,
I have
chosen t o show o n l y t h e p o s s i b i l i t i e s f o r j u n c t i o n s which border
t h e background,
b u t the same methods can be used t o a s s i g n v a l u e s
f o r a1 1 j u n c t i o n s ,
F i g u r e 7,
The r e s u l t s of t h i s process a r e shown I n
Understanding Scenes Wfth Shadows. Page 5 1
Let me review e x a c t l y what we need t o do t o a r r i v e a t these
listings.
F i r s t we must generate a Ful-1 s e t o f j u n c t i o n 1 i n e
l a b e l i n g s f o r each j u n c t i o n type.
and a s s i g n r e g i o n values f o r
w h i c h t h i s l a b e l i n g can occur.
which have cracks,
We can then t a k e each l a b e l fng
a l l the possible orientations i n
( N o t i c e t h a t i n those j u n c t i o n s
I have i n c l u d e d some g r a v i t y information,
on
t h e b a s i s o f t h e o b s e r v a t i o n t h a t c e r t a i n o r i e n t a t i o n s can never
occur i f g r a v l t y i s assumed t o be i n t h e -y d i r e c t i o n ,
normal t o t h e suppport surface and-downward.
say about t h i s l a t e r ,
1.e.
I w i l l have more t o
a l o n g w i t h i n f o r m a t i o n about support which
can be deduced from l o c a l f e a t u r e s . 1
. ,
We haqe. thus d i v i d e d each
p o s s i b l e l a b e l i n g f o r 1 i n e s i n t o se"era1
has t h e same l i n e labels,
l a b e l ings,
each o f which
We can
but d.ifferent region labels.
t h e n t a k e each member o f t h i s extended set and d l v l d e i t i n t o
s e v e r a l p o s s l b l l l t i e s a c c o r d i n g t o 1 i g h t i n g direction.
N o t i c e t h a t i n some cases,
p a r t i c u l a r l y i n those j u n c t i o n s I
have c a l l e d "shadow causing junctions",
t h e r e i s o n l y one
p o s s i b i l i t y f o r region illuminatlon o f the junction.
I n these
cases we can t h e n say t h a t t h e j u n c t i o n as l a b e l e d can o n l y be
. present i n t h e scene i f t h e 11-ghting i s from a p a r t i c u l a r subset
of t h e s e t o f p o s s i b l e p o s i t i o n s .
Thus knowing t h a t t h e l i g h t i n g i s
-from a p a r t i c u l a r p o s i t i o n we can c o n s t r a i n t h e p o s s i b l e
l a b e l i n g s f o r j u n c t l o n s In t h e l i n e dr-awing and 1.n t h e o p p o s i t e
Understandi-ng Scenes W i t h Shadows. Page 53
d i r e c t i o n of Implication,
knowing a p a r t i c u l a r j u n c t i o n labellnq
we' can. c o n s t r a i n t h e possl.bl~e1 l g h t p o s i t i o n s .
V1.B.
An example
To show how these f a c t s h e 1 p . u ~i n l a b e l i n g a scene,
now r e t u r n t h e example s t a r t e d i n t h e p r e v i o u s s e c t i o n ,
t h e p a r t i a l labeling,
Given
o f 5 C and assuming t h a t a l l i t s v e r t i c e s
are r e c t a n g u l a r w i t h . no degeneracies,
l a b e l e d (1) i n F l g u r e 8.
f i g u r e 7,
l e t us
look f i r s t a t t h e j u n c t i o n
Glvbn t h e l i n e . l a b e l s a-nd r e f e r i n g t o -
t h e r e I s o n l y one p o s s i b i l i t y f o r l a b e l i n g t h e regions:
F i g u r e 8A
A t j u n c t i o n ( 2 ) by t h e same reasoning we have:
Understanding Scenes W i t h Shadows.
Page 54
F i g u r e 88
Thus we may deduce t h a t t h e l i g h t i n g must be somewhere I n t h e
i n t e r s e c t i o n o f t h e two i n t e r v a l s s p e c l f l e d by (1) and ( 2 1 ,
namely (RF,
LF)
/\
( F L O BL) = ( F L O L F ) ,
A t ( 3 ) we now have:
Figure 8C
so we must l o o k f o r a j u n c t i o n which agrees by r e g i o n type,
label,
and l i g h t i n g .
l a b e l i n g are:
line
The o n l y two candidates which agree on l i n e
Understanding Scenes W i t h Shadows. Paee 55
..
and c l e a r l y o n l y t h e f i r s t I s p o s s i b l e when r e g i o n l a b e l s are
considered.
Hence. we can l a b e l t h e o t h e r two segments and t h e
o t h e r two l i n e s .
s i n c e (F,
LF)
No f u r t h e r r e s t r i c t i o n on l l g h t i n g I s Implied,
(FL,
LF') = (FL,
LF),
A t (4) we now have:
Figure 8E
and a l l these a r e p o s s i b l e .
1 i n e segments,
yet.
Howevek, we can now l a b e l a l l t h e
even though we a r e n o t a b l e t o l a b e l t h e r e g i o n s
We do know t h a t t h e two regions f o r which we do not yet
have an o r i e n t a t i o n l a b e l must be g i v e n t h e same o r i e n t a t i o n
Understanding Scenes W i t h Shadows.
Page YQ
label :
Figure 8F
At (5) w e can have either:
Figure 8 G
At ( 6 ) w e can thus have, i f @ i s the correct labeling, only:
Figure 8H
U n d e r s t a n d i n g Scenes W i t h Shadows. Page 57
whereas I f
i s c o r r e c t we can have e l t h e r :
F i g u r e 81
However,
i fa w e r e t r u e ,
l i g h t i n g was between (R,
j u n c t i o n ( 6 ) would i m p l y t h a t t h e
F ) and (R,
F ) f ) (FL,
LF) = n i l .
S i m i l a r l y t h e 1 i g h t i n g p o s s l b l l i t i e s g i v e n m a s t h e proper
l a b e l i n g have an empty I n t e r s e c t i o n w i t h (FL,
el-lminate b o t h @ a n d B
LF),
so we can
T h l s now e n a b l e s us t o
leaving only
go back t o (4) and . a s s i g n X = L.
Because (6) must b e l a b e l e d as
ins we can
l a b e l a1 1 on t h e
l l n e s i n j u n c t i o n s ( 7 ) t h r o u g h (11) as shadow l i n e s ,
are the o n l y possible continuations,
s i n c e these
regardless of l i g h t i n g
posi tlon,
J u n c t i o n ( 1 2 ) must be l a b e l e d as:
Understand i n g Scenes W l t h Shadows.
Page
50
F i g u r e 8J
since again t h i s i s the o n l y p o s s i b i l i t y .
J u n c t i o n (13) must be l a b e l e d as:
F l g u r e 8K
s i n c e t h e r e I s no o t h e r l a b e l i n g whlch has one l i n e l a b e l e d as
shown and whlch a l s o i s p o s s i b l e g i v e n t h e l i g h t i n g d i r e c t i o n .
T h i s t h e n a l l o w s us t o l a b e l (14) through (18) as f o r c e d choices
on a l i n e l a b e l i n g
b a s i s alone. N o t i c e t h a t t h i s f o r c e s r e g i o n
R 1 t o b e l a b e l e d llFV1l,
since t h i s i s the o n l y region o r i e n t a t i o n
w h i c h can be bounded by b o t h (14) and ( 1 8 ) .
U n d e r s t a n d i n g Scenes W i t h Shadows.
I w i 1 1 n o t c o n t i n u e t o show each step,
r e s t as an e x e r c i s e ,
but w i l l
Page 5 3
leave the
i f you a r e i n t e r e s t e d enough t o c o m p l e t e I t .
The f i n a l r e s u l t s a r e shown i n F i g u r e 9.
ambiguous 1 i n e segments,
There a r e t h r e e
and p a r t o f my work t o b e done I n t h e
f u t u r e w i l l d e a l w i t h ways o f r e s o l v i n g t h e s e a m b i g u i t i e s .
Understanding Scenes With Shadows. Page 6 1
V I I . L i ne segrnent o r i e n t a t ions
i n general we w i l l n o t be a b l e t o t o t a l l y l a b e l a 1 i n e
preceeding
drawing w i t h o n l y t h e i n f o r m a t i o n we used i n t h e
example. F o r t u n a t e l y ,
t h e r e are ways o f e x t r a c t i n g i n f o r m a t i o n
which we have n o t yet expl.oited,
V I I .A.
Background/scene boundary r e v i s i t e d
We can a s s i g n q u a n t i z e d o r i e n t a t i o n values t o each o f t h e
l i n e segments we encounter w h i l e moving around t h e scene I n o u r
normal ( c l o c k w i s e as seen from t h e background) d i r e c t i o n . We
a1 ready know t h a t each 1 i n e segrnent on t h l s bound"ary must be
l a b e l e d obscure,
concave,
case of b l o c k s only,
exposition,
o r shadow,
and t h a t i n t h e r e s t r i c t e d
which I w i l l c o n t i n u e t o use f o r ease o f
each r e g i o n separated from t h e background by one o f
these 1 i n e segments must be H,
FV,
R.V,
LV, o r B / S A .
I w i 11 assign
1 i n e o r i e n t a t i o n values from a set o f e l g h t compass points,
so
t h a t f o r i n s t a n c e i f we move d i r e c t l y upward along a l i n e
segment,
I w i l l l a b e l t h a t l i n e N,
and so on. N,
E,
S,
and W w i l !
be d e f i n e d t o i n c l u d e l i n e s which f a l l w f t h i n about 5 degrees o f
v e r t i c a l o r horizontal,
denoted by NE,
SE,
SW,
and t h e angles I n between w i l l then b e
and NW. (The w i d t h o f t h e r e g i o n s 1 s a
f u n c t i o n of eye distance;
as the d i s t a n c e goes t o I n f i n i t y ,
the
U n d e r s t a n d i n g Scenes W i t h Shadows.
N,
S,
Page ;2
and W s e c t o r s can becorne a r b i t r a r i l y s m a l l . 1
E,
We can t h e n s t a t e t h a t t h e f o l l o w i n g f a c t s a r e t r u e b y
e x h a u s t i v e l y c o n s i d e r i n g a1 1 p o s s i b i l i t i e s :
1. A shadow l i n e may have any o r i e n t a t i o n .
2.
I f t h e l i n e L i s an o b s c u r i n g l i n e ,
r e g i o n s e p a r a t e d f r o m t h e background by L,
o r i e n t a t i o n o f L,
O(L) = N
+X
= RV, FV, o r H.
O(L) = E ) X = F V
= SE . ) X
O(L) = S
O(L) =
3
and O(L) = t h e
then:
O(L) = NE S X = R V
O(L)
and X denotes the
(*I
(*I
= LV ( * I
X = LV, FV, o r H.
{SW,
W,
NW]
+X
= H.
( * I i n d i c a t e s t h a t t h e o b j e c t o f w h i c h t h i s edge i s a p a r t 1 s
s u p p o r t e d o f f t h e s u r f a c e o r e l s e i s concave.
3.
I f L I s a concave edge then:
O(L) = {N,
S,
SW, W,
edge.
O(L)
= NE .) X = R V / I .
+
N W ~ t h e r e i s no such X w i t h a concave
Understanding Scenes W i t h Shadows.
Using t h e same approach w i t h o'ther l i n e l a b e l s ,
g e n e r a t e t a b l e 3,
which give;
Page 63
we can
the possible region labels f o r
1 i n e s w l t h any l a b e l and o r i e n t a t i o n . T h i s t a b l e can t h e n be used
t o generate f i g u r e 6, which I presented i n s e c t i o n V I . A .
w i t h no
just i f ication,
Interior forks
V I I.B,
.
.
Rather than g i v e a f u l l treatment of l n t e r i , o r j u n c t i o n s ,
w i 11 t r e a t o n l y one p a , r t , i c u l a r l y , u s e f u l ' j u n c t i o n .
I
I..f i n a scene
w i t h o n l y r e c t a n g u l a r v e r t i c e s we .encounter a f o r k i n t h e
i n t e r i o r of a scene ( l.e.
none..o..f I . t s segments i s p a r t o f t h e
i1 lumlnated background) 'then' the' correspon'dinp; v e r t e x I s t y p e I
( a l l l i n e segments convex1 i f and o n l y i f one l i n e segment p o i n t s
"south"
sort,
from t h e j u n c t i o n ;
i f there i s a type I junction. o f t h i s
t h e n we can a l s o l a b e l i t s r e g i o n s :
Figure 17A
Understanding Scenes With Shadows. Page
6s
Rough p r o o f :
The e n t i r e s e t o f p o s s l b l e f o r k s i s :
yyyyyy'y
CI)
(*)
I
(3)
C4 )
.CS)
Figure 168
C6)
(7)
C l e a r l y (1) can be l a b e l e d as shown.
A l l of t h e o t h e r s have a t l e a s t . one concave edge.
concave edge p o f n t s "south";
then t h e r e g i o n t o I t s r i g h t must be
LV and t h e r e g i o n t o I t s l e f t must be RV.
obscured r e g i o n must be H,
Suppose a
Then I n ( 2 ) t h e
b u t we have assumed from t h e b e g i n n i n g
t h a t . t h e eye e l e v a t i o n i s s u f f i c i e n t s.o t h a t any h o r i z o n t a l
surface w i t h normal upward i s v l s i b l e unless obscured.
Likewise
( 3 ) 1 s . Impossible,
but the
s i n c e I t must a l s o have an H region,
normal o f t h i s r e g i o n p o i n t s downward,
so t h e surface c o u l d n o t
be v l s l b l e . S i m l l a r l y ( 4 ) through ( 7 ) a r e impossible w i t h t h e i r
concave edges downward,
eliminate (21,
V I I.C,
(41,
(51,
and s i m i l a r reasoning can be used t o
and ( 7 ) i n t h e i r o t h e r o r i e n t a t i o n s ,
An example w i t h degeneracies
I have p i c k e d t h e scene i n f i g u r e 10A t o i l l u s t r a t e not o n l y
t h e use of l i n e o r i e n t a t i o n s ,
w i t h several degeneracies.
b u t a l s o t h e p a r s i n g of a scene
We can s t a r t by l a b e l i n g the l i n e
Understanding Scenes With Shadows. Page 6 7
segments of j u c t ions w i t h scene/background segments o f T I ,
Sl, 52, M, and F as we d i d i n s e c t i o n V I . 8 ,
r e s u l t i n g "forced"
l a b e l ings,
T2,
and then l a b e l
choices, F i g u r e 1 0 A c o n t a i n s a l l these
p l u s o r i e n t a t ions f o r each 1 i n e segment around t h e
scene/background boundary. Each de.generate j u n c t i o n i s marked
IID
I1
b
i g n o r i n g t h e i n t e r i o r f o r k s I n t h i s example, we can
c o n s t r u c t p o s s i b l e l a b e l i n g s f o r R 1 through R 1 3 .
Notice that f o r
t h e l i n e segments a l r e a d y l a b e l e d ltobscu.ret' we need n o t conslder
shadow and concave edge r e g i o n p o s s i b i l i t i e s . The p o s s i b l e
l a b e l ings f o r each r e g i o n are shown i n t a b l e form i n f i g u r e I O B .
From t h e r e s u l t s of t h e a p p l . i c a t i o n o f t h e l i n e o r i e n t a t i o n r u l e s
we can unambiguously l a b e l t h r e e r e g i o n s and s i x l i n e segments
directly.
We can then proceed w i t h " f o r c e d choices" as we d i d i n t h e
p r e v i o u s example.
10C.
T h i s process i s shown s t e p b y s t e p i n F i g u r e
As t h e r e s u l t o f t h e a p p l i c a t i o n o f these l i n e o r i e n t a t i o n
r u l e s we a r e l e f t w i t h s i x j u n c t i o n s n o t t o t a l l y labeled,
of whlch are degenerate, These a r e shown i n F i g u r e 10D.
three
RI- NW
utqe
RI-s€
G/I
Rl-Sw
K5- S E
RS-S
h e
H
L
B/sA
v(R~)=
8 / 5 ~
Understanding Scenes With Shadows. Page 7 2
About degenerate j u n c t i o n s we can say:
1. C e r t a i n j u n c t l o n s can never be degenerate,
namely e l l and
and p o s s i b l y o t h e r s i n p a r t i c u l a r o r i e n t a t i o n s .
I f ever we
a r e l e f t w l t h such a j u n c t l o n which we cannot l a b e l ,
then t h e r e
tee,
i s a m i s l a b e l i n g somewhere e l s e i n t h e scene.
2. Any j u n c t l o n whlch does n o t f a 1 1 i n t o one o f t h e
a l l o w a b l e j u n c t l o n categories must. be a degenerate j u n c t i o n .
3.
Any j u n c t i o n whlch i s degenerate can be broken up i n t o :
( a ) two real, j u n c t l o n s ,
one o f which must s.eparate a l o n g
1i.nes l a b e l e d e l t h e r -both. o b s c u r i n g o r b o t h concave,
or
.
( b ) a j u n c t i o n w i t h a c o i n c i d e n ' t a l l y a l i g n e d sh.adow edge.
A good h e u r i s t i c t o n o t e I s t h a t any s t r a i g h t l i n e s I n a
degenerate j u n c t i o n whlch appear t o be c o l l l n e a r a r e l i k e l y t o be
t r u l y connected. Some cases which f o l l o w t h i s p a t t e r n a r e showri
i n f i g u r e 1 0 E a l o n g w l t h some cases which do not.
I n any event,
I
have n o t y e t devoted s u f f i c i e n t a t t e n t i o n t o degenerate j u n c t i o n s
t o say an.y more t h a n t h a t I can show methods t o l d e n t i f y whlch
j u n c t I o n s a r e degenerate,
and t h a t I can suggest some promising
c o n s i s t e n c i e s between v a r i o u s p o s s i b l e degenerate j u n c t l o n s .
Understanding Scenes W i t h S h a d o ! ~ ~ ?a:;c
.
Sorne m i s c e l l a n e o u s p o i n t s o f i n t e r e s t
VIII.
V I I I .A.
Support
I f we ever encounter any j u n c t i o n s such t h a t t h e r c q i o n s and
1 i n e s a r e l a b e l e d as i n f i g u r e 11, we can draw c o n c l u s i o n s about
support
listed.
between t h e o b j e c t s which correspond t o t h e r e g i o n s
I have shown c o n f i g u r a t i o n s which i m p l y support,
t h e same token,
b u t by.
c e r t a i n o t h e r c o n f i g u r a t i o n s can be used t o
a s c e r t a i n cases where s u p p o r t between o b j e c t s i s precluded,
Note t h a t i f t h e r e a r e no shadows h t h e scene we w i l l
u s u a l l y n o t be a b l e t o t e l l whether one b l o c k supports a n o t h e t o r
whether i t m e r e l y obscures it, s i n c e shadows a r e the prime means
o f d i s t i n g u i s h i n g between o b s c u r i n g and concave edges,
V1iI.B.
F i n d i n g i s o l a t e d subscenes
I mentioned a t t h e b e g i n n i n g o f s e c t i o n V that we can
i s o l a t e p o r t i o n s o f a scene which i t may be p o s s i b l e t o t r e a t as
though t h e y were surrounded by background. To do t h i s we need
o n l y f i n d p a t h s around a l l s e t s o f r e g i o n s i n t h e l i n e drawing
which c o n s i s t o n l y o f j u n c t i o n s which can occur on t h e
background/scene boundary p l u s TEES which a r e o r i e n t e d so t h a t TO
i s i n t h e i n t e r i o r o f t h e r e g i o n we a r e i s o l a t i n g .
I belleve t h a t
Understanding Scenes W i t h Shadows. Page 7 6
t h i s I s one o f t h e most p r o m i s i n g ' l i n e s o f r e s e a r c h t o b e
f o l l o w e d up.
VII1.C.
Non-rectangular
scenes
I n t h i s a r e a I s t i l l have a great. d e a l o f work t o do.
However,
from p r e l i m i n a r y o b s e r v a t i o n I be1 I e v e t h a t i t may b e
p r o f i t a b l e t o c o n s i d e r e v e r y scene i n i t i a l l y as though i t were a
scene w i t h o n l y r e c t a n g u l a r v e r t i c e s . T h i s w i 11. accomplish a t
l e a s t two ends:
any subport ions. o f t h e scene which c o n s i s t o f
r e c t a n g u l a r j u n c t i o n s o r i n which non-rectangul a r j u n c t i o n s
f o l low t h e same t y p e s o f r u l e s as do r e c t a n g u l a r j u n c t i o n s can be
labeled,
and we can e f f e c t i v e l s y I s o l a t e j u n c t i o n s which do n o t
f o l l o w t h i s pattern,
these junctions w i l l t h e n be e i t h e r non-
r e c t a n g u l a r o r degenerate.
V I I I .D,
Curved surfaces
U s i n g t h e r e s u l t s o f Appendix 1, we can f i n d c o n d i t i o n s on
t h e b o u n d a r i e s of curved surfaces,
s i n c e a t these boundaries t h e
p o i n t s must a l l s a t i s f y t h e c o n d i t i o n s V
t h e o r i g i n - e y e v e c t o r and V
V
= 0, where V
Is
i s t h e s u r f a c e normal. Consequent1 y
we can a l s o c h a r a c t e r i z e t h e q u a n t i z e d r e g i o n v a l u e f o r a curved
s u r f a c e a t each p o i n t a l o n g i t s boundary,
and perhaps can
Understanding Scenes W l t h Shadows. Page 7 7
u s e f u l l y t r e a t such surfaces accordlng t o t h e range of values
through whlch t h e surface normal must pass.
V1II.E.
A grammar o f r e g i o n s
An i n t e r e s t i n g c o r o l l a r y t o t h e o b s e r v a t i o n t h a t each r e g i o n
can have o n l y one l a b e l I s t h a t we can d e s c r i b e a g e n e r a t i v e
grammar f o r surfaces o f particular types. For example,
every
s u r f a c e whlch I s bounded b y a l l t y p e 1 v e r t i c e s ( t y p e 1 v e r t i c e s
a r e ones'whlch occupy o n l y one o c t a n t o f space a t t h e vertex,
or
a l t e r n a t e l y have t h r e e convex edges). can be generated b y t h e
transi t l o n rules:
A1
-+ TO
o r L1 o r A2
L1
+ TO
o r L1 o r A 2 ( l a b e l
(label
-+)
+)
TO 4 TO o r L 1 o r A 2 Clabel
A 2 4 A l o r F
F
--=+A 1
or F
(label
(label
+
+
-+-
1
where any "sentence1' which
c o n t a i n s a t l e a s t t h r e e j u n c t i o n s f rorn the s e t
ELI,
Al,
A2,
FI
represents a r e a l s u r f a c e o f t h i s k l n d . Such s u r f a c e s i n c l u d e the
faces o f a l l convex polyhedra such as prisms,
cubes,
pyramids,
e t c . whlch have o n l y t r i h e d r a l v e r t i c e s .
We can a l s o l i s t t r a n s l t l o n r u l e s f o r a l l t y p e SA shadow
-regions and r e g i o n s bounded on1 y by convex edges,
as we1 1 as each
of t h e preceedlng types as I t appears when p a r t i a l l y obscured by
u n d e r s t a n d i n g Scenes W i t h Shadows. Page 7 8
o t h e r o b j e c t s . We may be a b l e t o use t h e s e o b s e r v a t i o n s t o
r e c o g n i z e some r e g i o n s d i r e c t l y ,
r a t h e r t h a n a r r i v i n g a t labelinx
f o r regions on t h e b a s i s o f the
l i n e s surrounding the region
w h i c h have a l r e a d y been l a b e l e d .
V I I I.F.
A f i n e r c l a s s i f i c a t i o n o f j u n c t i o n types
I n a l l t h e work so f a r ,
I have assumed t h a t we a r e u n a b l e t o
t e l l t h e d i f f e r e n c e between a n g l e s l e s s t h a n o r g r e a t e r t h a n 9 0
degrees. ' A g l a n c e a t t h e ar'row and t e e l a b e l i n g s o f f i g u r e 1
s h o u l d c o n v i n c e y o u t h a t I f we have more preci'se i n f o r m a t I o n about
angles,
we can f u r t h e . r p a r t i t i o n t h e p o s s i b l e j u n c t i o n l a b e l l n g s
and t h u s make t h e p r o b l e m o f a s s i g n i n g 1-abels e a s i e r .
Understanding Scenes l J i t h Shadows. Page 7 7
IX.
Surnmary and plans f o r r e s e a r c h
I have demonstrated t h e use o f a l a r g e number o f r a t h e r
i s o l a t e d f a c t s and methods,
and would now l i k e t o say a 1 i t t l e
about how these can a l l be t i e d t o g e t h e r i n a conceptual .
framework. As a f i r s t a p p r o x i m a t i o n t o a t o t a l p i c t u r e o f t h i s
system,
we can o r d e r t h e v a r i o u s scene v a r i a b l e s a c c o r d i n g t o how
w i d e l y any p a r t i c u l a r p i e c e o f i n f o r m a t i o n can d i r e c t l y I n f l u e n c e
o t h e r scene v a r i a b l e s .
1. L i n e l a b e l s a f f e c t ( d i r e c t l y ) o n l y t h e j u n c t i o n s a t b o t h
ends o f t h e l i n e segment,
2.
L i n e l a b e l s p l u s o r i e n t a t i o n of the l i n e s a f f e c t the
j u n c t i o n s a t b o t h ends p l u s t h e r e g i o n s on b o t h s i d e s of t h e 1 lne
segment.
3. J u n c t i o n l a b e l s a f f e c t a l l l i n e s whlcli are p a r t o f t h e
j u n c t i o n and may i n some cases be i n t e r r e l a t e d w i t h l i g h t and e y e
position,
4,
J u n c t i o n l a b e l s p l u s o r i e n t a t i o n a f f e c t a l l l i n e s whlch
are p a r t o f the junction,
and eye p o s i t i o n ,
f r e q u e n t l y are i n t e r r e l a t e d w l t h l i g h t
and a l s o a f f e c t a l l r e g i o n s which bound t h e
junction,
5.
region,
Region l a b e l s a f f e c t a l l j u n c t i o n s and l i n e s bounding t h e
a r e i n t e r r e l a t e d w i t h eye and 1 i g h t p o s i t i o n ,
a f f e c t a d j a c e n t regions.
and may
Understanding Scenes W i t h S h i ~ d o w s . P a . 7 ~E r j
6.
Region l a b e l s p l u s a l i n e l a b e l d e f i n i t e l y a f f e c t
the
a d j a c e n t r e g i o n s as w e l l as t h e o t h e r v a r i a b l e s frotq (5).
7.
Path l a b e l s a f f e c t a l l r e g i o n s i n s i d e t h e p a t h ( a s i n
f i n d i n g an i s o l a t e d subscene o r rnoving a-round t h e
scene/background boundary,).
8.
L i g h t source and eye p o s i t i o n s can p o t e n t l a l l y a f f e c t a l l
j u n c t i o n s and r e g i o n s ,
T h i s i s a v e r y rough h i e r a r c h y ,
b u t I t h i n k i t g i v e s some
i d e a o f t h e amount and types o f interdependence between t h e
v a r i o u s scene v a r i a b l e s ,
My p l a n s f o r r e s e a r c h I n c l u d e as a t o p p r i o r i t y t h e g a i n i n g
o f g r e a t e r i n s i g h t i n t o j u s t what t y p e o f conceptual framework we
need t o be a b l e t o l u c i d l y d e s c i be t h e n a t u r e of t h e problem and
t h e methods f o r s o l v i n g i t . I a l s o i n t e n d t o f o l l o w up a l l t h e
t o p i c s mentioned i n t h e p r e v i o u s section,
and f i n a l l y I p l a n t o
w r i t e a program embody1 ng t h e r e s u l t s o f my work.
Understand1 ng Scenes W i t h Shadows. Page 8 1
Bib1 iography
(1) Clotves M.B.
On s e e i n g t h i n g s
A l Journal, S p r i n g 1971
( 2 ) Dowson M.
What c o r n e r s l o o k l i k e
MIT/AI V l s i o n F l a s h 13,
June 1 9 7 1
( 3 ) Dowson M. & W a l t z D.L.
Shadows and c r a c k s
MIT/AI V i s i o n F l a s h 14, June 1 9 7 1
(
(4) F i n i n T.
F i n d i n g the skeleton of a b r i c k
MIT/AI V i s i o n F l a s h 19, August 1971
( 5 ) Guzman A,
Computer r e c o g n i t i o n o f t h r e e - d i m e n s l o n a l
o b j e c t s I n a v i s u a l scene
P r o j e c t MAC Techn i c a l r e p o r t MAC-TR-59
December 19 68
( 6 ) Huffman D.A.
I m p o s s i b l e o b j e c t s as nonesense sentences
Machine I n t e l l i g e n c e 6, 1970
Af
Understanding Scenes W i t h Shadows. Page
Appendix 1. Exact r e l a t i o n s between scene f e a t u r e s
and l i g h t - e y e placement
The f i r s t problem i s t o f i n d hays o f o b t a i n i n g t h e l i g h t - e y e
angles w i t h r e s p e c t t o t h e scene. U s i n g t h e c o o r d i n a t e system C
defined i n section IV.C.,
adir
assuming t h a t t h e p r o j e c t i o n s o f
t h e scene x and y axes d e f l n e t h e r e t i n a l x and y axes,
we then
know t h a t any v e c t o r i n t h e scene:
appears i n t h e r e t i n a l p r o j e c t i o n as
A
axp+ ( b
A
where xc,
A
and y,
ys,
A
z,
-
A
c)yp
h
a r e u n i t v e c t o r s . in. t h e scene c o o r d i n a t e s and x,
are u n i t vectors i n the r e t i n a l coordinates.
Appendix 1.A.
Eye e l e v a t i o n
I f t h e r e a r e r e c t a n g u l a r b l o c k s i n t h e scene which r e s t on
t h e support surface,
then we can e x t r a c t t h e eye e l e v a t i o n b y
l o o k i n g a t t h e appparent angles of t h e c o r n e r o f a block.
A 1 shows such a corner.
Figure
Understanding Scenes W i t h Shadows, Page A1.Z
Figure A 1
The t h r e e v i s i b l e edges d e f i n e an orthogonal c o o r d i n a t e
system C",
r o t a t e d b y an a n ~ l e
QEB
w i t h respect t o t h e eye
c o o r d i n a t e system C as d e f i n e d I n t h e p r e v i o u s s e c t i o n .
This i s
p i c t u r e d i n F i g u r e A2, where t h e corner i s drawn as i t would
appear f r o m a p o i n t on t h e scene y-axis.
where
7," and
the retina.
'
We have
a r e def ined w i t h respect t o
C",
not
Understanding Scenes W i t h Shadows. Page
I n t h e scene coordinates
tan
7," t a c t yi'
(11
(
Furthermore we know from elementary geometry t h a t these angles
a r e r e l a t e d t o those on t.he r e t i n a l p r ~ j e c , t l o nby
7,
= t a n 7,''
s i n &E
tan
and
(2)
t a n ~ ~ = t a n ~ ~ ' s i (n
3) ~ E
and plugging i n t o (1) we get
tan
and s i n c e
solve f o r s i n
7,/ s i n
$m
sin
/ tan
'7(,
(10
can o n l y be between 0 and 90 degrees, we can
(Pt:
Al.3
Understanding Scenes W i t h Shadows. Page Ai.4
sin
cpG
= (tan
tan
7' 1.Y
O f course i f t h e r e a r e no r e c t a n g u l a r c o r n e r s i n t h e scene,
w e cannot use t h i s method.
Appendix 1 . B .
L i g h t placement
Whenever t h e r e
Is a v e r t i c a l edge, v i s i b l e I n i t s e n t i r e t y ,
w h i c h c a p t s a shadow,
v i s i b l e t o i t s f l r s t junction,
we can f i n d
t h e l l g h t placement as a f u n c t i o n o f t h e eye e l e v a t i o n .
Such a
j u n c t i o n i s shown i n F i g u r e A3.
F i g u r e A3
The a n g l e
CK
i s r e l a t e d t o t h e angle
((ACB)
BE',
o r a n g u l a r d1fferenc.e between t h e p r o j e c t . l o n s o f t h e l l g h t source
and eye o n t o t h e s u p p o r t plane,
t h e l l g h t source e l e v a t l o n ,
and is independent of
As above,
Understandl ng Scenes W i t h Sharlows. Paee A1.5
eEL=
t a n DC s i n
tan
4s
To f i n d t h e l i g h t e l e v a t i o n I s a l i t t l e more complicated.
F i r s t consider the vector
$in
t h e scene c o o o r d i n a t e s .
p o i n t s toward t h e l i g h t source from t h e o r i g i n .
-+
A'B'
I f the l i ~ h t
#L and i t s angle w i t h r e s p e c t t o t h e eye
source e l e v a t i o n i s
is
Thls
gEL , then
= (cos +Lsin
egL):
+ (sin
I f t h e eye e l e v a t i o n i s g l v a n by
+L)
C
+E
+
(cos 4 L c o s
, then
%L) hz .
t h i s transforms
o n t o t h e r e t i n a as
+ ( s i n &cos
Assuming
oC
(arc t a n b i n
Solving f o r tan
and
-
+E
p.
&-
cos
+pas
sin
€L
A
Y.
a r e b o t h between 0 and 1 8 0 degrees,
f
cos
gives
&OS
$Lsin$E)/cos
# p i n
ea))
Understa.ndlng Scenes W i t h Shadows. Page Al.6
tan
(pL= cos egbtan
Appendix 1.C.
+G+
(sine
/cos & ) t a n
CL
(6+ a < - 9 0
I.
F i n d i n g s u r f a c e normals
Whenever a r e g i o n borders t h e supppart surface and shares an
edge l a b e l e d "concave1' w i t h i t , we can use i t s angle w i t h r e s p e c t
t o the horizontal t o find
, the
@
angle o f t h e p r o j e c t i o n
o f t h e s u r f a c e normal w i t h r e s p e c t t o the z a x l s of t h e scene.
From t h e r e t i n a we can o b t a i p
degrees,
where
,
,
, between
- 1 8 0 and 1 8 0
Is measured f r o m t h e x axis ( h o r i z o n t a l 1
t o t h e edge, w i t h respect t o t h e j u n c t i o n encountered f i r s t when
movlng c l o c k w i s e f r o m i n s i d e t h e background. r e g i o n around t h e
scene/background boundary.
T h i s i s lll u s t r a t e d I n F i g u r e A 4 .
0
Figure A4
Understandl,ng Scenes Mi t h Shadows. Page
3
and
tan
es
A1,7
a r e r e l a t e d b y t h e equet I o n
eS= t a n rb; / s i n
I t i s n o t always posslbla t o f i n d
+E,
&,
t h e angle between
t h e support surface and t h e normal t o another surface,
Whenever
t h e surface i s v e r t i c a l and rect.angu.lar,
.then t t.s edges. w f 1 1
appear as v e r t i c a l i n t h e p r o j e c t i o n on the r e t i n a .
these cases
rectangular,
+s
= 0.
I f we assume t h a t t h e surface i s
then we can o b t a i n t h e a n g l e
+5
and
F i g u r e A5
(Assume t h a t R 1 i s r e c t a n g u l a r )
I n t h e scene
Obvfously i n
as a f u n c t l o n
U n d e r s t a n d l ng Scenes W t t h Shadows. Page
A1,8
On t h e r e t i n a
*
A B =
+
-(sin
( C O S +5
+J
cos
c/)~
+
so
s i n QS
and s o l v i n g f o r t a n
)/(cos
+s
eS I $
- ( s i n Cb5sin
s i n +S cos
tan
+S
We g e t
cos
&'
&
eS s i n
cPE
h
y,
=
+
s i n +=cos
QS
sin
1,
Page A1,9
U n d e r s t a n d i n g Scenes W i t h Shadows.
Appendix 1.D.
An example
Suppose t h a t we encounter t h e scene p o r t i o n shown i n F i g u r e
A6,
and suppose f u r t h e r t h a t we have beeen a b l e t o l a b e l i t a s
shown.
I w i l l show how t o f i n d t h e v a r i o u s scene v a l u e s ,
Eye e l e v a t i o n
sin
(PE =
(tan
(B)
tan%%(^))'/s =
= ((.577)(.577)?~=
( ( t a n 30')(tan
.577
Light-eye angle
tan
eELs( t a n O< ( B ) )
sin
&
= ( t a n ( - 7 ~ ~ ) ) ( . 5 7 7 )= (-3.73)(.577)
= -2.16
6
!&
30))
U n d e r s t a n d i n g Scenes W l t h Shadows.
Page
Al,ld.
Light elevation
tan
= Cos eGLtan
= (.420)(.708)
= .298
+
(bG+( s i n
/ ,816
+ (-.909
(-l.ll)(tan
ecL/cos(bg
+L.=
( p + oc -
t a n (30°+ 75'-
-
= .298
(-15°))
tan
90°)
90°)
= .596
(1.11)(-.268)
30.8~
Assuming t h a t a n g l e FEA i s a r i g h t a n g l e i n t h e scene,
we
can t h e n f i n d t h e s u r f a c e orientations as w e l l ,
Orientation o f R 1
Weassume t h a t
BD i s v e r t i c a l .
$S(~
= 0l, ) i.e.
R 1 i s vertical,
Whether o r n o t t h i s i s so, we can f i n d
tan
and assuming s i n
8(~1)=
(PE(A)
t a n %,(A)
= sin
tan
4,
/ s i n &(A)
( R )we g e t
€I+RI)
=
(tan ( - 3 0 ~ ) ) / ( s i n (35.3Q))
= (-.577)
/ (.577)
= -1.
since
Understanding Scen.es W i t h Shadows. Pafie ALL2
e S ( ~=l )-45'
= 315~.
O r i e n t a t l o n o f R2
+=(~2)
Asain assume t h a t s i n c e BD and GF a r e v e r t i c a l ,
t a n t 3 5 ( ~ 2 ) = tan
a
t a n 30°/
s i n 35.3O
( T h i s I s r e a l l y a double-check.
7,(B)
*
/ sin &(B)
( . 5 7 7 ) ./ (.577)
eS ( R 2 )
=
= 1.
tf we assumed f o r f i n d l n g
t h a t a n g l e ABG was a r i g h t angle i n t h e scene,
-45O )
= 0.
then
es(ql)
=
$so.)
Orientation of R3
.(
Here t h e surface i s not v = r t i c a l , so we. .kill f i n d
first.
= )t a n T l ( ~ ) / . s i n
t a n eS(~3
+E(~).
Q s( R 3 )
Under s t a n d l ng Scenes W i t h Shadows. Page A1.13
+E ( E l =
and a g a i n assuming t h a t
+E
t a n e S ( ~ 3 )= t a n ( - 1 5 0 ~ ) / ( s i n 35.3')
(8)
= 1.
R ( R=~225O.
)
( N o t i c e t h a t h e r e we have t o use some a d d i t i o n a l i n f o r m a t t o n t o
t e l l t h a t t h i s a n g l e i s not 45O.I
tan
= ( ( - s i ' n % ( ~ ) / cos & ( E ) )
= (-(-.707)
=
&(~3
/ (.816)(3.73)
( ,232 +
-
tan-' ~ ( E I cos
.5
- (-.707)(.701)
= 1 / .732
dL(~3)
=
53.8'
eS(~lt a n & E ) )
= 1.37
1
Download