MASSA CHUSETT S INST

advertisement
MASSACHUSETTS INSTI TUTE OF TECHNOLOGY
ARTI FI CI AL I NTELLI GENCE LABORATORY
A.I . Memo No. 1376
September, 1992
Localizati on and Posi ti oni ng usi ng
Combi nati ons of Model Vi ews
Ehud Rivli n and Ronen Basri
Abstract
Amet hod for l ocal i zat i on and posi t i oni ng i n an i ndoor envi ronment i s present ed.
i s t he act of recogni zi ng t he envi ronment , and
i s t he act of comput i ng t he e
coordi nat es of a robot i n t he envi ronment . The met hod i s based on represent i ng t he sce
as a set of 2D vi ews and predi ct i ng t he appearance of novel vi ews by l i near combi nat i on
t he model vi ews. The met hod accurat el y approxi mat es t he appearance of scenes under weak
perspect i ve project i on. Anal ysi s of t hi s proj ect i on as wel l as experi ment al resul t s demo
t hat i n many cases t hi s approxi mat i on i s suci ent t o accurat el y descri be t he scene. W
ort hographi c approxi mat i on i s i nval i d, ei t her a l arger number of model s can be acqui red o
i t erat i ve sol ut i on t o account f or t he perspect i ve di st ort i ons can be empl oyed.
The present ed met hod has several advant ages over exi st i ng met hods. I t uses rel at i vel y
represent at i ons, t he represent at i ons are 2Drat her t han 3D, and l ocal i zat i on can be done
a si ngl e 2Dvi ew onl y. The same pri nci pal met hod i s appl i ed bot h f or t he l ocal i zat i on as
as t he posi t i oni ng probl ems, and a si mpl e al gori t hmf or
, t he t ask of ret urni
a previ ousl y vi si t ed posi t i on dened by a si ngl e vi ew, i s deri ved f romt hi s met hod.
Localizati
posi t i oni ng
repos i t i oni ng
c Massachuset t s I nst i t ut e of Technol ogy (1992)
Thi s report descri bes research done at t he Massachuset t s I nst i t ut e of Technol ogy wi t hi
Art i ci al I nt el l i gence Laborat ory and t he McDonnel l - Pew Cent er f or Cogni t i ve Neurosci e
Support f or t he l aborat ory's art i ci al i nt el l i gence research i s provi ded i n part by t he Ad
Research Proj ect s Agency of t he Depart ment of Def ense under Oce of Naval Research cont ract N00014- 91- J- 4038. Ronen Basri i s support ed by t he McDonnel l - Pew and t he Rot hchi l d
post doct oral f el l owshi ps. Ehud Ri vl i n i s at t he Uni versi ty of Maryl and, Col l ege Park, MD
1 Introduction
Basi c t asks i n aut onomous robot navi gat i on are l ocal i zat i on and posi t i oni ng.
act of recogni zi ng t he envi ronment , t hat i s, assi gni ng consi st ent l abel s t o di erent l ocat
i s t he act of comput i ng t he coordi nat es of t he robot i n t he envi ronment . Posi t i
i s a t ask compl ement ary t o l ocal i zat i on, i n t he sense t hat posi t i on ( e. g. , \1. 5 met ers no
of t abl e T ") i s of t en speci ed i n a pl ace- speci c coordi nat e syst em( \i n room911") . I
paper we suggest a met hod of bot h l ocal i zat i on and posi t i oni ng usi ng vi si on al one. Avar
of t he posi t i oni ng probl em, ref erred t o as
, i nvol vi ng t he ret urn t o a prev
vi si t ed pl ace i s al so di scussed.
Previ ous st udi es have exami ned t he probl ems of l ocal i zat i on and posi t i oni ng under a var
of condi t i ons, dened by t he ki nd of sensor( s) empl oyed, t he nat ure of t he envi ronment ,
t he represent at i ons used. We can di st i ngui sh bet ween act i ve and passi ve sensi ng, i ndoor
out door navi gat i on t asks, and met ri c and t opol ogi cal represent at i ons. The met ri c appr
at t empt s t o ut i l i ze a det ai l ed geomet ri c descri pt i on of t he envi ronment , whi l e t he t opo
approach uses a more qual i t at i ve descri pt i on i ncl udi ng a graph wi t h nodes represent i ng p
and arcs represent i ng sequences of act i ons t hat woul d resul t i n movi ng t he robot f romone n
t o anot her.
I n t he paper we consi der a robot t hat uses a passi ve sensor, vi si on, i n an i ndoor envi ron
The envi ronment cannot be changed by t he robot t o i mprove i t s perf ormance; nei t her beacon
nor oor or wal l marki ngs are empl oyed. The paper addresses bot h t he l ocal i zat i on and t
posi t i oni ng probl ems. Sol ut i ons t o t hese probl ems are present ed based on obj ect recogn
t echni ques. The met hod, based on t he l i near combi nat i ons scheme of [17], represent s sce
by set s of t hei r 2D i mages. Local i zat i on i s achi eved by compari ng t he observed i mage
l i near combi nat i ons of model vi ews, and t he posi t i on of t he robot i s comput ed by anal yz
t he coeci ent s of t he l i near combi nat i on t hat al i gns t he model t o t he i mage. Al so, a si m
\qual i t at i ve" sol ut i on t o t he reposi t i oni ng probl em usi ng t he l i near combi nat i ons sch
present ed.
The rest of t he paper i s organi zed as f ol l ows. The next sect i on descri bes t he l ocal i zat
posi t i oni ng probl ems and surveys previ ous sol ut i ons. The met hod of l ocal i zat i on and posi t
usi ng l i near combi nat i ons of model vi ews i s descri bed i n Sect i on 3. The met hod assumes we
perspect i ve proj ect i on. An i t erat i ve scheme t o account f or perspect i ve di st ort i ons i s p
i n Sect i on 4. An anal ysi s of t he error resul t i ng f romt he proj ect i on assumpt i on i s prese
Sect i on 5. Const rai nt s i mposed on t he mot i on of t he robot as a resul t of speci al propert
i ndoor envi ronment s can be used t o reduce t he compl exi t y of t he met hod present ed here. Th
t opi c i s covered on Sect i on 6. Experi ment al resul t s f ol l ow.
Local i z at i on
pos i t i oni ng
repos i t i oni ng
1
2 The Problem
Local i zat i on and posi t i oni ng f romvi sual i nput are dened i n t he f ol l owi ng way: Gi ven a
mi l i ar envi ronment , i dent i f y t he observed envi ronment , and t hen nd your posi t i on i n t
envi ronment . Local i zat i on resembl es t he t ask of obj ect recogni t i on, wi t h obj ect s repl a
scenes. Once l ocal i zat i on i s accompl i shed, posi t i oni ng can be perf ormed.
One probl ema syst emf or l ocal i zat i on and posi t i oni ng shoul d address i s t he vari abi l
i mages due t o vi ewpoi nt changes. The i nexact ness of pract i cal syst ems makes i t di cul t f
robot t o ret urn t o a speci ed posi t i on on subsequent vi si t s. The vi sual dat a avai l abl e
robot between vi si t s vari es i n accordance wi t h t he vi ewi ng posi t i on of t he robot . Al ocal
syst emshoul d be abl e t o recogni ze scenes f romdi erent posi t i ons and ori ent at i ons.
Anot her probl emi s t hat of changes i n t he scene. At subsequent vi si t s t he same pl ace m
l ook di erent due t o changes i n t he arrangement of t he obj ect s, t he i nt roduct i on of newobj
and t he removal of ot hers. I n general , some obj ect s t end t o be more st at i c t han ot hers. W
chai rs and books are of t en moved, t abl es, cl oset s, and pi ct ures t end t o change t hei r po
much l ess, and wal l s are al most guarant eed t o be st at i c. St at i c cues nat ural l y are more re
t han mobi l e ones. Conni ng t he syst emt o st at i c cues, however, may i n some cases resul t
f ai l ure t o recogni ze t he scene due t o i nsuci ent cues. The syst emshoul d t heref ore at t em
rel y on st at i c cues, but shoul d not i gnore t he dynami c cues.
Sol ut i ons t o t he probl emof l ocal i zat i on f romvi sual dat a requi re a l arge memory and he
comput at i on. Exi st i ng syst ems of t en t ry t o reduce t hi s cost by usi ng sparse representa
and by expl oi t i ng cont ext ual i nf ormat i on. Sparse represent at i ons are i nt roduced i n [ 1
Mat ari c [ 10] represent s scenes as sequences of l andmarks ( such as wal l s, doors, et c. ) ex
by t raci ng t he boundari es of t he scene usi ng a sonar and a compass. Met ri c i nf ormat i on
and bet ween t he l andmarks i s not st ored. Sarachi k [ 14] recogni zes a roomby i t s di mensi
whi ch are measured by i dent i f yi ng and l ocat i ng t he t op corners of t he roomusi ng st ereo d
( obt ai ned f romf our cameras) . I n bot h cases t he represent at i on i s very sparse, and t he sc
t heref ore of t en ambi guous.
Ri cher represent at i ons are used i n [ 3, 5] where hi gher success rat es are report ed. Bra
[ 3] represent s t he scene by an occupancy t abl e, a 2D bi t array whi ch cont ai ns a 1 at ev
l ocat i on occupi ed by some obj ect . The t abl e i s const ruct ed by t aki ng st ereo pi ct ures cov
360 f romt he mi ddl e of t he roomand proj ect i ng t he obt ai ned 3D dat a ont o t he oor. The
met hod suers f roml oss of i nf ormat i on due t o t he proj ect i on ont o t he oor.
Engel son
[ 5] represent t he scene by a set of i nvari ant \si gnat ures". A si gnat
usual l y composed of l ow- resol ut i on gray- l evel or range dat a obt ai ned by bl urri ng an i mag
set of si gnat ures t aken f romdi erent vi ewpoi nt s are st ored. Ascene i s recogni zed i f t he
encount ers a si gnat ure si mi l ar t o one of t he st ored si gnat ures.
Syst ems t hat use t he f ul l i nf ormat i on provi ded by t he i mage ( e. g. , [ 6, 12] ) usual l
on cont ext ual i nf ormat i on t o avoi d scanni ng al l t he model s i n t he memory and t o reduce t
comput at i onal cost of compari ng a model t o t he i mage. The syst emf ol l ows a predet ermi ne
et al .
2
pat h, so t hat t he i dent i ty of each vi si t ed l ocat i on i s known i n advance, and l ocal i zat i on b
a veri cat i on probl em. Pat h cont i nui ty i n many cases i s essent i al , and t he so- cal l ed \dro
probl emi s not addressed. The emphasi s i n t hese syst ems i s on posi t i oni ng, whi ch i s use
keep t he robot on t he pat h. I t i s typi cal f or t hese syst ems ( e. g. , [ 1, 6, 12] ) t o use a
model of t he envi ronment .
Onoguchi
[ 12] , among ot hers, represent t he envi ronment by a set of l andmarks sel e
f rompai rs of st ereo i mages by a human operat or. These l andmarks are t ransf ormed by an i mag
processi ng program whi ch i s desi gned so as t o i dent i f y t he speci c l andmark usi ng spe
ext ract i on i nst ruct i ons ( such as what f eat ures t o l ook f or and at what l ocat i ons) . Local
i s achi eved by appl yi ng t he ext ract i on procedure speci ed f or t he next l andmark. Onc
l andmark i s i dent i ed, t he posi t i on of t he robot rel at i ve t o t hat l andmark i s det ermi n
compari ng t he di mensi ons of t he observed l andmark wi t h t hose of t he st ored model .
The met hod present ed i n t hi s paper represent s t he envi ronment usi ng a set of edge map
Local i zat i on and posi t i oni ng are achi eved by compari ng i mages of t he envi ronment t o l i n
combi nat i ons of t he model vi ews. The met hod uses ri ch vi sual i nf ormat i on t o represent
scene. The syst emi s exi bl e. I n many cases i t i s capabl e of recogni zi ng i t s l ocat i o
coverage i s not requi red) . When one i mage i s not suci ent , addi t i ona
one i mage onl y ( 360
i mages can be acqui red t o sol ve t he l ocal i zat i on probl em. Cont ext can be used t o det erm
t he order of compari son of t he model s t o t he observed i mage and t o i ncrease t he condence
a gi ven mat ch, but cont ext i s not essent i al : t he syst emcan al so, by perf ormi ng more ext en
comput at i ons, sol ve t he \drop- o" probl em.
et al .
3 The Method
The probl ems of l ocal i zat i on and obj ect recogni t i on are si mi l ar i n many ways. Bot h probl
requi re t he mat chi ng of vi sual i mages t o st ored model s, ei t her of t he envi ronment or of
observed obj ect s. Bot h probl ems f ace si mi l ar di cul t i es, such as varyi ng i l l umi nat i on con
and changes i n appearance due t o vi ewpoi nt changes. Si mi l ar met hodol ogi es t heref ore can
used f or sol vi ng bot h probl ems.
A part i cul ar appl i cat i on of an obj ect recogni t i on scheme, t he Li near Combi nat i ons (
scheme [ 17] , t o t he probl ems of l ocal i zat i on and posi t i oni ng i s di scussed bel ow. The envi r
i s represent ed i n t hi s scheme by a smal l set of vi ews obt ai ned f romdi erent vi ewpoi nt s an
t he correspondence bet ween t he vi ews. A novel vi ew i s recogni zed by compari ng i t t o l i n
combi nat i ons of t he st ored vi ews. Posi t i oni ng i s achi eved by recoveri ng t he posi t i on
camera rel at i ve t o i t s posi t i on i n t he model vi ews f romt he coeci ent s of t he al i gni ng
combi nat i on. I n t he rest of t hi s sect i on we revi ewt he l i near combi nat i ons approach and des
i t s appl i cat i on t o bot h l ocal i zat i on and posi t i oni ng. The sect i on concl udes wi t h a sol u
t he probl emof reposi t i oni ng, t hat i s, t he probl emof ret urni ng t o a previ ousl y vi si t ed p
by \l ocki ng" i nt o an i mage acqui red i n t hat posi t i on.
3
3.1
Localizati on
The probl emof l ocal i zat i on i s dened as f ol l ows: gi ven P , a 2Di mage of a pl ace, and M, a s
i. Local i zat i on i s t he recogni t i on
st ored model s, nd a model i M2 M such t hat P mat ches M
of a pl ace. I t can t heref ore pot ent i al l y benet f romusi ng an obj ect recogni t i on met hod
A common approach t o handl i ng t he probl emof recogni t i on f romdi erent vi ewpoi nt s i s b
compari ng t he st ored model s t o t he observed envi ronment af t er t he vi ewpoi nt i s recovered
compensat ed f or. Thi s approach, cal l ed
, i s used i n a number of st udi es of ob
recogni t i on [ 2, 7, 8, 9, 15, 16] . We appl y t he al i gnment approach t o t he probl emof l ocal i z
The syst emdescri bed bel owuses t he \Li near Combi nat i ons" ( LC) scheme, whi ch was suggest ed
by Ul l man and Basri [ 17] .
We begi n wi t h a bri ef revi ewof t he LCscheme. LCi s dened as f ol l ows. Gi ven an i mage, we
const ruct two vi ewvect ors f romt he f eat ure poi nt s i n t he i mage, one cont ai ns t he x- coordi
of t he poi nt s, and t he ot her cont ai ns t he y- coordi nat es of t he poi nt s. An obj ect ( i n our
t he envi ronment ) i s model ed by a set of such vi ews, where t he poi nt s i n t hese vi ews are orde
i n correspondence. The appearance of a novel vi ew of t he obj ect i s predi ct ed by appl y
l i near combi nat i ons t o t he st ored vi ews. The predi ct ed appearance i s t hen compared wi t h
act ual i mage, and t he obj ect i s recogni zed i f t he t wo mat ch. The advant age of t hi s meth
i s t wof ol d. Fi rst , vi ewer- cent ered represent at i ons are used rat her t han obj ect - cent er
namel y, model s are composed of 2Dvi ews of t he observed scene; second, novel appearances a
predi ct ed i n a si mpl e and accurat e way ( under weak perspect i ve proj ect i on) .
Formal l y, gi ven P , a 2Di mage ofk a scene, and M, a set of st ored model s, t he obj ect i ve i
nd a model Mi 2 M such t hat P = j j Mji f or some const ant sj 2 R. I t has been shown
t hat t hi s scheme accurat el y predi ct s t he appearance of ri gi d obj ect s under weak perspe
proj ect i on ( ort hographi c proj ect i on and scal e) . The l i mi t at i ons of t hi s proj ect i on mo
di scussed l at er i n t hi s paper.
More concret el y, l et
i =p( x i; yi; zi) , 10 0 i n, be a set of n obj ect poi nt s. Under weak
perspect i ve proj ect i on, t he posi0i =(
t i xoni; pyi) of t hese poi nt s i n t he i mage are gi ven by
x0i = sr xi + s r yi +s r zi +t x
yi0 = s r xi +s r yi +s r zi +t y
( 1)
where rij are t he component s of a 3 2 3 rot at i on mat ri x, and s i s a scal e f act or. Rewri t i ng t
i n vect or equat i on f ormwe obt ai n
x0 = s r x +s r y +s r z +t x1
y0 = s r x +s r y +s r z +t y 1
( 2)
where x; y; z; 0 ;x y0 2 Rn are t he vect ors ofi, xyi, zi, x0i and yi0 coordi nat es respect i vel y, and
1 =( 1; 1; : : : ; 1) . Consequent l y,
x0; y0 2 fx; y; z; 1g
( 3)
al i gnment
P
=1
11
12
13
21
22
23
11
12
13
21
22
23
s pan
4
n. (ofNotRi ce t hat
or, i n ot her words,0 and
x y0 bel ong t o a f our- di mensi onal l i near subspace
0
z , t he vect or of dept h coordi nat es of t he proj ect ed poi nt s, al so bel ongs t o t hi s subspac
f act i s used i n Sect i on 4 bel ow. ) A f our- di mensi onal space i s spanned by any f our l i ne
i ndependent vect ors of t he space. Two vi ews of t he scene suppl y f our such vect ors [ 13,
Denot e by x , y and x , y t he l ocat i on vect ors of t he n poi nt s i n t he t wo i mages; t hen t her
exi st coeci ent s; aa ; a ; a and b ; b ; b ; b such t hat
x0 = a x +a y +a x +a 1
y0 = b x +b y +b x +b 1
( 4)
( Not e t hat t he vect or aly ready depends on t he ot her f our vect ors. ) Si nce R i s a rot at i o
mat ri x, t he coeci ent s sat i sf y t he f ol l owi ng t wo quadrat i c const rai nt s:
a +a +a 0 b 0 b 0 b =2( b b 0 aa ) r +2( b b 0 aa ) r
a b +a b +a b +( a b +a b ) r +( a b +a b ) r =0
( 5)
To deri ve t hese const rai nt s t he t ransf ormat i on bet ween t he two model vi ews shoul d be recove
Thi s can be done under weak perspect i ve usi ng a t hi rd i mage. Al t ernat i vel y, t he const ra
can be i gnored, i n whi ch case t he syst emwoul d conf use ri gi d t ransf ormat i ons wi t h ane on
Thi s usual l y does not prevent successf ul l ocal i zat i on si nce general l y scenes are f ai rl y
f romone anot her.
A LC scheme f or t he probl emof l ocal i zat i on i s as f ol l ows: The envi ronment i s model
by a set of i mages wi t h correspondence between t he i mages. For exampl e, a spot can b
model ed by t wo of i t s correspondi ng vi ews. The correspondi ng quadrat i c const rai nt s may a
be st ored. Local i zat i on i s achi eved by recoveri ng t he l i near combi nat i on t hat al i gns t he
t o t he observed i mage. The coeci ent s are det ermi ned usi ng f our model poi nt s and t he
correspondi ng i mage poi nt s by sol vi ng a l i near set of equat i ons. Three poi nt s are suci e
det ermi ne t he coeci ent s i f t he quadrat i c const rai nt s are al so consi dered. Addi t i onal
may be used t o reduce t he eect of noi se.
The LC scheme uses vi ewer- cent ered model s, t hat i s, represent at i ons t hat are compos
of i mages. I t has a number of advant ages over met hods t hat bui l d f ul l t hree- di mensi o
model s t o represent t he scene. Fi rst , by usi ng vi ewer- cent ered model s t hat cover rel at i vel
t ransf ormat i ons we avoi d t he need t o handl e occl usi ons i n t he scene. I f f romsome vi ewpo
t he scene appears di erent because of occl usi ons we ut i l i ze a new model f or t hese vi ewpo
Second, vi ewer- cent ered model s are easi er t o bui l d and t o mai nt ai n t han obj ect - cent ered
The model s cont ai n onl y i mages and correspondences. By l i mi t i ng t he t ransf ormat i on betwe
t he model i mages one can nd t he correspondence usi ng mot i on met hods. I f l arge port i ons
t he envi ronment are changed bet ween vi si t s a newmodel can be const ruct ed by si mpl y repl aci
ol d i mages wi t h newones.
One probl emwi t h usi ng t he LCscheme f or l ocal i zat i on i s due t o t he weak perspect i ve a
proxi mat i on. I n cont rast wi t h t he probl emof obj ect recogni t i on, where we can general l y as
1
1
2
1
2
2
3
4
1
2
3
4
1
1
1
1
2
2
1
3
1
3
2
2
4
4
2
2
1
2
2
2
3
1 1
2
1
2 2
2
2
2
3
3 3
1 3
1 3
1 3
3 1
5
11
11
2 3
2 3
2 3
3 2
12
12
t hat obj ect s are smal l rel at i ve t o t hei r di st ance f romt he camera, i n l ocal i zat i on t he
ment surrounds t he robot and perspect i ve di st ort i ons cannot be negl ect ed. The l i mi t at
of weak perspect i ve model i ng are di scussed bot h mat hemat i cal l y and empi ri cal l y i n t he n
two sect i ons. I t i s shown t hat i n many pract i cal cases weak perspect i ve i s suci ent t o e
accurat e l ocal i zat i on. The mai n reason i s t hat t he probl emof l ocal i zat i on does not r
accurat e measurement s i n t he ent i re i mage; i t onl y requi res i dent i f yi ng a suci ent numbe
spot s t o guarant ee accurat e nami ng. I f t hese spot s are rel at i vel y cl ose t o t he cent er
i mage, or i f t he dept h di erences t hey creat e are rel at i vel y smal l ( as i n t he case of l oo
a wal l when t he l i ne of si ght i s nearl y perpendi cul ar t o t he wal l ) , t he perspect i ve di st
are rel at i vel y smal l , and t he syst emcan i dent i f y t he scene wi t h hi gh accuracy. Al so,
rel at ed by a t ransl at i on paral l el t o t he i mage pl ane f orma l i near space even when perspe
di st ort i ons are l arge. Thi s case and ot her si mpl i cat i ons are di scussed i n Sect i on 6.
By usi ng weak perspect i ve we avoi d st abi l i ty probl ems t hat f requent l y occur i n perspec
comput at i ons. We can t heref ore comput e t he al i gnment coeci ent s by l ooki ng at a rel at i v
narrowel d of vi ew. The ent i re scheme can be vi ewed as an accumul at i ve process. Rat her t h
acqui ri ng i mages of t he ent i re scene and compari ng t hemal l t o a f ul l scene model ( as i n
we recogni ze t he scene i mage by i mage, spot by spot , unt i l we accumul at e suci ent convi nci
i nf ormat i on t hat i ndi cat es t he i dent i ty of t he pl ace.
When perspect i ve di st ort i ons are rel at i vel y l arge and weak perspect i ve i s i nsuci e
model t he envi ronment , t wo approaches can be used. One possi bi l i t y i s t o const ruct a l a
number of model s so as t o keep t he possi bl e changes bet ween t he f ami l i ar and t he novel vi e
smal l . Al t ernat i vel y, an i t erat i ve comput at i on can be appl i ed t o compensat e f or t hese d
t i ons. Such an i t erat i ve met hod i s descri bed i n Sect i on 4.
3. 2
Posi t i oni ng
Posi t i oni ng i s t he probl emof recoveri ng t he exact posi t i on of t he robot . Thi s posi t i on
speci ed i n a xed coordi nat e syst emassoci at ed wi t h t he envi ronment ( i . e. , roomcoordi na
or i t can be associ at ed wi t h some model , i n whi ch case l ocat i on i s expressed wi t h respect t
posi t i on f romwhi ch t he model vi ews were acqui red. I n t hi s sect i on we di scuss an appl i ca
of t he LCscheme t o t he posi t i oni ng probl em.
The i dea i s t he f ol l owi ng. We assume a model composed of two i mages,
and PP ; t hei r
rel at i ve posi t i on i s gi ven. Gi ven a novel 0;i mage
we rst
P al i gn t he model wi t h t he i mage
( i . e. , l ocal i zat i on) . By consi deri ng t he coeci ent s of t he l i near combi nat i on t he robot '
rel at i ve t o t he model i mages i s recovered. To recover t he absol ut e posi t i on of t he robot
roomt he absol ut e posi t i ons of t he model vi ews shoul d al so be provi ded.
Assumi ng P i s obt ai ned f romPby a rot at i on R, t ransl at i on xt; =(
yt ) ,t and scal i ng s , t he
0
0
0
coordi nat es of a poi nt i; n(Px; y) , can be wri t t en as l i near combi nat i ons of t he correspondi ng
model poi nt s i n t he f ol l owi ng way:
x0 = a x +a y +a x +a
1
2
1
1 1
2 1
6
3
2
4
2
=
y0
b 1 x1 +b 2y1 +b 3 x2 +b 4
( 6)
Subst i t ut i ng f or wex obt ai n
x0 = a x +a y +a ( s r x +s r y +s r z +t x ) +a
y0 = b x +b y +b ( s r x +s r y +s r z +t x ) +b
( 7)
and rearrangi ng t hese equat i ons we obt ai n
x0 = ( a +a s r ) x +( a +a s r ) y +( a s r ) z +( a tx +a )
y0 = ( b +b s r ) x +( b +b s r ) y +( b s r ) z +( b tx +a )
( 8)
Usi ng t hese equat i ons we can deri ve al l t he paramet ers of t he t ransf ormat i on between t he mo
and t he i mage. Assume t he i mage i s obt ai ned by a rot at i on U , t ransl
at i on
scalt i ngn .s
n , and
Usi ng t he ort honormal i t y const rai nt we can rst deri ve t he scal e f act or
sn = ( a +a s r ) +( a +a s r ) +( a s r )
= a +a +a s +2 a s ( a r +a r )
( 9)
FromEquat i ons ( 8) and ( 9) , by deri vi ng t he component s of t he t ransl at i non, we
vectcan
or, t
obt ai n t he posi t i on of t he robot i n t he i mage rel at i ve t o i t s posi t i on i n t he model vi ews
1x = a tx +a
1y = b ty +b
( 10)
1
1
1z = f ( s 0 s )
2
1
1
2 1
1 1
1
2 1
3
1
3
11
3
11
2
3
1
11
2
1
3
2
2
1
13 1
12 1
12
3
11
12 1
1
3
2
1
2
1
11
12
2
2
2 2
3
3
3
13 1
1
3
1
3
3
4
12
1 11
13
13
4
1
3
1
2
3
4
3
13
4
2
2 12
4
3
4
n
Not e t hat 1z i s deri ved f romt he change i n scal e of t he obj ect . The rot at i on mat ri x U betw
P and P 0 i s gi ven by
a +a s r
a +a s r
a sr
u =
u =
u =
sn
sn
sn
( 11)
b +a s r
b +a s r
b sr
u =
u =
u =
s
s
s
1
11
21
1
3
11
1
3
21
n
12
22
2
3
12
2
3
22
n
13
23
3
13
3
23
n
As was al ready ment i oned, t he posi t i on of t he robot i s comput ed here rel at i ve t o t he posi t i
t he camera when t he rst model i mage,, Pwas acqui red. 1x and 1z represent t he mot i on of
t he robot f romPt o P0; and t he rest of t he paramet ers represent i t s 3Drot at i on and el evat i
To obt ai n t he rel at i ve posi t i on t he t ransf ormat i on paramet ers between t he model
andvi ews, P
P , are requi red.
1
1
1
2
3. 3
Repos i t i oni ng
An i nt erest i ng vari ant of t he posi t i oni ng probl em, ref erred t o as
, i s de
f ol l ows. Gi ven an i mage, cal l ed t he
i mage, posi t i on yoursel f i n t he l ocat i on f rom
r epos i t i oni ng
t ar get
7
t hi s i mage was observed.One way t o sol ve t hi s probl emi s t o ext ract t he exact posi t i on f rom
whi ch t he t arget i mage was obt ai ned and di rect t he robot t o t hat posi t i on. I n t hi s sect i o
are i nt erest ed i n a more qual i t at i ve approach. Under t hi s approach posi t i on i s not compu
I nst ead, t he robot observes t he envi ronment and ext ract s onl y t he di rect i on t o t he t a
l ocat i on. Unl i ke t he exact approach, t he met hod present ed here does not requi re t he reco
of t he t ransf ormat i on bet ween t he model vi ews.
We assume we are gi ven wi t h a model of t he envi ronment t oget her wi t h a t arget i mage.
The robot i s al l owed t o t ake new i mages as i t i s movi ng t owards t he t arget . We assume a
hori zont al l y movi ng pl at f orm. ( I n ot her words, we assume t hree degrees of f reedomrat her t
si x; t he robot i s al l owed t o rot at e around t he vert i cal axi s and t ransl at e hori zont al l
val i di ty of t hi s const rai nt i s di scussed i n Sect i on 6. ) Bel owwe gi ve a si mpl e comput at i o
det ermi nes a pat h whi ch t ermi nat es i n t he t arget l ocat i on. At each t i me st ep t he robot acqu
a new i mage and al i gns i t wi t h t he model . By compari ng t he al i gnment coeci ent s wi t h t he
coeci ent s f or t he t arget i mage t he robot det ermi nes i t s next st ep. The al gori t hmi s di
i nt o t wo st ages. I n t he rst st age t he robot xat es on one i dent i abl e poi nt and moves al
a ci rcul ar pat h around t he xat i on poi nt unt i l t he l i ne of si ght t o t hi s poi nt coi nci de
t he l i ne of si ght t o t he correspondi ng poi nt i n t he t arget i mage. I n t he second st age t he
advances f orward or ret reat s backward unt i l i t reaches t he t arget l ocat i on.
Gi ven a model composed of t wo i mages, and
P P , P i s obt ai ned f rom Pby a rot at i on
about t he Y - axi s by an angl e , hori zont al t ransl
i onscal
t e f act or s . Gi ven a t arget
x, atand
i mage Pt, Pt i s obt ai ned f rom Pby a si mi l ar rot at i on by an angl e , t ranslt, atand
i onscal
t e
st . Usi ng Eq. ( 4) t he posi t i on of a t arget poi
( x be expressed as
t ; yt )ntcan
xt = a x +a x +a
yt = b y
( 12)
( The rest of t he coeci ent s are zero si nce t he pl at f ormmoves hori zont al l y. ) I n f act , t he
ci ent s are gi ven by
s si n( 0 )
a = t
si n st si n a =
( 13)
s si n t s si n a = tt0 x t
s si n b = st
( The deri vat i on i s gi ven i n t he Appendi x. )
At every t i me st ep t he robot acqui res an i mage and al i gns i t wi t h t he above model . Assum
t hat i mage Pp i s obt ai ned as a resul t of a rot at i on by an angl e , t ransl
i on tep.s
p , andat scal
1
1
2
2
1
1
1 1
3 2
4
2 1
1
3
4
2
1
This problem can be c ons i de r e d as a var i ant of t he homi ng pr obl e m. A di s c us s i on of t he ge ne r al homi ng
pr obl e mwi t h a \s i gnat ur e - bas e d" s ol ut i on c an be f ound i n[11].
8
The posi t i on of a poi ntp; (ypx) i s expressed by
xp = c x +c x +c
yp = d y
( 14)
where t he coeci ent s are gi ven by
s si n( 0 )
c = p
si n sp si n ( 15)
c =
s si n t s si n c = tp0 x p
s si n d = sp
The st ep perf ormed by t he robot i s det ermi ned by
c a
= 0
( 16)
c a
That i s,
s si n( 0 ) s si n( 0 )
( 17)
=
si n 0 si n =s si n ( cot 0 cot )
The robot shoul d now move so as t o reduce t he absol ut e val ue of . The di rect i on of mot i
depends on t he si gn of . The robot can deduce t he di rect i on by movi ng sl i ght l y t o t he s
and checki ng i f t hi s mot i on resul t s i n an i ncrease or decrease of . The mot i on i s den
f ol l ows. The robot moves t o t he ri ght ( or t o t he l ef t , dependi ng on whi ch di rect i on reduce
by a st ep 1x .
A new i mage Pn i s now acqui red, and t he xat ed poi nt i s l ocat ed i n t hi s i mage. Denot
i t s newposi t i on byn . xSi nce t he mot i on i s paral l el t o t he i mage pl ane t he dept h val ues of t
poi nt i n t he two vi ews,p and
P Pn, are i dent i cal . We now want t o rot at e t he camera so as t o
ret urn t he xat ed poi nt t o i t s ori gi nal posi t i on. The angl e of rot at i on, , can be deduce
t he equat i on
xp = x n cos +si n ( 18)
Thi s equat i on has two sol ut i ons. We chose t he one t hat count ers t he t ransl at i on ( namel y
t ransl at i on i s t o t he ri ght , t he camera shoul d rot at e t o t he l ef t ) , and t hat keeps t he a
rot at i on smal l . I n t he next t i me st ep t he new pinctrepl
ure aces
P pPand t he procedure i s
repeat ed unt i l vani shes. The resul t i ng pat h i s ci rcul ar around t he poi nt of f ocus.
Once t he robot arri ves at a posi t i on f or whi ch = 0 ( namel y, i t s l i ne of si ght coi n
wi t h t hat of t he t arget i mage, and = ) i t shoul d nowadvance f orward or ret reat backwar
t o adj ust i t s posi t i on al ong t he l i ne of si ght . Several measures can be used t o det ermi
di rect i on of mot i on; one exampl e i s t he t=aermwhic ch sat i ses
sp
c
=
( 19)
a
st
when t he t wo l i nes of si ght coi nci de. The obj ect i ve at t hi s st age i s t o bri ng t hi s measure
1
1
3
2 1
1
3
4
2
1
1
3
3
1
1
1
9
1
2
4
4 Handl i ng Perspecti ve Di storti ons
The l i near combi nat i on scheme present ed above accurat el y handl es changes i n vi ewpoi nt assu
i ng t he i mages are obt ai ned under weak perspect i ve proj ect i on. Error anal ysi s and experi m
resul t s demonst rat e t hat i n many pract i cal cases t hi s assumpt i on i s val i d. I n cases wher
spect i ve di st ort i ons are t oo l arge t o be handl ed by a weak perspect i ve approxi mat i on, mat c
bet ween t he model and t he i mage can be f aci l i t at ed i n t wo ways. One possi bi l i t y i s t o av
cases of l arge perspect i ve di st ort i on by augment i ng t he l i brary of st ored model s wi t h addi
model s. I n a rel at i vel y dense l i brary t here usual l y exi st s a model t hat i s rel at ed t o t h
by a suci ent l y smal l t ransf ormat i on avoi di ng such di st ort i ons. The second al t ernat i ve
i mprove t he mat ch bet ween t he model and t he i mage usi ng an i t erat i ve process. I n t hi s sect
we consi der t he second opt i on.
The suggest ed i t erat i ve process i s based on a Tayl or expansi on of t he perspect i ve co
nat es. As descri bed bel ow, t hi s expansi on resul t s i n a pol ynomi al consi st i ng of t erms
of whi ch can be approxi mat ed by l i near combi nat i ons of vi ews. The rst t ermof t hi s ser
represent s t he ort hographi c approxi mat i on. The process resembl es a met hod of mat chi ng
poi nt s wi t h 2Dpoi nt s descri bed recent l y by DeMent hon and Davi s [ 4] . I n t hi s case, howev
t he met hod i s appl i ed t o 2Dmodel s rat her t han 3Dones. I n our appl i cat i on t he 3Dcoordi nat
of t he model poi nt s are not provi ded; i nst ead t hey are approxi mat ed f romt he model vi ews.
An i mage poi nt ( x; y ) =( f X=Z; f Y =Z) i s t he proj ect i on of some obj ect poi nt , ( X; Y;
t he i mage, where f denot es t he f ocal l engt h. Consi der t he f ol l owi ng Tayl or expansi on of
around some dept h val ue Z:
0
1 =
Z
1 f (k) ( Z )
0
X
k=0
k!
( Z 0 Z)k
0
( 0 1)k ( Z 0 Z)k
( 20)
k
k ( k 0 1) ! Z
1
k
k
= Z1 1 + ((k001)1) ! Z 0Z Z
k
The Tayl or seri es descri bi ng t he posi t i on of a poi nt x i s t heref ore gi ven by
1 ( 0 1)
k Z0 Z k
fX fX
x=
=
1
+
( 21)
Z
Z
k ( k 0 1) ! Z
k t hee by 1
Not i ce t hat t he zero t ermcont ai ns t he ort hographi c approxi mat i on f or x . Denot
k t h t ermof t he seri es:
k
k
( 22)
1 k = fZX ((k001)1) ! Z 0Z Z
Arecursi ve deni t i on of t he above seri es i s gi ven bel ow.
= Z1 +
0
"
1
X
0
+1
0
=1
X
0
#
0
0
=1
"
0
#
X
0
0
=1
( )
0
( )
0
0
10
Initialization:
x( 0)
=1
( 0)
fX
Z0
=
Iterative step:
Z k0
1 k = 0 ( Zk 00 1)
1
Z
x k = x k0 +1 k
where x k represent s t he k t h order approxi mat i on f or x ,k and
represent
1 s t he hi ghest order
k
t ermi n x .
Accordi ng t o t he ort hographi c approxi mat i on bot h X and Z can be expressed as l i near com
bi nat i ons of t he model vi ews ( Eq. ( 4) ) . We t heref ore appl y t he above procedure, approxi mat
X and Z at every st ep usi ng t he l i near combi nat i on t hat best al i gns t he model poi nt s wi t h
i mage poi nt s. The general i dea i s t heref ore t he f ol l owi ng. Fi rst , weand
est 1i matby
ex
sol vi ng t he ort hographi c case. Then at each st ep of t he i t erat i on we i mprove t he est i mat
seeki ng t he l i near combi nat i on t hat best est i mat es t he f act or
0
( )
(
1)
0
( )
(
1)
( )
( )
( )
( )
( 0)
( 0)
Z
x 0 xk0
0 ( Zk 00 1)
Z 1 k0
(
0
(
0
1)
( 23)
1)
Denot e by x 2 Rn t he vect or of i mage poi nt coordi nat es, and denot e by
P =[ x ; y ; x ; 1]
( 24)
an n 2 4 mat ri x cont ai ni ng t he posi t i on of t he poi nt s i n t he t wo model i mages. Denot e
P = ( P T P )0 P T t he pseudo- i nverse of P ( we assume P i s overdet ermi ned) . Al so denote
by a k t he coeci ent s comput ed f or t he k t h st ep.k represent
Pa
s t he l i near combi nat i on
comput ed at t hat st ep t o approxi mat e t he X or t he Z val ues. Si nce at every
, fst, ep
andZ
k and
k are const ant t hey can be merged i nt o t he l i near combi nat i on. Denot
e by1xk t he
vect ors of comput ed val ues of x and 1at t he k t h st ep. An i t erat i ve procedure t o al i gn a mo
t o t he i mage i s descri bed bel ow.
1
+
1
2
1
( )
( )
( )
Initialization:
Sol ve t he ort hographi c approxi mat i on, namel y
a =P x
x = 1 = Pa
( 0)
( 0)
Iterative step:
+
( 0)
( 0)
q k = ( x 0 x k0 ) 4 1 k0
ak = P qk
1 k = ( P a k ) 1 k0
x k = x k0 +1 k
( )
( )
(
+
( )
( )
(
( )
( )
(
1)
1)
11
(
( )
1)
1)
0
( )
where t he vect or operat i ons and 4 are dened as
u v = ( u v ; : : : n;vnu)
u4 v = ( uv ; : : uv: n;)
1 1
1
n
1
5 Projecti on M
odel { Error Anal ysi s
I n t hi s sect i on we est i mat e t he error obt ai ned by usi ng t he l i near combi nat i on met hod.
met hod assumes a weak perspect i ve proj ect i on model . We compare t hi s assumpt i on wi t h t he
more accurat e perspect i ve proj ect i on model .
Apoi nt ( X; Y; Z) i s proj ect ed under t he perspect i ve model t o ( x; y ) =( f X=Z; f Y =Z) i n
i mage, where f denot es t he f ocal l engt h. Under our weak perspect i ve model t he same poi
i s approxi mat ed by ( x^ ; ^y ) =( s X; s Y ) where s i s a scal i ng f act or. The best est i mat e f or
scal i ng f act or, i s gi ven by s =, fwhere
=Z Z i s t he average dept h of t he observed envi ronment .
Denot e t he error by
E = j ^x 0 xj
( 25)
The error i s expressed by
1 1
E = f X( 0 )
( 26)
Z Z
Changi ng t o i mage coordi nat es
1 1
( 27)
E = xZ ( 0 )
Z Z
or
Z
E = jxj
0 1
( 28)
Z
The error i s smal l when t he measured f eat ure i s cl ose t he opt i cal axi s, or when our est i
f or t he dept h, ,Z i s cl ose t o t he real dept h, Z. Thi s support s t he basi c i nt ui t i on t hat
i mages wi t h l owdept h vari ance and f or xat ed regi ons ( regi ons near t he cent er of t he i ma
t he obt ai ned perspect i ve di st ort i ons are rel at i vel y smal l , and t he syst emcan t heref ore
t he scene wi t h hi gh accuracy. Fi gures 1 and 2 showt he dept h ratasi oaZ=Z
f unct i on of x f or
=10 and 20 pi xel s, and Tabl e 5 shows a number of exampl es f or t hi s f unct i on. The al l owe
dept h vari ance, Z=Z
, i s comput ed as a f unct i on of x and t he t ol erat ed error, . For exampl
a 10 pi xel error t ol erat ed i n a el d of vi ew of up t o 650 pi xel s i s equi val ent t o al l owi ng
vari at i ons of 20%. Fromt hi s di scussi on i t i s apparent t hat when a model i s al i gned t o t he i
t he resul t s of t hi s al i gnment shoul d be j udged di erent l y at di erent poi nt s of t he i mage
f art her away a poi nt i s f romt he cent er t he more di screpancy shoul d be t ol erat ed between
predi ct i on and t he act ual i mage. Ave pi xel error at posi t i on x =50 i s equi val ent t o a 10
error at posi t i on x =100.
So f ar we have consi dered t he di screpanci es bet ween t he weak perspect i ve and t he persp
t i ve proj ect i ons of poi nt s. The accuracy of t he LCscheme depends on t he val i di ty of t he w
0
0
0
0
0
0
0
0
12
4.5
10/x + 1
4
3.5
3
2.5
2
1.5
1
0
50
100
150
200
250
300
Fi gure 1:ZZ0 as a f unct i on of x f or =10 pi xel s.
8
20/x + 1
7
6
5
4
3
2
1
0
50
100
150
200
250
300
Fi gure 2:ZZ0 as a f unct i on of x f or =20 pi xel s.
13
5
1. 2
1. 1
1. 07
1. 05
xn
25
50
75
100
10 15 20
1. 4 1. 6 1. 8
1. 2 1. 3 1. 4
1. 13 1. 2 1. 27
1. 1 1. 15 1. 2
, as a f unct i on of x ( hal f t he wi dt h of t he el d consi dered)
Tabl e 1: Al l owed dept h rat iZZos,
and t he error al l owed ( , i n 0pi xel s) .
perspect i ve proj ect i on bot h i n t he model vi ews and f or t he i ncomi ng i mage. I n t he rest of
sect i on we devel op an error t ermf or t he LC scheme assumi ng t hat bot h t he model vi ews and
t he i ncomi ng i mage are obt ai ned by perspect i ve proj ect i on.
The error obt ai ned by usi ng t he LCscheme i s gi ven by
E = jx 0 ax 0 b y0 c x0 dj
( 29)
Si nce t he scheme accurat el y predi ct s t he appearances of poi nt s under weak perspect i ve pr
t i on, i t sat i ses
^x = a ^x 0 b ^y0 c ^x0 d
( 30)
where accent ed l et t ers represent ort hographi c approxi mat i ons. Assume t hat i n t he t wo mo
pi ct ures t he dept h rat i os are roughl y equal :
1
1
Z0M
ZM
1
1
2
2
= ZZ ZZ
01
02
1
2
( 31)
( Thi s condi t i on i s sat i sed, f or exampl e, when bet ween t he two model i mages t he camera on
t ransl at es al ong t he i mage pl ane. ) Usi ng t he f act t hat
x=
we obt ai n
E
f X f X Z0
=Z Z
Z
0
=^x ZZ
( 32)
0
= jx 0 ax0 b y0 c x0 dj
M
M
M
^xZZ 0 a ^xZZ M 0 b ^yZZ M 0 c ^xZZ M 0
=
=
1
0
1
1
2
0
1
0
M
^xZZ0 0 ( a1^x0 b 1^y0 c 2^x) ZZ0M
Z0M
Z0
^x Z 0 ( ^x 0 Zd M) 0 d
14
2
0
d
0 d
( 33)
=
M
M
^x (ZZ0 0 ZZ0M ) 0 dZZ( 0M 0 1)
M
Z0M Z0
Z0
j ^xj Z 0 Z M + jdj Z M 0 1
The error t heref ore depends on t wo t erms. The rst get s smal l er as t he i mage poi nt s get cl
t o t he cent er of t he f rame and as t he di erence bet ween t he dept h rat i os of t he model and t
i mage get s smal l er. The second get s smal l er as t he t ransl at i on component get s smal l er an
t he model get s cl ose t o ort hographi c.
Fol l owi ng t hi s anal ysi s, weak perspct i ve can be used as a proj ect i on model when t he de
vari at i ons i n t he scene are rel at i vel y l owand when t he syst emconcent rat es on t he cent er
of t he i mage. We concl ude t hat , by xat i ng on di st i ngui shed part s of t he envi ronment ,
l i near combi nat i ons scheme can be used f or l ocal i zat i on and posi t i oni ng.
6 Imposi ng Constrai nts
Local i zat i on and posi t i oni ng requi re a l arge memory and a great deal of on- l i ne comput at
A l arge number of model s must be st ored t o enabl e t he robot t o navi gat e and mani pul at e
i n rel at i vel y l arge and compl i cat ed envi ronment s. The comput at i onal cost of model - i m
compari son i s hi gh, and i f cont ext ( such as pat h hi st ory) i s not avai l abl e t he number of req
compari sons may get very l arge. To reduce t hi s comput at i onal cost a number of const rai nt s m
be empl oyed. These const rai nt s t ake advant age of t he st ruct ure of t he robot , t he propert i
i ndoor envi ronment s, and t he nat ural propert i es of t he navi gat i on t ask. Thi s sect i on exa
some of t hese const rai nt s.
One t hi ng a syst emmay at t empt t o do i s t o bui l d t he set of model s so as t o reduce t he
eect of perspect i ve di st ort i ons i n order t o avoi d perf ormi ng i t erat i ve comput at i ons.
of t he envi ronment obt ai ned when t he syst em l ooks rel at i vel y deep i nt o t he scene usua
sat i sf y t hi s condi t i on. When perspect i ve di st ort i ons are l arge t he syst emmay consi der mo
subset s of vi ews rel at ed by a t ransl at i on paral l el t o t he i mage pl ane ( perpendi cul ar t o t
of si ght ) . I n t hi s case t he dept h val ues of t he poi nt s are roughl y equal across al l
consi dered, and i t can be shown t hat novel vi ews can be expressed by l i near combi nat i ons
two model vi ews even i n t he presence of l arge perspect i ve di st ort i ons. Thi s becomes appa
f romt he f ol l owi ng deri vat i on. Let
i; Yi;( ZXi) ; 1 i n be a poi nt proj ect ed i n t he i mage
t o ( xi; yi) = ( f Xi =Zi; f Yi=Zi) , and l et 0i(; xyi0 ) be t he proj ect ed poi nt af t er appl yi ng a ri gi d
t ransf ormat i on. Assumi ng t hat
i0 = Z i we obt ai n
Zi x0i = r Xi +r Yi +r Zi +t x
Zi yi0 = r Xi +r Yi +r Zi +t y
( 34)
Di vi di ng by iZwe obt ai n
1
x0i = r xi +r yi +r +t x
Z
11
12
13
21
22
23
11
12
15
13
i
( 35)
= r xi +r yi +r +t y Z1
i
Rewri t i ng t hi s i n vect or equat i on f ormgi ves
x0 = r x +r y +r 1 +t xz0
y0 = r x +r y +r 1 +t y z0
( 36)
where x, y, x0 , and y0 are t he vect ors ofi , xyi, x0i, and yi0 val ues respect i vel y, 1 i s a vect or
of al l 1s, and0 zi s a vect or of 1i =Z
val ues. Consequent l y, as i n t he weak perspect i ve case,
novel vi ews obt ai ned by a t ransl at i on paral l el t o t he i mage pl ane can be expressed by l i
combi nat i ons of f our vect ors.
An i ndoor envi ronment usual l y provi des t he robot wi t h a at , hori zont al support . Con
quent l y, t he mot i on of t he camera i s of t en const rai ned t o rot at i on about t he vert i cal ( Y
and t o t ransl at i on i n t he XZ- pl ane. Such mot i on has onl y t hree degrees of f reedomi nst ea
t he si x degrees of f reedomi n t he general case. Under t hi s const rai nt f ewer correspondenc
requi red t o al i gn t he model wi t h t he i mage. For exampl e, i n Eq. ( 4) ( above) t he coeci e
a = b = b = b =0. Three poi nt s rat her t han f our are requi red t o det ermi ne t he coeci ent
by sol vi ng a l i near syst em. Two, rat her t han t hree, are requi red i f t he quadrat i c const rai
al so consi dered. Anot her advant age t o consi deri ng onl y hori zont al mot i on i s t he f act t ha
mot i on const rai ns t he possi bl e epi pol ar l i nes between i mages. Thi s f act can be used t o
t he t ask of correspondence seeki ng.
Obj ect s i n i ndoor envi ronment s somet i mes appear i n roughl y pl anar set t i ngs. I n part i c
t he rel at i vel y st at i c obj ect s t end t o be l ocat ed al ong wal l s. Such obj ect s i ncl ude wi
shel ves, pi ct ures, cl oset s and t abl es. When t he assumpt i on of ort hographi c proj ect i on i
( f or exampl e, when t he robot i s rel at i vel y di st ant f romt he wal l , or when t he l i ne of si
roughl y perpendi cul ar t o t he wal l ) t he t ransf ormat i on between any t wo vi ews can be descri
by a 2Dane t ransf ormat i on. The di mensi on of t he space of vi ews of t he scene i s t hen reduc
t o t hree ( rat her t han f our) , and Eq. ( 4) becomes
x0 = a x +a y +a 1
y0 = b x +b y +b 1
( 37)
( a =b =0. ) Onl y one vi ewi s t heref ore suci ent t o model t he scene.
Most oce- l i ke i ndoor envi ronment s are composed of rooms connect ed by corri dors. Navi
gat i ng i n such an envi ronment i nvol ves maneuveri ng t hrough t he corri dors, ent eri ng and exi
t he rooms. Not al l poi nt s i n such an envi ronment are equal l y i mport ant . Junct i ons, pl aces w
t he robot f aces a number of opt i ons f or changi ng i t s di rect i on, are more i mport ant t han o
pl aces f or navi gat i on. I n an i ndoor envi ronment t hese pl aces i ncl ude t he t hreshol ds of
and t he begi nni ngs and ends of corri dors. A navi gat i on syst emwoul d t heref ore t end t o st
more model s f or t hese poi nt s t han f or ot hers.
One i mport ant propert y shared by many j unct i ons i s t hat t hey are conned t o rel at i ve
smal l areas. Consi der f or exampl e t he t hreshol d of a room. I t i s a rel at i vel y narrow
yi0
21
22
23
11
12
13
21
22
23
1
1
1
2
3
1
3
4
1
1
1
1
3
16
2
2
1
1
4
4
t hat separat es t he roomf romt he adj acent corri dor. When a robot i s about t o ent er a roo
a common behavi or i ncl udes st eppi ng t hrough t he door, l ooki ng i nt o t he room, and i dent i f y
i t bef ore a deci si on i s made t o ent er t he roomor t o avoi d i t . The set of i nt erest i ng i mag
t hi s t ask i ncl udes t he set of vi ews of t he roomf romi t s ent rance. Provi ded t hat t hreshol
narrowt hese vi ews are rel at ed t o each ot her al most excl usi vel y by rot at i on around t he ver
axi s. Under perspect i ve proj ect i on, such a rot at i on i s rel at i vel y easy t o recover. The
of poi nt s i n novel vi ews can be recovered f romone model vi ew onl y. Thi s i s apparent f r
t he f ol l owi ng deri vat i on. Consi der a poi nt p =( X; Y; Z) . I t s posi t i on i n a model vi ew i s
by ( x; y ) =( f X=Z; f Y =Z) . Now, consi der anot her vi ew obt ai ned by a rot at i on R around t
camera. The l ocat i on of p i n t he newvi ewi s gi ven by ( assumi ng f =1)
( 38)
( x0 ; y0) = ( rr XX ++rr YY ++rr ZZ ; rr XX ++rr YY ++rr ZZ )
i mpl yi ng t hat
( x0 ; y0 ) = ( rr xx ++rr yy ++rr ; rr xx ++rr yy ++rr )
( 39)
Dept h i s t heref ore not a f act or i n det ermi ni ng t he rel at i on bet ween t he vi ews. Eq. ( 39) bec
even si mpl er i f onl y rot at i ons about t he Y - axi s are consi dered:
+si n y
( x0; y0 ) =( 0x cos
;
)
( 40)
x si n +cos 0 x si n +cos where i s t he angl e of rot at i on. I n t hi s case can be recovered merel y f roma si ngl e c
spondence.
11
12
13
21
22
23
31
32
33
31
32
33
11
12
13
21
22
23
31
32
33
31
32
33
7 Experi ments
The LC met hod was i mpl ement ed and appl i ed t o i mages t aken i n an i ndoor envi ronment .
I mages of t wo oces, Aand B, t hat have si mi l ar st ruct ures were t aken usi ng a Panasoni c camer
wi t h a f ocal l engt h of 700 pi xel s. Semi - st at i c obj ect s, such as heavy f urni t ure and pi ct ur
used t o di st i ngui sh bet ween t he oces. Fi gure 3 shows t wo model vi ews of oce A. The vi ews
were t aken at a di st ance of about 4mf romt he wal l . Correspondences were pi cked manual l
The resul t s of al i gni ng t he model vi ews t o i mages of t he t wo oces are present ed i n Fi gur
The l ef t i mage cont ai ns an overl ay of a predi ct ed i mage ( t he t hi ck whi t e l i nes) , const ruc
l i nearl y combi ni ng t he t wo vi ews, and an act ual i mage of oce A. Agood mat ch between t he
two was achi eved. The ri ght i mage cont ai ns an overl ay of a predi ct ed i mage const ruct ed f r
a model of oce B and an i mage of oce A. Because t he oces share a si mi l ar st ruct ure t he
st at i c cues ( t he wal l corners) were perf ect l y al i gned. The semi - st at i c cues, however,
mat ch any f eat ures i n t he i mage.
Fi gure 5 shows t he mat chi ng of t he model of oce Awi t h an i mage of t he same oce obt ai ned by a rel at i vel y l arge mot i on f orward ( about 2m) and t o t he si de ( about 1. 5m) . Al t ho
17
Fi gure 3: Two model vi ews of oce A.
Fi gure 4: Mat chi ng a model of oce At o an i mage of oce A( l ef t ) , and mat chi ng a model of
oce B t o t he same i mage ( ri ght ) .
Fi gure 5: Mat chi ng a model of oce At o an i mage of t he same oce obt ai ned by a rel at i vel y
l arge mot i on f orward and t o t he ri ght .
18
Fi gure 6: Two model vi ews of a corri dor.
Fi gure 7: Mat chi ng t he corri dor model wi t h t wo i mages of t he corri dor. The ri ght i mage w
obt ai ned by a rel at i vel y l arge mot i on f orward ( about hal f of t he corri dor l engt h) and t
ri ght .
t he di st ances are rel at i vel y short most perspect i ve di st ort i ons are negl i gi bl e, and a goo
bet ween t he model and t he i mage i s obt ai ned.
Anot her set of i mages was t aken i n a corri dor. Here, because of t he deep st ruct ure
t he corri dor, perspect i ve di st ort i ons are not i ceabl e. Nevert hel ess, t he al i gnment res
demonst rat e an accurat e mat ch i n l arge port i ons of t he i mage. Fi gure 6 shows t wo model vi e
of t he corri dor. Fi gure 7 ( l ef t ) shows an overl ay of a l i near combi nat i on of t he model
wi t h an i mage of t he corri dor. I t can be seen t hat t he part s t hat are rel at i vel y di st ant
perf ect l y. Fi gure 7 ( ri ght ) shows t he mat chi ng of t he corri dor model wi t h an i mage obt ai ne
a rel at i vel y l arge mot i on ( about hal f of t he corri dor l engt h) . Because of perspect i ve di s
t he rel at i vel y near f eat ures no l onger al i gn ( e. g. , t he near door edges) . The rel at i vel y f
however, st i l l mat ch.
The next experi ment shows t he appl i cat i on of t he i t erat i ve process present ed i n Sect i
19
i n cases where l arge perspect i ve di st ort i on were not i ceabl e. Fi gure 8 shows two model v
and Fi gure 9 shows t he resul t s of mat chi ng a l i near combi nat i on of t he model vi ews t o
i mage of t he same oce. I n t hi s case, because t he i mage was t aken f roma rel at i vel y cl
di st ance, perspect i ve di st ort i ons cannot be negl ect ed. The eect s of perspect i ve di st ort
be not i ced on t he ri ght corner of t he board, and on t he edges of t he hanger on t he t op ri g
Perspect i ve eect s were reduced by usi ng t he i t erat i ve process. The resul t s of appl yi n
procedure af t er one and t hree i t erat i ons are shown i n Fi gure 10.
The experi ment al resul t s demonst rat e t hat t he LCmet hod achi eves accurat e l ocal i zat i o
many cases, and t hat when t he met hod f ai l s because of l arge perspect i ve di st ort i ons an i t e
comput at i on can be used t o i mprove t he qual i t y of t he mat ch.
8 Concl usi ons
Amet hod of l ocal i zat i on and posi t i oni ng i n an i ndoor envi ronment was present ed. The met h
i s based on represent i ng t he scene as a set of 2Dvi ews and predi ct i ng t he appearance of n
vi ews by l i near combi nat i ons of t he model vi ews. The met hod accurat el y approxi mat es t h
appearances of scenes under weak perspect i ve proj ect i on. Anal ysi s of t hi s proj ect i on a
as experi ment al resul t s demonst rat e t hat i n many cases t hi s approxi mat i on i s suci ent
accurat el y descri be t he scene. When t he weak perspect i ve approxi mat i on i s i nval i d, ei t
l arger number of model s can be acqui red or an i t erat i ve sol ut i on can be empl oyed t o acco
f or t he perspect i ve di st ort i ons.
The met hod present ed i n t hi s paper has several advant ages over exi st i ng met hods. I t u
rel at i vel y ri ch represent at i ons; t he represent at i ons are 2Drat her t han 3D, and l ocal i za
be done f roma si ngl e 2D vi ew onl y. The same basi c met hod i s used i n bot h t he l ocal i zat i
and posi t i oni ng probl ems, and a si mpl e al gori t hmf or reposi t i oni ng i s deri ved f romt hi s me
Fut ure work i ncl udes handl i ng t he probl emof acqui si t i on and mai nt enance of model s, devel
i ng eci ent and robust al gori t hms f or sol vi ng t he correspondence probl em, and bui l di ng m
usi ng vi sual i nput .
Appendi x
I n t hi s appendi x we deri ve t he expl i ci t val ues of t he coeci ent s of t he l i near combi nat i ons
case of hori zont al mot i on. Consi der a poi nt p =( x; y ; z ) t hat i s proj ect ed by weak persp
t o t hree i mages, , PP , and P0; P i s obt ai ned f rom Pby a rot at i on about t he Y - axi s by an
angl e , t ransl at imon, and
t scal e f act or
m, sand P0 i s obt ai ned f rom Pa rot at i on about t he
Y - axi s by an angl e , t ransl at
i on scal
t e ps. The posi t i on of p i n t he t hree i mages i s gi ven
p and
by
( x ; y ) = ( x; y )
1
2
2
1
1
1
1
20
Fi gure 8: Two model vi ews of oce C.
Fi gure 9: Mat chi ng t he model t o an i mage obt ai ned by a rel at i vel y l arge mot i on. Perspect
di st ort i ons can be seen i n t he t abl e, t he board, and t he hanger at t he upper ri ght .
Fi gure 10: The resul t s of appl yi ng t he i t erat i ve process t o reduce perspect i ve di st ort i o
one ( l ef t ) and t hree ( ri ght ) i t erat i ons.
21
( x ; y ) = ( smx cos +smz si n +mt ; sm y )
( x0 ; y0 ) = ( spx cos +spz si n +pt; spy )
The poi nt ( 0x; y0 ) can be expressed by a l i near combi nat i on of t he rst t wo poi nt s:
x0 = a x +a x +a
y0 = b y
Rewri t i ng t hese equat i ons we get
spx cos +sp z si n +pt = a x +a ( sm x cos +sm z si n +m
t ) +a
sp y = b y
Equat i ng t he val ues f or t he coeci ent s i n bot h si des of t hese equat i ons we obt ai n
sp cos = a +a sm cos sp si n = a sm si n tp = a tm +a
sp = b
and t he coeci ent s are t heref ore gi ven by
s si n( 0 )
a = p
si n sp si n a =
sm si n t s si n a = tp 0 m p
sm si n b = sp
2
2
1
1
2
2
3
1
1
2
1
3
2
2
2
3
1
3
4
References
[ 1] N. Ayache and O. D. Faugeras. Mai nt ai ni ng represent at i ons of t he envi ronment of a mobi
robot ,
Vol . 5, pp. 804{819, 1989.
[ 2] R. Basri and S. Ul l man. The al i gnment of obj ect s wi t h smoot h surf aces.
, Tarpon Spri ngs, FL, pp. 482{488, 1988.
[ 3] D. J. Braunegg. Marvel |Asyst emf or recogni zi ng worl d l ocat i ons wi t h st ereo vi si on
, 1990.
[ 4] D. F. DeMent hon and L. S. Davi s. Model - based obj ect pose i n 25 l i nes of code.
, Genova, I t al y, 1992.
IEEE Trans . on Robot i cs and Aut om
at i on,
Pr oc.
2nd I
Conf. on Com
put er Vi s i on
TR- 1229, MI T
Pr oc. 2
Eur opean Conf . on Comput er Vi s i on
22
[ 5] S. P. Engel son and D. V. McDermot t . I mage si gnat ures f or pl ace recogni t i on and ma
const ruct i on.
, Bost on, MA, 1991.
[ 6] C. Fennema, A. Hanson, E. Ri seman, R. J. Beveri dge, and R. Kumar. Model - di rect ed
mobi l e robot navi gat i on.
, Vol . 20, pp. 1352{
1369, 1990.
[ 7] M. A. Fi schl er and R. C. Bol l es. Randomsampl e consensus: a paradi gmf or model t t i n
wi t h appl i cat i on t o i mage anal ysi s and aut omat ed cart ography.
Vol . 24, pp. 381{395, 1981.
[ 8] D. P. Hut t enl ocher and S. Ul l man. Obj ect recogni t i on usi ng al i gnment .
, London, UK, pp. 102{111, 1987.
[ 9] D. G. Lowe. Three- di mensi onal obj ect recogni t i on f rom si ngl e t wo- di mensi onal i ma
, 1985.
[ 10] M. J. Mat ari c. Envi ronment l earni ng usi ng a di st ri but ed represent at i on.
, Ci nci nnat i , OH, 1990.
[ 11] R. N. Nel son. Vi sual homi ng usi ng an associ at i ve memory.
, pp. 245- 262, 1989.
[ 12] K. Onoguchi , M. Wat anabe, Y. Okamot o, Y. Kuno, and H. Asada. A vi sual navi gat i on
syst emusi ng a mul t i i nf ormat i on l ocal map.
,
Ci nci nnat i , OH, pp. 767{774, 1990.
[ 13] T. Poggi o. 3D obj ect recogni t i on: on a resul t by Basri and Ul l man.
, Povo, I t al y, 1990.
[ 14] K. B. Sarachi k. Vi sual navi gat i on: const ruct i ng and ut i l i zi ng si mpl e maps of an i
envi ronment .
, 1989.
[ 15] D. W. Thompson and J. L. Mundy. Three di mensi onal model mat chi ng f roman unconst rai ned vi ewpoi nt .
, Ral ei gh, NC, pp. 208{
220, 1987.
[ 16] S. Ul l man. Al i gni ng pi ct ori al descri pt i ons: an approach t o obj ect recogni t i on.
Vol . 32, pp. 193{254, 1989.
[ 17] S. Ul l man and R. Basri . Recogni t i on by l i near combi nat i ons of model s.
Vol . 13, pp. 992{1006, 1991.
Pr oc. SPI E Sym
pos i umon I nt el l i gent Robot i c Sys t em
s
I EEE Trans . on Sys t em
s, M
an and Cyber net i cs
Comm
uni cat i ons of
t he
ACM
,
Proc. 1s t I nt . C
on Comput er Vi s i on
Robot i cs Res earch Techni cal
Repor t 202,
Cour ant I ns t i t ut e of
M
at h.
Sci ences ,
New Yor k
Uni ver s i t y
Pr oc. I nt .
on Robot i cs and Aut om
at i on
DARPAI m
age Under s t andi ng
Wor ks hop
Pr oc. I nt . Conf . on Robot i cs and Aut om
at i on
Techni cal
Repo
9005- 03, I RST
AI - TR- 1113, M
IT
Pr oc. I nt . Conf . on Robot i cs and Aut omat i on
Cog
I EEE Trans . o
Pat t er n Anal ys i s and M
achi ne I nt el l i gence,
23
Download