From Neuroscience to Mechatronics Presentation by Fabian Diewald JASS April 2006

advertisement
From Neuroscience to Mechatronics
Presentation by Fabian Diewald
JASS April 2006
Outline
The human brain
The vestibulo-ocular reflex (VOR)
The VOR – technical applications
Camera stabilizing system at TU Munich
A possible algorithm
Testing the camera stabilizing system
Another field of neuroscience and technical
application: face recognition
Conclusion and outlook
Learning from nature – a justified
strategy
first vertebrates ~500 million years ago
civilization of man since ~10 000 years
Consequently nature has a great time advantage!
The human brain
cerebellum = "little
brain"
responsible for accurate
movement
instructions by the
forebrain insufficient
instructions have to be
translated into accurate
commands by the
cerebellum
cerebellum essential
part in learning motor
skills
Neurons – elementary components of
the central nerve system
dendrites: "input" of a neuron
axon: "output"
axon terminals/boutons contact other neurons or
muscles
Neurons – elementary components of
the central nerve system
transmission works electrically
information through the axon is encoded in
changes of electrical potential
short impulses with fixed intensity
information is contained in
firing frequency
("pulse rate modulation")
Synapses – link between neurons
gap against electrical
transmission
transmission between
neurons is controlled by
chemicals called
neurotransmitters
controlled transmission
is important in regard to
adaptation and learning
The structure of the cerebellar cortex
parallel fibres
Purkinje cells:
~200 000 synapses per cell,
only output for cerebellar cortex
climbing fibre:
axon of inferior olive;
one single climbing fibre
for one Purkinje cell
wraps around dendritic
Inferior
tree of Purkinje cell; Olive
essential for "learning"
parallel fibres
=axons of granule cells,
synapse on Purkinje cells
granule cells,
~20 per mossy fibre,
half of all neurons are
Granule
cells
granule cells
Purkinje
cells
climbing
fibre
-
+
- Deep +
a
a
cerebellar
nuclei
inferior olive
Diagram from Max Westby
mossy fibres
= information input,
synapse on
granule cells
mossy
fibre
The influential theory of Marr-Albus
twice independently proposed:
Marr, David: “A Theory of Cerebellar Cortex”, 1969,
Journal of Physiology 202: 437-470
Albus, James S.: "A theory of cerebellar function", 1971,
Mathematical Biosciences 10: 25-61
major aspects:
climbing fibre carries a teaching signal
signal influences the synapses of Purkinje cells
consequently the intensity of certain inputs to the
Purkinje cells can be controlled
The influential theory of Marr-Albus –
disappointment by Marr himself
Marr 1982:
"In my own case, the cerebellar study (...) disappointed
me, because even if the theory was correct, it did not
enlighten one about the motor system – it did not, for
example, tell one how to go about programming a
mechanical arm."
Outline
The human brain
The vestibulo-ocular reflex (VOR)
The VOR – technical applications
Camera stabilizing system at TU Munich
A possible algorithm
Testing the camera stabilizing system
Another field of neuroscience and technical
application: face recognition
Conclusion and outlook
The vestibulo-ocular reflex (VOR)
aim: stable picture of our environment/object we are staring at in spite
of moving head
only use of visual information would be too slow for stabilization (visual
processing delay about 50-100ms)
signals of the vestibular organ (motion of the head) are more or less
directly transmitted to the extra-ocular eye muscles, leading to process
times of 5-10ms
vestibular signals are also influenced by visual information
The vestibulo-ocular reflex (VOR) –
experimentation with monkeys
VOR does an excellent job in monkeys as well
as in other vertebrates under normal
conditions
magnifying or miniaturizing glasses cause
abnormal image motion speed perceived
during head motion
monkeys with glasses show problems with
head motion in the beginning
after several days: considerable improvement
is completed
then: taking the glasses away
monkeys show problems again
after some days: normal behaviour again
unconscious use of VOR, motoric issue,
calibration needed
 typical case concerning the cerebellum
The vestibulo-ocular reflex –
paths of information
eye
(visual information)
vestibular organ
(head motion)
direct path:
very quick
not accurate
(not sensible to
changes in the system,
e.g. changes in eye
adaptation path:
muscle strength)
teaching signal
transferred on
climbing
fibres
"strengthens" or
"weakens" the
synapses
inferior olive
eye muscles
adaptive path
over Purkinje
cells:
transferred
on parallel
fibres
synapses of
Purkinje cells
Purkinje cells
cerebellum
Outline
The human brain
The vestibulo-ocular reflex (VOR)
The VOR – technical applications
Camera stabilizing system at TU Munich
A possible algorithm
Testing the camera stabilizing system
Another field of neuroscience and technical
application: face recognition
Conclusion and outlook
The vestibulo-ocular reflex –
possible technical applications
camera systems in general
face-recognition
systems
The vestibulo-ocular reflex –
also an important automotive application
Affordable
driver-assistance systems
wide angle cameras used
to obtain an overview of
the environment
for a closer look (road
signs, number plates etc.)
a telephoto lens is needed,
which is quite sensitive to
motion
Requirements concerning driverassistance systems
camera stabilization only by optical means is too slow
inertial measurement of head angular velocity
needed
affordable hardware
cheap inaccurate sensors must be allowed (multisensor fusion of translation/angular velocity and
visual information)
inaccuracies have to be compensated
automatically
 similar problems/requirements as in biology
the vestibulo-ocular reflex may solve our
technical problem!
Outline
The human brain
The vestibulo-ocular reflex (VOR)
The VOR – technical applications
Camera stabilizing system at TU Munich
A possible algorithm
Testing the camera stabilizing system
Another field of neuroscience and technical
application: face recognition
Conclusion and outlook
The VOR – biological and technical
analogons
How can we implement cerebellar structures as computer hardware?
the structure of the cerebellum is quite homogeneous –
although different information appears in different regions,
these regions are similar
idea of "cerebellar chip"
today: normally no need for cerebellar chip as DSPs are
very multifunctional and cerebellar algorithms can be
implemented software-based (e.g. in Matlab)
DSP dSPACE DS1103 PowerPC,
used in camera stabilizing system at
TU Munich
The VOR – biological and technical
analogons
neuroscience/biology:
vestibular organ in the inner ear
direct pathway vestibular to muscles
cerebellum
extra-ocular eye muscles
mechatronics:
6-DOF inertial measurement unit
CAN bus system
DSP with specific algorithm
servo motors
main difficulties:
commercially available inertial measurement units are too big
for driver-assistance systems
the algorithm biology uses in the cerebellum has to be
detected and implemented, which is obviously difficult
(remember Marr's quotation!)
Camera stabilization system at TU
Munich, Institute of Applied Mechanics
serial gimballed configuration
two perpendicular axes
pan actuator driven by:
4.5 Watt motor
inner frame driven by:
11 Watt Maxon motor with
backlash free HarmonicDrive
gearbox
each axis controlled by differential
encoders with 512 steps
camera: CMOS sensor with effective
resolution of 750x400 pixels
in the future: only a mirror is moved leading
to further miniaturization
Coping with difficulty 1:
size of inertial measurement unit (IMU)
smallest available intelligent IMU 50x38x25 mm³
too big for driver-assistance-systems
new IMU was developed within the FORBIAS
project:
edge length:
3 gyroscopes
(rotation), bandwith
up to 100Hz
accelerometer with 3 axes
on one chip
(translation), bandwith up
to 640Hz
15 mm
Outline
The human brain
The vestibulo-ocular reflex (VOR)
The VOR – technical applications
Camera stabilizing system at TU Munich
A possible algorithm
Testing the camera stabilizing system
Another field of neuroscience and technical
application: face recognition
Conclusion and outlook
Coping with difficulty 2 (main difficulty):
implementing an adequate algorithm (1)
central ideas/principles:
motor movements are calculated according to angular velocity of
measurement unit ("direct path")
theoretically (perfect sensors/perfect production) this is sufficient for
camera stabilization
but: due to inaccuracies in production and sensors themselves there is
always relative movement of the picture of the environment ("optical
flow")
the relation between angular rates and this relative movement is
calculated
the result dynamically influences a matrix w
Coping with difficulty 2 (main difficulty):
implementing an adequate algorithm (2)
central ideas/principles:
this matrix w calculates a certain output out of angular velocity
this output is used to "clean" the signal of the direct path (velocity)
additively
result: after some time the matrix w is "perfect" (learning completed) and
always knows what to add to the measured angular velocity so that there
is no optical flow any more, i.e. optical flow and angular velocity are
decorrelated
consequently further cost-reduction possible by teaching the
system after assembling and storing the matrix w on a chip
but in this case inaccuracies because of e.g. plastic mechanical
deformation during use cannot be compensated
The vestibulo-ocular reflex –
paths of information
eye
(visual information)
vestibular organ
(head motion)
direct path:
very quick
not accurate
(not sensible to
changes in the system,
e.g. changes in eye
adaptation path:
muscle strength)
teaching signal
transferred on
climbing
fibres
"strengthens" or
"weakens" the
synapses
inferior olive
eye muscles
adaptive path
over Purkinje
cells:
transferred
on parallel
fibres
synapses of
Purkinje cells
Purkinje cells
cerebellum
Coping with difficulty 2 (main difficulty):
implementing an adequate algorithm
fz
w
f pf
fy
flow, f x
x
y
z
x
y
z
d
03x3
E 3 x3
p
E 3 x3
03 x3
central idea: decorrelation of angular rates and optical
decorrelator: contains dynamic state matrix w, dynamically influenced by optical flow,
angular velocity and its derivative
f (optical flow = teaching signal)
Decorrelator with weight matrix w  wp
D
decomposition to
"parallel fibre signals"
w 
pf   
w 
T

wd ,
0
d  J w w 
equivalent to synapses of Purkinje cells
p
 fx 
T
w   f  pf      f y  w x w y
 f z 

 p  E 3 x3
03 x 3 

 d  E 3x3 
w z w x w y w z 
3 x3
 0
E 3x3
p  J wp wd  3x3
0


d
3x3
E
3x3
03x3 
 pf
03x3 
03x3 
 pf
03x3 
parallel fibres
w
S
diagonal
sensitivity
matrix
Decomposition:
splitting physical
input B
signal in
b  JSw
brainstem
adequate inputs
for decorrelator
= direct
path
(decorrelator should be able to
learn by derivatives of velocity
as well, not only by velocity!)
biological equivalent:
parallel fibres
Inferior
Olive
I
+
Purkinje
cells
+
P
-
C
Granule
cells
climbing
fibre
+
- Deep +
a
a
cerebellar
nuclei
mossy
fibre
Diagram from Max Westby
from: Günthner, W.; Glasauer, S.; Wagner, P.; Ulbrich, H.: Biologically inspired multi-sensor fusion for
adaptive camera stabilization in driver-assistance systems, Advanced Microsystems for Automotive
Applications AMAA, Berlin, April 25-27, 2006 (in press)
Coping with difficulty 2 (main difficulty):
implementing an adequate algorithm
fz
w
f pf
fy
flow, f x
x
y
z
x
y
d
03x3
z
E 3 x3
p
E 3 x3
03 x3
central idea: decorrelation of angular rates and optical
decorrelator: contains dynamic state matrix w, dynamically influenced by optical flow,
angular velocity and its derivative
f (optical flow = teaching signal)
D
decomposition to
"parallel fibre signals"
w 
pf   
w 
w
S
diagonal
sensitivity
matrix
T
Decorrelator with weight matrix w  wp

wd ,
0
d  J w w 
equivalent to synapses of Purkinje cells
p
 fx 
T
w   f  pf      f y  w x w y
 f z 

 p  E 3 x3
w z w x w y w z 
3 x3
 0
03 x 3 

 d  E 3x3 
b  JSw
brainstem B
+
output
given
I
= direct
pathby weighted sum:
E 3x3
p  J wp wd  3x3
0

+
• proportional output p and
• derivative path d (added after integrator I)
• are calculated out of the "input" angular velocity (vector pf)
by the dynamic matrix w
• J: Jacobean projecting angular rates on degrees
of freedom of motion device

P
d
3x3
E
3x3
03x3 
 pf
03x3 
03x3 
 pf
03x3 
-
C
from: Günthner, W.; Glasauer, S.; Wagner, P.; Ulbrich, H.: Biologically inspired multi-sensor fusion for
adaptive camera stabilization in driver-assistance systems, Advanced Microsystems for Automotive
Applications AMAA, Berlin, April 25-27, 2006 (in press)
Coping with difficulty 2 (main difficulty):
implementing an adequate algorithm
fz
w
f pf
fy
flow, f x
x
y
z
x
y
z
d
03x3
E 3 x3
p
E 3 x3
03 x3
central idea: decorrelation of angular rates and optical
decorrelator: contains dynamic state matrix w, dynamically influenced by optical flow,
angular velocity and its derivative
f (optical flow = teaching signal)
D
decomposition to
"parallel fibre signals"
w 
pf   
w 
S
T
Decorrelator with weight matrix w  wp

wd ,
0
d  J w w 
equivalent to synapses of Purkinje cells
p
 fx 
T
w   f  pf      f y  w x w y
 f z 

 p  E 3 x3
w z w x w y w z 
learning rule:
3 x3
 0
03 x 3 

 d  E 3x3 
E 3x3
p  J wp wd  3x3
0


d
3x3
E
3x3
03x3 
 pf
03x3 
03x3 
 pf
03x3 
b  JSw
brainstem B
+
•
similar
to
LMS
algorithm
of+adaptive control theory
= direct path
• here: teaching signal = optical flow f;
in biology: teaching signal = climbing fibre signals
• covariance learning rule (Sejnowski 1977):
–If parallel-fibre firing pf (angular velocity and derivative) is positively
correlated with climbing-fibre firing f (optical flow), reduce the weight w,
–if parallel-fibre firing pf (angular velocity and derivative) is negatively
from: Günthner, W.; Glasauer, S.; Wagner,
P.;with
Ulbrich,
H.: Biologically
inspired
multi-sensor
fusion
for w,
correlated
climbing-fibre
firing f (optical
flow),
increase the
weight
adaptive camera stabilization in driver-assistance
Advanced
Microsystems
for Automotive
–if parallel-fibre systems,
firing pf (angular
velocity
and derivative)
is uncorrelated
with climbing-fibre
firing f (optical flow), no change
Applications AMAA, Berlin, April 25-27,
2006 (in press)
w
diagonal
sensitivity
matrix
I
P
C
Possible variants of the basic systems
(1)
fz
w
f pf
fy
flow, f x
x
y
z
x
y
z
d
03x3
E 3 x3
p
E 3 x3
03 x3
central idea: decorrelation of angular rates and optical
delay ~100ms
decorrelator: contains dynamic state matrix w, dynamically
influenced by optical flow,
angular velocity and its derivative
f (optical flow = teaching signal)
delay ~10ms
D
decomposition to
"parallel fibre signals"
w 
pf   
w 
w
S
diagonal
sensitivity
matrix
T
Decorrelator with weight matrix w  wp

wd ,
0
d  J w w 
equivalent to synapses of Purkinje cells
p
 fx 
T
w   f  pf      f y  w x w y
 f z 

 p  E 3 x3
w z w x w y w z 
3 x3
 0
03 x 3 

 d  E 3x3 
E 3x3
p  J wp wd  3x3
0
for frequencies above ~2.5Hz
the phase difference can get quite big
=> instability in the learning process possible
b  JS
w
brainstem
B pf-signal
=> delay of
(~100ms)
+ and I
+
=smoothing
direct pathfilter removing high frequencies
("eligibility trace")


P
d
3x3
E
3x3
03x3 
 pf
03x3 
03x3 
 pf
03x3 
-
C
from: Günthner, W.; Glasauer, S.; Wagner, P.; Ulbrich, H.: Biologically inspired multi-sensor fusion for
adaptive camera stabilization in driver-assistance systems, Advanced Microsystems for Automotive
Applications AMAA, Berlin, April 25-27, 2006 (in press)
Possible variants of the basic systems
(2)
fz
w
f pf
fy
flow, f x
x
y
z
x
y
z
d
03x3
E 3 x3
p
E 3 x3
03 x3
central idea: decorrelation of angular rates and optical
decorrelator: contains dynamic state matrix w, dynamically influenced by optical flow,
angular velocity and its derivative
f (optical flow = teaching signal)
D
decomposition to
"parallel fibre signals"
w 
pf   
w 
w
S
diagonal
sensitivity
matrix
T
Decorrelator with weight matrix w  wp

wd ,
0
d  J w w 
equivalent to synapses of Purkinje cells
p
 sign f x 
  E 3 x3
T
w  wf 
 pf
   f y  w x w y w z w x w y w z  p 3 x 3
...  sign
 0
 sign f z 


03 x 3 

 d  E 3x3 
E 3x3
p  J wp wd  3x3
0


d
3x3
E
3x3
03x3 
 pf
03x3 
03x3 
 pf
03x3 
capacity of climbing fibre pathway (teaching signal)
is limited
b  JSwdata flow can be avoided
brainstem=>
B unimportant
+
+
I the teaching
P
= direct path
by only evaluating the sign of
signal,
i.e. the direction of the optical flow
=> learning was not significantly worse compared
to learning with exact values
C
from: Günthner, W.; Glasauer, S.; Wagner, P.; Ulbrich, H.: Biologically inspired multi-sensor fusion for
adaptive camera stabilization in driver-assistance systems, Advanced Microsystems for Automotive
Applications AMAA, Berlin, April 25-27, 2006 (in press)
Outline
The human brain
The vestibulo-ocular reflex (VOR)
The VOR – technical applications
Camera stabilizing system at TU Munich
A possible algorithm
Testing the camera stabilizing system
Another field of neuroscience and technical
application: face recognition
Conclusion and outlook
Other visual technical problems to be
solved for driver assistance
optical flow caused by camera rotation can
successfully be removed by camera stabilization
as just seen, but...
line of sight may have to be kept on the road
rapid changes of viewing direction ("saccades")
have to be implemented
furthermore: closer examination of certain
objects (signs etc.) need visual tracking
Testing of the camera stabilization
system under laboratory conditions
testing in the laboratory by mounting the system
on a hexapod
created sensed angular rates of ~100°/s
optical flow was reduced from 6 pix/frame to
less than 1 pix/frame after 2-3 minutes of
adaptation
the improvement within this time was also
subjectively viewable
Testing of the camera stabilization
system "on the road" (1)
system mounted near the rear-view mirror
additional camera installed as well to detect points of interest for the
camera with telephoto lens
system tested in association with saccade and visual tracking
vehicle velocity and yaw rate added to the system via CAN bus to
improve tracking of space fixed objects
Testing of the camera stabilization
system "on the road" (2)
three modes were tested:
1. lane marker was focused in a distance of 60m, followed up to a
distance of 20m, then the next was focused on
2. lane separation was focused on in a constant distance of 40m
3. nothing was focused on but the camera was stabilized around a
constant line of sight
in all modes: bumps could be compensated
static wide angle
camera
stabilized camera
with telephoto lens
Outline
The human brain
The vestibulo-ocular reflex (VOR)
The VOR – technical applications
Camera stabilizing system at TU Munich
A possible algorithm
Testing the camera stabilizing system
Another field of neuroscience and technical
application: face recognition
Conclusion and outlook
Learning from the human brain –
another example
a current example: face recognition
excellent recognition abilities by humans
under different circumstances, e.g.
illumination
viewing angle
pose
facial expression
...
need for technical face recognition systems
(e.g. war against terrorism)
consequently need to continue exploring the
neuroscience of face recognition
Face recognition – the problem with
different angles of view
How can
we/a
computer
tell that
each face
belongs to
the same
person?
Face recognition
– step:
adaption of
testing
unfamiliar view of the
weights and output as
weighted sum
face, but
nevertheless the
highest output is
produced for "Betty"
"hidden units": one for
each angle of view – the
more similar the input is
to the "prototype" face of
one unit, the more
"active" the unit gets
learning step:
changing the weights
within the network so
that the output is "1"
for the viewed person
from: Vatentin, D.; Abdi, H..; Edelman, B.: What represents a face: A Computational Approach for the
Integration of Physiological and Psychiological Data, 1997
Outline
The human brain
The vestibulo-ocular reflex (VOR)
The VOR – technical applications
Camera stabilizing system at TU Munich
A possible algorithm
Testing the camera stabilizing system
Another field of neuroscience and technical
application: face recognition
Conclusion and outlook
Conclusion and outlook
technical use of neuroscience is the key to
giving machines several typical human abilities
optimizing service intervals and
minimizing complexity of installing systems by
self-learning abilities
cost reduction
...
consequently interesting to a large variety of
technical fields
Thank you!
Thank you for your attention!
Fabian Diewald
Fabian.Diewald@mytum.de
Download