ISRA`021 070902

advertisement
Assistive Human-Machine Interfaces via
Artificial Neural Networks
Wei Tech Ang and Cameron N. Riviere
The Robotics Institute, Carnegie Mellon University, Pittsburgh, PA 15213, USA
Abstract - This paper reports the study and results of modeling and
online compensating of movement disorder stemming from multiple
sclerosis (MS) via artificial neural networks. We trained and tested a
cascade-correlation neural network with Kalman filtering on data
collected from 11 subjects with MS. The test subjects use headcontrolled mouse emulators to move a cursor to a series of random
targets on screen. Simulated real-time testing of the trained neural
networks shows that the networks successfully make the cursor
trajectories of all the 11 subjects less chaotic, and hence more
controllable. The neural networks also reduce the time needed to
reach the targets by an average of 31.8%. The neural network
approach can be easily applied to other human-machine interfaces
such as computer mice and joysticks, or powered wheelchairs. This
technique is also applicable to movement disorders resulting from
certain geriatric diseases such as Parkinson’s disease and essential
tremor.
I. INTRODUCTION
Multiple sclerosis is a serious progressive disease of the
central nervous system, occurring mainly in young adults and
thought to be caused by a malfunction of the immune system.
It leads to the loss of myelin in the brain or spinal cord and
reduces and even loses bodily control. Movement disorders
associated with MS include tremor, myoclonus and
spasticity. These movement disorders are among the most
difficult of MS symptoms to treat, and many do not respond
well to medication [1]. These symptoms can, in the least
severe form, hamper effective control of the patients’
environment, and in more acute cases obscuring most daily
voluntary activities. For many of these patients, the
interference of these symptoms make the use of interfaces
such as computer mice and joysticks or powered wheelchairs
a challenging and daunting task. This reduction in
independence is detrimental to patients’ quality of life, and is
a significant factor contributing to the enormous lifetime cost
of MS, estimated at $2.2 million per patient [2]. Similar
observations could be made regarding other common
diseases such as essential tremor and Parkinson’s disease.
The most well known type of involuntary motion
affecting patients with MS is pathological tremor. Tremor is
defined as any involuntary, approximately rhythmic, and
roughly sinusoidal movement [3]. Tremor stemming from
MS is typically a cerebellar intention tremor. Most types of
tremor are characterized by having a higher frequency band
than voluntary motion. Many of the previous work in the area
of modeling and suppressing erroneous motion during
human-machine control have taken advantage of this
frequency separation. Riley and Rosen [4] have investigated
lowpass filtering, while Prochazka et al. [5] have shown
some success with the approach of closed-loop functional
electrical stimulation. Gonzalez et al. [6] has also proposed a
signal equalizer technique to suppress pathological tremor.
Pledgie et al. [7, 8] have presented methods that suppress the
power of pathological tremors through force feedback and
impedance control. Riviere et al. [9] have developed adaptive
noise canceling of tremor based on a dynamic sinusoidal
model, offering better performance than low-pass filtering,
but distortion of the voluntary motion still has not been
completely eliminated. Performance improvement beyond
this adaptive noise canceller is likely to require modeling the
dynamics of the specific type of tremor treated. Several
others have looked into application of viscous damping to
suppress tremor [10, 11].
Besides tremor, other types of involuntary motions such
as pathological myoclonus can also cause incapacitation of
human motor control. Myoclonic movements are twitches or
jerks resulting from sudden muscle contractions that can
occur alone or in a sequence [12]. Non-tremulous types of
erroneous movement are aperiodic, erratic, unpredictable at
our current state of understanding, and can overlap in
frequency with voluntary motion. Effective online canceling
is therefore difficult.
Efforts to develop assistive interfaces for persons with
the most severe non-tremulous movement disorders have
generally taken the low signal-to-noise ratio as given, and
then followed one of the two approaches. One approach is to
develop interfaces of sufficiently low bandwidth and input
gain such that the tasks are slowed down to a level that the
users are capable of handling them. One example is
specialized keyboards with larger key pitch (effectively a low
input gain) and “sticky key” operation – a keystroke is
registered only if the finger dwells on the key for a few
seconds. Another approach is to develop interfaces to be
controlled with different body parts which may be less
affected by the movement disorder in question, such as head
motion tracker, chin joysticks, mouthsticks, tongue-touch
keypads, electroencephalogram and electrooculogram-based
systems, and an electrogalvanic intraoral access device [13][17].
Although these approaches have often been successful in
many ways, they have intrinsic drawbacks. The low
bandwidth systems are slow and inefficient, while control via
alternative body parts is often awkward. A technique to
specifically estimate and suppress nonlinear In this paper, we
present a neural network approach to simultaneously
modeling and canceling both tremulous and non-tremulous
types of movement disorder. Section II explains the
motivation of the neural network approach and introduces the
characteristics of a cascade-correlation neural network with
Kalman filtering. Section III describes the experimental
methods involve in the training and testing of the neural
networks. Section IV presents the results of our experiments.
Section V discusses the significance of the results and future
work, and finally concluding remarks in Section VI.
collected from 11 subjects with MS, wearing a HeadMaster
PlusTM computer head control system (Prentke Romich
Company, Wooster, OH). The test subjects had no prior
experience in using head controls to operate computers at the
time of the experiment. Each subject underwent an icon
selection exercise where they were asked to move the cursor
into a series of randomly positioned circular target of 30pixel radius, on a 14” screen of 1024 x 768 resolution. A test
was completed when the cursor dwelled in the circle for more
than 500 ms, or when the 10-second time limit expired. The
sampling rate of the system is 32 Hz [17].
II. ERROR MODELING AND CANCELING VIA ARTIFICIAL NEURAL
NETWORKS
Proper or physically parameterized modeling of nontremulous involuntary motion is problematic due to the
largely unknown nature of the dynamics of the process [18].
Given this situation, along with the fact that biological
signals and involuntary movements are frequently nonlinear
[19, 20], nonlinear black-box modeling via artificial neural
network has been utilized. For the sake of versatility, instead
of the traditional fixed network architecture with
backpropagation, we used a cascade-correlation neural
network with Kalman filtering proposed by Nechyba [19].
Nechyba combines (i) flexible cascade-correlation neural
networks, which dynamically adjust the size of the neural
network as part of the learning process, and (ii) nodedecoupled extended Kalman filtering (NDEKF), a fastconverging alternative to backpropagation. This technique
has been used successfully for modeling of human control
strategies in driving vehicles [21], and modeling and
canceling of physiological tremor and myoclonic jerk in
microsurgery [22].
In this method, when training starts, the network has no
hidden nodes, only linear connections between the input and
the output nodes. During training, each time the error
performance stagnates, a new hidden node is added to the
network from a pool of candidate units (transfer functions),
such as sigmoid, Gaussian, sinusoidal, and Bessel. After a
brief period of being trained independently and in parallel
with different random initial weights using the quickprop
algorithm, the candidate unit with the best performance is
selected. Once a new hidden unit is installed, the hidden-unit
input weights are frozen, while weights to the output units are
retrained. The process is repeated until the algorithm
succeeds in reducing the root mean square error (rmse)
sufficiently for the training set or the number of hidden units
reaches a specified maximum number.
In neural network training, learning is cast as an
identification problem for a nonlinear dynamic system. The
neural network weights represent the state of the nonlinear
system. The NDEKF theory is then used to derive a recursion
for the weight updates [23]. This method improves the
convergence of the network weights over backpropagation
and significantly reduces the computational complexity.
III. EXPERIMENTAL METHODS
The experimental data, made available by Dr. Ed LoPresti of
the University of Pittsburgh, consisted of selected recordings
Raw trajectory
Low-passed trajectory

□
Start position
End position
*
Target
Fig. 1. The dotted line depicts a typical cursor trajectory of a MS patient
using HeadMaster PlusTM computer head control system. The solid line is
the low-passed version of the original trajectory, corrected for phase shift
and extrapolated to end at the same point as the raw trajectory.
To make the training process easier and more effective,
particularly given the relatively small volume of training
data, the training was organized in the following way. The
screen was divided into eight equal bearing sectors, namely
north, northeast, east, southeast, south, southwest, west, and
northwest. For each bearing sector, the two-dimensional
network training problem was split into two one-dimensional
ones, each for the x and y direction respectively. Each
network had 15 input nodes and 1 output node. The input to
each network was a moving window of data in the time
series. To ensure that the networks were trained to be
independent of the starting position on the screen, the
forward difference (velocity) instead of position coordinates
were used as the inputs. The training targets were the forward
difference of the phase-compensated low-passed trajectories.
Each neural network had one output node, representing the
forward difference of the error-compensated motion in that
dimension. For each bearing sector, 6 to 8 icon selection tests
randomly chosen from the 11 subjects were used for training.
The organization of the neural net training is illustrated in
Fig. 2.
N
SE
for more than 500 ms or 16 sample points (32 Hz sampling
rate). The offline simulated real-time testing of the trained
neural networks is shown in Fig. 3.
NE
2D cursor
trajectory
data file
E
W
SW
Cursor movement data read
in sequentially
SE
S
Yes
Split X & Y
trajectories
No
2D cursor
trajectory
No
Forward difference
Read in 15 pairs of
X & Y data samples
Split X & Y trajectories
X
net
Low-pass &
phase correction
Forward difference
X target
Y target
Y
net
Forward difference
X input
X net
In circle with
10-pixel
radius?
Y input
Y net

Trained
Neural
Networks:
N, NE, E,
SE, S, SW,
W, NW
Most likely
bearing
detection
14th X & Y coordinates
Error compensated cursor positions
Training
Fig. 3. Offline simulated real-time testing of the trained neural networks
Fig. 2. Organization of the neural net training
The data presented here represent offline tests using
recorded input data. The cursor recordings used for testing
were again randomly selected from the 11 subjects. These
recordings were not used in the training of the networks. To
simulate real-time testing conditions, the cursor coordinate
data was read in from the data file sequentially. To account
for the synchronization mismatch between the beginning of
data recordings and the test subjects, and to eliminate
equipment noise in the experimental setup, no action was
taken when the cursor was within a radius of 10 pixels from
its starting position. Beyond that, the forward difference of
the first 15 samples was computed as they were read in, and
from these information the most probable bearing was
estimated based on maximum likelihood criterion. The fact
that these cursor trajectories often have a reasonably accurate
onset helped the system to maintain a high success rate in
identifying the bearing. The pair of trained neural networks
that corresponded to the bearing then read in these 15 pairs of
forward difference samples, and put out the error
compensated x and y forward differences. The compensated
cursor position for the 15th point was obtained by summing
the network outputs to the position of the 14th point. There
was no compensation for the first 14 points. The neural
networks performed real-time error compensation of the raw
cursor movement until the cursor dwelled in the target circle
In our experiment, we trained a pair of neural networks
for each of the northeast, south, and west directions. Eight to
ten cursor trajectories randomly selected from the icon
selection exercise of 11 MS subjects were used in training the
neural networks for each bearing direction. A total of 29
cursor trajectories that are not used in the training, randomly
selected from the 11 MS subjects, are chosen to test the
trained neural nets. Of the 29 tests, we have 12, 8, and 9
targets located in each of the northeast-, south- and westbearing sectors respectively.
IV. RESULTS
Our bearing detection algorithm managed to correctly
identifying the bearing direction in 56 of the 69 cases tested,
or a success rate of 81.2%.
Experimental results show that the neural networks
successfully produce smoother trajectories for all the 29 tests.
The neural networks also bring the cursor of all the tests into
the target circle and keep it in the circle for more than 500 ms
or 16 sampling cycles, the criteria for the system to register
the test as complete. On the average, the neural networks
complete the test 31.8% faster than the MS subjects. A
comparison of test completion time is presented in Table II.
Fig. 4(a)-(c) shows typical results for the northeast-, south-,
and west-bearing sectors respectively.


TABLE II
Comparison of test completion time with and without the assistive
interface
NE
12
S
8
W
9
Raw data: Mean
completion time (s)
NN output: Mean
completion time (s)
Mean time saved (s)
4.52
3.34
4.05
2.89
2.39
2.44
1.62
0.95
1.60
Mean time saved (%)
35.8
28.4
39.5
Standard deviation of
% time saved (%)
14.0
11.3
23.8
No. of tests
Target circle


□
*
Target circle
□
*
Raw trajectory
NN output
Start position
End position
Target circle
10-pixel
circle
(c)
Fig. 4. Sample results for various sectors. The dashed line is the raw
cursor trajectory of the MS subjects. The dotted line is the combined
output of trained X and Y neural networks (NN). (a) Northeast-bearing
sector. (b) South-bearing sector. (c) West-bearing sector.
While we are compensating for tremulous and chaotic
cursor trajectories, we also want to make sure that the neural
networks do not overcompensate or worsen those that are
already good. The trained neural networks are tested on 4
tests with relatively smooth trajectories, 1 each from southand west-, and 2 from northest bearing sectors. On the
average, task completion time is 18.4% less with the assistive
interface than without. Fig. 5 shows the results of one of the
four trajectories tested.
Raw trajectory
NN output
Start position
End position
Target circle
10-pixel
circle
Target circle
(a)
10-pixel
circle


□
*
Raw trajectory
NN output
Start position


End position
Target circle
□
10-pixel
circle
*
Raw trajectory
NN output
Start position
End position
Target circle
Fig. 5. Sample result of the effect of neural networks(NN) on less tremulous
cursor trajectory.
Target circle
V. DISCUSSION
(b)
The experimental results indicate the feasibility of the
neural-network approach to modeling and canceling of
movement disorders of MS patients in assistive computer
interfaces.
In many cases encountered in our experiment, the cursor
trajectories become tremorous and sometimes chaotic when
the cursors get close to the target circle. As a result, the
subjects do not have sufficient control of the cursor to let it
dwells in the target circle for more than the required 500 ms.
The neural networks have evidently captured this
characteristic and compensated for the erroneous movements,
allowing sufficient dwelling time in the target circle. The
bearing detection algorithm, based on maximum likelihood
criterion, has worked reasonably well. This shows that the
assumption on the MS subjects’ ability to control cursor to
approach the general target directions is not an audacious
one. We are currently working toward implementation of the
neural network approach for online error compensation
experiment. It would be interesting to see the interactions
between the test subjects and the neural networks in action at
the human-machine interface.
The progress made in our preliminary experiment has
been encouraging, but in some ways the present tests do not
exploit the full extent of the nonlinear capabilities of the
method. MS can exhibit a variety of movement disorder
symptoms, including myoclonic and spastic involuntary
movements that are aperiodic and may not be as easily
separable in frequency from voluntary components of
motion. As this work continues it will involve data from a
larger number of subjects with a wider variety of movement
disorder symptoms, likely including those that are not
amenable to linear modeling techniques. The present results
show the feasibility of the approach; its full advantages in
comparison with, e.g., linear filtering, are expected to
become more evident as the work continues with further
exploration and additional experimental data.
The method presented here, based on the cascadecorrelation learning architecture, is a versatile technique
capable of modeling and canceling a variety of different
types of nonlinear involuntary motion. In the future we plan
to apply it also to other types of input interfaces (e.g., handoperated devices) and different medical disorders, including
severe conditions such as athetoid cerebral palsy that exhibit
extremely low signal-to-noise ratio. We also intend to design
and conduct our own data collection exercise involving hand
movement data (controlling mouse or joystick).
Foundation, and by the National Institutes of Health (grant 1
R21 RR13383) and the National Science Foundation (grant
EEC-9731748).
The authors thank Dr. Edmund LoPresti of AT Sciences
and Koester Performance Research for supplying test data,
and Ms. Ashley Holtgraver for assistance with data analysis.
REFERENCES
[1]
[2]
[3]
[4]
[5]
[6]
[7]
[8]
[9]
[10]
VI. CONCLUSION
The feasibility of a novel artificial neural network
technique for modeling and online correction of erroneous
movement in an assistive computer interface has been
demonstrated using data from subjects with multiple
sclerosis. The trained neural networks successfully make the
cursor trajectories of all the test subjects less chaotic, and
hence more controllable. The neural networks also reduce the
time needed to reach the targets by an average of 31.8%.
[11]
[12]
[13]
ACKNOWLEDGEMENT
This work was supported in part by the R. Green Annan Fund
and the Lettie B. Trognitz Fund of The Pittsburgh
W. J. Weiner and A. E. Lang Tremor., eds., Movement
Disorders: A Comprehensive Survey. New York:
Futura, pp. 22 1-256, 1989.
K. Wbetten-Goldstein, F.A. Sloan, L.B. Goldstein, and
E.D. Kulas, “A comprehensive assessment of the cost
of multiple sclerosis in the United States”, Muit Scier 4
(5): 41 9-25, 1998.
R. J. Elble and W. C. Koller, Tremor. Baltimore: Johns
Hopkins, 1990.
P. O. Riley and M. Rosen, “ Evaluating manual control
devices for those with tremor disability,” J.
Rehabilitation Research Dev., vol24, pp.99-110, 1987
A. Prochazka, J. Elek, and M. Javidan, “Attenuation of
pathological tremors by functional electrical
stimulation 1:method”, Ann Biomed Eng 20:205-24,
1992.
J. G. Gonzalez, E. A. Heredia, T. Rahman, K. E.
Barner, S. K. Basu, and G. R. Arce, “ A new approach
to suppressing abnormal tremor through signal
equalization,” Proc. RESNA Annu. Conf., pp.707-709,
1995.
S. Pledgie, K. E. Barner, S. K. Agrawal, and T.
Rahman, “Tremor Suppression Through Impedance
Control”, IEEE Transactions on Rehabilitation
Engineering, Vol. 8, No. 1, 2000, 53-59.
S. Pledgie, K. E. Barner, S. K. Agrawal, and T.
Rahman, “Tremor Suppression Through Force
Feedback”, Proceedings of the Sixth International
Conference on Rehabilitation Robotics, July 1-2, 1999,
Stanford, CA, pp. 16-26, 1999.
C. N. Riviere, R. S. Rader, N. V. Thakor, “Adaptive
canceling of physiological tremor for improved
precision in microsurgery”, IEEE Trans Biomned Eng
45(7): 839-46, 1998.
Beringhause, M. Rosen, and S. Haung, “Evaluation of
a damped joystick for people disabled by intention
tremor. Proc. of the RESNA 12th Annual Conf, 41-42,
1989.
M. Rosen, A. Arnold, I. Baiges, M. Aisen, and S.
Eglowstein. Design of a controlled energy-dissipationorthosis (CEDO) for functional suppression of
intention tremors. J. Rehabilitation Research and
Development, 32(l): 1 — 16, 1995.
J. Caviness, “Myoclonus”, Mayo Clinic Proc 71:679688, 1996.
J. R. Wolpaw, D. Flotzinger, G. Pfurtscheller and D. J.
McFarland, “Timing of EEG-hased cursor control”, J
Clin Neurophysiol 14 (6): 529-38, 1997.
[14] C. Lau and S. Oteary, “Comparison of computer
interface devices for persons with severe physical
disabilities”, Am J Occup Ther 47(ll): 1022-30, 1993.
[15] C. J. Clayton, “An intra-oral access device”, JMed Eng
Technol l6 (5): 204-9, 1992.
[16] J. R. LaCourse and F.C. Hiudik Jr., “An eye movement
communication-control system for the disabled”, IEEE
Trans BiomedEng 37: 1215-20, 1990.
[17] E. F. LoPresti and D. M. Brienza, “Automatic
Customization Software for Computer Head Controls”,
Proceedings of the RESNA 2002 Annual Conference.
June 2002. (in press).
[18] J. B. Hampshire, “A differential theory of learning for
efficient statistical pattern recognition”, Ph. D.
dissertation, Carnegie Mellon University, Pittsburgh,
Pa., ch. 3, 1993.
[19] C. N. Riviere, and N. Thakor, “Modeling and
canceling tremor in human-machine interfaces”, IEEE
Engineering in Medicine and Biology, 15(3): 29- 36,
1996.
[20] C. Gantert, J. Honerkamp, and J. Timmer, “Analyzing
the dynamics of hand tremor time series”, Biol Cybern
66:479-84, 1992.
[21] M. C. Nechyba and Y. Xu, “Learning and transfer of
human
real-time
control
strategies”,
JAdv
Computational Intell 1(2): 137-54, 1997.
[22] W. T. Ang and C. N. Riviere, “Neural Network
Methods for Error Canceling in Human-Machine
Manipulation”, Proceedings of the 23rd IEEE EMBC
Annual Conference, pp. 7.6.1-5, 2001.
[23] G. V. Puskorius and L. A. Feldkamp, “Decoupled
extended Kalman filter training of feedforward layered
networks. Proc. Intl. Joint Conf. Neural Networks,
1:771-7, 1991.
Download