ANN

advertisement
Neural network architectures
and learning algorithms
Author : Bogdan M. Wilamowski
Source : IEEE INDUSTRIAL ELECTRONICS MAGAZINE
Date : 2011/11/22
Presenter : 林哲緯
1
Outline
•
•
•
•
•
Neural Architectures
Parity-N Problem
Suitable Architectures
Use Minimum Network Size
Conclusion
2
Neural Architectures
Lecture Notes for E Alpaydın 2010 Introduction to Machine Learning 2e © The MIT Press (V1.0)
3
Neural Architectures
Lecture Notes for E Alpaydın 2010 Introduction to Machine Learning 2e © The MIT Press (V1.0)
4
Neural Architectures
Lecture Notes for E Alpaydın 2010 Introduction to Machine Learning 2e © The MIT Press (V1.0)
5
error back propagation(EBP) algorithm
• error back propagation(EBP) algorithm
– multilayer perceptron (MLP)
Lecture Notes for E Alpaydın 2010 Introduction to Machine Learning 2e © The MIT Press (V1.0)
6
multilayer perceptron (MLP)
MLP-type architecture 3-3-4-1(without connections across layers)
Neural network architectures and learning algorithms, Wilamowski, B.M.
7
neuron by neuron(NBN) algorithm
• neuron by neuron(NBN) algorithm
– bridged multilayer perceptron (BMLP)
– fully connected cascade (FCC)
arbitrarily connected network
Neural network architectures and learning algorithms, Wilamowski, B.M.
8
neuron by neuron(NBN) algorithm
• Levenberg–Marquardt(LM) algorithm
– Improve nonlinear function of least square
– Forward & Backward Computation
• Jacobian Matrix
– Forward-Only Computation
9
bridged multilayer perceptron (BMLP)
BMLP architecture 3=3=4=1(with connections across layers marked by dotted lines)
Neural network architectures and learning algorithms, Wilamowski, B.M.
10
fully connected cascade (FCC)
Bipolar neural network for parity-8 problem in a FCC architecture
Neural network architectures and learning algorithms, Wilamowski, B.M.
11
Outline
•
•
•
•
•
Neural Architectures
Parity-N Problem
Suitable Architectures
Use Minimum Network Size
Conclusion
12
parity-8 problem
MLP 8*9 + 9 = 81 weights
BMLP 4*9 + 8 + 4 + 1 = 49 weights
Neural network architectures and learning algorithms, Wilamowski, B.M.
13
parity-8 problem
9 + 10 + 11 + 12 = 42 weights
Neural network architectures and learning algorithms, Wilamowski, B.M.
14
parity-17 problem
• MLP architecture needs 18 neurons
• BMLP architecture with connections across
hidden layers needs 9 neurons
• FCC architecture needs only 5 neurons
15
parity-N problem
• MLP architectures
nn = neurons
nw = weights
• BMLP architectures
• FCC architectures
Neural network architectures and learning algorithms, Wilamowski, B.M.
16
Outline
•
•
•
•
•
Neural Architectures
Parity-N Problem
Suitable Architectures
Use Minimum Network Size
Conclusion
17
suitable architectures
• For a limited number of neurons, FCC neural
networks are the most powerful architectures,
but this does not mean that they are the only
suitable architectures
18
suitable architectures
• if the two weights marked by red dotted lines
– signal has to be propagated by fewer layers
Neural network architectures and learning algorithms, Wilamowski, B.M.
19
Outline
•
•
•
•
•
Neural Architectures
Parity-N Problem
Suitable Architectures
Use Minimum Network Size
Conclusion
20
Use Minimum Network Size
• receive a close-to-optimum answer for all
patterns that were never used in training
• generalization abilities
21
Case Study
TSK fuzzy controller:
(a) Required control surface
(b) 8*6 = 48 defuzzification rules
TSK fuzzy controller:
(a) Trapezoidal membership functions
(b) Triangular membership functions
Neural network architectures and learning algorithms, Wilamowski, B.M.
22
Case Study
(a) 3 neurons in cascade (12 weights), training error = 0.21049
(b) 4 neurons in cascade (18 weights), training error = 0.049061
(a) 5 neurons in cascade (25 weights), training error = 0.023973
(b) 8 neurons in cascade (52 weights), training error = 1.118E-005
Neural network architectures and learning algorithms, Wilamowski, B.M.
23
time complexity
NBN algorithm can train neural
networks 1,000 times faster than
the EBP algorithm.
(a) EBP algorithm, average solution time of 4.2s, and average 4188.3 iterations
(b) NBN algorithm, average solution time of 2.4ms , and average 5.73 iterations
Neural network architectures and learning algorithms, Wilamowski, B.M.
24
two-spiral problem
NBN algorithm using FCC architecture
244 iterations and 0.913s
EBP algorithm using FCC architecture
30,8225 iterations and 342.7s
Neural network architectures and learning algorithms, Wilamowski, B.M.
25
Outline
•
•
•
•
•
Neural Architectures
Parity-N Problem
Suitable Architectures
Use Minimum Network Size
Conclusion
26
Conclusions
• FCC or BMLP architectures are not only more
powerful but also easier to train
• use networks with a minimum number of
neurons
• NBN have to invert a nw*nw matrix, but 500
weights are limit now.
27
Download