Neural network architectures and learning algorithms Author : Bogdan M. Wilamowski Source : IEEE INDUSTRIAL ELECTRONICS MAGAZINE Date : 2011/11/22 Presenter : 林哲緯 1 Outline • • • • • Neural Architectures Parity-N Problem Suitable Architectures Use Minimum Network Size Conclusion 2 Neural Architectures Lecture Notes for E Alpaydın 2010 Introduction to Machine Learning 2e © The MIT Press (V1.0) 3 Neural Architectures Lecture Notes for E Alpaydın 2010 Introduction to Machine Learning 2e © The MIT Press (V1.0) 4 Neural Architectures Lecture Notes for E Alpaydın 2010 Introduction to Machine Learning 2e © The MIT Press (V1.0) 5 error back propagation(EBP) algorithm • error back propagation(EBP) algorithm – multilayer perceptron (MLP) Lecture Notes for E Alpaydın 2010 Introduction to Machine Learning 2e © The MIT Press (V1.0) 6 multilayer perceptron (MLP) MLP-type architecture 3-3-4-1(without connections across layers) Neural network architectures and learning algorithms, Wilamowski, B.M. 7 neuron by neuron(NBN) algorithm • neuron by neuron(NBN) algorithm – bridged multilayer perceptron (BMLP) – fully connected cascade (FCC) arbitrarily connected network Neural network architectures and learning algorithms, Wilamowski, B.M. 8 neuron by neuron(NBN) algorithm • Levenberg–Marquardt(LM) algorithm – Improve nonlinear function of least square – Forward & Backward Computation • Jacobian Matrix – Forward-Only Computation 9 bridged multilayer perceptron (BMLP) BMLP architecture 3=3=4=1(with connections across layers marked by dotted lines) Neural network architectures and learning algorithms, Wilamowski, B.M. 10 fully connected cascade (FCC) Bipolar neural network for parity-8 problem in a FCC architecture Neural network architectures and learning algorithms, Wilamowski, B.M. 11 Outline • • • • • Neural Architectures Parity-N Problem Suitable Architectures Use Minimum Network Size Conclusion 12 parity-8 problem MLP 8*9 + 9 = 81 weights BMLP 4*9 + 8 + 4 + 1 = 49 weights Neural network architectures and learning algorithms, Wilamowski, B.M. 13 parity-8 problem 9 + 10 + 11 + 12 = 42 weights Neural network architectures and learning algorithms, Wilamowski, B.M. 14 parity-17 problem • MLP architecture needs 18 neurons • BMLP architecture with connections across hidden layers needs 9 neurons • FCC architecture needs only 5 neurons 15 parity-N problem • MLP architectures nn = neurons nw = weights • BMLP architectures • FCC architectures Neural network architectures and learning algorithms, Wilamowski, B.M. 16 Outline • • • • • Neural Architectures Parity-N Problem Suitable Architectures Use Minimum Network Size Conclusion 17 suitable architectures • For a limited number of neurons, FCC neural networks are the most powerful architectures, but this does not mean that they are the only suitable architectures 18 suitable architectures • if the two weights marked by red dotted lines – signal has to be propagated by fewer layers Neural network architectures and learning algorithms, Wilamowski, B.M. 19 Outline • • • • • Neural Architectures Parity-N Problem Suitable Architectures Use Minimum Network Size Conclusion 20 Use Minimum Network Size • receive a close-to-optimum answer for all patterns that were never used in training • generalization abilities 21 Case Study TSK fuzzy controller: (a) Required control surface (b) 8*6 = 48 defuzzification rules TSK fuzzy controller: (a) Trapezoidal membership functions (b) Triangular membership functions Neural network architectures and learning algorithms, Wilamowski, B.M. 22 Case Study (a) 3 neurons in cascade (12 weights), training error = 0.21049 (b) 4 neurons in cascade (18 weights), training error = 0.049061 (a) 5 neurons in cascade (25 weights), training error = 0.023973 (b) 8 neurons in cascade (52 weights), training error = 1.118E-005 Neural network architectures and learning algorithms, Wilamowski, B.M. 23 time complexity NBN algorithm can train neural networks 1,000 times faster than the EBP algorithm. (a) EBP algorithm, average solution time of 4.2s, and average 4188.3 iterations (b) NBN algorithm, average solution time of 2.4ms , and average 5.73 iterations Neural network architectures and learning algorithms, Wilamowski, B.M. 24 two-spiral problem NBN algorithm using FCC architecture 244 iterations and 0.913s EBP algorithm using FCC architecture 30,8225 iterations and 342.7s Neural network architectures and learning algorithms, Wilamowski, B.M. 25 Outline • • • • • Neural Architectures Parity-N Problem Suitable Architectures Use Minimum Network Size Conclusion 26 Conclusions • FCC or BMLP architectures are not only more powerful but also easier to train • use networks with a minimum number of neurons • NBN have to invert a nw*nw matrix, but 500 weights are limit now. 27