BP Neural Network-based Commercial Loan Risk Early Warning Research H. Zhou College of Management and Economics, Tianjin University, China (zh5553158@163.com) Abstract - To establish a scientific and effective commercial loan risk early-warning model is an important measure to effectively prevent, defuse the risk of commercial banks. This paper analyzes the main factors on commercial loans risk and establishes early warning indicator system, also uses BP neural network to build a bank commercial loan risk early-warning model. The empirical results show that: the neural network model can achieve higher accuracy. Keywords - Risk warning model; commercial loans; BP neural network process any kind of data. By learning continuously, the network can figure out the regular pattern from numerous complicated data of unknown patterns. The neural network overcomes the complexity of traditional processing and difficulties of choosing proper model functions. In other words, it is a natural process of no linear modeling which does not have to distinguish which kind of linear relation exists, thus brings great convenience to modeling and analyzing [3-6]. 2. RISK EARLY WARNING SYSTEM 1. INTRODUCTION 2.1 Principles of index design [7-10] Building a scientific and effective risk early warning system for commercial loans, has important practical significance to identify the risk of loans from commercial banks and to take timely measures to prevent spreading of the risk. Traditional risk early warning system mainly predicts through statistical techniques such as multivariate statistical analysis, and logistic regression, which has significant limitations to solve the increasingly complex problem due to constraints of dealing with highly nonlinear data, over-reliance on historical data, and not having the dynamic early warning capacity. With the development of artificial intelligence, the BP neural network is introduced into the economic forecasting and early warning, which provides a new research idea for the areas of research. Tam and Kiang (1992)[1] use the BP neural network to train the network, provide a set of weights according to some samples of the input to the network ,and after training, the company of any new input can be divided into bankruptcy and non-bankruptcy. According to the prediction accuracy, adaptability and robustness, the empirical results show that the neural network is a better assessment of the bank's risk profile. E. NurOzkan-Gunay, Mehmed Ozkan (2007) [2] use Turkey's failed bank sample to predict the risk of using artificial neural networks. The empirical results show that: artificial neural networks can classify the patterns of financial data usefully. Use this classification, most of the banking crisis can be predicted in advance, and more importantly, it is also able to detect potential crisis signals. To sum up, BP neural network has three significant advantages of dealing economic data: First, the neural network has advanced paralleled processing system as well as high speed self-learning and self-adapting abilities, with a lot of adjustable arguments which enhance the flexibility of the system. Second, BP neural network can First, principle of comprehensiveness. Risks of loans involve various factors and sources, therefore comprehensive estimate should be considered from diverse aspects. Second, principle of science. Science and fairness are the principles of all systems of index. Choosing index, determining weights of index, selecting data, calculating and combining should be based on well-accepted scientific theories (statistical theory, management and decision-making theory etc.). Third, principle of independence. Representative index are better to be chosen when setting up indexes since they may overlap with each other, thus indexes of relative independence should be chosen. Fourth, principle of measurableness. All the contents within the index system should be measured directly according to measuring standards and definite results can be achieved. Fifth, principle of operability. In terms of index selecting and estimate, the evaluate outcome should be comprehensive and processing procedure should be operable. 2.2 Setting of risk recognizing index system a) Financial Risks Corporate financial risk is a microeconomic risk, which is the possibility of the actual future results of corporate financial activities deviating from the expected results. How the performance of the organization and management of the corporate financial activities will be inevitably reflected in the business capital of movement on the status and results of the performance of the financial status and outcome. The selected financial indicators are shown as Table 2-1: Profitabilit y Operation al capacity Solvency Developm ent capacity Cash flow Sales growth, profit growth rate of main business, capital gains rate Cash maturity debt ratio,inflow and outflow of cash ratio Loans due for settlement rate, accounts payable due for settlement rates, inventory loan ratio 3.1 The Three-Layer three-node neural network structure In the above system, we define the early warning as the following three types: corresponding to normal, attention and warning respectively. Therefore, the output model of the network should be as follows: (1,0,0),(0,1,0),(0,0,1). According to the experiences, the hidden nodes should meet 2 m generally, where n is the number of the hidden nodes. The three-layer three-node output neural network structure is shown as follows: n … … x b) Non-financial risk Non-financial factors relative to financial factors, refer to the sum of risks which have great impact on bank credit risk and cannot be reflected on the financial statement analysis. They are mainly reflected in the following areas: industry risk, business risk, management risk and moral hazard .This paper choose the following indicators to measure non-financial risk and these indicator are showed in Table 2-2. Industr y risk Operati onal risk Manag ement risk x (1) p1 x (1) p1 x (1) ph (2) p1 … … x (2) pk Evaluation index to quantify the membership function Commerci al reputation Table 2-1 Sales profit margin, operating margin, pre-tax profit margin, net margin, return on assets Total asset turnover, fixed asset turnover, receivables turnover, inventory turnover Asset-liability ratio, current ratio, quick ratio, interest coverage ratio w11 1 1 … … x (1) ph x 1 Ξn1 (Normal) … … i wij j (2) p1 wj 2 2 ξn2 (Attention) … … … … x (2) pk w11 n1 n2 wn1n2 wn2 3 3 ξn3 (Warning) Fig. 3-1. ˆ (1) ˆ (2) ˆ (2) 1) In the Figure 3-1, xˆ (1) p1 ……x ph , x p1 ……x ph are the Table 2-2 Cost structure, industry maturity, industry subject to periodic impact, legal and policy environment evaluation index attribute values of the p-th sample mode of the domain U {u1 , u2 , un } , denoted as: (1) (1) (2) (2) Xˆ xˆ xˆ , xˆ xˆ . Firm size, product diversity, development stage x ……x , x ……x(2) are the evaluation ph vectors(the membership vectors) after normalizing by the corresponding membership function, denoted as: (2) X p x (1) x (1) x (2) p1 ph , x ph pk . Experience and quality of management layer, management stability and financial management capacity p p1 2) ph (1) p1 3) Wij (i 1, 2, ph (1) ph pk (2) p1 , m; j 1, 2, , n) is the connection weight coefficient from the i-th unit of the input layer to the j-th unit of the hidden unit layer; W j ( j 1, 2, , n) is 3. ESTABLISHING OF THE RISK EARLY WARNING SYSTEM FOR COMMERCIAL BANKS The feed forward Three-Layer BP(Back Propagation) neural network is considered as the most suitable method for simulating the approximate relationship about the input and output. It has a wide range of applications in the economic field and has achieved good results, such as stock price forecasting, middle and long term forecasting of exchange rate and so on. It is the most mature and widely used one in the ANN, however, the research in the early warning area, especially the risk early warning system for commercial bank loans, is still not much. Therefore, putting ANN into this area is a very meaningful exploration and trial. the connection weight coefficient from the j-th unit of the hidden unit layer to the output layer; Y p is the output of the p-th sample pattern. 4) Considering the general situation, we assume that the number of the sample mode is s, and then the evaluation index attribute values matrix and the expected output matrix can be respectively denoted as: Xˆ [ Xˆ 1 , Xˆ 2 , , Xˆ s ]T [ xˆ pi ]sxnl , B [b1 , b2 , , bs ]T [bp ]sxl . 3.2 The algorithm flow With regard to the three-layer three-node output BP neural network structure, Mehmet and Mclean proposed the General Delta Rule, namely the Back Propagation(BP) Algorithm, which is the most effective and practical method. Its algorithm flow as shown in the Figure 3-2. Begin to learn The initialization of the connection weights and thresholds Provide learning samples to the neural network Adjust the connection weight from the input layer to the middle layer and the output threshold of each unit of the middle layer Calculate the input and output of each unit of the middle layer Calculate the input and output of each unit of the output layer Calculate the generalized error of each unit of the output layer N The sample overall error<E Y Stop learning Adjust the number of middle layer's units Adjust the connection weight from the middle layer to the output layer and the output threshold of each unit of the output layer Input new samples, test whether the misjudgment ratio meet the discriminant accuracy requirement N Y End Fig. 3-2 Here we still use the general delta rule, which adopts the Sigmoid Function as the excitation function, i.e. f ( Net pi ) 1/ (1 exp( Net pi )) (1) Net pi W ji o pi j , (2) j Where i 1, 2, , n1 ; j 1, 2, n2 o pi 1 W ji o pi j 1 Net , (3) 1 exp 1 e pi j where j is the threshold of the unit U j . For this kind of excitation function o pj o pj (1 o pj ) , Net pj with regard to the output layer unit, o pj (5) o pj (1 o pj ) , Net pj with regard to the hidden layer unit, pj ( jp o pj )o pj (1 o pj )( j 1, 2,3) . (6) (4) In order to make the learning rate large enough and difficult to produce concussion, it is still necessary to add “trend items” to the Delta Rule, namely W ji (t 1) pj pi W ji (t ) . (7) Where is a constant, which determines the influence degree of the past weight changes on the current weight changes. The following algorithm assumes that the network is the Forward Multi-layer Network, the Excitation Function adopts the Sigmoid Function and threshold makes the same training. 4. EMPIRICAL TEST OF RISK EARLY WARNING MODEL FOR COMMERCIAL BANKS As a test, this paper selects 15 of the financial risk early warning indicators to establish the BP network. 15 financial indicators means that there are 15 input nodes in the BP network and each input nodes corresponds to a financial indicators. The System’s input has defined 3 nodes: (1,0,0), (0,1,0), (0,0,1) corresponds to normal, attention and warning, 3 different warning level. Based on the experience, the hidden nodes need to satisfy r>m. Since the sample of this paper is 30/r, so n=5 and that is there are 5 nodes in the hidden layer. Because the inputs are continuous variable and the outputs are Boolean discrete vector, the inputs need to be normalized. The values of 15 indicators are calculated according to the enterprise and the actual inputs are got from the formula: actual inputs= weighs + (actual value ÷ standard value). This paper used 15×5×3 network topology and neuron function was Sigmoid characteristic function.30 sample data form the risk management of the Bank of Communications were selected as learning samples. Based on BP algorithm training 15×5×3 network topology by error of 0.001 and 0.3 , 0.3 , the weight matrix initial value of 15×5 matrix and 5×3 matrix .Their Element is subject to the normal distribution N (0,1) random number. The initial weight matrix as follows: 0.2427 0.0901 0.5822 0.4744 0.3779 0.2813 0.5744 W ji 0.5807 0.1050 0.2017 0.2799 0.0899 0.3810 0.2628 0.0145 0.2413 0.2081 0.1830 0.1048 0.2811 0.1314 0.3793 0.7520 0.1062 0.6353 0.3661 0.1845 0.1617 0.0515 0.1249 0.2614 0.7770 0.4042 0.6788 0.6617 0.4776 0.2216 0.0054 0.3411 0.3261 0.6540 0.0330 0.2862 0.2691 0.1881 0.3858 0.3980 0.3844 0.5024 0.0665 0.0466 0.4057 0.3596 0.1064 0.6725 0.0637 0.0069 0.1619 0.2600 0.7522 Wkj 0.6554 0.6005 0.5545 0.1098 0.4591 0.0147 0.5624 0.2693 0.5756 0.2876 (8) 0.1909 0.1127 0.4825 0.0116 0.1298 0.7337 0.0082 0.1235 0.2094 0.4392 0.3673 0.0131 0.3582 0.4839 0.0651 0.4563 0.1832 0.3781 0.2828 0.2891 (9) The PC training took about 8 minutes and after 9065 times, it meets the requirement. The final value of weights matrix are Wkj and Wkj . Noticing that different 1.5447 11.2659 5.7659 23.1046 5.2303 7.0783 19.5395 W jh 29.1984 5.9162 17.3507 7.7016 19.1718 4.7370 2.3017 18.5184 1.6211 0.7020 3.7252 3.5588 0.7725 1.3400 3.4426 24.4115 2.2591 4.1776 2.9613 10.2205 1.1361 4.2272 0.7366 13.8896 3.5135 16.7223 3.7378 1.1523 4.7926 8.30098 0.3578 19.5454 0.0494 8.6073 2.4674 16.6633 0.3945 10.7106 2.3729 2.2117 21.3867 18.2992 4.0820 2.0921 19.4779 11.5019 6.2449 4.2933 18.7865 13.5627 9.2150 11.2725 7.1715 0.9966 32.8327 27.8172 23.9371 22.0060 33.5136 30.0967 6.2621 1.3512 0.4380 1.5085 24.4167 19.3385 0.7802 3.2665 14.1739 23.7384 17.9570 4.0861 4.4634 1.7070 Whi 24.6918 23.3329 3.5381 11.2089 26.5356 29.5935 7.3141 17.3059 25.1014 Output layer Output layer --node1 --node2 --node3 1 1 1 1 0 0 0 0 0 0 0 0 (11) In the learning process, for overlearning will introduce much noisy signal into the weights set, prediction results are not necessarily the better when the control errors of learning termination are smaller. We should pay attention to it. From the Table 4-1 we know that, the actual output is very close to our expected output except some specific sample. 5. OUTCOMES AND PROSPECTS initial value can be trained into different final value, but this will not affect the final warning output. The coincide rate of the original input-output data is 95%. Output layer (10) Commercial loan risk early warning index system is built in this paper, on which a BP neural network loan risk early warning model based on multi-output is established. Artificial neural network structure of 3-level, 3-node and algorithm process are introduced. In the end, existing data of the Bank of Communications are used to verify the validity of the system, and the results show that the system meets the requirements of the risk warning. Further researches on refining risk early warning standard of commercial bank loan and expansion and integration of candidate sets are needed, which will attach great significance to clarify the boundary of the pattern space of all loans, and then solve the distribution problem of learning samples. Table 4-1 Test Result Output Output Output Output Output Output layer layer layer layer layer layer Actual output --node1 0.9966 0.9312 0.9621 0.9598 Actual output --node2 0.0016 0.006 0.0006 0.0077 Actual output --node3 0 0 0 0 Error --node1 Error --node2 Error --node3 0.0034 0.0688 0.0379 0.0402 0.0016 0.0060 0.0006 0.0077 0 0 0 0 1 0 0 0 0 0 0 0 0 0 0 0 1 1 1 1 1 0 0 0 0 0 0 0 0 0 0 0 1 1 1 1 1 0.9876 0 0 0 0 0 0.0035 0 0 0 0 0.0014 0.999 1 0.9998 0.9944 0.9991 0.0006 0 0 0 0.0001 REFERENCES [1] Tam, Kar-Yan, Melody Y Kiang. Managerial Applications of Neural Network: The Case of Bank Failure Prediction[J]. Manent Science, 1992, 38: 927~ 938. [2] E. Nur Ozkan-Gunay, Mehmed Ozkan. Prediction of Bank Failures in Emerging Financial Markets: an ANN Approach[J]. The Journal of Risk Finance, 2007, 8(5):465~ 480. [3] E. Philip Davis, Dilruba Karim. Comparing Early Warning Systems for Banking Crises[J]. Journal of Financial Stability, 2008, 4(2):89~120. [4] E. Nur Ozkan~Gunay Meluned Ozkam. Prediction of Bank Failures in Emerging Financial Markets:an ANN Approach[J] , The Journal of Prisk Finance 2007 , 8 (5):465~480. [5] E. Philip Davis, Dilruba Karim. Comparing Early Warning Systems for Bunking Crises[J],Journal of Financial Stability, 2008, 4(2):89~120. [6] Hwang,F.K. A cost and operations based product heterogeneity index[J]. International Journal of Production Economics2002,Vo1 179(9)45~55 [7] Lin Yikuei, Two~commodity reliability evaluation for a stochastic~flow network with node failure[J]. Computers and Operations Research.2002. Vol.29(11).1927~1939 [8] Chen Ke.Hussein Anwar and Wan Haibin. on a class of new and practical performance indexes for approximation of fold bifurcations of nonlinear power flow equations[J]. Journal of Computational and Applied Mathematics. 2002, Vol, 140. (3),119~141 [9] Altman,E.I. Financial Ratios,Discriminant Analysis and Prediction of Corporate Bankruptcy[J]. Journal of Finance. 1968, 9:589~609 [10] McCulloch W S, PITTSW. A logical calculus of the ideas immanent in nervous activity[J]. Bull Math Biophys, 1943 ,(5):115 ~ 133. 0 0.0021 0 0 0.3233 0.0006 0.9841 1 1 1 0.9999 0.0124 0 0 0 0 0 0.0035 0 0 0 0 0.0014 0.0010 0 0.0002 0.0056 0.0009 0.0006 0 0 0 0.0001 0 0.0021 0 0 0.3233 0.0006 0.0159 0 0 0 0.0001