Human Face Recognition using Wavelet Transform based Approach Mr. Jageshvar K. Keche Department of Computer Science, SSESA, Science College, Congress Nagar, Nagpur, (MS) - India jkkeche@gmail.com Mr. Vikas K. Yeotikar Dr. Mahendra P. Dhore Department of Computer Science, SSESA, Science College, Congress Nagar, Nagpur, (MS) - India Associate Professor Department of Electronics & Computer Science, RTM Nagpur University, Nagpur, (MS) - India vkyeotikar@gmail.com mpdhore@rediffmail.com ABSTRACT: Human face recognition is the most important research area of computer vision and pattern recognition. This area consists of several rules like image processing, neural networks, pattern matching and recognition. The performance of face recognition system is depends on its feature extraction by using different techniques with classification. This paper presents face recognition using Wavelet Transform and its sub techniques. Wavelet Transform such as Gabor wavelet, One, Two and Three level Decomposition approach of WT on ORL Face database is implemented. And lastly conclude the best recognition of human face. Keywords: Face Recognition, Pattern Recognition, Gabor Wavelet, Wavelet Transform, ORL. 1. INTRODUCTION Human face recognition has become the important area of research in computer vision, pattern recognition and one of the most successful applications of image analysis and processing. It has lot of attention to the researchers in recent years. Face recognition is considered to be an important part of the biometrics technique, and meaningful in scientific research [1]. It has the potential of being a non-intrusive form of biometric identification. Face recognition task is actively being used at airports, employee entries, criminal detection systems, etc. classifier and they shows very good performance. The challenges of face recognition lie in the inherent variability arising from face characteristics like age and gender, geometry like distance and viewpoint, image quality like resolution, illumination, and signal to noise ratio, and image content like background, occlusion and disguise. It is the ability to establish a subject’s identity based on his facial characteristics. Automatic face recognition has been extensively studied over the past two decades due to its important role in a number of application domains, such as access control, visual surveillance [2]. Face recognition algorithms consists of two parts: a) Face localization & normalization b) Face identification. Dimensional reduction techniques are used in reducing complexity of the recognition process such as Principal Component Analysis (PCA) [3][4] have now been successfully applied to this problem. Neural Network Models (NMM) [5] technique has been applied with a training set of images along with correct classification that system to determine which areas of an image are most important. Discrete Wavelet Transform (DWT) [6], [7] has also been used within the field of face recognition. The popularity lies in its complete theoretical framework, the great flexibility for choosing bases and the low computational complexity. Wavelet Transform is a signal analysis method of the time scale and has a advantage of multi-resolution analysis. It is the timefrequency localization analysis which has the capacity of local features in the time domain and frequency domain. Wavelet decomposition is a multilevel dimension reduction process that makes time–space–frequency analysis. Unlike Fourier trans-form, which provides only frequency analysis of signals, wavelet transforms provide time–frequency analysis, which is particularly useful for pattern recognition. In this paper, we studied and presented face recognition using Wavelet Transform techniques. The rest of this paper is organized as follows. Section II extends the feature mapping, which also introduces and discusses the Wavelet Transform and its sub-techniques in detail. In Section III, shows experiments on ORL face database. Finally, conclusions are drawn in Section IV with some discussions. 2. PATTERN RECOGNITION Pattern recognition is a field within the area of machine learning. Alternatively, it can be defined as “the act of taking in raw data and taking an action based on the category of the data”. It is a collection of methods for supervised learning. Pattern recognition aims to classify data or patterns based on either a prior knowledge or on statistical information extracted from the patterns. The patterns to be classified are usually groups of measurements or observations that define points in an appropriate multidimensional space. The classification is usually based on the availability of a set of patterns that have already been classified or described. This set of patterns is termed the training set and the resulting learning strategy is characterized as supervised learning. From the past 20 to 30 years, the growth of pattern recognition increases day by day. The applications of pattern recognition includes: identification of human faces and fingerprints; automatic recognition; character recognition; remote sensing; medical diagnosis; biomedical signal and image analysis; and many others. Pattern recognition methods are classified into two categories: structural methods and feature space methods. The structural methods are useful in situation where the different classes of entity can be distinguished from each other by structural information. For e.g. the different letters of the alphabet are structurally different from each other in character recognition. The earliestdeveloped structural methods based on formal grammars to describe the structure of an entity. The machine vision method may be structural and based on point distribution models, active contours, etc [8]. In featurespace methods, a set of measurements (typically numerical) is made on each realworld entity (pattern), and from the measurement set there is extracted a set of features which together characterize the class of patterns to which the given pattern belongs. The feature-space pattern recognition is the statistical approach, where the boundaries between the regions representing pattern classes in feature space are found by statistical inference based on a design set of sample patterns of known class members [8]. The traditional goal of feature extraction is to characterize the object to be recognized by measurements whose values are very similar for objects in the same category and very different for objects in different categories. This idea of seeking distinguishing features that are invariant to irrelevant transformations of the input. The task of the classifier component proper of a full system is to use the feature vector provided by the feature extractor to assign the object to a category [9]. Image classification is implemented by computing the similarity score between a target feature vector and a query feature vector [10]. 2.1. Gabor Wavelet Gabor wavelet captures the properties of orientation selectivity, spatial localization and optimally localized in the space and frequency domains. It has been extensively and successfully used in face recognition [12]. The 2D Gabor wavelet representation pioneered by Daugman in computer vision in 1980’s [11]. The characteristics of Gabor wavelets are quite similar to those of human visual system for frequency and orientation representations. This Gabor-wavelet based extraction of features directly from the graylevel images is successful and widely been applied to texture segmentation and fingerprint recognition. The commonly used 2-D Gabor filters in face recognition area [12-13] are defined as in Eq. (1) Where, 𝑓𝑢 and 𝜃𝑣 defines the orientation and scale of the Gabor wavelets, 𝑓𝑚𝑎𝑥 is the maximum central frequency and √2 is the spacing factor between different central frequencies. Fig.1 shows a family of Gabor wavelets. …(1) Where, f is the frequency of the modulating sinusoidal plane wave and 𝜃 is the orientation of the major axis of the elliptical Gaussian. The modified Gabor wavelet is defined as: ..(2) Where, K is an offset parameter dependent on 𝛾 and 𝜂. The Gabor wavelet is used as the discrete wavelet transform with either continuous or discrete input signal. There is an intrinsic disadvantage in the constraints 1-D and 2-D Gabor wavelets. When extracting the features for pattern recognition, retrieval, or computer vision purpose, the transformed coefficients are used for distance measure or compressed representation but not for reconstruction, so the orthogonal constraint could be omitted. There are some other equivalent definitions of the Gabor wavelets and it is defined as: …(3) Where, 𝑘 =2𝜋𝑓𝑒𝑥𝑝(𝑗𝜃), and the scaling functions for the two elliptical axes are the same as 𝜎. For most of the applications, a family of 𝑈×𝑉 Gabor wavelets is usually required to perform the multiresolution and multi-orientation analysis, which is defined as below [14]: ..(4) Fig. 1 Gabor wavelets with 5 scales and 8 orientations 2.2 Wavelet Transform Wavelets are functions that satisfy certain mathematical requirements and are used in presenting data or other functions, similar to sines and cosines in the Fourier transform. However, it represents data at different scales or resolutions, which distinguishes it from the Fourier transform. Wavelet transform is an increasingly popular tool in computer vision and image processing. Many applications, such as compression, detection, recognition, image retrieval have been investigated. Wavelet transform has nice features of space-frequency localization and multi-resolutions. The wavelet transform of a 1-D signal f(x) is defined as: …(5) The mother wavelet Ψ has to satisfy the admissibility criterion to ensure that it is a localized zero-mean function. Equation (5) can be discretized by restraining α and b to a discrete lattice. Typically, some more constraints are imposed on Ψ to ensure that the transform is non-redundant, complete and constitutes a multi-resolution representation of the original signal. 2-D DWT is generally carried out using a separable approach, by first calculating the 1-D DWT on the rows, and then the 1-D DWT on the columns : DWTn[DWTm[x[m,n]]. Two-dimensional Wavelet Transform decomposes an image into 4 “sub-bands” that are localized in frequency and orientation, by LL, HL, LH, HH. LL1 HL1 LH1 HH1 (a) LL2 HL2 HL1 LH2 HH2 LH1 HH1 (b) LL3 HL3 LH3 HH3 frequency component of the image. Second level decomposition can then be conducted on the LL sub band. Fig.2(b) shows a two-level wavelet decomposition of different images of size 112X92 pixels. Earlier studies concluded that information in low spatial frequency bands play a dominant role in face recognition. Nastar et al. has investigated the relationship between variations in facial appearance and their deformation spectrums [15]. They found that facial expressions and small occlusions affect the intensity manifold locally. Under frequency-based representation, only highfrequency spectrum is affected, called highfrequency phenomenon. Moreover, changes in pose or scale of a face affect the intensity manifold globally, in which only their lowfrequency spectrum is affected, called lowfrequency phenomenon. Further decomposition to the LL sub-band (two-level decomposition), leads to lower dimensionalities and a multi resolution image. We performed three level decomposition which as shown in fig. 2(c). The number of levels we choose depends on our work and need. HL2 HL1 LH2 LH1 HH2 HH1 (c) Fig.2 Discrete Wavelet Transform : a) 1-D DWT b) 2 level 2-D DWT c) 3 level 2-D DWT Each of these sub bands can be thought of as a smaller version of the image representing different image properties. The band LL is a closer approximation to the original image. The bands LH and HL record the changes of the image along horizontal and vertical directions, respectively. The HH band shows the high Fig.3 RMS error V.S. Threshold Pt. plot in MATLAB The Root Mean Square (RMS) Error (also called the root mean square deviation, RMSD) is a frequently used measure of the difference between values predicted by a model and the values actually observed from the environment that is being modeled. These individual differences are also called residuals, and the RMSE serves to aggregate them into a single measure of predictive power. The RMSE of a model prediction with respect to the estimated variable Xmodel is defined as the square root of the mean squared error: RMSE n i 1 ( X obs,i X mo del ,i ) 2 n …(6) Where, Xobs is observed values and Xmodel is modeled values at time/place i. 3. ORL FACE DATABASE (400 images of 10 different peoples) AT&T Laboratories Cambridge was founded in 1986 as the Olivetti Research Laboratory, better known as ORL. This directory contains a set of faces taken between April 1992 and April 1994 at the Olivetti Research Laboratory in Cambridge, UK. There are 10 different images of 40 distinct subjects. For some of the subjects, the images were taken at different times, varying lighting slightly, facial expressions (open/closed eyes, smiling/non-smiling) and facial details (glasses/no-glasses). All the images are taken against a dark homogeneous background and the subjects are in up-right, frontal position (with tolerance for some side movement). The ORL Face Database was manually altering the file format into Portable Grey Map (PGM) using suitable image processing software. The PGM format is chosen because it is a lowest common denominator grayscale image file format. The files are in PGM format and can be converted into other image format like .png or .tif. The size of each image is 92x112, 8-bit grey levels. The images are organized in 40 directories (one for each subject). Fig. 4 Some face images from ORL database 4. EXPERIMENTAL RESULTS We used MATLAB 7.6.0(R2008a) to implement all the experiments of Gabor wavelets and Wavelet decompositions on different face images of ORL database. The original images and its features of some face images as shown below. Fig.5 Original image from ORL database (a) (b) (c) (d) Fig.8 Two level 2-D Wavelet Decomposition (e) (g) (f) (h) Fig.6 Gabor Wavelets (a),(c),(e),(g) and its filters(b),(d),(f),(h) on different values of theta (𝜃=0, pi/4, pi/2, 3pi/4) from fig.5 We define the five parameters to extract the features using Gabor Wavelet (filter) in Eq. 1. The parameter values are theta=0; lambda=3.5; gamma=0.3; sigma=2.8; psi=0. By changing the values of 𝜃 from 𝜃 =0, pi/4, pi/2 and 3pi/4, the extracted features of Gabor wavelets and its filters are shown in figure 6. To find this result, we used ORL face database. Fig.7 1-D Wavelet Decomposition Fig.9 Three level 2-D Wavelet Decomposition Again ORL face database is used to find the Wavelet decomposition results. Figure 7 shows the result of 1-D Wavelet decomposition. One, Two and Three level 2Dimensional Wavelet Transform decomposes an image into 4 “sub-bands” that are localized in frequency and orientation, by LL, HL, LH, HH. The band LL is a very closer approximation to the original image. The bands LH and HL record the changes of the image along horizontal and vertical directions, respectively. The HH band shows the high frequency component of the image. Figure 8 shows two-level 2-D Wavelet decomposition and Figure 9 shows a three-level 2-D Wavelet decomposition of two different images of size 112X92 pixels. Figure 10 shows the 3D plotted graph of Multi-resolution Discrete Wavelet analysis in MATLAB. [3] M. Turk and A. Pentland “Eigenfaces for Recognition”, J. Cognitive NeuroScience, vol. 3, pp. 71-86, 1991. [4] B. Moghaddan, W. Wahid and A. Pentland, “Beyond Eigenfaces: Probabilistic Matching for Face Recognition”, Proceedings of Face and Gesture Recognition, pp 30-35, 1998. [5] M. Bicego, U. Castellani and V. Murino “Using Hidden Markov Models and Wavelets for face recognition” ICIAP’03, 12th International Conference on Image Analysis and Processing,September 17-19, 2003. [6] E. J. Stollnitz, T. D. DeRose and D. H. Salesin" Wavelets for computer graphics: a primer, part I,"IEEE Computer Graphics and Applications, vol.15, No. 3, pp. 76-84, May 1995. [7] M. Harandi, M. Ahmadabadi and B. Araabi “FaceRecognition Using Reinforcement Learning”Proceedings of ICIP’04, International Conference on Image Processing. [8] Daisheng Luo, Pattern Recognition and Image Processing (Horwood Publishing Limited 1998), pp 2-3. [9] Richard O. Duda, Peter E. Hart, David G. Stork, Pattern Classification (John Wiley 2001), pp 1112. Fig.10 Multi-resolution Discrete Wavelet 3D Plot in MATLAB 5. CONCLUSION AND FUTURE WORK In this research paper we studied the Gabor, Wavelet (filter) and Wavelet Transform with respect to filtering the face feature and decomposing the original face image respectively. The 2-D Wavelet Transform is to capture the variations in faces. Experimental results on an extensive set of ORL face database demonstrate the MATLAB implementation for identification. Experiments conducted on various face conditions, including different angles, expressions etc. Future work includes more improvement in identification of number of face images. Further implementing the new method with more multiresolution transforms and finding the transform which will give maximum recognition rate. The further research-orientation in the future is to design the algorithm which gives more accurate face recognition. REFERENCES [1] [2] Jiarui Zhou, Zhen Ji, Linlin Shen, Zexuan Zhu and Siping Chen, "PSO Based Memetic Algorithm for Face Recognition Gabor Filters Selection", IEEE Conference Memetic Computing (MC), 2011, pp.1- 6. Anil K. Jain, Brendan Klare and Unsang Park, "Face Recognition: Some Challenges in Forensics", in IEEE International Conference Automatic Face & Gesture Recognition and Workshops (FG 2011), 2011, pp. 726-733. [10] Chengjun Liu, “Learning the Uncorrelated, Independent, and Discriminating Color Spaces for Face Recognition”, IEEE Transactions On Information Forensics And Security, Vol. 3, No. 2, June 2008, pp 213-222 [11] Zhao Lihong, Yang Caikun, Pan Fen, Wang Jiahe, "Face Recognition Based on Gabor with 2DPCA and PCA", in Control and Decision Conference (CCDC), 2012 24th Chinese, pp. 2632-2635. [12] Jian Wang, Jian Cheng, "Face Recognition Based on Fusion of Gabor and 2DPCA Features", in International Symposium on Intelligent Signal Processing and Communication Systems, December 2010, pp. 1-4. [13] C.MageshKumar, R.Thiyagarajan, S.P.Natarajan, S.Arulselvi, "Gabor features and LDA based Face Recognition with ANN classifier", In International Conference Emerging Trends in Electrical and Computer Technology (ICETECT), 2011, pp. 831-836. [14] L. Shen and L. Bai, “A review of Gabor wavelets for face recognition,” Patt. Anal. Appl. 9: 273-292, 2006 [15] C. Nastar and N. Ayach, “Frequency-Based Nonrigid Motion Analysis”,IEEE Trans. Pattern Anal. Mach. Intell., vol 18: 1067–1079, Nov. 1996.