ADAPTIVE MATCHING WAVELET NETWORKS FOR FACE RECOGNITION Yang Zhi Gu Ming School of Software, Tsinghua University, Bei Jing , P.R.China 100084 yangz02@mails.tsinghua.edu.cn guming@mail.tsinghua.edu.cn ABSTRACT This paper describes a novel Adaptive Matching method for face recognition. We employ an Face Bunch Graph(FBG), which has reconstructed Gabor feature on each FBG nodes. The reconstructed Gabor feature comes from the orthogonal analysis of a family of Gabor wavelet coefficients, which reduces the redundant information. After selecting fiducial points roughly by elastic bunch graph matching(EBGM) algorithm, the reconstructed feature was used for the relocation. The Wavelet Networks(WN) and best matching fiducial points’ location contain the discriminative information of faces. We proposed a new approach to compute the similarity between two faces on both the global means and topological means. The experimental results show that our algorithm is an effective method compared with EBGM, PCA, HMM face recognition approaches. 1. INTRODUCTION Face Recognition had been researched for nearly two decades and many effective methods had been developed. Today, successful methods can be mainly divided into two classes: one focuses on the global senses (such as eigenface and fisherface),the other prefers to analyze the constitutive local feature of faces( eyes, nose or mouth) . It is no doubt that among those local approaches, the EBGM algorithm[1] is the most successful one by applying Gabor Wavelet analysis to local feature points. This method compute the related value among feature points to form an elastic grid graph. The similarity can be computed by comparing the difference between two elastic grid graphs. Some smart improvement of EBGM algorithm were achieved. María José Escobar and Javier Ruiz-del-Solar applied EBGM on a Log Polar Transformation to locate faces’ landmarks[2]; Anastasios Tefas,Constantine Kotropoulos, and Ioannis Pitas utilized Support Vector Machine(SVM) to enhance performance of EBGM[3]; based on EBGM, Stefano Arca, Paola Campadelli, Raffaella Lanzarotti proposed an completely This project is sponsored by National 973 Plan of China ( Project Number: 2004CB719400). 0-7803-9134-9/05/$20.00 ©2005 IEEE automatic face-recognition system and they reported good performance of this system[4]. However, to our knowledge, our work that using Adaptive Matching Wavelet Networks, which is also based on EBGM, is a new Face Recognition approach. We utilize the wavelet analysis and energy function as supervision to direct the matching process. Besides, we also utilize Wavelet Networks, which is a kind of universal approximator and achieves faster convergence over radial basis function networks (RBFN). In WN, the radial basis functions of RBF-networks are replaced by wavelets. WN inherit the properties of wavelet decomposition and mention especially their universal approximation property, the availability of convergence rates and the explicit link between the network coefficients and the wavelet transform, and during the training phase, the network weights as well as the degrees of freedom (position, scale, orientation) of the wavelet functions are optimized [5]. The outline of this paper is as follows: Section 2 will provide some background of the Elastic Bunch Graph Matching. An improved method on matching facial landmarks will be presented in Section3. In Section4, a Wavelet Network will be applied to Face Recognition. In the final Section, the experiment results will be shown. 2 . RELATED BACKGROUND In order to express a face’s important feature, we can extract a group of wavelet coefficients from the convolution of a family of Gabor Wavelet Kernels with input image. The 2-Dimension Gabor Wavelet Kernel can be defined as follows: r ψ j ( x) = k 2j σ exp(− 2 k 2j x 2 rr σ2 )[exp(ik j x ) − exp(− )] 2σ 2 (1) 2 v+2 − π r k jx k cos φu π kv = 2 2 φu = u k j = = v k 8 k cos φ u jy v σ is the standard deviation of the Gaussian, with ν the frequency parameter. The latter term in the second bracket makes the function ψ j ( xr )d 2 xr. converge. ∫ At one selected point(fiducial point) of face image ,we extract a series of Gabor Wavelet coefficients as the local feature, namely a Jet. For example, 5 frequencies and 8 orientations means a Jet with 40 coefficients. After having chosen all the Jets, the fiducial points have constructed the nodes of an FBG with its edges. Because all the fiducial points in FBG have the identical structure, different distance value of one elastic gragh represent different appearance of the same human’s face. So on a fiducial point, there will be a group of value, each indicating an appearance. The group is called a bunch. All the edges are labeled with average value between two fiducial points: ∆xre = ∑ ∆xrem / M . m 3. ADAPTIVE WAVELET FEATUER MATCHING In [1], a heuristic algorithm is used to find the image graph which maximizes the graph similarity function. First, the FBG is resized and aspect ratio to adapt to the right format of the face. Then all nodes are moved locally and relative to each other to optimize the graph similarity(see (2)) further[1]. r r ( ∆ x eJ − ∆ xeB ) 2 1 λ S B (G j , B ) = ∑ max( S φ ( J nj , J nBm )) − ∑ rB 2 N where E n Sφ ( J , J ' ) = e rr ∑ a j a ' j cos(φ j − φ ' j − d k ) ( ∆ xe ) (2) j ∑ a ∑ a' 2 j 2 j (3) j j Their results could be better convinced if they have considered the redundant information, so some fiduicial points may be misplaced although they get the maximal similarity. Here we proposed an adaptive remedial method to find more reasonable matching fiducial points. 3.1 Reconstruction Of Jets As mentioned above, an input face image I can be decomposed into a family of Gabor Wavelet coefficients. On the contrary , the coefficients can not easily reconstruct the image Iˆ by a linear combination ,because the Gabor wavelet functions are not orthogonal. If we really want to do so and let wi be the weight of corresponding Gabor wavelet ψ i , we have N r r (4) Iˆ ( x ) = ∑ w iψ i ( x ) r r r r r Jˆ j ( x ) = ∫ Iˆ( x )ψ j ( x − x ' )d 2 x ' r r r r = ∑ wi ∫ψ i ( x )ψ j ( x − x ' )d 2 x ' i =1 In order to reconstruct Ĵ j ,we must compute wi first, but it is not possible to calculate it by a simple projection of the Gabor wavelet ψ j onto the image[5] (as it is being done for orthogonal wavelets). Instead we have to ~ consider the family of dual wavelets Ψ = {ψ~1 ,...ψ~ N }. The ~ wavelet ψ j is the dual wavelet to the wavelet ψ j if : if i = j if i ≠ k (6) ψ~i = ∑ (Ψi , j ) −1ψ j where Ψi , j = 〈ψ i ,ψ j 〉 (7) 1 r r r 〈ψi ,ψ~j 〉 = ∫ψi (x)ψ~k (x)d2x =δi, j = 0 ψ~i can be computed as follows: j This function’s proof can be found in [5].Then we can (8) calculate wi directly: wi = 〈 I ,ψ~i 〉 3.2 Adjusting The Weight Of Wavelet The reconstructed image is not completely identical to the original image, thus there is an difference between them[6]. We use an energy function to express the r r 1 difference: (9) E = ∑ [ I ( x ) − Iˆ( x )]2 2 Applying conjugate gradient method can minimize E and adjust wk . 3.3 Adaptive Matching The FBG is constructed as follows: firstly, compute all fiducial points based on EBGM; secondly, compute all the dual wavelets and get their corresponding weights wi according to (8); thirdly, minimize the difference (9) by applying conjugate gradient method; fourthly, compute the reconstructed Jet according to (5); finally, add reconstructed Jet into bunch. The matching process includes three stages. In the first stage, matching all the fiducial points based on EBGM; in the second stage, computing the reconstructed Jet; in the third stage, all nodes are moved locally and relative to each other by computing among reconstructed Jets to optimize the graph similarity by (2) . i =1 where Iˆ is the reconstructed image and i=1,….,N means a set of Gabor Wavelet(in our experiment N=40). Therefore a Jet can be approximated as: (5) N 4. FACE RECOGNITION 4.1 An Introduction To Wavelet Networks Recently, wavelet networks(WN) was applied in many research fields, such as Pose Estimation [6], Facial Feature Detection [7], Face Alignment [8],etc. A Wavelet Networks can be defined as: Letψ ni i = 1,....,N be a set of wavelets, f a DC-free image and wi and ni chosen according to the energy functional (10). The two vectors Ψ = (ψ n1 ,..., ψ nN )T , and W = ( w1 ,..., wN )T , define then the wavelet network (Ψ,W) for image f. Where parameter vectors n = (c x , c y ,θ , s x , s y ) .The c x , c y defines the translation of the wavelet, s x , s y defines the dilation and θ defines the orientation of the wavelet. The parameters vector n (translation, orientation and dilation) of the wavelets maybe chosen arbitrarily at this point[5]. According to wavelet theory, any function f ∈ L2 ( R 2 ) can be losslessly represented by their continuous wavelet transform and thus, with arbitrary precision, by a wavelet network. We therefore interpret the image f to be a function of the space f ∈ L2 ( R 2 ) and assume further, without loss of generality that f is DC-free. In order to find the WN for image f , we minimize the energy function: N (10) E = min f − wψ ∑ ni , wi for i =1 all i ni N i i =1 N ni − ∑ wiψ ni (11) i =1 algebraic transformations lead to: v−w := ∑ (vi − wi )(v j − w j )〈ψ i ,ψ j 〉 Ψ i, j = (v − w)T ( Ψ )(v − w) 12 (12) Where • computes the Euclidean distance between the Ψ two appropriate points in < Ψ > and thus considers the different parameters of the wavelets. For orthogonal wavelets, the matrix (Ψ )i , j = 〈ψ ni ,ψ nj 〉, is the unity matrix and no weighting is needed[5]. 4.2 Ψm . We have many methods to compute the difference between two faces, among them the simplest method is to calculate the sum of all Dm . As mentioned in[1], we can calculate the similarity between two faces on an average over the difference between pairs of corresponding face appearance. For an image graph G I , the similarity between it and a stored FBG B with edges e = 1, ..., E is defined as : (13) S (G I , B ) = r r ( ∆xeI − ∆xeB ) 2 λ 1 − ∑ Dm (V , Wm ) − ∑ r M m E e (∆xeB ) 2 Where m means the m-th appearance’s (Ψm ,Wm ) stored in the FBG B. λ determines the relative importance of difference and the topography term. 5. EXPERIMENTS i If we want to compute the distance between two vectors of wavelet coefficients v and w , for example , we can compute the Euclidean distance in the (image) subspace < Ψ > between the v and w : ∑vψ and then optimize them based on function(9) (this two steps have been done by the adaptive matching). After getting Vn we will compare it with Wm of each person , the difference is defined as: Dm (V , W m ) = V − W m . To check the validity of our algorithm, experimental studies are carried out on the ORL face image database of Cambridge University. 400 face images from 40 individuals in different states from the ORL have been used to evaluate the performance of the proposed method. The sample images vary in position, rotation, scale and expression. In this database each person has changed his face expression in each of 10 samples (open/close eye, smiling/not smiling). For some individuals, the images were taken at different times, varying facial details (glasses/no glasses). Fig.1 shows some matching results on ORL database. The first experiment was tested for comparing the recognition rate between EBGM and our proposed method, and that maybe indicates the matching rate of two algorithms. We used 25 individuals totally 250 images for the test and extracted face feature from 13 fiducial points Face Recognition Based On WN In order to apply face recognition algorithm, we first extract Wm = ( w1 ,..., wN )T m , and Ψ m = (ψ 1 ,..., ψ N ) T m . of the m-th appearance of each person’s face. The wavelet network (Ψ , W ) = ((Ψ1 ,..., ΨM ), (W1 ,..., WM )), of each person is regarded as his or her prototype sample and stored in this person’s FBG. When a new face image is input, we extract its coefficient vectors V = (v1 ,..., vN )T , according to (8), Figure1. The matching results on ORL Face Database: the top row shows the matching results of our proposed method, the results of EBGM on the bottom. The matching points on top row was adjusted from the position of matching points of bottom row’s images. Figure2. The Recognition rate curves of Adaptive Matching and EBGM. to 23 fiducial points. Each time after having added another fiducial point we randomly chose 20 faces to test Recognition Rate. The comparison results is shown in Fig.2. We can see that our method gain a higher recognition rate when number of ficucial points increase. Another experiment is to test the robustness of classification capacity of our algorithm. The other three effective methods were tested in our experiment--- PCA, HMM and EBGM, because they were proved to be valid in the Face Recognition applications[9]. We increased the trainning gallery from 25 to 40 individuals in the experiment and chose randomly among them to test recognition rate, regardless the expression or pose, each new individual with 10 images. After being added another new individual, the test was carried on by randomly choosing 40 images from the current face database. The HMM algorithm utilized EHMM model and selected 8 DCT coefficients as its input feature. EBGM and AM used 23 fiducial points. The Fig.3 show that HMM method and our AM method achieved higher performance than the other two approaches. proposed an approach to resolve it. This method computes the dual wavelets to get weight of each wavelet, then optimizes the weights to reconstruct the Gabor Jets . Through the adjusted Jets we can find the fiducial points whose locations are more accurate. The Wavelet Networks was used for the face-feature’s classification. For doing so we proposed a new approach for computing similarity between two faces, and this method not only consider the global representation of face but also take topological information into account. The algorithm was tested on the ORL database and proved to be useful for face-feature’s location and face recognition. Future work will be carried on estimating similarity based on a statistical view over wavelet difference. REFERENCES [1] Laurenz Wiskott, Jean-Marc Fellous, Norbert Krüger, Christoph von der Malsburg , “Face recognition by elastic bunch graph matching,” IEEE trans on pattern analysis and machine intelligence, Vol.19, NO.7,pp.775-779, July 1997. [2] Escobar, M.J., and Ruiz-del-Solar, “Biologically –based face recognition using Gabor Filters and Log-Polar Images,” IJCNN 2002, Honolulu, USA, pp.1143 – 1147, 2002. [3] Anastasios Tefas,Constantine Kotropoulos, and Ioannis Pitas, “Using support vector machines to enhance the performance of elastic graph matching for frontal face authentication,” IEEE trans on pattern analysis and machine intelligence, Vol. 23, NO. 7, pp.735-745, 2001. [4] Stefano Arca, Paola Campadelli, Raffaella Lanzarotti, “An automatic feature-based face recognition system,” WIAMIS 2004, Portugal ,May 2004. [5] Volker Krüger, Sommer G, “Wavelet networks for face processing.” JOSA-A(19),No. 6, pp. 1112-1119,June 2002. [6] Volker Krüger, Gerald Sommer, “Gabor wavelet networks for efficient head pose estimation,” Image Vision Compute. 20(9-10), pp. 665-672,2002. [7] R. S. Feris, J. Gemmell, K. Toyama and V. Krueger, “Facial feature detection using a hierarchical wavelet face database,” Microsoft Research Technical Report MSR-TR-2002-05, 2002. Figure3. Face recognition performance of PCA, EGBM, HMM and our proposed method on the ORL database as number of individuals increase. 6.CONCLUSION In this paper, we present a novel method for Face Recognition. Based on EBGM, we analysis the problem of redundant information of elastic matching and [8] Volker Krüger, “Gabor wavelet networks for object representation,” Technical Report CS-TR-4245, University of Maryland, CFAR, May 2001. [9] W. Y. Zhao, R. Chellappa, A. Rosenfeld, and P. J. Phillips. “Face recognition: a literature survey”. UMD CfAR Technical Report CAR-TR-948, 2000.