Recent Researches in Artificial Intelligence, Knowledge Engineering and Data Bases On Possibilistic Correlation Coefficient and Ratio for Fuzzy Numbers PÉTER VÁRLAKI JÓZSEF MEZEI ISTVÁN Á. HARMATI ROBERT FULLÉR Bud. Univ. of Tech. Econ. Åbo Akademi Széchenyi István University Åbo Akademi Széchenyi István University TUCS Dep. of Math. Comp. Sci. IAMSR Egyetem tér 1, Győr Joukahaisenkatu 3-5, Åbo Egyetem tér 1, Győr Joukahaisenkatu 3-5, Åbo HUNGARY FINLAND HUNGARY FINLAND varlaki@kme.bme.hu jmezei@abo.fi harmati@sze.hu rfuller@abo.fi Abstract: In this paper we show some properties of possibilistic correlation coefficient and correlation ratio for marginal possibility distributions. We will show some examples for their computation from some joint possibility distributions. Key–Words: Fuzzy number, Possibility distribution, Correlation coefficient, Correlation ratio 1 Introduction Definition 1 A function g : [0, 1] → R is said to be a weighting function if g is non-negative, monotone increasingR and satisfies the following normalization 1 condition 0 g(γ)dγ = 1. In probability theory the expected value of functions of random variables plays a fundamental role in defining the basic characteristic measures of probability distributions. For example, the variance, covariance and correlation of random variables can be computed as the expected value of their appropriately chosen real-valued functions. In possibility theory we can use the principle of expected value of functions on fuzzy sets to define variance, covariance and correlation of possibility distributions. Marginal probability distributions are determined from the joint one by the principle of ’falling integrals’ and marginal possibility distributions are determined from the joint possibility distribution by the principle of ’falling shadows’. Probability distributions can be interpreted as carriers of incomplete information [12], and possibility distributions can be interpreted as carriers of imprecise information. In 1987 Dubois and Prade [3] defined an interval-valued expectation of fuzzy numbers, viewing them as consonant random sets. They also showed that this expectation remains additive in the sense of addition of fuzzy numbers. In possibility theory we can use the principle of average value of appropriately chosen real-valued functions to define mean value, variance, covariance and correlation of possibility distributions. Namely, we can equip each level set of a possibility distribution (represented by a fuzzy number) with a uniform probability distribution, then apply their standard probabilistic calculation, and then define measures on possibility distributions by integrating these weighted probabilistic notions over the set of all membership grades [1, 4]. These weights (or importances) can be given by weighting functions. ISBN: 978-960-474-273-8 Different weighting functions can give different (casedependent) importances to level-sets of possibility distributions. We should note here that the choice of uniform probability distribution on the level sets of possibility distributions is not without reason. We suppose that each point of a given level set is equally possible and then we apply Laplace’s principle of Insufficient Reason: if elementary events are equally possible, they should be equally probable (for more details and generalization of principle of Insufficient Reason see [2], page 59). In this paper we will discuss the concepts of possibilistic correlation coefficient and ratio for marginal possibility distributions of joint possibility distributions. Definition 2 A fuzzy number A is a fuzzy set R with a normal, fuzzy convex and continuous membership function of bounded support. The family of fuzzy numbers is denoted by F. Fuzzy numbers can be considered as possibility distributions. A fuzzy set C in R2 is said to be a joint possibility distribution of fuzzy numbers A, B ∈ F, if it satisfies the relationships max{x | C(x, y)} = B(y), and max{y | C(x, y)} = A(x), for all x, y ∈ R. Furthermore, A and B are called the marginal possibility distributions of C. A γ-level 263 Recent Researches in Artificial Intelligence, Knowledge Engineering and Data Bases set (or γ-cut) of a fuzzy number A is a non-fuzzy set denoted by [A]γ and defined by In other words, the g-weighted possibilistic correlation coefficient is nothing else, but the g-weighted average of the probabilistic correlation coefficients ρ(Xγ , Yγ ) for all γ ∈ [0, 1]. In 2011 Harmati [10] proved that the correlation coefficient depends only on the joint possibility distribution, but not directly on its marginal distributions. In other words exact knowledge of the marginal possibility distributions does not give any restrictions for the possibilistic correlation coefficient and also for the g-weighted possibilistic correlation coefficient. If A and B are non-interactive fuzzy numbers then their joint possibility distribution is defined by C = A × B. Since all [C]γ are rectangular and the probability distribution on [C]γ is defined to be uniform we get cov(Xγ , Yγ ) = 0, for all γ ∈ [0, 1]. So Covg (A, B) = 0 and ρg (A, B) = 0 for any weighting function g. That is, non-interactivity entails zero correlation. Fuzzy numbers A and B are said to be in perfect correlation, if there exist q, r ∈ R, q 6= 0 such that their joint possibility distribution is defined by [1] [A]γ = {t ∈ X | A(t) ≥ γ}, if γ > 0 and cl(suppA) if γ = 0, where cl(suppA) denotes the closure of the support of A. Let A ∈ F be fuzzy number with a γ-level set denoted by [A]γ = [a1 (γ), a2 (γ)], γ ∈ [0, 1] and let Uγ denote a uniform probability distribution on [A]γ , γ ∈ [0, 1]. 2 A Correlation Coefficient for Marginal Possibility Distributions In 2004 Fullér and Majlender [4] introduced the notion of covariance between marginal distributions of a joint possibility distribution C as the expected value of their interactivity function on C. That is, the gweighted measure of interactivity between A ∈ F and B ∈ F (with respect to their joint distribution C) is defined by their measure of possibilistic covariance [4], as C(x1 , x2 ) = A(x1 ) · χ{qx1 +r=x2 } (x1 , x2 ), 1 Z Covg (A, B) = cov(Xγ , Yγ )g(γ)dγ, 0 where χ{qx1 +r=x2 } , stands for the characteristic function of the line where Xγ and Yγ are random variables whose joint distribution is uniform on [C]γ for all γ ∈ [0, 1], and cov(Xγ , Yγ ) denotes their probabilistic covariance. It is easy to see that the possibilistic covariance is an absolute measure in the sense that it can take any value from the real line. To have a relative measure of interactivity between marginal distributions Fullér, Mezei and Várlaki introduced the principle of correlation (normalized covariance) in 2011 (see [9]) that improves the earlier definition introduced by Carlsson, Fullér and Majlender in 2005 (see [1]). {(x1 , x2 ) ∈ R2 |qx1 + r = x2 }. If A and B have a perfect positive (negative) correlation then from ρ(Xγ , Yγ ) = 1 (ρ(Xγ , Yγ ) = −1) (see [1] for details), for all γ ∈ [0, 1], we get ρg (A, B) = 1 (ρg (A, B) = −1) for any weighting function g. We emphasize here that zero correlation does not always imply non-interactivity. Let A, B ∈ F be fuzzy numbers, let C be their joint possibility distribution, and let γ ∈ [0, 1]. Suppose that [C]γ is symmetrical, i.e. there exists a ∈ R such that Definition 3 The g-weighted possibilistic correlation coefficient of A, B ∈ F (with respect to their joint distribution C) is defined by Z C(x, y) = C(2a − x, y), 1 ρg (A, B) = for all x, y ∈ [C]γ (the line defined by {(a, t)|t ∈ R} is the axis of symmetry of [C]γ ). In this case cov(Xγ , Yγ ) = 0 and ρg (A, B) = 0 for any weighting function g. (see [5]). We should note here that there exist several other ways to define correlation coefficient for fuzzy numbers, e.g. Liu and Kao [15] used fuzzy measures to define a fuzzy correlation coefficient of fuzzy numbers and they formulated a pair of nonlinear programs to find the α-cut of this fuzzy correlation coefficient, then, in a special case, Hong [11] showed an exact calculation formula for this fuzzy correlation coefficient. ρ(Xγ , Yγ )g(γ)dγ, 0 where ρ(Xγ , Yγ ) = p cov(Xγ , Yγ ) p , var(Xγ ) var(Yγ ) and, where Xγ and Yγ are random variables whose joint distribution is uniform on [C]γ for all γ ∈ [0, 1], and cov(Xγ , Yγ ) denotes their probabilistic covariance. ISBN: 978-960-474-273-8 264 Recent Researches in Artificial Intelligence, Knowledge Engineering and Data Bases 3 A Correlation Ratio for Marginal Possibility Distributions 4 In this Section we will compute the correlation coefficient and the correlation ratio for A and B when their joint possibility distribution, C, is defined by C(x, y) = T (A(x), B(y)), where T is a t-norm [20]. We will consider the Mamdani t-norm [17], Łukasiewicz t-norm [16] and Larsen t-norm [14]. For simplicity we will consider fuzzy numbers of same shape, x if 0 ≤ x ≤ 1 A(x) = , 0 otherwise y if 0 ≤ y ≤ 1 B(y) = . 0 otherwise In statistics, the correlation ratio is a measure of the relationship between the statistical dispersion within individual categories and the dispersion across the whole population or sample. The correlation ratio was originally introduced by Karl Pearson [18] as part of analysis of variance and it was extended to random variables by Andrei Nikolaevich Kolmogorov [13] as, η 2 (X|Y ) = Examples D2 [E(X|Y )] , D2 (X) where X and Y are random variables. If X and Y have a joint probability density function, denoted by f (x, y), then we can compute η 2 (X|Y ) using the following formulas Z ∞ E(X|Y = y) = xf (x|y)dx Recall that X and Y are random variables then their correlation coefficient is computed from, ρ(X, Y ) = p −∞ and E(XY ) − E(X)E(Y ) p − E 2 (X) E(Y 2 ) − E 2 (Y ) E(X 2 ) and their correlation ratio is defined by, D2 [E(X|Y )] = E(E(X|y) − E(X))2 , and where, η 2 (X, Y ) = f (x, y) . f (y) It measures a functional dependence between random variables X and Y . It takes on values between 0 (no functional dependence) and 1 (purely deterministic dependence). In probability theory the joint distribution function is always connected with its marginals by a copula (Sklar theorem [19]), but it is not always true for joint possibility distributions [6]. In 2010 Fullér, Mezei and Várlaki introduced the definition of possibilistic correlation coefficient for marginal possibility distributions (see [6]). E(E 2 (X|Y )) − E 2 (X) . E(X 2 ) − E 2 (X) f (x|y) = 4.1 In this case the joint possibility distribution C(x, y) is defined by the min operator, C(x, y) = min{A(x), B(y)}. Then a γ-level set of C is, [C]γ = (x, y) ∈ R2 | γ ≤ x, y ≤ 1 . The joint density function of a uniform distribution on [C]γ is 1 if (x, y) ∈ [C]γ , f (x, y) = Tγ 0 otherwise , Definition 4 Let us denote A and B the marginal possibility distributions of a given joint possibility distribution C. Then the g-weighted possibilistic correlation ratio of marginal possibility distribution A with respect to marginal possibility distribution B is defined by Z 1 2 ηg (A|B) = η 2 (Xγ |Yγ )g(γ)dγ where Tγ = (1 − γ)2 denotes the area of the γ-level set. The marginal density functions are obtained from f (x, y) by Z f1 (x) = f (x, y) dy 0 where Xγ and Yγ are random variables whose joint distribution is uniform on [C]γ for all γ ∈ [0, 1], and η 2 (Xγ |Yγ ) denotes their probabilistic correlation ratio. that is, So the g-weighted possibilistic correlation ratio of the fuzzy number A on B is nothing else, but the g-weighted average of the probabilistic correlation ratios η 2 (Xγ |Yγ ) for all γ ∈ [0, 1]. ISBN: 978-960-474-273-8 Mamdani t-norm 1 (1 − γ) if γ ≤ x ≤ 1 , f1 (x) = T γ 0 otherwise . 265 Recent Researches in Artificial Intelligence, Knowledge Engineering and Data Bases The expected values are, 1 Z E(Xγ ) = E(Yγ ) = The joint density function of a uniform distribution on [C]γ is 1 if (x, y) ∈ [C]γ , f (x, y) = Tγ 0 otherwise , 1+γ . 2 xf1 (x) dx = γ and, where E(Xγ2 ) = E(Yγ2 ) = 1 Z x2 f1 (x) dx = γ 1+γ+ 3 γ2 (1 − γ)2 Tγ = 2 denotes the area of the γ-level set. The marginal density function 1 (x − γ) if γ ≤ x ≤ 1 , f1 (x) = Tγ 0 otherwise . . The expected value of the product is E(Xγ Yγ ) = Z γ 1Z γ 1 (1 + γ)2 xyf (x, y) dy dx = . 4 The expected values are: Z 1 1 2+γ E(Xγ ) = E(Yγ ) = x (x − γ) dx = . Tγ 3 γ The conditional expected value is 1 Z E(X|Y ) = xf (x|y) dx γ 1 Z = γ 1 Z = γ E(Xγ2 ) f (x, y) dx x f2 (y) 1 x dx 1−γ = E (X|Y )f2 (y) dy Z 1 = γ 1+γ 2 !2 1 (1 − γ) dy Tγ (1 + γ)2 . 4 (γ + 5)(γ + 1) . 12 The conditional expected value: Z 1 2+γ−y 1 E(X|Y ) = x dx = . 2 1+γ−y y − γ Łukasiewicz t-norm In this example we define the joint possibility distribution by the Łukasiewitz t-norm, that is, C(x, y) = max{A(x) + B(y) − 1, 0}. Z 2 E(E (X|Y )) = γ Then a γ-level set of C is [C]γ = (x, y) ∈ R2 | γ ≤ x, y ≤ 1, x + y ≥ 1 + γ . ISBN: 978-960-474-273-8 1 (x − γ) dx Tγ From these we can compute the correlation coefficient: 1 ρ(Xγ , Yγ ) = − 2 The conditional density function is: f (x, y) 1 if γ ≤ y ≤ 1 , f (x|y) = = y − γ f2 (y) 0 otherwise . From these results it follows the correlation ratio is also equal to zero. It is not surprising, since in this case A and B are non-interactive fuzzy numbers. 4.2 x2 + 2γ + 3 . 6 = 2 γ = γ2 1 E(E (X|Y )) = 1 = The expected value of the product is, Z 1Z 1 1 E(Xγ Yγ ) = xy dy dx T γ γ 1+γ−x and, Z Z γ 1+γ . = 2 2 = E(Yγ2 ) 266 = 1 2+γ−y 2 !2 3γ 2 + 10γ + 11 . 24 1 (y − γ) dy Tγ Recent Researches in Artificial Intelligence, Knowledge Engineering and Data Bases With this result we can compute the correlation ratio and we get η 2 (Xγ , Yγ ) = 1 4 ⇒ η(Xγ , Yγ ) = 0.5 0.4 1 2 0.3 In this example η 2 = ρ2 , because of the linear relationship between A and B. 0.2 4.3 Larsen t-norm In this case we define the joint possibility distribution C by the product t-norm, C(x, y) = A(x)B(y). Then a γ-level set of C is, [C]γ = (x, y) ∈ R2 | 0 ≤ x, y ≤ 1, xy ≥ γ . 0 0 E(E 2 (X|Y )) = = 1 (1 − Tγ 2 x γ 2 γ) γ 0.6 0.8 1 11 3 (γ + 2γ 2 − 5γ + 2 − 2γ ln γ) . 8 Tγ From these expression we could determine the correlation coefficient and the correlation ratio for any γ. The final formulas are very long and difficult, so they are omitted here. The expected values are, 1 0.4 and, furthermore, where Tγ = 1 − γ + γ ln γ denotes the area of the γ-level set. The marginal density function 1 x−γ if γ ≤ x ≤ 1 , f1 (x) = Tγ x 0 otherwise . Z 0.2 Figure 1: The absolute value of the correlation coefficient (ρ) and the correlation ratio (η) as the function of γ. The joint density function of a uniform distribution on [C]γ is 1 if (x, y) ∈ [C]γ , f (x, y) = Tγ 0 otherwise , E(Xγ ) = E(Yγ ) = |ρ| η 0.1 Acknowledgements: This work was supported in part by the project TAMOP 421B at the Széchenyi István University, Győr. 1 x−γ dx Tγ x . References: E(Xγ2 ) = E(Yγ2 ) = Z 1 x2 γ = [1] C. Carlsson, R. Fullér and P. Majlender, On possibilistic correlation, Fuzzy Sets and Systems, 155(2005)425-445. doi: 10.1016/j.fss.2005.04.014 1 x−γ dx Tγ x 11 (1 − γ)2 (γ + 2) . 6 Tγ [2] D. Dubois, Possibility theory and statistical reasoning, Computational Statistics & Data Analysis, 5(2006), pp. 47-69. doi: 10.1016/j.csda.2006.04.015 The expected value of the product is, Z 1Z 1 1 E(Xγ Yγ ) = xy dy dx Tγ γ γ/x = [3] D. Dubois and H. Prade, The mean value of a fuzzy number, Fuzzy Sets and Systems, 24(1987), pp. 279-300. doi: 10.1016/01650114(87)90028-5 11 (1 − γ 2 + 2γ 2 ln γ) . 4 Tγ The conditional expected value is, E(X|Y ) = ISBN: 978-960-474-273-8 [4] R. Fullér and P. Majlender, On interactive fuzzy numbers, Fuzzy Sets and Systems, 143(2004), pp. 355-369. doi: 10.1016/S01650114(03)00180-5 1y + γ . 2 y 267 Recent Researches in Artificial Intelligence, Knowledge Engineering and Data Bases [14] P. M. Larsen, Industrial applications of fuzzy logic control, International Journal of Man–Machine Studies, 12(1980), 3–10. doi: 10.1016/S0020-7373(80)80050-2 [15] S.T. Liu, C. Kao, Fuzzy measures for correlation coefficient of fuzzy numbers, Fuzzy Sets and Systems, 128(2002), pp. 267-275. [16] J. Łukasiewicz, O logice trójwartościowej (in Polish). Ruch filozoficzny 5(1920), pp. 170171. English translation: On three-valued logic, in L. Borkowski (ed.), Selected works by Jan Łukasiewicz, NorthHolland, Amsterdam, 1970, pp. 87-88. [17] E.H. Mamdani, Advances in the linguistic synthesis of fuzzy controllers, International Journal of Man–Machine Studies, 8(1976), issue 6, 669– 678. doi: 10.1016/S0020-7373(76)80028-4 [18] K. Pearson, On a New Method of Determining Correlation, when One Variable is Given by Alternative and the Other by Multiple Categories, Biometrika, Vol. 7, No. 3 (Apr., 1910), pp. 248257. [19] A. Sklar, Fonctions de Répartition àn Dimensions et Leurs Marges. Paris, France: Publ. Inst. Statist. Univ. Paris, 1959(8), pp. 229-231. [20] B. Schweizer and A. Sklar, Associative functions and abstract semigroups, Publ. Math. Debrecen, 10(1963), pp. 69-81. [5] R. Fullér and P. Majlender, On interactive possibility distributions, in: V.A. Niskanen and J. Kortelainen eds., On the Edge of Fuzziness, Studies in Honor of Jorma K. Mattila on His Sixtieth Birthday, Acta universitas Lappeenrantaensis, No. 179, 2004 61-69. [6] R. Fullér, J. Mezei and P. Várlaki, A Correlation Ratio for Possibility Distributions, in: E. Hüllermeier, R. Kruse, and F. Hoffmann (Eds.): Computational Intelligence for Knowledge-Based Systems Design, Proceedings of the International Conference on Information Processing and Management of Uncertainty in Knowledge-Based Systems (IPMU 2010), June 28 - July 2, 2010, Dortmund, Germany, Lecture Notes in Artificial Intelligence, vol. 6178(2010), Springer-Verlag, Berlin Heidelberg, pp. 178187. doi: 10.1007/978-3-642-14049-5 19 [7] R. Fullér, J. Mezei and P. Várlaki, Some Examples of Computing the Possibilistic Correlation Coefficient from Joint Possibility Distributions, in: Imre J. Rudas, János Fodor, Janusz Kacprzyk eds., Computational Intelligence in Engineering, Studies in Computational Intelligence Series, vol. 313/2010, Springer Verlag, [ISBN 9783-642-15219-1], pp. 153-169. doi: 10.1007/9783-642-15220-7 13 [8] R. Fullér, I. Á. Harmati and P. Várlaki, On Possibilistic Correlation Coefficient and Ratio for Triangular Fuzzy Numbers with Multiplicative Joint Distribution, in: Proceedings of the Eleventh IEEE International Symposium on Computational Intelligence and Informatics (CINTI 2010), November 18-20, 2010, Budapest, Hungary, [ISBN 978-1-4244-9278-7], pp. 103-108 doi: 10.1109/CINTI.2010.5672266 [9] R. Fullér, J. Mezei and P. Várlaki, An improved index of interactivity for fuzzy numbers, Fuzzy Sets and Systems, 165(2011), pp. 56-66. doi:10.1016/j.fss.2010.06.001 [10] I. Á. Harmati, A note on f-weighted possibilistic correlation for identical marginal possibility distributions, Fuzzy Sets and Systems, 165(2011), pp. 106-110. doi: 10.1016/j.fss.2010.11.005 [11] D.H. Hong, Fuzzy measures for a correlation coefficient of fuzzy numbers under TW (the weakest t-norm)-based fuzzy arithmetic operations, Information Sciences, 176(2006), pp. 150-160. [12] E.T. Jaynes, Probability Theory : The Logic of Science, Cambridge University Press, 2003. [13] A.N. Kolmogorov, Grundbegriffe der Wahrscheinlichkeitsrechnung, Julius Springer, Berlin, 1933, 62 pp. ISBN: 978-960-474-273-8 268