Numerical Characterizations of Covering Rough Sets Based on

advertisement
Numerical characterizations
of covering rough sets
based on evidence theory
Chen Degang, Zhang Xiao
Department of Mathematics and Physics, North China Electric Power
University, Beijing, 102206, P. R. China
Outline:
1. Introduction
2. Basic notions related to covering rough sets
3. Belief function and plausibility function of
covering rough sets
4. Numerical characterizations of attribute reduction
of covering information system
5. Numerical characterizations of attribute reduction
of covering decision systems
6. Conclusions
1. Introduction
What is covering rough sets?
Covering rough sets are improvements of traditional rough
sets by considering cover of universe instead of partition.
Why do we need covering rough sets?
Partition and equivalence relation are too restrictive to many
applications. One response to this argument is to extend an
equivalence relation to more general relations such as
similarity relation [21], tolerance relation [4,20] or others
[22,25,26]. Another response is to consider a cover instead
of partition and obtain covering rough sets [1-3,5,9,16,28,2932].
1. Introduction
The existing study on covering rough sets
Zakowski employed coverings of universe to establish the generalized
rough sets [28]. Bonikowski et al. [1] studied the structures of covers.
Mordeson [9] examined the relationship between the approximations of
sets defined with respect to covers and some axioms satisfied by
traditional rough sets. Chen et al. [5] discussed the covering rough set
under the framework of a complete completely distributive lattice. Zhu and
Wang [29-32] compared three kinds of generalized rough sets to deal with
vagueness and granularity in information system.
Chen et al.[6] began to develop definition and methods of attribute
reduction with covering rough sets. In [6] the intersection of coverings was
defined and the discernibility matrix was employed to compute all reducts.
Their study established a theoretical foundation for attribute reduction of
covering decision systems.
1. Introduction
Two facts motivate our idea
Among these work on covering rough sets, less effort has been
concentrated on developing measures for covering rough sets up to
now. As well known, in traditional rough set theory different kinds of
measures are proposed to reveal numerical characterizations of
rough sets and applied to develop algorithms of finding reducts.
This fact motivates our idea in this paper to develop measures to
characterize covering rough sets numerically.
As pointed in [23,24], there is closed connection between rough set
theory and evidence theory. This connection further motivates us to
set up connection between covering rough sets and evidence theory,
i.e., to characterize approximations and attribute reductions in
covering rough sets by employing measures in evidence theory.
2. Basic notions related to covering rough sets
We recall the basic concepts related to covering rough sets [6].
Definition 2.1. Let U be a universe, and C a family of subsets of U .
C is called a covering of U if none elements in C is empty and C  U .
Definition 2.2. Let C  {C1 , C2 , , Cn }be a covering of U. For every
x U , let Cx  {C j : C j  C, x  C j } , then Cov(C)  {Cx : x U }
U
U
is also a covering of . We call it the induced covering of C .
Definition 2.3. Let   {Ci : i  1, , m} be a family of coverings ofU
. For every x U , let  x  {Cix : Cix  Cov(Ci )}, thenCov( )  { x : x U }
is also a covering of U. We. call it the induced covering of  .
For every X  U , the lower and the upper approximations of X
with respect to  are defined as follows
 ( X )  { x :  x  X }  ( X )  { x :  x X  }
The positive region Pos ( X )   ( X ) .
3. Belief function and plausibility function of covering
rough sets
In this section, we first discuss the property of lower approximation
and propose a new upper approximation for covering rough sets.
Lemma 3.1. Let(U ,  ) be a covering information system and
Δ  {Ci : i  1,..., m} be a family of coverings of U . For x U and X  U
, we have
 ( X )  { x :  x  X }  {x :  x  X }
In this paper, we define a new upper approximation of X with respect
*
to the induced cover of  as  * ( X )  {x :  x X  }. Here  is
developed in terms of an induced cover, it certainly can be defined for
arbitrary covering. Furthermore, we have the following conclusions.
3. Belief function and plausibility function of covering
rough sets
Theorem 3.2. Suppose Δ  {Ci : i  1,..., m} is a family of coverings of
U , the covering lower approximation  and upper approximation  *
have the following properties:
(Contraction)
( X )  X
(2)  (~ X ) ~ * ( X ) (Duality)
(3L)  ( )  
(Normality)
(1L)
(4L)  (U )  U
(1U)
X  * ( X )
(Extension)
* (~ X ) ~  ( X ) (Duality)
(3U)  * ( )  
(Co-normality) (4U) * (U )  U
(Normality)
(Co-normality)
(5L)( X Y )  ( X ) (Y ) (Multiplication)(5U) * ( X Y )  * ( X ) * (Y )(Addition)
(6L)X  Y   ( X )  (Y )(Monotone) (6U) X  Y  * ( X )  * (Y ) (Monotone)
(7L)  ( ( X ))   ( X ) (Idempotency)(7U) * (* ( X ))  * ( X ) (Idempotency)
3. Belief function and plausibility function of covering
rough sets
Theorem 3.3. Let (U ,  ) be a covering information system,U  {x1 , x2 , , x,n }
for any X  U , denote
*
Bel ( X )   ( X ) n , Pl ( X )   ( X ) n .
Then Bel and Pl are belief and plausibility functions on U respectively,
and the corresponding mass distribution is
 f 1 ( i )

, X   i , here
mΔ ( X )  
n
f : U  Cov( Δ) defined as f ( xi )  i ,

0, otherwise
and f 1 (i )  {xk : f ( xk )  i } .
4. Numerical characterizations of attribute reduction of
covering information system
The reduct of covering information systems is the minimal subset of 
that preserves the induced coveringCov( ) .
Theorem 4.1. Let S  (U ,  ) be a covering information system and  be
a family of coverings, U  {x , x ,
1
2
, xn } , Cov( )  {1 ,  2 ,
,  n }, P   ,
then P is a reduct of S iff
n
n
1
1
BelP (i )  1, and for any nonempty subset P   P , BelP (i )  1.

i 1  i
i 1  i
4. Numerical characterizations of attribute reduction of
covering information system
Theorem 4.2. Let S  (U ,  ) be a covering information system and  be
a family of coverings, U  {x1 , x2 , , xn },Cov( )  {1 ,  2 ,
n
1
Pl Δ (i )  M , then P is a reduct of S iff

i 1  i
n

i 1
,  n }, P   ,
n
1
1
,
and
for
any
nonempty
subset
,
PlP (i )  M
P   P  PlP' (i )  M.
i
i 1  i
4. Numerical characterizations of attribute reduction of
covering information system
From Theorem 4.1 and 4.2 we conclude that the purpose of attribute
reduction in covering information
systems is to find a minimal subset
n
n
1
Bel
(

)


1
or 
PlP (i )  M . In
i
P   which preserves  P i
i 1
i 1  i

Theorem 4.3 M  1may not hold since Δ (i )  i may not always hold.
Generally we always have Δ (i )  i even i is a basic granule, and
this is one difference between covering rough sets and traditional
rough sets since every basic granule equals to its lower and upper
approximations in traditional rough sets.
Now we define the significance of a covering in  in a covering
information system.
4. Numerical characterizations of attribute reduction of
covering information system
Definition 4.3. Let S  (U ,  )be a covering information system.
U  {x1 , x2 , , xn } , we define the significance of the covering C  Δ by
n
n
i 1
i 1
Sig Δ (C)   Bel Δ (i ) i   Bel Δ{C} (i ) i .
Theorem4.4. Let S  (U ,  ) be a covering information system. For
every C  Δ , C is indispensable in D in  iff Sig Δ (C)  0 .
4. Numerical characterizations of attribute reduction of
covering information system
Theorem 4.5. Core( Δ)  {C  Δ : Sig Δ (C)  0}.
Definition 4.6. LetS  (U ,  ) be a covering information system.
U  {x1 , x2 , , xn } , for every covering C  Core( Δ) , we define the
significance of the covering C relative to Core( )by
n
SigCore ( Δ) (C)   BelCore ( Δ)
i 1
n
{C}
(i ) i   BelCore ( Δ) (i ) i
i 1
.
Algorithm 1. Acquire the core and the reduct for a covering
information system.
(1) let Core( )   ;
n
n
i 1
i 1
(2) for each C  Δ, calculate Sig Δ (C)   BelΔ (i ) i   Bel Δ{C} (i ) i
(3) if for every C  Δ , Sig Δ (C)  0, then Core( )   , go to step (6);
(4) If Sig Δ (C)  0 , then let Core( Δ)  Core( Δ) {C};
n
(5) if
 Bel
i 1
Core (  )
(i ) i  1 then return Core( ) , else go to step (6);
(6) let P  Core( ) ;
(7) for each C  { Δ  P}, calculate Sig P (C) ;
(8) if Sig P (C)  max C { Δ-P } Sig P (Ci ) , then P  P {C};
i
n
(9) if
 Bel
i 1
P
(i ) i  1 then stop and output as a reduct P , else
go back to step (7).
2
2
Let   m and U  n , the time complexity of Algorithm 1 is O(m  n ).
By Algorithm 1, we can acquire not only the core but also a proper
reduct.
5. Numerical characterizations of attribute reduction of
covering decision systems
Similar to attribute reduction of decision systems in traditional rough sets,
attribute reduction of covering decision systems aims to find the minimal set
of conditional attributes to preserve the positive region of decision attribute
[6].
Lemma 5.1. LetS  (U ,  , D  {d }) be a covering decision system,
U / D  {D1 , D2 , , Dr }, then we have
r
 Bel ( D )  Pos ( D)
j 1
j
U.
Theorem 5.2. Let S  (U ,  , D  {d }) be a covering decision system,
U  {x1 , x2 , , xn },U / D  {D1 , D2 , , Dr } , P   , then P is a relative reduct
r
of S iff
r
 Bel
j 1
 Bel
j 1
P
r
P
( D j )   Bel ( D j ), and for any nonempty subset P   P,
r
j 1
( D j )   BelP ( D j )
j 1
.
5. Numerical characterizations of attribute reduction of
covering decision systems
The covering decision systems can be divided into consistent covering
decision systems and inconsistent covering decision systems[6]. Generally
speaking, for covering decision systems, we only consider to find a minimal
subset of to preserve the sum of belief functions of all decision classes.
Definition 5.3. Let S  (U ,  , D  {d }) be a covering decision system.
U / D  {D1 , D2 , , Dr }, for everyC  Δ , we define the significance of the
r
r
covering C relative to D in  by Sig Δ D (C)   Bel Δ ( D j )   Bel Δ{C} ( D j )
j 1
i 1
Theorem 5.4. Let S  (U ,  , D  {d }) be a covering decision system.
For every C  Δ , C is indispensable relative to D in  iff Sig Δ D (C)  0 .
5. Numerical characterizations of attribute reduction of
covering decision systems
By Theorem 5.4 and the definition of CoreD ( ), we have the following
result.
Theorem 5.5. CoreD ( Δ)  {C  Δ : Sig Δ D (C)  0} .
Definition 5.6. Let S  (U ,  , D  {d }) be a covering decision system,
U / D  {D1 , D2 ,
, Dr }. For everyCi  CoreD ( ) but C  Δ , we define the
relative significance of the covering C to CoreD ( )by
r
SigCoreD ( Δ) (C)   BelCoreD ( Δ)
j 1
r
{C}
( D j )   BelCoreD ( Δ) ( D j )
i 1
.
Algorithm 2. Acquire the core and the reduction in a covering
decision system.
(1) let Core( )   ;
r
r
j 1
i 1
(2) for each C  Δ , calculate Sig Δ D (C)   Bel Δ ( D j )   Bel Δ{C} ( D j ) ;
(3) if for every C  Δ, Sig Δ D (C)  0 , then CoreD ( )   , go to step (6);
(4) If Sig Δ
r
D
(C)  0, then CoreD ( Δ)  CoreD ( Δ) {C};
r
(5) if  BelCore (  ) ( D j )   Bel ( D j ) then return CoreD ( ) , else go to
j 1
step (6);
j 1
(6) let P  CoreD ( ) ;
r
(7) for each C  { Δ  P}, calculate Sig P (C)   BelP
j 1
r
{C}
(8) if Sig P (C)  max Ci{ Δ-P } Sig P (Ci ) , then P  P {C};
(9) if
r
 Bel
j 1
( D j )   BelP ( D j ) ;
i 1
r
P
( D j )  Bel ( D j ) then stop and output as a reduct P ,
j 1
else go back to step (7).
Let   m and U  n , the time complexity of Algorithm 2 is O(m2  n2 ).
By Algorithm 2, we can acquire not only the core but also a proper
reduct.
6. Conclusions
The covering rough set theory is a generalization of traditional rough
set theory characterized by covers instead of partitions. Since there is
closed connection between rough set theory and evidence theory, we
try to propose belief and plausibility functions to characterize covering
rough sets. The relationships between attribute reduction and belief
(plausibility) function are analyzed in covering information and
decision systems respectively. We give the concepts of significance
and relative significance of coverings to find reducts in covering
information and decision systems. In addition, the relevant algorithms
are designed. In a word, we develop a numerical method to find
reducts by employing the belief function in covering information
(decision) systems. Our future work will concentrate on developing
other measures for covering rough sets based on belief and
plausibility functions in this paper.
References
[1] Z. Bonikowski, E. Bryniarski, U. Wybraniec, Extensions and intentions in the rough set theory, Information
Sciences 107 (1998) 149–167.
[2] Z. Bonikowski, Algebraic structures of rough sets, in: W. Ziarko (Ed.), Rough Sets, Fuzzy Sets and
Knowledge Discovery, Springer-Verlag, London, 1994, pp. 243–247.
[3] E. Bryniarski, A calculus of rough sets of the first order, Bulletin of the Polish Academy of Sciences 16
(1989) 71–77.
[4] G. Cattaneo, Abstract approximate spaces for rough theories, in: Polkowski, Skowron (Eds.), Rough Sets
in Knowledge Discovery1: Methodology and Applications, Physicaverlag, Heidelberg, 1998, pp. 59–98.
[4] G. Cattaneo, Abstract approximation spaces for rough theories, in: L. Polkowski, A. Skowron (Eds.), Rough
Sets in Knowledge Discovery 1: Methodology and Applications, Physica-Verlag, Heidelberg, 1998, pp. 59–98.
[5] D.G. Chen, W.X. Zhang, S. Yeung, C.C. Tsang, Rough approximations on a complete completely
distributive lattice with applications to generalized rough sets, Information Sciences 176 (2006) 1829–1848.
[6] D.G. Chen, C.Z. Wang, Q.H. Hu, A new approach to attribute reduction of consistent and inconsistent
covering decision systems with covering rough sets, Information Sciences 177 (2007) 3500–3518.
[7] A.P. Dempster, Upper and lower probabilities induced by a multi valued mapping, Annals of
Mathematical Statistics 38 (1967) 325–339.
[8] F. Li, Y.Q. Yin, Approaches to knowledge reduction of covering decision systems based on information
theory, Information Sciences 179 (2009) 1694–1704.
[9] J.N. Mordeson, Rough set theory applied to (fuzzy) ideal theory, Fuzzy Sets and Systems 121 (2001) 315–
324.
[10] H.S. Nguyen, D. Slezak, Approximation reducts and association rules correspondence and complexity
results, in: N. Zhong, A. Skowron, S. Oshuga (Eds.), Proceedings of RSFDGrC’99, Yamaguchi, Japan.
LNAI1711, 1999, pp. 137–145.
[11] Z. Pawlak, Rough sets, International Journal of Computer and Information Sciences 11 (1982) 341–356.
[12] Z. Pawlak, Rough Sets: Theoretical Aspects of Reasoning About Data, Kluwer Academic Publishing,
Dordrecht, 1991.
[13] Z. Pawlak, Andrzej Skowron, Rudiments of rough sets, Information Sciences 177 (2006) 3–27.
[14] Z. Pawlak, Andrzej Skowron, Rough sets: Some extensions, Information Sciences 177 (2006) 28–40.
[15] Z. Pawlak, Andrzej Skowron, Rough sets and Boolean reasoning, Information Sciences 177 (2006) 41–73.
[16] J.A. Pomykala, Approximation operations in approximation space, Bulletin of the Polish Academy of
Sciences 9–10 (1987) 653–662.
[17] G. Shafer, A Mathematical Theory of Evidence, Princeton University Press, Princeton, 1976.
[18] A. Skowron, J. Grzymala-Busse, From rough set theory to evidence theory, in: R.R. Yager, M. Fedrizzi, J.
Kacprzyk (Eds.), Advance in the Dempster–Shafer Theory of Evidence, Wiley, New York, 1994, pp. 193–236.
[19] A. Skowron, C. Rauszer, The discernibility matrices and functions in information systems, in: R. Slowinsk
(Ed.), Intelligent Decision support, Handbook of Applications and Advances of the Rough Sets Theory, Kluwer
Academic Publishers, 1992.
[20] A. Skowron, J. Stepaniuk, Tolerance approximation spaces, Fund. Inform. 27 (1996)245–253.
[21] R. Slowinski, D. Vanderpooten, A generalized definition of rough approximations based on similarity, IEEE
Trans. Knowledge Data Eng. 12 (2000) 331–336.
[22] A. Wasilewska, Topological rough algebras, in: T.Y. Lin, N. Cercone (Eds.), Rough Sets & Data Mining,
Kluwer Academic Publishers, Boston, 1997, pp. 425–441.
[23] W. Z. Wu, Y. Leung, W.X. Zhang, Connections between rough set theory and Dempster– Shafer theory of
evidence, International Journal of General Systems 31 (2002) 405–430.
[24] W. Z. Wu, M. Zhang, H.Z. Li, J.S. Mi, Knowledge reduction in random information systems via Dempster–
Shafer theory of evidence, Information Sciences 174 (2005) 143–164.
[25] Y. Y. Yao, Constructive and algebraic methods of theory of rough sets, Inform. Sci. 109 (1998) 21–47.
[26] Y. Y.Yao, Relational interpretations of neighborhood operators and rough set approximation operators,
Inform. Sci. 111 (1998) 239–259.
[27] Y. Y. Yao, P. J. Lingras, Interpretations of belief functions in the theory of rough sets, Information Sciences
104 (1998) 81–106.
[28] W. Zakowski, Approximations in the space , Demonstratio Math. 16(1983) 761–769.
[29] W. Zhu, F.Y. Wang, Some results on the covering generalized rough sets, Pattern Recognition and Artificial
Intelligence 5 (2002) 6–13.
[30] W. Zhu, F.Y. Wang, Reduction and axiomization of covering generalized rough sets, Information Sciences
152 (2003) 217–230.
[31] W. Zhu, Topological approaches to covering rough sets, Information Sciences 177 (2007) 1892–1915.
[32] W. Zhu, F.Y. Wang, On three types of covering-based rough sets, IEEE Trans on Knowledge and Data
Engineering, 19(2007)1131-1144
[33] W. Ziarko, Variable precision rough set model, Journal of Computer and System Sciences 46 (1993) 39–59.
Download