Clustering of max. spacing algorithm for the clustering problem

advertisement
Probably Approximately Correct
Learning
Yongsub Lim
Applied Algorithm Laboratory
KAIST
Definition
• A class C is PAC learnable by a
hypothesis class  if
 there is an algorithm  such that
   0,   0, c  C, D over X
 m  poly(1 /  ,1 /  ) , # of i.i.d. training examples
sampled from D , such that Pr[errD ( x)   ]  1  
where h   is an output of 
Probably Approximately Correct Learning
2
Example
• Consider the class space C which is the
set of all positive half-lines
• An example is any real number
• Eg)
1 if x  kc
c( x)  
0 if x  kc
1
0
kc
Probably Approximately Correct Learning
3
Proof.
is PAC learnable
• C is PAC learnable by C
D([kc , k c ))  
R
kc
kc
kc
Probably Approximately Correct Learning
4
Proof.
is PAC learnable
• Our algorithm  outputs a hypothesis h
such that kh  maxx : c( x)  0, minx : c( x)  1
• Suppose kh  k c
 errD ( x)   for a positive example x
 errD ( x)  0 for a negative example x

kc
kc
kh
Probably Approximately Correct Learning
kc
5
Proof.
is PAC learnable
• Suppose kh  k c , and it called b





b only occurs if no training example x  R
Prx  R   1  
Prx1  R  xm  R   1    , xi ' s are i.i.d.
m
Prb   1     em
m
Prb  b   2em
kc

R
kc
kc
Probably Approximately Correct Learning
kh
6
Proof.
is PAC learnable
• Prb  b   2e
• PrerrD ( x)     1  2em
m


Pr
err
(
x
)



1

2
e
 1 
•
D
m

R
kc
kc
kc
Probably Approximately Correct Learning
kh
7
Proof.
is PAC learnable
m
1

ln
2

1 2
  ln
m 
Probably Approximately Correct Learning
8
Proof.
is PAC learnable
• The class C is PAC learnable by itself
1 2
with at least  ln  training examples
Probably Approximately Correct Learning
9
A More General Theorem
• If h  H , H   such thath is consistentwith
m independent randomlabeled trainingexamples,
thenfor any  ,   0 we can assert with confidence
1   that therrorof h is less than providedthat:
m
ln   ln1 

Probably Approximately Correct Learning
10
Thanks
Probably Approximately Correct
Learning
11
Download