Power Laws, Highly Optimized Tolerance and generalized source

advertisement
Power Laws, Highly Optimized
Tolerance and generalized
source coding
John Doyle, J.M. Carlson
Presented by Sumitra Ganesh
Overview


Propose an optimization problem that is
a generalization of the Data
Compression (Source coding) problem
Problem is formulated and analyzed for
3 examples : Data compression (DC),
Website layout (WWW), Forest Fires
(FF)
The PLR optimization problem
J  { pi li | li  f ( ri ),  ri  R}
1 i  N
pi
Probability of event
li
Loss/size associated with event
ri
Resource allocated to suppress the size
f  ( ri )   log( ri ),   0
f  (ri )  c(ri    1) /  ,   0
Data Compression
 0
Each source symbol – codeword of
k
p

2
Bits. i
i
2
li
1
ri  pi
li   log( pi )  ki
J   pi log( pi )
li
Web Layout




 1
The sizes are the lengths of the files
The probabilities are determined by
user interest in the files
The cost is the average delay that a
user experiences in downloading files
Resource limit is on the total number of
files
WWW contd
Crucial : The resource-loss tradeoff is determined by
li  ri
1
The resource allocation (separation if document
into files) may change influence the hit probability
for a given file.
Split the one-dimensional file into N regions of
equal size and with uniform access probability

Forest Fires





 1
Two-dimensional problem
The losses are the burned areas
The probabilities are determined by the
probability of sparks
The cost is the average timber lost in
fires
Resource is the density of firebreaks
etc.
Heavy tailed CDFs
Comments



Generalized framework : not clear about
its applicability
The average statistics seem to obey
power laws even though the individual
websites were not optimally designed
(Why?)
The three examples are explained well
Download