PPT - webdev.fit.cvut.cz

advertisement
Using Interactive Evolution
for Exploratory Data Analysis
Tomáš Řehořek
Czech Technical University in Prague
CIG Research Group

Czech Technical University in Prague

Faculty of Electrical Engineering (FEL)
 Faculty of Information Technology (FIT)
CIG Research Group

Data Mining


Algorithms, Visualization, Automation
Biologically inspired algorithms

Evolutionary computation
 Artificial neural networks

Artificial Intelligence

Machine learning, Optimization
Optimization in Data Mining

Main objective of the CIG research group
Evolutionary
computation
Data
Mining
Artificial
Neural Networks
Machine
learning
Artificial
Intelligence
Optimization
Dimensionality Reduction and
Visualization in Data Mining

Linear projections

Principal Component Analysis (PCA)
 Linear Discriminant Analysis (LDA)

Non-linear projections

Multidimensional Scaling (MDS)
 Sammon Projection
 Kernel PCA
Interactive Evolutionary
Computation (IEC)
Evolutionary Computation using human
evaluation as the fitness function
 Currently used almost exclusively
for artistic purposes



Images, Sounds, Animations…
Inspiration: http://picbreeder.org
Interactive
Evolution
PicBreeder
by
Jimmy
Secretan
Kenneth
Stanley
Next
generation
…
and so on
…
And after 75 generations ...
... you eventually get something interesting
The technology hidden behind
x
z
grayscale
z
Neural net draws the image
x
Neuroevolution
x
grayscale
z
By clicking, you increase fitness of nets
Next generations inherit fit building patterns
Gallery of discovered images
Using Interactive Evolution
in Exploratory Data Analysis

Experiment with evolving
projections
f:
n

Examples in
n-dimensional
space
2D
2
Interactive Evolution of Projections
Machine
Candidate
projections
   



Feedback
Feedback
Human
Interactive Evolution of Projections
Machine
Candidate
projections





Feedback
Feedback
Human
Data Projection Experiments

Linear transformation
 Evolve
coefficient matrix
a1, a2 ,
b , b ,
 1
2
 Do
, an 
, bn 
the transformation using formula:
f  x   i=1ai  xi,

n

… resulting a point in 2D-space
n
i=1
bi  xi 

Data Projection Experiments

Sigmoidal transformation
 Evolve
 a1,1,
a ,
 2,1
coefficient matrix
a1,2 ,
,
a1,n ,
b1,1,
b1,2 ,
,
b1,n ,
c1,1,
c1,2 ,
,
a2,2 ,
,
a2,n ,
b2,1,
b2,2 ,
,
b2,n ,
c2,1,
c2,2 ,
,
 Do
the transformation using formula:
a1,i
a2,i
 n

n
f  x   i=1
, i=1
b1,i  xi c1,i 
b2,i  xi c 2,i  
1+ e
1+ e


b
a
c
c1,n 
c2,n 
Experiments with Wine Dataset
PCA
SOM
Separation of Different Classes
using Linear Projection
Separation of Different Classes
using Sigmoidal Projection
There are many possible goals!
„Blue points down“ – 5 generations, sigmoid
projection
Outlier Detection – 8 generations, linear projection
Conclusion
Interactive Evolution can be used in
Exploratory Data Analysis
 Our experiments show that complex
projections can be easily evolved
 In future, we plan to investigate such
evolution in fields of Data Mining other
than EDA

Thank you for your attention!
Tomáš Řehořek
tomas.rehorek@fit.cvut.cz
Computational Intelligence Group (CIG)
Faculty of Information Technology (FIT)
Czech Technical University (CTU) in Prague
Download