Using Interactive Evolution for Exploratory Data Analysis Tomáš Řehořek Czech Technical University in Prague
CIG Research Group Czech Technical University in Prague Faculty of Electrical Engineering (FEL) Faculty of Information Technology (FIT)
CIG Research Group Data Mining Algorithms, Visualization, Automation Biologically inspired algorithms Evolutionary computation Artificial neural networks Artificial Intelligence Machine learning, Optimization
Optimization in Data Mining Main objective of the CIG research group Data Mining Evolutionary computation Artificial Intelligence Optimization Machine learning Artificial Neural Networks
Dimensionality Reduction and Visualization in Data Mining Linear projections Principal Component Analysis (PCA) Linear Discriminant Analysis (LDA) Non-linear projections Multidimensional Scaling (MDS) Sammon Projection Kernel PCA
Interactive Evolutionary Computation (IEC) Evolutionary Computation using human evaluation as the fitness function Currently used almost exclusively for artistic purposes Images, Sounds, Animations… Inspiration:
PicBreeder Jimmy Secretan Kenneth Stanley Interactive Evolution by
Next generation … and so on …
And after 75 generations you eventually get something interesting
The technology hidden behind x z grayscale x z Neural net draws the image
Neuroevolution grayscale By clicking, you increase fitness of nets Next generations inherit fit building patterns x z
Gallery of discovered images
Using Interactive Evolution in Exploratory Data Analysis Experiment with evolving projections Examples in n-dimensional space 2D
Interactive Evolution of Projections Machine Human Candidate projections Feedback
Interactive Evolution of Projections Machine Human Candidate projections Feedback
Data Projection Experiments Linear transformation Evolve coefficient matrix Do the transformation using formula: … resulting a point in 2D-space
Data Projection Experiments Sigmoidal transformation Evolve coefficient matrix Do the transformation using formula: a b c
Experiments with Wine Dataset PCASOM
Separation of Different Classes using Linear Projection
Separation of Different Classes using Sigmoidal Projection
There are many possible goals! „Blue points down“ – 5 generations, sigmoid projection Outlier Detection – 8 generations, linear projection
Conclusion Interactive Evolution can be used in Exploratory Data Analysis Our experiments show that complex projections can be easily evolved In future, we plan to investigate such evolution in fields of Data Mining other than EDA
Thank you for your attention! Tomáš Řehořek Computational Intelligence Group (CIG) Faculty of Information Technology (FIT) Czech Technical University (CTU) in Prague