Transformation of Input Space using Statistical Moments: EA-Based Approach Ahmed Kattan: Um Al Qura University, Saudi Arabia Michael Kampouridis: University.

Slides:



Advertisements
Similar presentations
Random Forest Predrag Radenković 3237/10
Advertisements

Yuri R. Tsoy, Vladimir G. Spitsyn, Department of Computer Engineering
CHAPTER 2: Supervised Learning. Lecture Notes for E Alpaydın 2004 Introduction to Machine Learning © The MIT Press (V1.1) 2 Learning a Class from Examples.
Forecasting Models With Linear Trend. Linear Trend Model If a modeled is hypothesized that has only linear trend and random effects, it will be of the.
Genetic Algorithms By: Anna Scheuler and Aaron Smittle.
CSCI 347 / CS 4206: Data Mining Module 07: Implementations Topic 03: Linear Models.
« هو اللطیف » By : Atefe Malek. khatabi Spring 90.
The General Linear Model. The Simple Linear Model Linear Regression.
Biologically Inspired AI (mostly GAs). Some Examples of Biologically Inspired Computation Neural networks Evolutionary computation (e.g., genetic algorithms)
Institute of Intelligent Power Electronics – IPE Page1 Introduction to Basics of Genetic Algorithms Docent Xiao-Zhi Gao Department of Electrical Engineering.
The loss function, the normal equation,
TEMPLATE DESIGN © Genetic Algorithm and Poker Rule Induction Wendy Wenjie Xu Supervised by Professor David Aldous, UC.
Object Recognition Using Genetic Algorithms CS773C Advanced Machine Intelligence Applications Spring 2008: Object Recognition.
Greg GrudicIntro AI1 Introduction to Artificial Intelligence CSCI 3202 Fall 2007 Introduction to Classification Greg Grudic.
A new crossover technique in Genetic Programming Janet Clegg Intelligent Systems Group Electronics Department.
Support Vector Machines Kernel Machines
Sparse Kernels Methods Steve Gunn.
Selecting Informative Genes with Parallel Genetic Algorithms Deodatta Bhoite Prashant Jain.
Neural Optimization of Evolutionary Algorithm Strategy Parameters Hiral Patel.
Genetic Algorithm What is a genetic algorithm? “Genetic Algorithms are defined as global optimization procedures that use an analogy of genetic evolution.
Genetic Programming. Agenda What is Genetic Programming? Background/History. Why Genetic Programming? How Genetic Principles are Applied. Examples of.
Classification and Prediction: Regression Analysis
IE 594 : Research Methodology – Discrete Event Simulation David S. Kim Spring 2009.
Attention Deficit Hyperactivity Disorder (ADHD) Student Classification Using Genetic Algorithm and Artificial Neural Network S. Yenaeng 1, S. Saelee 2.
A Genetic Algorithms Approach to Feature Subset Selection Problem by Hasan Doğu TAŞKIRAN CS 550 – Machine Learning Workshop Department of Computer Engineering.
Computer Implementation of Genetic Algorithm
Efficient Model Selection for Support Vector Machines
Using Genetic Programming to Learn Probability Distributions as Mutation Operators with Evolutionary Programming Libin Hong, John Woodward, Ender Ozcan,
Soft Computing Lecture 18 Foundations of genetic algorithms (GA). Using of GA.
Improved Gene Expression Programming to Solve the Inverse Problem for Ordinary Differential Equations Kangshun Li Professor, Ph.D Professor, Ph.D College.
Machine Learning CSE 681 CH2 - Supervised Learning.
Machine Learning Seminar: Support Vector Regression Presented by: Heng Ji 10/08/03.
Ch.12 Machine Learning Genetic Algorithm Dr. Bernard Chen Ph.D. University of Central Arkansas Spring 2011.
GATree: Genetically Evolved Decision Trees 전자전기컴퓨터공학과 데이터베이스 연구실 G 김태종.
An Iterative Heuristic for State Justification in Sequential Automatic Test Pattern Generation Aiman H. El-MalehSadiq M. SaitSyed Z. Shazli Department.
What is Genetic Programming? Genetic programming is a model of programming which uses the ideas (and some of the terminology) of biological evolution to.
Kernel Methods A B M Shawkat Ali 1 2 Data Mining ¤ DM or KDD (Knowledge Discovery in Databases) Extracting previously unknown, valid, and actionable.
Design of an Evolutionary Algorithm M&F, ch. 7 why I like this textbook and what I don’t like about it!
Introduction to Evolutionary Algorithms Session 4 Jim Smith University of the West of England, UK May/June 2012.
SVM Support Vector Machines Presented by: Anas Assiri Supervisor Prof. Dr. Mohamed Batouche.
Neural and Evolutionary Computing - Lecture 9 1 Evolutionary Neural Networks Design  Motivation  Evolutionary training  Evolutionary design of the architecture.
How to apply Genetic Algorithms Successfully Prabhas Chongstitvatana Chulalongkorn University 4 February 2013.
1 5. Application Examples 5.1. Programmable compensation for analog circuits (Optimal tuning) 5.2. Programmable delays in high-speed digital circuits (Clock.
Learning by Simulating Evolution Artificial Intelligence CSMC February 21, 2002.
D. M. J. Tax and R. P. W. Duin. Presented by Mihajlo Grbovic Support Vector Data Description.
Concept learning, Regression Adapted from slides from Alpaydin’s book and slides by Professor Doina Precup, Mcgill University.
Correlation & Regression Analysis
Ensemble Methods in Machine Learning
Automated discovery in math Machine learning techniques (GP, ILP, etc.) have been successfully applied in science Machine learning techniques (GP, ILP,
Alice E. Smith and Mehmet Gulsen Department of Industrial Engineering
Solving Function Optimization Problems with Genetic Algorithms September 26, 2001 Cho, Dong-Yeon , Tel:
Improving Support Vector Machine through Parameter Optimized Rujiang Bai, Junhua Liao Shandong University of Technology Library Zibo , China { brj,
EE459 I ntroduction to Artificial I ntelligence Genetic Algorithms Practical Issues: Selection.
1 Simple Linear Regression and Correlation Least Squares Method The Model Estimating the Coefficients EXAMPLE 1: USED CAR SALES.
Evolving RBF Networks via GP for Estimating Fitness Values using Surrogate Models Ahmed Kattan Edgar Galvan.
A Kernel Approach for Learning From Almost Orthogonal Pattern * CIS 525 Class Presentation Professor: Slobodan Vucetic Presenter: Yilian Qin * B. Scholkopf.
1 Autonomic Computer Systems Evolutionary Computation Pascal Paysan.
Agenda  INTRODUCTION  GENETIC ALGORITHMS  GENETIC ALGORITHMS FOR EXPLORING QUERY SPACE  SYSTEM ARCHITECTURE  THE EFFECT OF DIFFERENT MUTATION RATES.
Metaheuristics for the New Millennium Bruce L. Golden RH Smith School of Business University of Maryland by Presented at the University of Iowa, March.
Pattern recognition – basic concepts. Sample input attribute, attribute, feature, input variable, independent variable (atribut, rys, příznak, vstupní.
Evolutionary Programming A.E. Eiben and J.E. Smith, Introduction to Evolutionary Computing Chapter 5.
Methods of multivariate analysis Ing. Jozef Palkovič, PhD.
Genetic Algorithm(GA)
Genetic Algorithm. Outline Motivation Genetic algorithms An illustrative example Hypothesis space search.
Evolutionary Computation Evolving Neural Network Topologies.
Data Transformation: Normalization
Evolutionary Algorithms Jim Whitehead
Who cares about implementation and precision?
EE368 Soft Computing Genetic Algorithms.
Product moment correlation
Presentation transcript:

Transformation of Input Space using Statistical Moments: EA-Based Approach Ahmed Kattan: Um Al Qura University, Saudi Arabia Michael Kampouridis: University of Kent, UK Yew-Soon Ong: Nanyang Technological University, Singapore Khalid Mehamdi: Um Al Qura University, Saudi Arabia

The problem Standard Regression models are presented with –Observational data of the form (x i, y i ) i=1…n –Each x i denotes a k-dimensional input vector of design variables and y is the response. When k ≫ n, high variance and over-fitting become a major concern.

The problem High dimensional regression problem Regression Model Poor approximation

Solutions Curse of dimensionality is solved by: –R–Reduce number of dimensions by selecting important features (e.g., PCA, FDA,..etc.) –T–Transformation of input space (e.g., GP, FFX,..etc.) Majority of work in this topic has been done for classification problems. The idea of transforming input space to reduce the number of design variables in the regression problems to improve generalisation is relatively little explored thus far.

Contributions of this work Contributions A novel evolutionary approach to transform the high-dimensional input space of regression models using only statistical moments. analysis to understand the impact of different statistical moments on the evolved transformation procedure dramatically improve LR’s generalisation and make it competitive to other state-of-the-art regression models.

The proposed transformation (x i, y i ) (z i, y i ) Transformation x1x1,,, xkxk x0x0 z1z1 znzn z0z0 We transform the input vector x into and vector called z. The z is smaller than x and easier to be approximated by standard regression models.

The proposed transformation We used standard Genetic Algorithm

Genetic Algorithm Population representation

Genetic Algorithm – Search operators Crossover in which two individuals exchange statistical moments and their parameters, randomly. op 0 op 1 op 2 op g a0a2a3a7a5a8a0a2a3a7a5a8 a 2 a 3 a 4 a 2 a 7... a0a2a7…a0a2a7… a0a5a6a7a9…a0a5a6a7a9… …. op 0 op 1 op 2 op g a0a2a3a7a5a8a0a2a3a7a5a8 a 2 a 3 a 4 a 2 a 7... a0a2a7…a0a2a7… a0a5a6a7a9…a0a5a6a7a9… ….

Genetic Algorithm – Search operators Aggressive mutation operator that replaces a statistical moment and its parameters, randomly selected, with another randomly selected moments from the pool of statistical moments. op 1 op 2 op g a0a2a3a7a5a8a0a2a3a7a5a8 a 2 a 3 a 4 a 2 a 7... a0a2a7…a0a2a7… a0a5a6a7a9…a0a5a6a7a9… …. a4a3a9…a4a3a9… op 0 New op 0

Genetic Algorithm – Search operators Smooth mutation operator where a parameter of a randomly selected statistical moment is mutated into a new parameter. op 0 op 1 op 2 op g a0a2a3a7a5a8a0a2a3a7a5a8 a 2 a 3 a 4 a 2 a 7... a0a2a7…a0a2a7… a0a5a6a7a9…a0a5a6a7a9… …. a4a4

Genetic Algorithm – Fitness measure We used average prediction errors of Linear Regression (LR) as a fitness measure for GA. LR is a very simple algorithm where it considers the family of linear hypotheses:

Genetic Algorithm – Fitness measure Why LR ? –Hence, given these features LR can push the GA’s evolutionary process to linearly align the transformed inputs with their outputs and minimise the dimensionality of the new space.

Genetic Algorithm – Fitness measure The GA aims to minimise the following fitness function:

Genetic Algorithm – Training Two disjoint sets: training and validation. LR: two-folds cross-validation approach. The best individual in each generation is further tested with the validation set. We select the individual that yields the best performance on the validation set across the run.

Empirical tests We tested the effects of the transformation procedure on LR and compared the results against five regression models, namely: 1.RBFN 2.RBFN + PCA 3.Kriging 4.Kriging + PCA 5.LR 6.LR + PCA 7. piecewise LR 8.Genetic Programming 9.Genetic Programming + PCA

Empirical tests F1 = Rastrigin functionF2 = Schwefel function We tested 5 benchmark functions

Empirical tests F5 = Dixon & Price function F3 = Michalewicz function F4 = Sphere function

Empirical tests For each test function, we trained all regression models to approximate the given function when the number of variables is –100 variables. –500 variables. –1000 variables.

Empirical tests

Approximation Quality Sphere function for 2 variables

Empirical tests LR approximate the Sphere function after input transformation

Learn from evolution

It is clear from the heat maps that each problem has its unique characteristics. Interestingly, there is a consensus among all maps that the following operators do not contribute to the construction of good transformation procedures. – copy –copy × intercept.

Learn from evolution Also, all maps agree that the following are important across all problems. –Average Deviation –Geometric Mean –Min –Max We still do not have a full understanding of the effect of these moments on the transformed space. In future research we will focus on this aspect.

Conclusions In this work we presented: –A novel evolutionary approach to transform the high-dimensional input space of regression models using only statistical moments. –analysis to understand the impact of different statistical moments on the evolved transformation procedure. –dramatically improve LR’s generalisation and make it competitive to other state-of-the-art regression models. We hope our results will inspire other researchers to build a deeper understanding to discover relations between straight statistical momnets on making good transformation

Thank you for paying attention!