Chapter 6: Techniques for Predictive Modeling

Slides:

Advertisements

Similar presentations

Artificial Neural Networks

Advertisements

Introduction to Support Vector Machines (SVM)

Data Mining Classification: Alternative Techniques

Data Mining Classification: Alternative Techniques

Support Vector Machines

CSCI 347 / CS 4206: Data Mining Module 07: Implementations Topic 03: Linear Models.

Kostas Kontogiannis E&CE

Machine Learning: Connectionist McCulloch-Pitts Neuron Perceptrons Multilayer Networks Support Vector Machines Feedback Networks Hopfield Networks.

Classification and Decision Boundaries

Machine Learning Neural Networks

Lecture 14 – Neural Networks

Decision Support Systems

RBF Neural Networks x x1 Examples inside circles 1 and 2 are of class +, examples outside both circles are of class – What NN does.

Prénom Nom Document Analysis: Artificial Neural Networks Prof. Rolf Ingold, University of Fribourg Master course, spring semester 2008.

Artificial Intelligence Statistical learning methods Chapter 20, AIMA (only ANNs & SVMs)

CES 514 – Data Mining Lecture 8 classification (contd…)

Slide 1 EE3J2 Data Mining EE3J2 Data Mining Lecture 15: Introduction to Artificial Neural Networks Martin Russell.

Neural Networks. R & G Chapter Feed-Forward Neural Networks otherwise known as The Multi-layer Perceptron or The Back-Propagation Neural Network.

Introduction to Neural Networks Simon Durrant Quantitative Methods December 15th.

Neural Networks Chapter Feed-Forward Neural Networks.

Artificial Neural Networks (ANNs)

Neural Networks. Background - Neural Networks can be : Biological - Biological models Artificial - Artificial models - Desire to produce artificial systems.

Radial-Basis Function Networks

CHAPTER 12 ADVANCED INTELLIGENT SYSTEMS © 2005 Prentice Hall, Decision Support Systems and Intelligent Systems, 7th Edition, Turban, Aronson, and Liang.

July 11, 2001Daniel Whiteson Support Vector Machines: Get more Higgs out of your data Daniel Whiteson UC Berkeley.

Soft Computing Colloquium 2 Selection of neural network, Hybrid neural networks.

Artificial Intelligence Lecture No. 28 Dr. Asad Ali Safi Assistant Professor, Department of Computer Science, COMSATS Institute of Information Technology.

CS 8751 ML & KDDSupport Vector Machines1 Support Vector Machines (SVMs) Learning mechanism based on linear programming Chooses a separating plane based.

Using Neural Networks in Database Mining Tino Jimenez CS157B MW 9-10:15 February 19, 2009.

Artificial Neural Nets and AI Connectionism Sub symbolic reasoning.

Chapter 9 Neural Network.

Chapter 3 Neural Network Xiu-jun GONG (Ph. D) School of Computer Science and Technology, Tianjin University

Mestrado em Ciência de Computadores Mestrado Integrado em Engenharia de Redes e Sistemas Informáticos VC 14/15 – TP19 Neural Networks & SVMs Miguel Tavares.

1 Machine Learning The Perceptron. 2 Heuristic Search Knowledge Based Systems (KBS) Genetic Algorithms (GAs)

NEURAL NETWORKS FOR DATA MINING

Chapter 7 Neural Networks in Data Mining Automatic Model Building (Machine Learning) Artificial Intelligence.

Introduction to machine learning and data mining 1 iCSC2014, Juan López González, University of Oviedo Introduction to machine learning Juan López González.

1 SUPPORT VECTOR MACHINES İsmail GÜNEŞ. 2 What is SVM? A new generation learning system. A new generation learning system. Based on recent advances in.

Artificial Neural Networks. The Brain How do brains work? How do human brains differ from that of other animals? Can we base models of artificial intelligence.

Machine Learning Using Support Vector Machines (Paper Review) Presented to: Prof. Dr. Mohamed Batouche Prepared By: Asma B. Al-Saleh Amani A. Al-Ajlan.

1 Pattern Classification X. 2 Content General Method K Nearest Neighbors Decision Trees Nerual Networks.

Artificial Neural Networks An Introduction. What is a Neural Network? A human Brain A porpoise brain The brain in a living creature A computer program.

1 Introduction to Neural Networks And Their Applications.

Chapter 6: Artificial Neural Networks for Data Mining

© 2005 Prentice Hall, Decision Support Systems and Intelligent Systems, 7th Edition, Turban, Aronson, and Liang 12-1 Chapter 12 Advanced Intelligent Systems.

NEURAL NETWORKS FOR DATA MINING

Neural Network Basics Anns are analytical systems that address problems whose solutions have not been explicitly formulated Structure in which multiple.

Neural Networks Demystified by Louise Francis Francis Analytics and Actuarial Data Mining, Inc.

Supervised Machine Learning: Classification Techniques Chaleece Sandberg Chris Bradley Kyle Walsh.

Artificial Neural Networks (ANN). Artificial Neural Networks First proposed in 1940s as an attempt to simulate the human brain’s cognitive learning processes.

Artificial Intelligence CIS 342 The College of Saint Rose David Goldschmidt, Ph.D.

Chapter 6 Neural Network.

Artificial Neural Networks for Data Mining. Copyright © 2011 Pearson Education, Inc. Publishing as Prentice Hall 6-2 Learning Objectives Understand the.

Artificial Neural Networks for Data Mining. Copyright © 2011 Pearson Education, Inc. Publishing as Prentice Hall 6-2 Learning Objectives Understand the.

Neural Networks for EMC Modeling of Airplanes Vlastimil Koudelka Department of Radio Electronics FEKT BUT Metz,

1 Traffic accident analysis using machine learning paradigms Miao Chong, Ajith Abraham, Mercin Paprzycki Informatica 29, P89, 2005 Report: Hsin-Chan Tsai.

Business Intelligence and Decision Support Systems (9 th Ed., Prentice Hall) Chapter 6: Artificial Neural Networks for Data Mining.

Business Intelligence and Decision Support Systems (9 th Ed., Prentice Hall) Chapter 6: Artificial Neural Networks for Data Mining.

Machine Learning Artificial Neural Networks MPλ ∀ Stergiou Theodoros 1.

Chapter 11 – Neural Nets © Galit Shmueli and Peter Bruce 2010 Data Mining for Business Intelligence Shmueli, Patel & Bruce.

Data Mining: Concepts and Techniques1 Prediction Prediction vs. classification Classification predicts categorical class label Prediction predicts continuous-valued.

Business Intelligence and Decision Support Systems (9 th Ed., Prentice Hall) Chapter 6: Artificial Neural Networks for Data Mining.

Artificial neural networks

Learning in Neural Networks

Chapter 12 Advanced Intelligent Systems

Artificial Intelligence Methods

Neuro-Computing Lecture 4 Radial Basis Function Network

Artificial Intelligence Lecture No. 28

Presentation transcript:

Chapter 6: Techniques for Predictive Modeling

Learning Objectives Understand the concept and definitions of artificial neural networks (ANN) Learn the different types of ANN architectures Know how learning happens in ANN Become familiar with ANN applications Understand the sensitivity analysis in ANN Understand the concept and structure of support vector machines (SVM) (Continued…)

Learning Objectives Learn the advantages and disadvantages of SVM compared to ANN Understand the concept and formulation of k-nearest neighbor algorithm (kNN) Learn the process of applying kNN Learn the advantages and disadvantages of kNN compared to ANN and SVM

Opening Vignette… Predictive Modeling Helps Better Understand and Manage Complex Medical Procedures Situation Problem Solution Results Answer & discuss the case questions.

Questions for the Opening Vignette Why is it important to study medical procedures? What is the value in predicting outcomes? What factors do you think are the most important in better understanding and managing healthcare? What would be the impact of predictive modeling on healthcare and medicine? Can predictive modeling replace medical or managerial personnel? What were the outcomes of the study? Who can use these results? How can they be implemented? Search the Internet to locate two additional cases in managing complex medical procedures.

Opening Vignette – A Process Map for Training and Testing Four Predictive Models

Opening Vignette The Comparison of Four Models

Neural Network Concepts Neural networks (NN): a brain metaphor for information processing Neural computing Artificial neural network (ANN) Many uses for ANN for pattern recognition, forecasting, prediction, and classification Many application areas finance, marketing, manufacturing, operations, information systems, and so on

Biological Neural Networks Two interconnected brain cells (neurons)

Processing Information in ANN A single neuron (processing element – PE) with inputs and outputs

Biology Analogy

Application Case 6.1 Neural Networks Are Helping to Save Lives in the Mining Industry Questions for Discussion How did neural networks help save lives in the mining industry? What were the challenges, the proposed solution, and the obtained results?

Elements of ANN Processing element (PE) Network architecture Hidden layers Parallel processing Network information processing Inputs Outputs Connection weights Summation function

Elements of ANN Neural Network with One Hidden Layer

Elements of ANN Summation Function for a Single Neuron (a), and Several Neurons (b)

Elements of ANN Transformation (Transfer) Function Threshold value? Linear function Sigmoid (logical activation) function [0 1] Tangent Hyperbolic function [-1 1] Threshold value?

Neural Network Architectures Architecture of a neural network is driven by the task it is intended to address Classification, regression, clustering, general optimization, association, …. Most popular architecture: Feedforward, multi-layered perceptron with backpropagation learning algorithm Used for both classification and regression type problems Others – Recurrent, self-organizing feature maps, Hopfield networks, …

Neural Network Architectures Feed-Forward Neural Networks Feed-forward MLP with 1 Hidden Layer

Neural Network Architectures Recurrent Neural Networks

Other Popular ANN Paradigms Self-Organizing Maps (SOM) First introduced by the Finnish Professor Teuvo Kohonen Applies to clustering type problems

Other Popular ANN Paradigms Hopfield Networks First introduced by John Hopfield Highly interconnected neurons Applies to solving complex computational problems (e.g., optimization problems)

Application Case 6.2 Predictive Modeling is Powering the Power Generators Questions for Discussion What are the key environmental concerns in the electric power industry? What are the main application areas for predictive modeling in the electric power industry? How was predictive modeling used to address a variety of problems in the electric power industry?

Development Process of an ANN

An MLP ANN Structure for the Box-Office Prediction Problem

Testing a Trained ANN Model Data is split into three parts Training (~60%) Validation (~20%) Testing (~20%) k-fold cross validation Less bias Time consuming

AN Learning Process A Supervised Learning Process Three-step process: 1. Compute temporary outputs. 2. Compare outputs with desired targets. 3. Adjust the weights and repeat the process.

Backpropagation Learning Backpropagation of Error for a Single Neuron

Backpropagation Learning The learning algorithm procedure Initialize weights with random values and set other network parameters Read in the inputs and the desired outputs Compute the actual output (by working forward through the layers) Compute the error (difference between the actual and desired output) Change the weights by working backward through the hidden layers Repeat steps 2-5 until weights stabilize

Illuminating The Black Box Sensitivity Analysis on ANN A common criticism for ANN: The lack of transparency/explainability The black-box syndrome! Answer: sensitivity analysis Conducted on a trained ANN The inputs are perturbed while the relative change on the output is measured/recorded Results illustrate the relative importance of input variables

Sensitivity Analysis on ANN Models For a good example, see Application Case 6.3 Sensitivity analysis reveals the most important injury severity factors in traffic accidents

Application Case 6.3 Sensitivity Analysis Reveals Injury Severity Factors in Traffic Accidents Questions for Discussion How does sensitivity analysis shed light on the black box (i.e., neural networks)? Why would someone choose to use a blackbox tool like neural networks over theoretically sound, mostly transparent statistical tools like logistic regression? In this case, how did NNs and sensitivity analysis help identify injury-severity factors in traffic accidents?

Support Vector Machines (SVM) SVM are among the most popular machine-learning techniques. SVM belong to the family of generalized linear models… (capable of representing non-linear relationships in a linear fashion). SVM achieve a classification or regression decision based on the value of the linear combination of input features. Because of their architectural similarities, SVM are also closely associated with ANN.

Support Vector Machines (SVM) Goal of SVM: to generate mathematical functions that map input variables to desired outputs for classification or regression type prediction problems. First, SVM uses nonlinear kernel functions to transform non-linear relationships among the variables into linearly separable feature spaces. Then, the maximum-margin hyperplanes are constructed to optimally separate different classes from each other based on the training dataset. SVM has solid mathematical foundation!

Support Vector Machines (SVM) A hyperplane is a geometric concept used to describe the separation surface between different classes of things. In SVM, two parallel hyperplanes are constructed on each side of the separation space with the aim of maximizing the distance between them. A kernel function in SVM uses the kernel trick (a method for using a linear classifier algorithm to solve a nonlinear problem) The most commonly used kernel function is the radial basis function (RBF).

Support Vector Machines (SVM) Many linear classifiers (hyperplanes) may separate the data

Application Case 6.4 Managing Student Retention with Predictive Modeling Questions for Discussion Why is attrition one of the most important issues in higher education? How can predictive analytics (ANN, SVM, and so forth) be used to better manage student retention? What are the main challenges and potential solutions to the use of analytics in retention management?

Application Case 6.4 Managing Student Retention with Predictive Modeling

How Does an SVM Work? Following a machine-learning process, an SVM learns from the historic cases. The Process of Building SVM 1. Preprocess the data Scrub and transform the data. 2. Develop the model. Select the kernel type (RBF is often a natural choice). Determine the kernel parameters for the selected kernel type. If the results are satisfactory, finalize the model; otherwise change the kernel type and/or kernel parameters to achieve the desired accuracy level. 3. Extract and deploy the model.

The Process of Building an SVM

SVM Applications SVMs are the most widely used kernel-learning algorithms for wide range of classification and regression problems SVMs represent the state-of-the-art by virtue of their excellent generalization performance, superior prediction power, ease of use, and rigorous theoretical foundation Most comparative studies show its superiority in both regression and classification type prediction problems. SVM versus ANN?

k-Nearest Neighbor Method (k-NN) ANNs and SVMs  time-demanding, computationally intensive iterative derivations k-NN is a simplistic and logical prediction method, that produces very competitive results k-NN is a prediction method for classification as well as regression types (similar to ANN & SVM) k-NN is a type of instance-based learning (or lazy learning) – most of the work takes place at the time of prediction (not at modeling) k : the number of neighbors used

k-Nearest Neighbor Method (k-NN) The answer depends on the value of k

The Process of k-NN Method

k-NN Model Parameter Similarity Measure: The Distance Metric Numeric versus nominal values?

k-NN Model Parameter Number of Neighbors (the value of k) The best value depends on the data Larger values reduce the effect of noise but also make boundaries between classes less distinct An “optimal” value can be found heuristically Cross Validation is often used to determine the best value for k and the distance measure

Application Case 6.5 Efficient Image Recognition and Categorization with kNN Questions for Discussion Why is image recognition/classification a worthy but difficult problem? How can k-NN be effectively used for image recognition/classification applications?

End of the Chapter Questions, comments

All rights reserved. No part of this publication may be reproduced, stored in a retrieval system, or transmitted, in any form or by any means, electronic, mechanical, photocopying, recording, or otherwise, without the prior written permission of the publisher. Printed in the United States of America.