Becerra-Fernandez, et al. -- Knowledge Management 1/e -- © 2004 Prentice Hall Chapter 12 Discovering New Knowledge – Data Mining.

Slides:



Advertisements
Similar presentations
Decision Tree Approach in Data Mining
Advertisements

Becerra-Fernandez, et al. -- Knowledge Management 1/e -- © 2004 Prentice Hall Chapter 7 Technologies to Manage Knowledge: Artificial Intelligence.
Machine Learning Neural Networks
Data Mining: A Closer Look Chapter Data Mining Strategies.
Decision Support Systems
Connectionist models. Connectionist Models Motivated by Brain rather than Mind –A large number of very simple processing elements –A large number of weighted.
1 Chapter 11 Neural Networks. 2 Chapter 11 Contents (1) l Biological Neurons l Artificial Neurons l Perceptrons l Multilayer Neural Networks l Backpropagation.
© Prentice Hall1 DATA MINING TECHNIQUES Introductory and Advanced Topics Eamonn Keogh (some slides adapted from) Margaret Dunham Dr. M.H.Dunham, Data Mining,
An overview of The IBM Intelligent Miner for Data By: Neeraja Rudrabhatla 11/04/1999.
Neural Networks. R & G Chapter Feed-Forward Neural Networks otherwise known as The Multi-layer Perceptron or The Back-Propagation Neural Network.
Basic Data Mining Techniques
© 2002 by Prentice Hall 1 SI 654 Database Application Design Winter 2003 Dragomir R. Radev.
Neural Networks Chapter Feed-Forward Neural Networks.
Data Mining: A Closer Look Chapter Data Mining Strategies (p35) Moh!
Classification and Prediction by Yen-Hsien Lee Department of Information Management College of Management National Sun Yat-Sen University March 4, 2003.
© Prentice Hall1 DATA MINING Introductory and Advanced Topics Part II Margaret H. Dunham Department of Computer Science and Engineering Southern Methodist.
Gini Index (IBM IntelligentMiner)
Neural Networks in Data Mining “An Overview”
Oracle Data Mining Ying Zhang. Agenda Data Mining Data Mining Algorithms Oracle DM Demo.
Data Mining: A Closer Look Chapter Data Mining Strategies 2.
Chapter 5 Data mining : A Closer Look.
Data Mining Dr. Chang Liu. What is Data Mining Data mining has been known by many different terms Data mining has been known by many different terms Knowledge.
Data Mining Joyeeta Dutta-Moscato July 10, Wherever we have large amounts of data, we have the need for building systems capable of learning information.
Chapter 13 Genetic Algorithms. 2 Data Mining Techniques So Far… Chapter 5 – Statistics Chapter 6 – Decision Trees Chapter 7 – Neural Networks Chapter.
INTRODUCTION TO MACHINE LEARNING. $1,000,000 Machine Learning  Learn models from data  Three main types of learning :  Supervised learning  Unsupervised.
Inductive learning Simplest form: learn a function from examples
Introduction To Data Mining. What Is Data Mining? A toolA tool Extraction of interesting (non-trivial, implicit, previously unknown and potentially useful)
Artificial Neural Networks (ANN). Output Y is 1 if at least two of the three inputs are equal to 1.
Appendix: The WEKA Data Mining Software
© N. Kasabov Foundations of Neural Networks, Fuzzy Systems, and Knowledge Engineering, MIT Press, 1996 INFO331 Machine learning. Neural networks. Supervised.
Chapter 9 Neural Network.
Chapter 3 Neural Network Xiu-jun GONG (Ph. D) School of Computer Science and Technology, Tianjin University
Becerra-Fernandez, et al. -- Knowledge Management 1/e -- © 2004 Prentice Hall Chapter 9 Using Past History Explicitly as Knowledge: Case-based Reasoning.
Data Mining Overview. Lecture Objectives After this lecture, you should be able to: 1.Explain key data mining tasks in your own words. 2.Draw an overview.
Appendix B: An Example of Back-propagation algorithm
Chapter 7 Neural Networks in Data Mining Automatic Model Building (Machine Learning) Artificial Intelligence.
 Diagram of a Neuron  The Simple Perceptron  Multilayer Neural Network  What is Hidden Layer?  Why do we Need a Hidden Layer?  How do Multilayer.
Data Mining: Classification & Predication Hosam Al-Samarraie, PhD. Centre for Instructional Technology & Multimedia Universiti Sains Malaysia.
1 Chapter 11 Neural Networks. 2 Chapter 11 Contents (1) l Biological Neurons l Artificial Neurons l Perceptrons l Multilayer Neural Networks l Backpropagation.
Dr. Chen, Data Mining  A/W & Dr. Chen, Data Mining Chapter 2 Data Mining: A Closer Look Jason C. H. Chen, Ph.D. Professor of MIS School of Business Administration.
Chapter 5: Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization DECISION SUPPORT SYSTEMS AND BUSINESS.
L6. Learning Systems in Java. Necessity of Learning No Prior Knowledge about all of the situations. Being able to adapt to changes in the environment.
What is Data Mining? process of finding correlations or patterns among dozens of fields in large relational databases process of finding correlations or.
CSE 5331/7331 F'07© Prentice Hall1 CSE 5331/7331 Fall 2007 Machine Learning Margaret H. Dunham Department of Computer Science and Engineering Southern.
Predictive Modeling Spring 2005 CAMAR meeting Louise Francis, FCAS, MAAA Francis Analytics and Actuarial Data Mining, Inc
Neural Networks Demystified by Louise Francis Francis Analytics and Actuarial Data Mining, Inc.
Neural Networks Presented by M. Abbasi Course lecturer: Dr.Tohidkhah.
MIS2502: Data Analytics Advanced Analytics - Introduction.
An Introduction Student Name: Riaz Ahmad Program: MSIT( ) Subject: Data warehouse & Data Mining.
Chapter 8: Adaptive Networks
Data Mining and Decision Support
Each neuron has a threshold value Each neuron has weighted inputs from other neurons The input signals form a weighted sum If the activation level exceeds.
Data Mining Copyright KEYSOFT Solutions.
Artificial Neural Networks for Data Mining. Copyright © 2011 Pearson Education, Inc. Publishing as Prentice Hall 6-2 Learning Objectives Understand the.
Chapter 14 Introduction to Regression Analysis. Objectives Regression Analysis Uses of Regression Analysis Method of Least Squares Difference between.
Copyright © 2011 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Chapter 28 Data Mining Concepts.
Business Intelligence and Decision Support Systems (9 th Ed., Prentice Hall) Chapter 6: Artificial Neural Networks for Data Mining.
Data Mining: Concepts and Techniques1 Prediction Prediction vs. classification Classification predicts categorical class label Prediction predicts continuous-valued.
Business Intelligence and Decision Support Systems (9 th Ed., Prentice Hall) Chapter 6: Artificial Neural Networks for Data Mining.
Knowledge Elicitation – Converting Tacit Knowledge to Explicit
MIS2502: Data Analytics Advanced Analytics - Introduction
Artificial neural networks
DATA MINING © Prentice Hall.
Prepared by: Mahmoud Rafeek Al-Farra
Preserving and Applying Human Expertise: Knowledge-Based Systems
CSE P573 Applications of Artificial Intelligence Neural Networks
CSE 573 Introduction to Artificial Intelligence Neural Networks
The Computer as a Medium for Sharing Knowledge
Data Mining Overview.
What is Artificial Intelligence?
Presentation transcript:

Becerra-Fernandez, et al. -- Knowledge Management 1/e -- © 2004 Prentice Hall Chapter 12 Discovering New Knowledge – Data Mining

Becerra-Fernandez, et al. -- Knowledge Management 1/e -- © 2004 Prentice Hall Chapter Objectives Introduce the student to the concept of Data Mining.  How it is different from knowledge elicitation from experts  How it is different from extracting existing knowledge from databases. The objectives of data mining  Explanation of past events (descriptive DM)  Prediction of future events (predictive DM) (continued)

Becerra-Fernandez, et al. -- Knowledge Management 1/e -- © 2004 Prentice Hall Chapter Objectives (cont.) Introduce the student to the different classes of methods available for DM  Symbolic (induction)  Connectionist (neural networks)  Statistical Introduce the student to the details of some of the methods described in the chapter.

Becerra-Fernandez, et al. -- Knowledge Management 1/e -- © 2004 Prentice Hall Section Objectives Introduction of chapter contents

Becerra-Fernandez, et al. -- Knowledge Management 1/e -- © 2004 Prentice Hall Section Objectives Defines the concept of data mining and the reasons for performing DM studies Defines the objectives of data mining  Descriptive DM  Predictive DM Introduces the three basic approaches to DM  Symbolic (induction)  Connectionist (neural networks)  Statistical (curve-fitting, others)

Becerra-Fernandez, et al. -- Knowledge Management 1/e -- © 2004 Prentice Hall Section Objectives Present a detailed description of the symbolic approach to data mining - rule induction Present the main algorithm for rule induction - C5.0 and its ancestors, ID3 and CLS Present several example applications of rule induction Present other alternate algorithms for rule induction  CART  CHAID

Becerra-Fernandez, et al. -- Knowledge Management 1/e -- © 2004 Prentice Hall Section Objectives Provide a detailed description of the connectionist approach to data mining - neural networks Present the basic neural network architecture - the multi-layer feed forward neural network Present the main supervised learning algorithm - backpropagation Present the main unsupervised neural network architecture - the Kohonen network

Becerra-Fernandez, et al. -- Knowledge Management 1/e -- © 2004 Prentice Hall Section Objectives Provide a detailed description of the most important statistical methods for data mining  Curve fitting with least squares method  Multi-variate correlation  K-Means clustering  Market Basket analysis  Discriminant analysis  Logistic regression

Becerra-Fernandez, et al. -- Knowledge Management 1/e -- © 2004 Prentice Hall Section Objectives Provide useful guidelines for determining what technique to use for specific problems

Becerra-Fernandez, et al. -- Knowledge Management 1/e -- © 2004 Prentice Hall Section Objectives Discuss the importance of errors in data mining studies Define the types of errors possible in data mining studies

Becerra-Fernandez, et al. -- Knowledge Management 1/e -- © 2004 Prentice Hall Section Objectives Summarize the chapter Provide Key terms Provide Review Questions Provide Review Exercises

Becerra-Fernandez, et al. -- Knowledge Management 1/e -- © 2004 Prentice Hall Figure 12.1 Is the stock’s price/earning s ratio > 5? Has the company’s quarterly profit increased over the last year by 10% or more? Don’ t buy No Yes Root node Is the company’s management stable? Leaf node Yes Don’ t buy No Buy Don’ t Buy Yes No

Becerra-Fernandez, et al. -- Knowledge Management 1/e -- © 2004 Prentice Hall Figure 12.2 DS3 - Not Enjoyable DS2 - Not Enjoyable = Sunny = Cloudy = Rainy Rain Outlook {DS1, DS2, DS3, DS4} DS1 – Enjoyable DS4 - Not Enjoyable

Becerra-Fernandez, et al. -- Knowledge Management 1/e -- © 2004 Prentice Hall Figure 12.3 None DS3 - Not Enjoyable DS2 - Not Enjoyable = Sunny = Cloudy= Rain Rain Outlook Temperature DS1 – Enjoyable DS4 – Not Enjoyable = Cold = Hot = Mild DS1 - Enjoyable DS4 – Not Enjoyable {DS1, DS2, DS3, DS4}

Becerra-Fernandez, et al. -- Knowledge Management 1/e -- © 2004 Prentice Hall Figure 12.4 MilliExpert ThoughtGen OffSite Language Genie XS SilverWorks XS Lis p C++ Java

Becerra-Fernandez, et al. -- Knowledge Management 1/e -- © 2004 Prentice Hall Figure 12.5 Language MilliExpert ThoughtGen OffSite Backward Forward Backward C++ Java MilliExpert ThoughtGen OffSite Genie XS SilverWorks XS Lis p OffSiteXSGinie XS SilverWorks Forward Backwards

Becerra-Fernandez, et al. -- Knowledge Management 1/e -- © 2004 Prentice Hall Figure 12.6 MilliExpert ThoughtGen OffSite Backward Forward Backward C++ Java MilliExpert ThoughtGen OffSite Language Genie XS SilverWorks XS Lis p OffSite XSGenie XSSilverWorks Forward Backward MilliExpert ThoughtGen OffSite SpreadsheetXL ASCII Devices dBase

Becerra-Fernandez, et al. -- Knowledge Management 1/e -- © 2004 Prentice Hall Figure 12.7 = Dry= Humid DS2 – Not Enjoyable DS3 – Not Enjoyable DS4 – Not Enjoyable {DS1, DS2, DS3, DS4} Humidity DS1 - Enjoyable

Becerra-Fernandez, et al. -- Knowledge Management 1/e -- © 2004 Prentice Hall Figure 12.8 x2x2 x1x1 xkxk Activation function f() Inputs y  W1W1 W2W2 WnWn

Becerra-Fernandez, et al. -- Knowledge Management 1/e -- © 2004 Prentice Hall Figure 12.9 Threshold function Piece-wise Linear function Sigmoid function 1.0

Becerra-Fernandez, et al. -- Knowledge Management 1/e -- © 2004 Prentice Hall Figure Inputs Outputs

Becerra-Fernandez, et al. -- Knowledge Management 1/e -- © 2004 Prentice Hall Figure 12.11

Becerra-Fernandez, et al. -- Knowledge Management 1/e -- © 2004 Prentice Hall Figure Variable A Variable B Cluster #1 Cluster #2

Becerra-Fernandez, et al. -- Knowledge Management 1/e -- © 2004 Prentice Hall Figure Inputs WiWi

Becerra-Fernandez, et al. -- Knowledge Management 1/e -- © 2004 Prentice Hall Figure x y

Becerra-Fernandez, et al. -- Knowledge Management 1/e -- © 2004 Prentice Hall Figure x y Best fitting equation x

Becerra-Fernandez, et al. -- Knowledge Management 1/e -- © 2004 Prentice Hall Table 12.1

Becerra-Fernandez, et al. -- Knowledge Management 1/e -- © 2004 Prentice Hall Table 12.2

Becerra-Fernandez, et al. -- Knowledge Management 1/e -- © 2004 Prentice Hall Table 12.3

Becerra-Fernandez, et al. -- Knowledge Management 1/e -- © 2004 Prentice Hall Table 12.3 (cont.)

Becerra-Fernandez, et al. -- Knowledge Management 1/e -- © 2004 Prentice Hall Table 12.4

Becerra-Fernandez, et al. -- Knowledge Management 1/e -- © 2004 Prentice Hall Table 12.5

Becerra-Fernandez, et al. -- Knowledge Management 1/e -- © 2004 Prentice Hall Table 12.5 (cont.)

Becerra-Fernandez, et al. -- Knowledge Management 1/e -- © 2004 Prentice Hall Table 12.6

Becerra-Fernandez, et al. -- Knowledge Management 1/e -- © 2004 Prentice Hall Table 12.7

Becerra-Fernandez, et al. -- Knowledge Management 1/e -- © 2004 Prentice Hall Conclusions The student should be able to use:  The C5.0 algorithm to capture rules from examples.  Basic feedforward neural networks with supervised learning.  Unsupervised learning, clustering techniques and the Kohonen networks.  Curve-fitting algorithms.  Statistical methods for clustering.  Other statistical techniques.

Becerra-Fernandez, et al. -- Knowledge Management 1/e -- © 2004 Prentice Hall Chapter 12 Discovering New Knowledge – Data Mining