Data Mining Tools Overview Business Intelligence for Managers.

Slides:



Advertisements
Similar presentations
CS 478 – Tools for Machine Learning and Data Mining Clustering: Distance-based Approaches.
Advertisements

Naïve-Bayes Classifiers Business Intelligence for Managers.
1 Machine Learning: Lecture 10 Unsupervised Learning (Based on Chapter 9 of Nilsson, N., Introduction to Machine Learning, 1996)
Data Mining Glen Shih CS157B Section 1 Dr. Sin-Min Lee April 4, 2006.
ML ALGORITHMS. Algorithm Types Classification (supervised) Given -> A set of classified examples “instances” Produce -> A way of classifying new examples.
Applications of Data Mining in Microarray Data Analysis Yen-Jen Oyang Dept. of Computer Science and Information Engineering.
CS157A Spring 05 Data Mining Professor Sin-Min Lee.
Oracle Data Mining Ying Zhang. Agenda Data Mining Data Mining Algorithms Oracle DM Demo.
Chapter 5 Data mining : A Closer Look.
Introduction to Data Mining Data mining is a rapidly growing field of business analytics focused on better understanding of characteristics and.
GUHA method in Data Mining Esko Turunen Tampere University of Technology Tampere, Finland.
Enterprise systems infrastructure and architecture DT211 4
Data Mining: Concepts & Techniques. Motivation: Necessity is the Mother of Invention Data explosion problem –Automated data collection tools and mature.
Evaluating Performance for Data Mining Techniques
Supervised Learning and k Nearest Neighbors Business Intelligence for Managers.
1 Data Mining DT211 4 Refer to Connolly and Begg 4ed.
Intelligent Systems Lecture 23 Introduction to Intelligent Data Analysis (IDA). Example of system for Data Analyzing based on neural networks.
Kansas State University Department of Computing and Information Sciences CIS 830: Advanced Topics in Artificial Intelligence From Data Mining To Knowledge.
Copyright R. Weber Machine Learning, Data Mining ISYS370 Dr. R. Weber.
Data Mining and Application Part 1: Data Mining Fundamentals Part 2: Tools for Knowledge Discovery Part 3: Advanced Data Mining Techniques Part 4: Intelligent.
Chapter 9 Business Intelligence and Information Systems for Decision Making.
Introduction to Data Mining Group Members: Karim C. El-Khazen Pascal Suria Lin Gui Philsou Lee Xiaoting Niu.
1 1 Slide Introduction to Data Mining and Business Intelligence.
Data Mining Chapter 1 Introduction -- Basic Data Mining Tasks -- Related Concepts -- Data Mining Techniques.
Introduction to machine learning and data mining 1 iCSC2014, Juan López González, University of Oviedo Introduction to machine learning Juan López González.
Basic Data Mining Technique
Data Mining: Classification & Predication Hosam Al-Samarraie, PhD. Centre for Instructional Technology & Multimedia Universiti Sains Malaysia.
Chapter 8 The k-Means Algorithm and Genetic Algorithm.
Dr. Chen, Data Mining  A/W & Dr. Chen, Data Mining Chapter 2 Data Mining: A Closer Look Jason C. H. Chen, Ph.D. Professor of MIS School of Business Administration.
Data Mining By Dave Maung.
Pattern Recognition April 19, 2007 Suggested Reading: Horn Chapter 14.
CS157B Fall 04 Introduction to Data Mining Chapter 22.3 Professor Lee Yu, Jianji (Joseph)
Advanced Database Course (ESED5204) Eng. Hanan Alyazji University of Palestine Software Engineering Department.
3-1 Data Mining Kelby Lee. 3-2 Overview ¨ Transaction Database ¨ What is Data Mining ¨ Data Mining Primitives ¨ Data Mining Objectives ¨ Predictive Modeling.
Chapter 5: Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization DECISION SUPPORT SYSTEMS AND BUSINESS.
Part II Tools for Knowledge Discovery Ch 5. Knowledge Discovery in Databases Ch 6. The Data Warehouse Ch 7. Formal Evaluation Technique.
Chapter 11 Statistical Techniques. Data Warehouse and Data Mining Chapter 11 2 Chapter Objectives  Understand when linear regression is an appropriate.
Data Mining BY JEMINI ISLAM. Data Mining Outline: What is data mining? Why use data mining? How does data mining work The process of data mining Tools.
An Investigation of Commercial Data Mining Presented by Emily Davis Supervisor: John Ebden.
Data Science and Big Data Analytics Chap 4: Advanced Analytical Theory and Methods: Clustering Charles Tappert Seidenberg School of CSIS, Pace University.
DATA MINING WITH CLUSTERING AND CLASSIFICATION Spring 2007, SJSU Benjamin Lam.
Dr. Chen, Data Mining  A/W & Dr. Chen, Data Mining Chapter 4 An Excel-based Data Mining Tool (iData Analyzer) Jason C. H. Chen, Ph.D. Professor of MIS.
Chapter 14 Data Mining Transparencies. 2 Chapter Objectives u The concepts associated with data mining. u The main features of data mining operations,
1 Unsupervised Learning and Clustering Shyh-Kang Jeng Department of Electrical Engineering/ Graduate Institute of Communication/ Graduate Institute of.
Data Mining By Farzana Forhad CS 157B. Agenda Decision Tree and ID3 Rough Set Theory Clustering.
Classification as data mining tool Classification as data mining tool Done by William Hellela William Hellela Rauf Gadar Alex Prewett.
Chapter 3 Data Mining: Classification & Association Chapter 4 in the text box Section: 4.3 (4.3.1),
Cluster Analysis What is Cluster Analysis? Types of Data in Cluster Analysis A Categorization of Major Clustering Methods Partitioning Methods.
DATA MINING TECHNIQUES (DECISION TREES ) Presented by: Shweta Ghate MIT College OF Engineering.
Department of Computer Science Sir Syed University of Engineering & Technology, Karachi-Pakistan. Presentation Title: DATA MINING Submitted By.
Data Mining Concept Submitted TO: Mrs. MONIKA SUBMITTED BY: SHALU 4717.
Big data classification using neural network
Data Mining, Machine Learning, Data Analysis, etc. scikit-learn
Machine Learning with Spark MLlib
Fuzzy Logic in Pattern Recognition
Semi-Supervised Clustering
By Arijit Chatterjee Dr
Chapter 6 Classification and Prediction
What is Pattern Recognition?
Machine Learning Week 1.
Data Analysis.
Prepared by: Mahmoud Rafeek Al-Farra
Pattern Classification All materials in these slides were taken from Pattern Classification (2nd ed) by R. O. Duda, P. E. Hart and D. G. Stork, John.
MIS2502: Data Analytics Clustering and Segmentation
MIS2502: Data Analytics Clustering and Segmentation
Data Mining, Machine Learning, Data Analysis, etc. scikit-learn
Data Mining, Machine Learning, Data Analysis, etc. scikit-learn
©Jiawei Han and Micheline Kamber
Unsupervised Learning: Clustering
Pattern Recognition and Training
Presentation transcript:

Data Mining Tools Overview Business Intelligence for Managers

Data Mining Definition Revisited Analysis of large quantities of data Knowledge discovery in databases Extracting implicit, previously unknown information from large volumes of raw data

Instances and Features Typically, the database will be a collection of instances Each instance will have values for a given set of features From database theory: instances are rows, features are columns

Classification Supervised learning Suppose instances have been categorized into classes and the database includes this categorization Goal: using the “knowledge” in the database, classify a given instance

Classifiers Classifier… feature values category X1 X2 X3 Xn Y DB collection of instances with known categories

Classifier intelligence A classifier’s intelligence will be based on a dataset consisting of instances with known categories Typical goal of a classifier: predict the category of a new instance that is rationally consistent with the dataset

BI Examples A loans officer in a bank uses a system that automatically approves or disapproves a loan application based on previous loan applications and decisions An admissions officer in a university uses a system that automatically makes an admission decision (accept, reject, wait-list), based on previous applicants’ data and decisions made on them

Data mining method example: k - nearest neighbors For a given instance T, get the top k database instances that are “nearest” to T Select a reasonable distance measure Inspect the category of these k instances, choose the category C that represent the most instances Conclude that T belongs to category C

Clustering (Chapter 5 of text) Unsupervised learning Classes/categories are not known, but unexpected groupings (clusters) are discovered Clustering provides insight into the population segments

Clustering Feature 1 Feature 2

Goal of Clustering Input: the database of instances, and possibly some predetermined number of clusters Output: the same database of instances partitioned into clusters

BI Examples After clustering the current university student population, it was discovered that there is a large group of female marketing majors coming from a particular exclusive school who tend to get high grades business response: focus recruitment on that school; push the university’s marketing program Customer segment characteristics and spending patterns can direct business strategies

Data mining method example: k-means Guess the number of clusters (k) Guess cluster centers from the samples (these will be called centroids) Determine cluster membership based on the distance from the centroids Repeatedly refine the centroids by getting the average (mean) of the members of each cluster

Summary Two sub-areas of data mining: supervised (classification) and unsupervised (clustering) learning methods For both types of methods, intelligent systems can be created to support business decision making