Data MINING Data mining is the process of extracting previously unknown, valid and actionable information from large data and then using the information.

Slides:



Advertisements
Similar presentations
Data Mining: What? WHY? HOW?
Advertisements

Data Mining in Computer Games By Adib Adam Hussain & Mohammed Sarfraz.
1 Chapter 34 Data Mining Transparencies © Pearson Education Limited 1995, 2005.
Data Mining Glen Shih CS157B Section 1 Dr. Sin-Min Lee April 4, 2006.
Data Mining Sangeeta Devadiga CS 157B, Spring 2007.
Week 9 Data Mining System (Knowledge Data Discovery)
Data Mining By Archana Ketkar.
Data Mining Concepts 1.1 COT5230 Data Mining Week 1 Data Mining Concepts M O N A S H A U S T R A L I A ’ S I N T E R N A T I O N A L U N I V E R S I T.
Data Mining: A Closer Look
Chapter 35 Data Mining Transparencies. 2 Chapter Objectives u The concepts associated with data mining. u The main features of data mining operations,
Data Mining & Data Warehousing PresentedBy: Group 4 Kirk Bishop Joe Draskovich Amber Hottenroth Brandon Lee Stephen Pesavento.
TURKISH STATISTICAL INSTITUTE INFORMATION TECHNOLOGIES DEPARTMENT (Muscat, Oman) DATA MINING.
GUHA method in Data Mining Esko Turunen Tampere University of Technology Tampere, Finland.
Enterprise systems infrastructure and architecture DT211 4
Data Mining By Andrie Suherman. Agenda Introduction Major Elements Steps/ Processes Tools used for data mining Advantages and Disadvantages.
Data Warehousing by Industry Chapter 4 e-Data. Retail Data warehousing’s early adopters Capturing data from their POS systems  POS = point-of-sale Industry.
Dr. Awad Khalil Computer Science Department AUC
Chapter 5: Data Mining for Business Intelligence
MAKING THE BUSINESS BETTER Presented By Mohammed Dwikat DATA MINING Presented to Faculty of IT MIS Department An Najah National University.
Shilpa Seth.  What is Data Mining What is Data Mining  Applications of Data Mining Applications of Data Mining  KDD Process KDD Process  Architecture.
1 Data Mining DT211 4 Refer to Connolly and Begg 4ed.
Data Mining Techniques As Tools for Analysis of Customer Behavior
Data Mining Chun-Hung Chou
Spatial Statistics and Spatial Knowledge Discovery First law of geography [Tobler]: Everything is related to everything, but nearby things are more related.
Data Mining CS157B Fall 04 Professor Lee By Yanhua Xue.
Introduction to Data Mining Group Members: Karim C. El-Khazen Pascal Suria Lin Gui Philsou Lee Xiaoting Niu.
Lecture 9: Knowledge Discovery Systems Md. Mahbubul Alam, PhD Associate Professor Dept. of AEIS Sher-e-Bangla Agricultural University.
1 1 Slide Introduction to Data Mining and Business Intelligence.
Data Mining Chapter 1 Introduction -- Basic Data Mining Tasks -- Related Concepts -- Data Mining Techniques.
Succeeding with Technology Database Systems Basic Data Management Concepts Organizing Data in a Database Database Management Systems Using Database Systems.
AI Week 14 Machine Learning: Introduction to Data Mining Lee McCluskey, room 3/10
Database Design Part of the design process is deciding how data will be stored in the system –Conventional files (sequential, indexed,..) –Databases (database.
Data Mining By : Tung, Sze Ming ( Leo ) CS 157B. Definition A class of database application that analyze data in a database using tools which look for.
Fox MIS Spring 2011 Data Mining Week 9 Introduction to Data Mining.
Methodology Qiang Yang, MTM521 Material. A High-level Process View for Data Mining 1. Develop an understanding of application, set goals, lay down all.
1 Reviewing Data Warehouse Basics. Lessons 1.Reviewing Data Warehouse Basics 2.Defining the Business and Logical Models 3.Creating the Dimensional Model.
Guest Lecture Introduction to Data Mining Dr. Bhavani Thuraisingham September 17, 2010.
1 Topics about Data Warehouses What is a data warehouse? How does a data warehouse differ from a transaction processing database? What are the characteristics.
Advanced Database Course (ESED5204) Eng. Hanan Alyazji University of Palestine Software Engineering Department.
3-1 Data Mining Kelby Lee. 3-2 Overview ¨ Transaction Database ¨ What is Data Mining ¨ Data Mining Primitives ¨ Data Mining Objectives ¨ Predictive Modeling.
CRM - Data mining Perspective. Predicting Who will Buy Here are five primary issues that organizations need to address to satisfy demanding consumers:
Dr. Chen, Data Mining  A/W & Dr. Chen, Data Mining Part I Data Mining Fundamentals Chapter 1 Data Mining: A First View Jason C. H. Chen, Ph.D. Professor.
Database Principles. Basics A database is a collection of data, along with the relationships between the data The data has to be entered into a structure,
Data Warehousing Lecture-30 What can Data Mining do? Virtual University of Pakistan Ahsan Abdullah Assoc. Prof. & Head Center for Agro-Informatics Research.
Chapter 14 Data Mining Transparencies. 2 Chapter Objectives u The concepts associated with data mining. u The main features of data mining operations,
MIS2502: Data Analytics Advanced Analytics - Introduction.
Data Mining and Decision Support
1 Categories of data Operational and very short-term decision making data Current, short-term decision making, related to financial transactions, detailed.
Academic Year 2014 Spring Academic Year 2014 Spring.
Data Mining Copyright KEYSOFT Solutions.
WHAT IS DATA MINING?  The process of automatically extracting useful information from large amounts of data.  Uses traditional data analysis techniques.
Knowledge Discovery and Data Mining 19 th Meeting Course Name: Business Intelligence Year: 2009.
WHAT IS DATA MINING?  The process of automatically extracting useful information from large amounts of data.  Uses traditional data analysis techniques.
DATA MINING It is a process of extracting interesting(non trivial, implicit, previously, unknown and useful ) information from any data repository. The.
Introduction.  Instructor: Cengiz Örencik   Course materials:  myweb.sabanciuniv.edu/cengizo/courses.
Data Resource Management – MGMT An overview of where we are right now SQL Developer OLAP CUBE 1 Sales Cube Data Warehouse Denormalized Historical.
DATA MINING and VISUALIZATION Instructor: Dr. Matthew Iklé, Adams State University Remote Instructor: Dr. Hong Liu, Embry-Riddle Aeronautical University.
Data Mining Transparencies
Data Mining.
Data Mining, Machine Learning, Data Analysis, etc. scikit-learn
MIS2502: Data Analytics Advanced Analytics - Introduction
DATA MINING © Prentice Hall.
Data and Applications Security Introduction to Data Mining
Sangeeta Devadiga CS 157B, Spring 2007
Week 11 Knowledge Discovery Systems & Data Mining :
Data Mining, Machine Learning, Data Analysis, etc. scikit-learn
Data Mining, Machine Learning, Data Analysis, etc. scikit-learn
Data Warehousing Data Mining Privacy
Data Mining The process of extracting valid, previously unknown, comprehensible, and actionable information from large databases and using it to make.
CSE591: Data Mining by H. Liu
Presentation transcript:

Data MINING Data mining is the process of extracting previously unknown, valid and actionable information from large data and then using the information so derived to make crucial business and strategic decision. To discover meaningful patterns and rules.

Data Warehouse to Data Mining D.W.H, being a subject-oriented, integrated, time variant and non-volatile collection of data aims at supporting business decision, whereas data mining process is a natural and logical continuation of D.W.H D.M needs accurate, consistent and good quality data as the model mined are to be consistent and accurate. D.W.H is collection of all aspects D.M can give more relations, uncovered patterns and rules it mined from different and more data sources D.W.H Query directions are made easy in D.W.H

Steps of Data Mining 1.Identifying the Data – Data could be all over the world not just in the enterprise. data could distributed (paper, people heads etc..) 2.Getting the data ready – (Getting the data ready) – It is to put in data in right format in DB are to be built in the system 3.Mining the data – After right data is known, its cleaned, scrubbed and remove unnecessary items and get only data essential for mining. 4.Getting useful results – After mining, What outcomes do we want? Do we want the tools to find interesting patterns? Are these tools available or do we build the tools? 5.Identifying action – After having determined the data and tools, we start the tools to operate the data. It can produce lots of data so we know what to do with patterns?

6.Implementing the actions – After getting useful results? Examine results and identify actions that can be taken eg analyzing pattern items go together and put together 7.Evaluating the benefits – After actions implemented, we wait to save results. These results may immediately or may take longtime. Once we are in position to determine the benefits and costs, re- evaluate the procedure. By that the data may have changed. New tools may be recommended. So plan the next mining cycle and find how to go about it 8.Determining what to do next 9.Carrying out the next cycle

Outcomes of Data Mining There are six activities in data mining which are known as data mining tasks or types 1.Classification 2.Estimation 3.Prediction 4.Affinity grouping or Association rules 5.Clustering 6.Description and Visualization

Classification Classification consists of examining the features of a newly presented object and assigning to it a predefined class the common features are extracted. Classification is carried out by developing training sets with pre-classified examples and then building a model that fits the description of the classes. In classification, a group of entities is partitioned based on a predefined value of some attributes. Classification deals with discrete outcomes yes or no, debit card or car loan.

Estimation Based on the spending patterns of a person and his age one can estimate his salary or the number of children he has. Estimation deals with continuously valued outcome. Classification and estimation are used together.

Prediction This task predicts the future behaviour of some values. For example based on the education of a person, his current job and trends in the industry one can predict that his salary will be a certain amount be year Predictive task feel different because the records are classifed according to some predicted future behaviour or estimate future value. With prediction the only way to check the accuracy of the classification is to wait and see. Historical data is used to build a model that current observed behavior, when applied to current input it can predict future behavior.

Affinity grouping or Association rules The task of affinity grouping is to determine is to determine which things go together. This determines the items that go together eg who are the people that travel together? What are the items that are purchased together? Affinity grouping can also be used to identify cross selling opportunity and to design attractive packages or grouping of product and services.

Clustering Clustering is a DM task that is often confused with classification. Clustering are formed by analyzing data.For eg group X prefer zen, Y prefer ford icon, z prefer nano. Once cluster are obtained then each cluster can be examined and mined future for other outcomes such as estimation and classification. Clustering ids often done as a prelude to some other form of data mining or modeling. Clustering might be the first step in a market segment efforts

Description and Visualization For eg john usually goes shopping after he goes to the bank, but last week he went to church after shopping Anomaly detection is a form of deviation detection and is used for applications such as fraud detection and medical illness detection.