Ghada H. El-Khawaga Marwa M. El-Sadeeq 2007.  What is data mining ?  Why data mining?  Data mining types  Data mining tasks  Knowledge discovery.

Slides:



Advertisements
Similar presentations
Supporting End-User Access
Advertisements

SAS solutions SAS ottawa platform user society nov 20th 2014.
Databases and Warehouses
Developing A Strategy For The Internet Age The Five Forces Model
© Prentice Hall1 DATA MINING TECHNIQUES Introductory and Advanced Topics Eamonn Keogh (some slides adapted from) Margaret Dunham Dr. M.H.Dunham, Data Mining,
Data Mining By Archana Ketkar.
Building Knowledge-Driven DSS and Mining Data
1 Data and Knowledge Management. 2 Data Management: A Critical Success Factor The difficulties and the process Data sources and collection Data quality.
CS157A Spring 05 Data Mining Professor Sin-Min Lee.
Data mining By Aung Oo.
Advanced Database Applications Database Indexing and Data Mining CS591-G1 -- Fall 2001 George Kollios Boston University.
Data Mining: A Closer Look
Data Mining.
Data Mining: An Introduction Wing Kee Ho Xiaohua Luan.
Gavin Russell-Rockliff BI Technical Specialist Microsoft BIN305.
TURKISH STATISTICAL INSTITUTE INFORMATION TECHNOLOGIES DEPARTMENT (Muscat, Oman) DATA MINING.
Data Mining By Andrie Suherman. Agenda Introduction Major Elements Steps/ Processes Tools used for data mining Advantages and Disadvantages.
Data Mining: Concepts & Techniques. Motivation: Necessity is the Mother of Invention Data explosion problem –Automated data collection tools and mature.
OLAM and Data Mining: Concepts and Techniques. Introduction Data explosion problem: –Automated data collection tools and mature database technology lead.
Data Warehouse Fundamentals Rabie A. Ramadan, PhD 2.
Data Mining Techniques
Shilpa Seth.  What is Data Mining What is Data Mining  Applications of Data Mining Applications of Data Mining  KDD Process KDD Process  Architecture.
Data Mining. 2 Models Created by Data Mining Linear Equations Rules Clusters Graphs Tree Structures Recurrent Patterns.
Kansas State University Department of Computing and Information Sciences CIS 830: Advanced Topics in Artificial Intelligence From Data Mining To Knowledge.
Data Mining Techniques As Tools for Analysis of Customer Behavior
Data Mining Chun-Hung Chou
5.1 © 2007 by Prentice Hall 5 Chapter Foundations of Business Intelligence: Databases and Information Management.
MD240 - MIS Oct. 4, 2005 Databases & the Data Asset Harrah’s & Allstate Cases.
1 Data Mining Books: 1.Data Mining, 1996 Pieter Adriaans and Dolf Zantinge Addison-Wesley 2.Discovering Data Mining, 1997 From Concept to Implementation.
3 Objects (Views Synonyms Sequences) 4 PL/SQL blocks 5 Procedures Triggers 6 Enhanced SQL programming 7 SQL &.NET applications 8 OEM DB structure 9 DB.
Chapter 1 Introduction to Data Mining
Introduction to Data Mining Group Members: Karim C. El-Khazen Pascal Suria Lin Gui Philsou Lee Xiaoting Niu.
INTRODUCTION TO DATA MINING MIS2502 Data Analytics.
Knowledge Discovery and Data Mining Evgueni Smirnov.
Data Mining Chapter 1 Introduction -- Basic Data Mining Tasks -- Related Concepts -- Data Mining Techniques.
Database Design Part of the design process is deciding how data will be stored in the system –Conventional files (sequential, indexed,..) –Databases (database.
Fox MIS Spring 2011 Data Mining Week 9 Introduction to Data Mining.
Ihr Logo Chapter 5 Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization Turban, Aronson, and Liang.
Data Mining: Software Helping Business Run
Guest Lecture Introduction to Data Mining Dr. Bhavani Thuraisingham September 17, 2010.
CS157B Fall 04 Introduction to Data Mining Chapter 22.3 Professor Lee Yu, Jianji (Joseph)
6.1 © 2010 by Prentice Hall 6 Chapter Foundations of Business Intelligence: Databases and Information Management.
Advanced Database Course (ESED5204) Eng. Hanan Alyazji University of Palestine Software Engineering Department.
3-1 Data Mining Kelby Lee. 3-2 Overview ¨ Transaction Database ¨ What is Data Mining ¨ Data Mining Primitives ¨ Data Mining Objectives ¨ Predictive Modeling.
CRM - Data mining Perspective. Predicting Who will Buy Here are five primary issues that organizations need to address to satisfy demanding consumers:
10-1 Identify the changes taking place in the form and use of decision support in business Identify the role and reporting alternatives of management information.
Chapter 5: Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization DECISION SUPPORT SYSTEMS AND BUSINESS.
1 Introduction to Data Mining C hapter 1. 2 Chapter 1 Outline Chapter 1 Outline – Background –Information is Power –Knowledge is Power –Data Mining.
MIS2502: Data Analytics Advanced Analytics - Introduction.
Big Data – Big Opportunity Mohammad Khansari ITRC President Jan 2015 ITRC, Tehran, Iran.
Why BI….? Most companies collect a large amount of data from their business operations. To keep track of that information, a business and would need to.
CS507 Information Systems. Lesson # 11 Online Analytical Processing.
Chapter 3 Building Business Intelligence Chapter 3 DATABASES AND DATA WAREHOUSES Building Business Intelligence 6/22/2016 1Management Information Systems.
CS570: Data Mining Spring 2010, TT 1 – 2:15pm Li Xiong.
Intro to MIS – MGS351 Databases and Data Warehouses
Data Mining.
Data Mining – Intro.
MIS2502: Data Analytics Advanced Analytics - Introduction
DATA MINING © Prentice Hall.
Marketing Research Introduction Overview.
Data and Applications Security Introduction to Data Mining
Databases and Data Warehouses Chapter 3
MIS5101: Data Analytics Advanced Analytics - Introduction
Data Warehousing and Data Mining
Supporting End-User Access
Data Mining: Introduction
MIS2502: Data Analytics Introduction to Advanced Analytics
Welcome! Knowledge Discovery and Data Mining
MIS2502: Data Analytics Introduction to Advanced Analytics and R
CSE591: Data Mining by H. Liu
Presentation transcript:

Ghada H. El-Khawaga Marwa M. El-Sadeeq 2007

 What is data mining ?  Why data mining?  Data mining types  Data mining tasks  Knowledge discovery in databases (KDD) processes  Data mining processes  Data mining techniques  Data mining and Data warehousing  Data Mining System Components  Data Mining Applications  Data Mining Tools

 Non-trivial extraction of implicit, previously unknown and potentially useful information from data.  A step in the knowledge discovery process consisting of particular algorithms (methods) that under some acceptable objective, produces a particular enumeration of patterns (models) over the data.

 Data volumes are too large for classical analysis approaches: Large number of records High dimensional data  Leverage organization’s data assets Only a small portion of the collected data is ever analyzed Data that may never be analyzed continues to be collected, at a great expense, out of fear that something which may prove important in the future is missing.  As databases grow, the ability to support the decision support process using traditional query languages becomes infeasible Query formulation problem

 Predictive data mining: which produces the model of the system described by the given data. It uses some variables or fields in the data set to predict unknown or future values of other variables of interest.  Descriptive data mining: which produces new, nontrivial information based on the available data set. It focuses on finding patterns describing the data that can be interpreted by humans.

 Data processing [descriptive]  Prediction [predictive]  Regression [predictive]  Clustering [descriptive]  Classification [predictive]  Link analysis/ associations [descriptive]  Evolution and deviation analysis [predictive]

 Statistical methods  Case-based reasoning  Neural networks  Decision trees

Data warehousing + data mining = increased performance of decision making process + knowledgeable decision makers  SQL Vs. Data mining Vs. OLAP

 Data Mining For Financial Data Analysis  Data Mining For Telecommunications Industry  Data Mining For The Retail Industry  Data Mining In Healthcare and Biomedical Research  Data Mining In Science and Engineering

The Function of the data mining system is to assign scores to various profiles.  Data Mart  Data Mining System(Processing)  Operational Data Store  Scoring Software  Reporting System

 Data Mining For Financial Data Analysis In Banking Industry data mining is used : 1- in the predicting credit fraud 2- in evaluation risk 3- in performing trend analysis 4- in analyzing profitability 5- in helping with direct marketing campaigns In financial markets and neural networks data mining is used : 1- forecasting stock prices 2- forecasting commodity-price prediction 3- forecasting financial disasters

 Data Mining For Telecommunications Industry - Answering some strategic questions through data-mining applications such as: 1-How does one retain customers and keep them loyal as competitors offer special offers and reduced rates? 2-When is a high-risk investment, such as new fiber optic lines, acceptable? 3-How does one predict whether customers will buy additional products like cellular services, call waiting, or basic services? 4-What characteristics differentiate our products from those of our competitors?

 Data Mining For The Retail Industry - The retail industry is a major application area for data mining since it collects huge amounts of data on sales, customer-shopping history, goods transportation, consumption patterns, and service records. -Retailers are interested in creating data-mining models to answer questions such as: 1- What are the best types of advertisements to reach certain segments of customers? 2- What is the optimal timing at which to send mailers? 3- What types of products can be sold together? 4- How does one retain profitable customers? 5- What are the significant customer segments that buy products?

 Data Mining In Healthcare and Biomedical Research - Storing patients' records in electronic format and the development in medical-information systems cause a large amount of clinical data to be available online. Regularities, and surprising events extracted from these data by data-mining methods are important in assisting clinicians to make informed decisions, thereby improving health services. - data mining has been used in many successful medical applications, including data validation in intensive care, the monitoring of children's growth, analysis of diabetic patient's data, the monitoring of heart-transplant patients.

 Data Mining In Science and Engineering - a few important cases of data-mine applications in engineering problems. Pavilion Technologies' Process Insights, an application-development tool that combines neural networks, fuzzy logic, and statistical methods was used to develop chemical manufacturing and control applications to reduce waste, improve product quality, and increase plant throughput.

 Data Mind  Agent Base/Marketer  DB Miner  Decision Series  IBM Intelligent Miner  Data Mining Suite  Darwin (now part of Oracle)  Business Miner  Data Engine

 Agent Base/Marketer It is based on emerging intelligent-agent technology. It can access data from all major sources, and it runs on Windows95, Windows NT, and the Solaris operating system.  Business Miner It is a single-strategy, easy-to-use tool based on decision trees. It can access data from multiple sources including Oracle, Sybase, SQL Server, and Teradata. It runs on all Windows platforms  Data Engine It is a multiple-strategy data-mining tool for data modeling, combining conventional data-analysis methods with fuzzy technology, neural networks, and advanced statistical techniques. It works on the Windows platform.

 Difficult to use  Needs Expert to run the tool  Difficult to add new functionality  Difficult to interface  Short lifetime  Limited Number of algorithms  Need lot of resources

 Data Mining: Concepts, Models, Methods, and Algorithms, Mehmed Kantardzic, ISBN: , John Wiley & Sons ©  Privacy data mining report, DHS privacy office,2005.  Building Data Mining Solutions with OLE DB for DM and XML for Analysis, Zhaohui Tang, Jamie Maclennan, Peter Pyungchul Kim, SIGMOD Record, Vol. 34, No. 2, June 2005