DATA MINING.

Slides:



Advertisements
Similar presentations
Renata Benda Prokeinova Department of Statistics and Operation Research FEM SUA in Nitra.
Advertisements

DATA MINING CS157A Swathi Rangan. A Brief History of Data Mining The term “Data Mining” was only introduced in the 1990s. Data Mining roots are traced.
1 ACCTG 6910 Building Enterprise & Business Intelligence Systems (e.bis) Introduction to Data Mining Olivia R. Liu Sheng, Ph.D. Emma Eccles Jones Presidential.
Chapter 6 Database Design
DATA WAREHOUSING.
Presented To: Madam Nadia Gul Presented By: Bi Bi Mariam.
DASHBOARDS Dashboard provides the managers with exactly the information they need in the correct format at the correct time. BI systems are the foundation.
Data Mining & Data Warehousing PresentedBy: Group 4 Kirk Bishop Joe Draskovich Amber Hottenroth Brandon Lee Stephen Pesavento.
TURKISH STATISTICAL INSTITUTE INFORMATION TECHNOLOGIES DEPARTMENT (Muscat, Oman) DATA MINING.
Data Mining By Andrie Suherman. Agenda Introduction Major Elements Steps/ Processes Tools used for data mining Advantages and Disadvantages.
Operational Data Tools Chapter Eight. Copyright © Houghton Mifflin Company. All rights reserved.8–28–2 Chapter Eight Learning Objectives To learn database.
©Silberschatz, Korth and Sudarshan18.1Database System Concepts - 5 th Edition, Aug 26, 2005 Buzzword List OLTP – OnLine Transaction Processing (normalized,
Tang: Introduction to Data Mining (with modification by Ch. Eick) I: Introduction to Data Mining A.Short Preview 1.Initial Definition of Data Mining 2.Motivation.
Data Warehouse & Data Mining
Ihr Logo Chapter 5 Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization Turban, Aronson, and Liang.
Database : collection of information. data management tool. huge volumes. like a filing system. providing answers.
Week 4 Lecture Part 3 of 3 Database Design Samuel ConnSamuel Conn, Faculty Suggestions for using the Lecture Slides.
DECISION SUPPORT SYSTEM ARCHITECTURE: The data management component.
1 1 Slide Introduction to Data Mining and Business Intelligence.
ASSOCIATION RULE DISCOVERY (MARKET BASKET-ANALYSIS) MIS2502 Data Analytics Adapted from Tan, Steinbach, and Kumar (2004). Introduction to Data Mining.
Data Mining Knowledge on rough set theory SUSHIL KUMAR SAHU.
Fall CIS 764 Database Systems Design L18.3 Business Intelligence Aspects (aka Decision support systems) (Slides support.
1 Reviewing Data Warehouse Basics. Lessons 1.Reviewing Data Warehouse Basics 2.Defining the Business and Logical Models 3.Creating the Dimensional Model.
Data Mining By Dave Maung.
5-1 McGraw-Hill/Irwin Copyright © 2007 by The McGraw-Hill Companies, Inc. All rights reserved.
5 - 1 Copyright © 2006, The McGraw-Hill Companies, Inc. All rights reserved.
1 Topics about Data Warehouses What is a data warehouse? How does a data warehouse differ from a transaction processing database? What are the characteristics.
EXAM REVIEW MIS2502 Data Analytics. Exam What Tool to Use? Evaluating Decision Trees Association Rules Clustering.
Advanced Database Course (ESED5204) Eng. Hanan Alyazji University of Palestine Software Engineering Department.
1 Categories of data Operational and very short-term decision making data Current, short-term decision making, related to financial transactions, detailed.
Chapter 5: Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization DECISION SUPPORT SYSTEMS AND BUSINESS.
Dr. Hammad Majeed Areas of Interest o Artificial Intelligence and Machine Learning Evolutionary Algorithms Data Mining Image Processing.
CISB113 Fundamentals of Information Systems Data Management.
Data Mining BY JEMINI ISLAM. Data Mining Outline: What is data mining? Why use data mining? How does data mining work The process of data mining Tools.
Data Warehousing Lecture-30 What can Data Mining do? Virtual University of Pakistan Ahsan Abdullah Assoc. Prof. & Head Center for Agro-Informatics Research.
Data Mining Basics. “Copyright and Terms of Service Copyright © Texas Education Agency. The materials found on this website are copyrighted © and trademarked.
1 Categories of data Operational and very short-term decision making data Current, short-term decision making, related to financial transactions, detailed.
Data Resource Management Agenda What types of data are stored by organizations? How are different types of data stored? What are the potential problems.
Data Mining. Overview the extraction of hidden predictive information from large databases Data mining tools predict future trends and behaviors, allowing.
Preparing for the Future with Decision Support Systems Copyright © 2001 by Harcourt, Inc. All rights reserved.
 Data Mining Mickey Schaefer, CFO Sutton County Hospital District.
DATA MINING It is a process of extracting interesting(non trivial, implicit, previously, unknown and useful ) information from any data repository. The.
IST 210 Database Design Process IST 210, Section 1 Todd S. Bacastow January 2004.
Copyright © 2011 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Chapter 28 Data Mining Concepts.
A Decision Tree Approach to Cube Construction Patrick Kelly.
Department of Computer Science Sir Syed University of Engineering & Technology, Karachi-Pakistan. Presentation Title: DATA MINING Submitted By.
KNOWLEDGE DISCOVERY & DATA MINING Abhishek M. Mehta ROLL NO:24.
Supplemental Chapter: Business Intelligence Information Systems Development.
Pengantar Sistem Informasi
Introduction BIM Data Mining.
Transaction Processing System (TPS)
Data Mining Generally, (Sometimes called data or knowledge discovery) is the process of analyzing data from different perspectives and summarizing it.
MIS2502: Data Analytics Advanced Analytics - Introduction
Chapter 6 Database Design
Data Warehousing, Access, Analysis, Mining, and Visualization
MIS5101: Extract, Transform, Load (ETL)
Datamining : Refers to extracting or mining knowledge from large amounts of data Applications : Market Analysis Fraud Detection Customer Retention Production.
ACS1803 Lecture Outline 2   DATA MANAGEMENT CONCEPTS Text, Ch. 3
Data Mining Basics Image Source:
المحاضرة 4 : مستودعات البيانات (Data warehouse)
MIS5101: Extract, Transform, Load (ETL)
Transaction Processing System (TPS)
Data Mining Association Rules Assoc.Prof.Songül Varlı Albayrak
MIS5101: Extract, Transform, Load (ETL)
Information Technology for Management
Transaction Processing System (TPS)
MIS2502: Data Analytics Association Rule Mining
Data Warehousing Concepts
Data Mining.
Data Warehouse and OLAP Technology
Presentation transcript:

DATA MINING

What is Data Mining? process that uses various tools to discover patterns and relationships in data that may be used to make valid predictions.

DATA MINING v/s DATA WAREHOUSING main repository of an organizations historical data. Raw material for DSS. Where as Data Mining Process which is done on the data from data warehouse.

Basic Structure

Example Pattern discovery of Midwest Grocery revealed. Men bought diapers on Thursdays and Saturdays, they also tended to buy beer. Weekly grocery shopping on Saturdays. On Thursdays they only bought a few items. Conclusion - they purchased the beer to have it available for the upcoming weekend.

Elements of Data Mining Extract, transform, and load transaction data onto the data warehouse system. Store and manage the data in a database system. Provide data access to business analysts and information technology professionals. Analyze the data by application software. Present the data in a useful format, such as a graph or table.

The Basic Steps of Data Mining 1. Define business problem 2. Build data mining database 3. Explore data 4. Prepare data for modeling 5. Build model 6. Evaluate model 7. Deploy model and results

Step 1 - Define business problem Understand data. Identify the problem. Define objective.

Step 2 - Build data mining database Maximum time requiring step. Data to be mined on database. Don’t use corporate database.

Step 3 - Explore data Identify Use good interface and fast computer responses.

Step 4 - Prepare data for modeling Final data preparation step a. Select variables b. Select rows c. Construct new variables d. Transform variables

Step 5 - Build model Iterative process Explore various alternatives to come up with the one that suits the business.

Step 6 - Evaluate model Feasibility analysis of the model is done.

Step 7 - Deploy model and results Two ways to use the model Recommend actions based on simply viewing the results. Apply the model to different data sets.

What it cannot do? Its not a magic wand. It only tells the pattern but not the value. Pattern may not have cause and effect.

THANK YOU