Data Mining and Data Warehousing – a connected view.

Slides:



Advertisements
Similar presentations
Supporting End-User Access
Advertisements

By: Mr Hashem Alaidaros MIS 211 Lecture 4 Title: Data Base Management System.
Data Mining Glen Shih CS157B Section 1 Dr. Sin-Min Lee April 4, 2006.
Data warehouse example
ICS 421 Spring 2010 Data Warehousing (1) Asst. Prof. Lipyeow Lim Information & Computer Science Department University of Hawaii at Manoa 3/18/20101Lipyeow.
Chapter 9 DATA WAREHOUSING Transparencies © Pearson Education Limited 1995, 2005.
DATA WAREHOUSING.
Chapter 15 Data Warehousing, OLAP, and Data Mining
13 Chapter 13 The Data Warehouse Hachim Haddouti.
Introduction to Data Warehousing Enrico Franconi CS 636.
Chapter 13 The Data Warehouse
1 Data and Knowledge Management. 2 Data Management: A Critical Success Factor The difficulties and the process Data sources and collection Data quality.
1 © Prentice Hall, 2002 Chapter 11: Data Warehousing.
Data Mining: A Closer Look
Business Intelligence
Chapter 13 – Data Warehousing. Databases  Databases are developed on the IDEA that DATA is one of the critical materials of the Information Age  Information,
Data Warehousing Alex Ostrovsky CS157B Spring 2007.
Defining Data Warehouse Concepts and Terminology.
M ODULE 5 Metadata, Tools, and Data Warehousing Section 4 Data Warehouse Administration 1 ITEC 450.
XP Information Information is everywhere in an organization Employees must be able to obtain and analyze the many different levels, formats, and granularities.
Data Mining : Introduction Chapter 1. 2 Index 1. What is Data Mining? 2. Data Mining Functionalities 1. Characterization and Discrimination 2. MIning.
Basic Concepts of Datawarehousing An Overview Prasanth Gurram.
Shilpa Seth.  What is Data Mining What is Data Mining  Applications of Data Mining Applications of Data Mining  KDD Process KDD Process  Architecture.
Week 6 Lecture The Data Warehouse Samuel Conn, Asst. Professor
Data Management for Decision Support Session-2 Prof. Bharat Bhasker.
Enhancing Management Decision Making Week-10 Prof. Bharat Bhasker.
Data Warehouse & Data Mining
DECISION SUPPORT SYSTEM ARCHITECTURE: The data management component.
Data warehousing and online analytical processing- Ref Chap 4) By Asst Prof. Muhammad Amir Alam.
1 Data Warehouses BUAD/American University Data Warehouses.
OLAP & DSS SUPPORT IN DATA WAREHOUSE By - Pooja Sinha Kaushalya Bakde.
Data Warehousing.
1 Reviewing Data Warehouse Basics. Lessons 1.Reviewing Data Warehouse Basics 2.Defining the Business and Logical Models 3.Creating the Dimensional Model.
1 Categories of data Operational and very short-term decision making data Current, short-term decision making, related to financial transactions, detailed.
1 Topics about Data Warehouses What is a data warehouse? How does a data warehouse differ from a transaction processing database? What are the characteristics.
6.1 © 2010 by Prentice Hall 6 Chapter Foundations of Business Intelligence: Databases and Information Management.
MANAGING DATA RESOURCES ~ pertemuan 7 ~ Oleh: Ir. Abdul Hayat, MTI.
1 Categories of data Operational and very short-term decision making data Current, short-term decision making, related to financial transactions, detailed.
Ch3 Data Warehouse Dr. Bernard Chen Ph.D. University of Central Arkansas Fall 2009.
Copyright © 2007 Ramez Elmasri and Shamkant B. Navathe Slide
Fox MIS Spring 2011 Data Warehouse Week 8 Introduction of Data Warehouse Multidimensional Analysis: OLAP.
Business Intelligence Transparencies 1. ©Pearson Education 2009 Objectives What business intelligence (BI) represents. The technologies associated with.
MIS2502: Data Analytics Advanced Analytics - Introduction.
Advanced Database Concepts
1 Categories of data Operational and very short-term decision making data Current, short-term decision making, related to financial transactions, detailed.
Academic Year 2014 Spring Academic Year 2014 Spring.
Data Resource Management Agenda What types of data are stored by organizations? How are different types of data stored? What are the potential problems.
Acct 6910 Building Business Intelligence Systems An Introduction to Data Warehouse.
BUSINESS INTELLIGENCE. The new technology for understanding the past & predicting the future … BI is broad category of technologies that allows for gathering,
1 Data Warehousing Data Warehousing. 2 Objectives Definition of terms Definition of terms Reasons for information gap between information needs and availability.
Chapter 3 Building Business Intelligence Chapter 3 DATABASES AND DATA WAREHOUSES Building Business Intelligence 6/22/2016 1Management Information Systems.
Data Mining – Introduction (contd…) Compiled By: Umair Yaqub Lecturer Govt. Murray College Sialkot.
Data Mining and Data Warehousing: Concepts and Techniques What is a Data Warehouse? Data Warehouse vs. other systems, OLTP vs. OLAP Conceptual Modeling.
Advanced Applied IT for Business 2
MIS2502: Data Analytics Advanced Analytics - Introduction
Datamining : Refers to extracting or mining knowledge from large amounts of data Applications : Market Analysis Fraud Detection Customer Retention Production.
Data Warehouse.
Chapter 13 – Data Warehousing
Adrian Tuhtan CS157A Section1
MANAGING DATA RESOURCES
Data Warehouse and OLAP
Dr. Bernard Chen Ph.D. University of Central Arkansas Fall 2009
Supporting End-User Access
Introduction of Week 9 Return assignment 5-2
The Database Environment
Data Warehousing Data Mining Privacy
Data Warehousing Concepts
Data Warehouse and OLAP
Data Warehouse and OLAP Technology
Presentation transcript:

Data Mining and Data Warehousing – a connected view

Introduction Data mining describes a collection of techniques that aim to find useful but undiscovered patterns in collected data The goal of data mining is to create models for decision-making that predict future behavior based on analysis of past activity

Introduction Data warehousing is a blend of technologies aimed at the effective integration of operational databases into an environment that enables the strategic use of data. These technologies include relational and multidimensional database management systems, client/server architecture, metadata modeling and repositories, graphical user interfaces, and much more.

Operational vs Informational Databases

Table 2-1 Operational Versus informational Databases

Operational vs Informational Databases

Table 2-2 Comparison of Data Stores, and Data Warehouses

Definition and characteristics of a data warehouse It’s a database designed for analytical tasks It supports a relatively small number of users Its usage is read-intensive Its content is periodically updated (mostly additions) It contains current and historical dta It contains a few large tables Each query frequently results in a large result set and involves frequent full table scan and multi-table joins A formal definition of the data warehouse is offered by W.H. Inmon –A data warehouse is a subject-oriented, integrated, time-variant, non-volatile collection of data in support of management decisions

Data warehouse architecture

Figure 2-1 Data Warehouse Environment

Data warehouse architecture

Figure 2-1 Data Warehouse and Data Operational Data Store

Data warehouse architecture

Figure 2-3 Two-tiered Data WarehouseArchitecture

Data warehouse architecture

Figure 2-4 Multi-tiered Data WarehouseArchitecture

Data mining defined Data mining as the process of discovering meaningful new correlations, patterns, and trends by digging into (mining) large amounts of data stored in warehouse. The major attraction of data mining is its capability to build predictive rather than retrospective models

Predictive versus Retrospective Models

Table 2-3 Predictive Versus Retrospective Models

Data Mining application Domain Customer retention Sales and customer service Marketing Risk Assessment and Fraud Detection

Data Mining Categories and Research Focus Data mining techniques deal with discovery and learning, and as such fall into three major learning modes: supervised, unsupervised, and reinforcement learning Data mining techniques can be categorized: –Representation of models and results –The type of data the techniques operates on –Application type –Pattern attributes

Data Mining Categories and Research Focus Data mining categorized by business problems –Retrospective Analysis –Predictive Analysis These two classes of business problems can be further classified by –Classification –Clustering/Segmentation –Associations –Sequencing

Data Mining Categories and Research Focus Approaches that underlie the most contemporary research in data mining: –The induction approach –The database querying approach –The compression approach –The approach of approximation and searching