Data Warehousing/Mining 1 Data Warehousing/Mining Introduction.

Slides:



Advertisements
Similar presentations
Data Warehousing CPS216 Notes 13 Shivnath Babu. 2 Warehousing l Growing industry: $8 billion way back in 1998 l Range from desktop to huge: u Walmart:
Advertisements

CPSC-608 Database Systems Fall 2011 Instructor: Jianer Chen Office: HRBB 315C Phone: Notes #15.
Data Management for Decision Support Session - 1 Prof. Bharat Bhasker.
ICS 421 Spring 2010 Data Warehousing (1) Asst. Prof. Lipyeow Lim Information & Computer Science Department University of Hawaii at Manoa 3/18/20101Lipyeow.
Data Warehouse IMS5024 – presented by Eder Tsang.
CSE6011 Data Warehouse and OLAP  Why data warehouse  What’s data warehouse  What’s multi-dimensional data model  What’s difference between OLAP and.
DATA WAREHOUSE CONCEPTS. A Definition · A Data Warehouse: Is a repository for collecting, standardizing, and summarizing snapshots of transactional data.
COMP 578 Data Warehouse and Data Warehousing: An Introduction
Distributed DBMSs A distributed database is a single logical database that is physically distributed to computers on a network. Homogeneous DDBMS has the.
Data Mining and Data Warehousing – a connected view.
10/30/2001Database Management -- R. Larson Data Warehousing University of California, Berkeley School of Information Management and Systems SIMS 257: Database.
Introduction to Data Warehousing Enrico Franconi CS 636.
11/2/2000Database Management -- R. Larson Data Warehouses, Decision Support and Data Mining University of California, Berkeley School of Information Management.
Data Warehousing DSCI 4103 Dr. Mennecke Introduction and Chapter 1.
A Comparsion of Databases and Data Warehouses Name: Liliana Livorová Subject: Distributed Data Processing.
Defining Data Warehouse Concepts and Terminology.
Joachim Hammer 1 Data Warehousing Overview, Terminology, and Research Issues Joachim Hammer.
1 Database Administration (CG168) – Lecture 10a: Introduction to Data Warehousing Data Warehousing “An Introduction” Dr. Akhtar Ali School of Computing,
Data Management for Decision Support Session-2 Prof. Bharat Bhasker.
D ATABASE S YSTEMS D ATA W AREHOUSING I Asma Ahmad 29 th April, 2011.
Enhancing Management Decision Making Week-10 Prof. Bharat Bhasker.
Business Intelligence. Unit 1 Important Concepts.
Database Design - Lecture 1
Data Warehouse Concepts Transparencies
DW-1: Introduction to Data Warehousing. Overview What is Database What Is Data Warehousing Data Marts and Data Warehouses The Data Warehousing Process.
Data Warehouse Overview September 28, 2012 presented by Terry Bilskie.
© 2007 by Prentice Hall 1 Introduction to databases.
Data warehousing and online analytical processing- Ref Chap 4) By Asst Prof. Muhammad Amir Alam.
Data Warehouse Fundamentals Rabie A. Ramadan, PhD 2.
1 Data Warehouses BUAD/American University Data Warehouses.
2 Copyright © Oracle Corporation, All rights reserved. Defining Data Warehouse Concepts and Terminology.
OLAP & DSS SUPPORT IN DATA WAREHOUSE By - Pooja Sinha Kaushalya Bakde.
Data Warehousing Lecture-2 Introduction and Background 1.
DWH-Ahsan Abdullah 1 Data Warehousing Lecture-2 Introduction and Background Virtual University of Pakistan Ahsan Abdullah Assoc. Prof. & Head Center for.
5 - 1 Copyright © 2006, The McGraw-Hill Companies, Inc. All rights reserved.
Operational vs. Informational System. Operational System Operational systems maintain records of daily business transactions whereas a Data Warehouse.
6.1 © 2010 by Prentice Hall 6 Chapter Foundations of Business Intelligence: Databases and Information Management.
Sachin Goel (68) Manav Mudgal (69) Piyush Samsukha (76) Rachit Singhal (82) Richa Somvanshi (85) Sahar ( )
1 Categories of data Operational and very short-term decision making data Current, short-term decision making, related to financial transactions, detailed.
Data resource management
Chapter 5 DATA WAREHOUSING Study Sections 5.2, 5.3, 5.5, Pages: & Snowflake schema.
MAIN BOOKS 1. DATA WAREHOUSING IN THE REAL WORLD : Sam Anshory & Dennis Murray, Pearson 2. DATA MINING CONCEPTS AND TECHNIQUES : Jiawei Han & Micheline.
Copyright© 2014, Sira Yongchareon Department of Computing, Faculty of Creative Industries and Business Lecturer : Dr. Sira Yongchareon ISCG 6425 Data Warehousing.
Acct 6910 Building Business Intelligence Systems An Introduction to Data Warehouse.
Data Warehousing MEC 623 – Data Warehousing and Data Mining.
Data Warehouse A place the information system department puts the data that is turned into information. Data must be properly prepared,organized,and presented.
The Need for Data Analysis 2 Managers track daily transactions to evaluate how the business is performing Strategies should be developed to meet organizational.
Introduction to OLAP and Data Warehouse Assoc. Professor Bela Stantic September 2014 Database Systems.
2 Copyright © 2006, Oracle. All rights reserved. Defining Data Warehouse Concepts and Terminology.
BUSINESS INTELLIGENCE. The new technology for understanding the past & predicting the future … BI is broad category of technologies that allows for gathering,
Data Mining and Data Warehousing: Concepts and Techniques What is a Data Warehouse? Data Warehouse vs. other systems, OLTP vs. OLAP Conceptual Modeling.
Data Warehouse and OLAP
Advanced Applied IT for Business 2
Data warehouse.
Data Warehousing CIS 4301 Lecture Notes 4/20/2006.
Data warehouse and OLAP
Data Warehouse.
Instructor: Dan Hebert
Data Warehouse Overview September 28, 2012 presented by Terry Bilskie
An Introduction to Data Warehousing
Introduction to Data Warehousing
Data Warehouse A place the information system department puts the data that is turned into information. Data must be properly prepared,organized,and presented.
Data Warehouse and OLAP
Data and Interoperability:
Data Warehouse.
Introduction to Data Warehousing
Data Warehouse and OLAP Technology
Data Warehousing & DATA MINING (SE-409) Lecture-1 Introduction and Background Huma Ayub Software Engineering department University of Engineering and Technology,
Presentation transcript:

Data Warehousing/Mining 1 Data Warehousing/Mining Introduction

Data Warehousing/Mining 2 Outline of Lecture  Brief History of Data Warehousing  What is a Data Warehouse?  Need For Strategic Information  Information Crisis  Operational and Decision Support System  Difference B/W standard DB and Data warehouse

Data Warehousing/Mining 3 Data Warehouse Evolution TIME Information- Based Management Data Revolution “Middle Ages” “Prehistoric Times” Relational Databases PC’s and Spreadsheets End-user Interfaces 1st DW Article DW Confs. Vendor DW Frameworks Company DWs “Building the DW” Inmon (1992) Data Replication Tools

Data Warehousing/Mining 4 Escalating Need For Strategic Information Organizations need information to formulate the business strategies,establish Goals,set Objectives e.g.  Increase the customer by 10% over the next 5 years  Gain market share by 15% in the next 2 years  Increase product quality levels in the top five product groups

Data Warehousing/Mining 5 The Information Crisis  Information is said to be doubled every 18 months  Organizations have tons of data available Then why information Crisis? Why cant organizations convert the data into useful information for strategic decision making?

Data Warehousing/Mining 6 Problem: Heterogeneous Information Sources “Heterogeneities are everywhere”  Different interfaces  Different data representations  Diverse structure of databases  Duplicate and inconsistent information Personal Databases Digital Libraries Scientific Databases World Wide Web

Data Warehousing/Mining 7 About Some Definitions  What is data?  What is information?  What is Warehouse?

Data Warehousing/Mining 8 What is a Data Warehouse? A Practitioners Viewpoint “A data warehouse is simply a single, complete, and consistent store of data obtained from a variety of sources and made available to end users in a way they can understand and use it in a business context.” -- Barry Devlin, IBM Consultant

Data Warehousing/Mining 9 A Data Warehouse is...  Stored collection of diverse data –A solution to data integration problem –Single repository of information  Subject-oriented –Organized by subject, not by application –Used for analysis, data mining, etc.  Large volume of data (Gb, Tb)  Non-volatile –Historical –Time attributes are important

Data Warehousing/Mining 10 A Data Warehouse is... (continued)  Updates infrequent  Examples –All transactions EVER at WalMart –Complete client histories at insurance firm –Stockbroker financial information and portfolios

Data Warehousing/Mining 11 Summary Operational Systems Data Warehouse Population Data Warehouse Business Information Interface

Data Warehousing/Mining 12 What is Operational and Decision Support System Operational Systems  Making the wheels of Business Turn –Take an order –Process a claim –Make shipment –Generate an invoice –Receive cash –Reserve an airline seat

Data Warehousing/Mining 13 Decision Support System  Watching the wheels of business turn –Show the top selling products –Show the problem regions –Tell me why (drill down) –Let me see other data (drill across) –Alert me when a district sells below target What is Operational and Decision Support System (Contd…)

Data Warehousing/Mining 14 Difference Operational Current Values Optimized for transaction High Read, update, delete Predictable, repetitive Sub seconds Large Number Informational Archived, derived, optimized Optimized for complex queries Medium to Low Read Ad hoc, random, Heuristic Several Seconds to Minutes Relatively Small number Data Content Data Structure Access Frequency Access Type Usage Response Time Users

Data Warehousing/Mining 15 Warehouse is a Specialized DB Standard DB  Mostly updates  Many small transactions  Mb - Gb of data  Current snapshot  Index/hash on p.k.  Raw data  Thousands of users (e.g., clerical users) Warehouse  Mostly reads  Queries are long and complex  Gb - Tb of data  History  Lots of scans  Summarized, reconciled data  Hundreds of users (e.g., decision-makers, analysts)

Data Warehousing/Mining 16 Warehousing and Industry  Warehousing is big business –$2 billion in 1995 –$3.5 billion in early 1997 –About $8 billion in 1998 [Metagroup]  WalMart has largest warehouse –900-CPU, 2,700 disk, 23 TB Teradata system –~7TB in warehouse –40-50GB per day

Data Warehousing/Mining 17 Data Warehousing: Two Distinct Issues (1) How to get information into warehouse “Data warehousing” (2) What to do with data once it’s in warehouse “Warehouse DBMS”  Both rich research areas  Industry has focused on (2)

Data Warehousing/Mining 18 Thank You Very Much