DataWarehousing and DataMining Prof. Sin-Min Lee.

Slides:



Advertisements
Similar presentations
Supervisor : Prof . Abbdolahzadeh
Advertisements

Data Warehousing components. Overall architecture.
Data Warehousing M R BRAHMAM.
Database Systems: Design, Implementation, and Management Tenth Edition
Chapter 9 DATA WAREHOUSING Transparencies © Pearson Education Limited 1995, 2005.
Introduction to Data Warehousing. From DBMS to Decision Support DBMSs widely used to maintain transactional data Attempts to use of these data for analysis,
Advanced Topics COMP163: Database Management Systems University of the Pacific December 9, 2008.
Components and Architecture CS 543 – Data Warehousing.
DATA WAREHOUSING.
1 ACCTG 6910 Building Enterprise & Business Intelligence Systems (e.bis) The Data Warehouse Lifecycle Olivia R. Liu Sheng, Ph.D. Emma Eccles Jones Presidential.
13 Chapter 13 The Data Warehouse Hachim Haddouti.
Chapter 13 The Data Warehouse
1 © Prentice Hall, 2002 Chapter 11: Data Warehousing.
How Business Intelligence Software Works and a Brief Overview of Leading Products Jai Windsor MIS 5973 December 8, 2005.
Designing a Data Warehouse
Chapter 13 – Data Warehousing. Databases  Databases are developed on the IDEA that DATA is one of the critical materials of the Information Age  Information,
Architecture and Infrastructure Module 2 G.Anuradha.
M ODULE 5 Metadata, Tools, and Data Warehousing Section 4 Data Warehouse Administration 1 ITEC 450.
An Introduction to Infrastructure Ch 11. Issues Performance drain on the operating environment Technical skills of the data warehouse implementers Operational.
Database Management Systems, 2 nd Edition. R. Ramakrishnan and J. Gehrke1 Decision Support Chapter 23.
What is Business Intelligence? Business intelligence (BI) –Range of applications, practices, and technologies for the extraction, translation, integration,
Chapter 5 Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization.
Designing a Data Warehouse Issues in DW design. Three Fundamental Processes Data Acquisition Data Storage Data a Access.
Week 6 Lecture The Data Warehouse Samuel Conn, Asst. Professor
L/O/G/O Metadata Business Intelligence Erwin Moeyaert.
Dr.S.Sridhar,Ph.D., RACI(Paris),RZFM(Germany),RMR(USA),RIEEEProc.
Data Warehouse & Data Mining
IST722 Data Warehousing Business Intelligence Development with SQL Server Analysis Services and Excel 2013 Michael A. Fudge, Jr.
Ihr Logo Chapter 5 Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization Turban, Aronson, and Liang.
IMS 6217: Data Warehousing / Business Intelligence Part 3 1 Dr. Lawrence West, Management Dept., University of Central Florida Analysis.
DECISION SUPPORT SYSTEM ARCHITECTURE: The data management component.
Data Warehouse Overview September 28, 2012 presented by Terry Bilskie.
Datawarehouse Objectives
© 2005 Prentice Hall, Decision Support Systems and Intelligent Systems, 7th Edition, Turban, Aronson, and Liang 5-1 Chapter 5 Business Intelligence: Data.
Data warehousing and online analytical processing- Ref Chap 4) By Asst Prof. Muhammad Amir Alam.
1 Data Warehouses BUAD/American University Data Warehouses.
Data Warehousing.
1 Reviewing Data Warehouse Basics. Lessons 1.Reviewing Data Warehouse Basics 2.Defining the Business and Logical Models 3.Creating the Dimensional Model.
Ihr Logo Chapter 5 Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization Turban, Aronson, and Liang.
Database Systems: Design, Implementation, and Management Ninth Edition Chapter 13 Business Intelligence and Data Warehouses.
1 Topics about Data Warehouses What is a data warehouse? How does a data warehouse differ from a transaction processing database? What are the characteristics.
6.1 © 2010 by Prentice Hall 6 Chapter Foundations of Business Intelligence: Databases and Information Management.
MANAGING DATA RESOURCES ~ pertemuan 7 ~ Oleh: Ir. Abdul Hayat, MTI.
13 1 Chapter 13 The Data Warehouse Database Systems: Design, Implementation, and Management, Seventh Edition, Rob and Coronel.
Data Warehouse. Group 5 Kacie Johnson Summer Bird Washington Farver Jonathan Wright Mike Muchane.
Copyright © 2007 Ramez Elmasri and Shamkant B. Navathe Slide
Database Systems: Design, Implementation, and Management Eighth Edition Chapter 13 Business Intelligence and Data Warehouses.
Creating a Data Warehouse Data Acquisition: Extract, Transform, Load Extraction Process of identifying and retrieving a set of data from the operational.
CHAPTER 4 Data Warehousing, Access, Analysis, Mining, and Visualization 2 1.
Foundations of Business Intelligence: Databases and Information Management.
Business Intelligence Transparencies 1. ©Pearson Education 2009 Objectives What business intelligence (BI) represents. The technologies associated with.
By N.Gopinath AP/CSE.  The data warehouse architecture is based on a relational database management system server that functions as the central repository.
Advanced Database Concepts
1 Database Systems, 8 th Edition 1 Chapter 13 Business Intelligence and Data Warehouses Objectives In this chapter, you will learn: –How business intelligence.
12 1 Database Systems: Design, Implementation, & Management, 6 th Edition, Rob & Coronel 12.4 Online Analytical Processing OLAP creates an advanced data.
Data Resource Management Agenda What types of data are stored by organizations? How are different types of data stored? What are the potential problems.
Database Management Systems, 2 nd Edition. R. Ramakrishnan and J. Gehrke1 Data Warehousing and Decision Support.
1 Copyright © Oracle Corporation, All rights reserved. Business Intelligence and Data Warehousing.
1 Database Systems, 8 th Edition Star Schema Data modeling technique –Maps multidimensional decision support data into relational database Creates.
Introduction to OLAP and Data Warehouse Assoc. Professor Bela Stantic September 2014 Database Systems.
Data Mining and Data Warehousing: Concepts and Techniques What is a Data Warehouse? Data Warehouse vs. other systems, OLTP vs. OLAP Conceptual Modeling.
Supervisor : Prof . Abbdolahzadeh
Chapter 13 Business Intelligence and Data Warehouses
Chapter 13 The Data Warehouse
Data Warehouse.
المحاضرة 4 : مستودعات البيانات (Data warehouse)
Data Warehouse and OLAP
Introduction of Week 9 Return assignment 5-2
Data Warehouse.
Data Warehouse and OLAP
Presentation transcript:

DataWarehousing and DataMining Prof. Sin-Min Lee

DATA WAREHOUSE, OLAP, and DATA MINING Concepts –Data warehousing –OLAP (On-Line Analytical Processing) –Data mining Case Studies –WebTarget (USN) –TFDW (USMC)

DATA WAREHOUSE DATABASE MANAGEMENT IN THE INTERNET ERA CLIENT/SERVER - BASED ANALYTICAL vs OPERATIONAL (OLAP vs OLTP) MULTI-DIMENSIONAL ANALYSIS DATA WAREHOUSE (ENTERPRISE-WIDE) vs DATA MART (FUNCTIONAL AREA)

MULTIDIMENSIONAL NATURE OF DATA WAREHOUSES BORING QUERY: “How many Sailors/Marines chose not to stay in the Navy/Marine Corps this year?” USEFUL QUERY: “What was our retention (separation) rate this year by community by paygrade by years of service by gender by rating and how did it compare to last year and what can we expect next year?”

DW ARCHITECTURE

DW 3-TIER ARCHITECTURE

1. DATA QUALITY & DATA CLEANSING #1 REASON FOR DW PROJECT FAILURE PROBLEMS - Database heterogeneity - Data heterogeneity FUNCTIONALITY OF TOOLS - Removing unwanted data from operational databases - Converting to common data names and definitions - Calculating summaries and derived data - Establishing defaults for missing data - Accommodating source data definition changes

APPROACHES TO DATA CLEANSING AUTOMATIC CODE GENERATION Creates code to convert from source to target data DATA REPLICATION TOOLS Captures changes to source database from recovery logs and database triggers and propagates changes to copies of the data DYNAMIC TRANFORMATION ENGINES Rule-driven systems that capture data from source databases at user-defined intervals, transform it, and export it to a data warehouse/mart target

2. METADATA (What does the data mean?) Logical Structure of DW Including End User Views Identification of Authoritative Data Sources Transformation Rules for Populating DW Transformation Rules to Deliver Data to End-User Analytical Tools Subscription Information for Information Delivery DW Operational Information DW Usage Metrics Security Authorizations, Access Control Lists, etc.

3. DATA WAREHOUSE DATABASE PARALLEL COMPUTING PLATFORMS Exs: Symmetric (Shared) Multiprocessors (SMPs); Massively Parallel Processors (MPPs) ROLAP Relational DBMS with “Heavy Duty” Indexing Capabilities MOLAP Multidimensional Databases (MDDBs) 3rd Party Tools that Augment Relational Model

4. DATA MARTS A Data Warehouse Focused on a Specific Subject Area Subsidiary to a Data Warehouse of Integrated Data More Rapidly Deployable than a Data Warehouse Subject-based vice Enterprise-based

5. ACCESS TOOLS QUERY AND REPORTING TOOLS - Managed query tools: Layer between user and SQL (e.g., BrioQuery) - Configurable report generators (e.g., Brio’s BrioReport) APPLICATIONS - Application development platforms (e.g., PowerSoft’s PowerBuilder; Microsoft’s Visual Basic)

ACCESS (cont’d) OLAP - Support of multidimensional analysis - Ability to drilldown and rollup along any of the predefined dimensions - Major vendors: Cognos, Business Objects, Brio

MULTIDIMENSIONAL DATA MODEL: STAR SCHEMA FACTS: Core data element being analyzed, e.g., Units_of_Items_Sold DIMENSIONS: Attributes about FACTS, e.g., Product_Type, Purchase_Date

ROLE OF METRICS Facts should be defined as Measures of Effectiveness (sometimes called Key Performance Indicators (KPI’s)) Exs: NEC Reutilization Rate Retention Rate Attrition Rate Readiness (Personnel)

COGNOS DEMO ysis_launch.htmlhttp:// ysis_launch.html

ACCESS: Data Mining “Searching for meaningful patterns in large data sets” Knowledge acquisition Motivated and facilitated by: –Availability of large data sets –Advances in storage technology –Data warehouse technology –E-commerce and the Internet Exploratory vs. confirmatory analysis

6. DW ADMINISTRATION AND MANAGEMENT “Normal” DBA Responsibilities plus: Source Data Quality Checks Keeping track of what all the source data means Managing Very Large Databases (gigabytes or terabytes in size)

7. INFORMATION DELIVERY SYSTEM How to get information from the data warehouse to users? Users subscribe to the data warehouse. Specifically, they subscribe to specific reports to be delivered on a periodic basis. Reports are delivered to user’s Web browser as per prescribed frequency. Powerful tool for delivering information to the people who need it in an extremely timely fashion. True MIS; true DSS.

BENEFITS OF DATA WAREHOUSE Freedom from restrictions of operational databases Decision-oriented Extremely efficient presentation of management information Widespread access to critical information for those who need it when they need it Knowledge discovery Improves business intelligence Relatively inexpensive to implement Does not require re-engineering of legacy systems

GIS: GEOGRAPHIC INFORMATION SYSTEMS Ability to visualize data spatially Maps on top of a relational DBMS Data is viewed on maps vice from tables Features: - Thematic maps - Spatial queries - Geocoding of data Vendors: MapInfo; ESRI (ArcInfo)