MIS 451 Building Business Intelligence Systems Data Staging.

Slides:



Advertisements
Similar presentations
Introduction to OWB(Oracle Warehouse Builder)
Advertisements

Business Intelligence Simon Pease. Experience with BI Developing end-to-end BI prototype for Plan International Developing end-to-end BI prototype for.
Data Warehousing – An Introductory Perspective
James Serra – Data Warehouse/BI/MDM Architect
Data transfers into a database First time system implementation –From a manual system Data warehousing projects Database version upgrade ERP projects Migration.
Data Warehousing M R BRAHMAM.
1 ACCTG 6910 Building Enterprise & Business Intelligence Systems (e.bis) Introduction to Data Mining Olivia R. Liu Sheng, Ph.D. Emma Eccles Jones Presidential.
Designing the data warehouse / data marts Part 2.
Designing the Data Warehouse and Data Mart Methodologies and Techniques.
1 ACCTG 6910 Building Enterprise & Business Intelligence Systems (e.bis) Data Staging Olivia R. Liu Sheng, Ph.D. Emma Eccles Jones Presidential Chair of.
Business Intelligence. On-Line Analytical Processing (OLAP) Tools The use of a set of graphical tools that provides users with multidimensional views.
1 ACCTG 6910 Building Enterprise & Business Intelligence Systems (e.bis) The Data Warehouse Lifecycle Olivia R. Liu Sheng, Ph.D. Emma Eccles Jones Presidential.
Data Warehousing. On-Line Analytical Processing (OLAP) Tools The use of a set of graphical tools that provides users with multidimensional views of their.
Data Warehousing DSCI 4103 Dr. Mennecke Introduction and Chapter 1.
Designing a Data Warehouse
Data Warehousing: Defined and Its Applications Pete Johnson April 2002.
Lecture-8/ T. Nouf Almujally
DATA RESOURCE MANAGEMENT.
Business Intelligence Instructor: Bajuna Salehe Web:
MIS 451 Building Business Intelligence Systems
Data Conversion to a Data warehouse Presented By Sanjay Gunasekaran.
ETL By Dr. Gabriel.
Designing a Data Warehouse Issues in DW design. Three Fundamental Processes Data Acquisition Data Storage Data a Access.
Intro to MIS – MGS351 Databases and Data Warehouses Chapter 3.
Database Systems – Data Warehousing
Data Warehouse Chapter 11. Multiple Files Problem Added complexity of multiple source files Start simple Multiple Source files Extracted data Logic to.
Data Warehousing Seminar Chapter 5. Data Warehouse Design Methodology Data Warehousing Lab. HyeYoung Cho.
ISV Innovation Presented by ISV Innovation Presented by Business Intelligence Fundamentals: Data Loading Ola Ekdahl IT Mentors 9/12/08.
IMS 6217: Data Warehousing / Business Intelligence Part 3 1 Dr. Lawrence West, Management Dept., University of Central Florida Analysis.
DW-1: Introduction to Data Warehousing. Overview What is Database What Is Data Warehousing Data Marts and Data Warehouses The Data Warehousing Process.
1 Data Warehouses BUAD/American University Data Warehouses.
5-1 McGraw-Hill/Irwin Copyright © 2007 by The McGraw-Hill Companies, Inc. All rights reserved.
CISB594 – Business Intelligence
Decision Support and Date Warehouse Jingyi Lu. Outline Decision Support System OLAP vs. OLTP What is Date Warehouse? Dimensional Modeling Extract, Transform,
Prepared By Aakanksha Agrawal & Richa Pandey Mtech CSE 3 rd SEM.
Data Staging Data Loading and Cleaning Marakas pg. 25 BCIS 4660 Spring 2012.
MIS 451 Building Business Intelligence Systems Data Analysis.
Advanced Accounting Information Systems Day 10 answers Organizing and Manipulating Data September 16, 2009.
Chapter 5 DATA WAREHOUSING Study Sections 5.2, 5.3, 5.5, Pages: & Snowflake schema.
CMPE 226 Database Systems October 21 Class Meeting Department of Computer Engineering San Jose State University Fall 2015 Instructor: Ron Mak
Foundations of Business Intelligence: Databases and Information Management.
7 Strategies for Extracting, Transforming, and Loading.
Copyright© 2014, Sira Yongchareon Department of Computing, Faculty of Creative Industries and Business Lecturer : Dr. Sira Yongchareon ISCG 6425 Data Warehousing.
1 Copyright © Oracle Corporation, All rights reserved. Business Intelligence and Data Warehousing.
Houston Petroleum Valve Company Data-Mining Project Data Modeling Phase Fouad Alibrahim Mohammad H. Monakes University of Houston Clear Lake University.
The Concepts of Business Intelligence Microsoft® Business Intelligence Solutions.
C Copyright © 2007, Oracle. All rights reserved. Introduction to Data Warehousing Fundamentals.
CMPE 226 Database Systems April 12 Class Meeting Department of Computer Engineering San Jose State University Spring 2016 Instructor: Ron Mak
INTRODUCTION TO INFORMATION SYSTEMS LECTURE 9: DATABASE FEATURES, FUNCTIONS AND ARCHITECTURES PART (2) أ/ غدير عاشور 1.
Copyright  Oracle Corporation, All rights reserved Building the Warehouse.
CSE6011 Implementing a Warehouse  Monitoring: Sending data from sources  Integrating: Loading, cleansing,...  Processing: Query processing, indexing,...
Data CLEANSING Getting Data Ready.
Plan for Populating a DW
Data Staging Data Staging Legacy System Data Warehouse SQL Server
Defining Data Warehouse Concepts and Terminology
Overview of MDM Site Hub
Data warehouse and OLAP
ETL TESTING ONLINE TRAINING
Informix Red Brick Warehouse 5.1
Data Warehouse.
Defining Data Warehouse Concepts and Terminology
Building an Observation Data Layer
CMPE 226 Database Systems April 11 Class Meeting
Typically data is extracted from multiple sources
Data warehouse.
Data Warehousing Concepts
Analysis Services Analysis Services vs. the Data Warehouse vs. OLTP DB
Best Practices in Higher Education Student Data Warehousing Forum
JTLS 6.0 View Data Files In Excel
Implementing ETL solution for Incremental Data Load in Microsoft SQL Server Ganesh Lohani SR. Data Analyst Lockheed Martin
Presentation transcript:

MIS 451 Building Business Intelligence Systems Data Staging

2 Project Planning Requirements Analysis Physical Design Logical Design Data Staging Data Analysis (OLAP)

3 What has been done? Project plan User requirements Dimentional Model (Star Schema) Index plan

4 Data Staging Data Warehouse (Oracle) DB2 Access Excel Legacy System Data Staging

5 Extraction Data Cleansing Data Integration Transformation Transportation (Loading) Maintenance

6 Extraction During extraction step, source data are extracted from legacy systems and placed in a staging area To keep the performance of legacy systems, source data are only extracted without any cleansing, integration and transformation operations.

7 Extraction A variety formats of data exist in legacy systems Relational database: DB2, Oracle, SQL Server, Informix, Access … Flat file: Excel file, text file Commercial data extraction tools are very helpful in data extraction. Ex: Oracle Data Mart Builder

8 Data Cleansing Data cleansing is to clean errors in source data. Data entry error Missing value

9 Data Cleansing Data Entry Error Reason: typing error, wrong deletion How to correct: Pre-extraction checking: domain constraints, referential integrity constraints, value dependency checking Post-extraction checking

10 Data Cleansing Missing value Reason: information not available, typing error How to correct: Replace with the attribute mean value Replace with a constant value Replace with the most probable value

11 Data Integration Data from different data sources with different formats need to be integrated into one data warehouse Ex: 3 customer table in sales department, marketing department and an acquired company Customer (cid, cname, city …) Customer (customerid, customername,city…) Customer (custid, custname, cname,…)

12 Data Integration Same attribute with different name: cid, customerid, custid Different attribute with same name: cname -> customer name cname -> city name Same attribute with different format

13 Data Integration How to integrate Get the schemas of all data sources Get the schema of the data warehouse Integrate source schemas with the help of commercial tools

14 Transformation Prepare data ready to load into the data warehouse Change the format of data Create derived attributes and tables Aggregate Create warehouse keys

15 Transportation Using bulk load tools, such as Oracle SQL Loader, instead of SQL command Create indexes

16 Maintenance Maintenance frequency: daily, weekly, monthly Identify change records and new records in legacy systems Create timestamp for change and new records in legacy systems Compare data between legacy systems and DW Load change and new records into DW