Data Warehousing/Loading the DW—Topics

Slides:



Advertisements
Similar presentations
Business Intelligence Simon Pease. Experience with BI Developing end-to-end BI prototype for Plan International Developing end-to-end BI prototype for.
Advertisements

Introduction to ETL Using Microsoft Tools By Dr. Gabriel.
SQL Server Accelerator for Business Intelligence (SSABI)
Technical BI Project Lifecycle
Chapter 18 - Data sources and datasets 1 Outline How to create a data source How to use a data source How to use Query Builder to build a simple query.
Integrating Access with the Web and with Other Programs.
Building a Data Warehouse with SQL Server Presented by John Sterrett.
5 Copyright © 2009, Oracle. All rights reserved. Defining ETL Mappings for Staging Data.
DB Audit Expert v1.1 for Oracle Copyright © SoftTree Technologies, Inc. This presentation is for DB Audit Expert for Oracle version 1.1 which.
ETL Design and Development Michael A. Fudge, Jr.
Microsoft Visual Basic 2005 ENRICHMENT CHAPTER Visual Studio Tools for Office.
SSIS Over DTS Sagayaraj Putti (139460). 5 September What is DTS?  Data Transformation Services (DTS)  DTS is a set of objects and utilities that.
Databases and LINQ Visual Basic 2010 How to Program 1.
IST722 Data Warehousing Business Intelligence Development with SQL Server Analysis Services and Excel 2013 Michael A. Fudge, Jr.
IMS 6217: Data Warehousing / Business Intelligence Part 3 1 Dr. Lawrence West, Management Dept., University of Central Florida Analysis.
Automating Database Processing Chapter 6. Chapter Introduction Design and implement user-friendly menu – Called navigation form Macros – Automate repetitive.
Session 4: The HANA Curriculum and Demos Dr. Bjarne Berg Associate professor Computer Science Lenoir-Rhyne University.
More ETL. ETL in a nutshell ETL is an abbreviation of the three words Extract, Transform and Load. It is an ETL process to –extract data, mostly from.
The DM Process – MS’s view (DMX). The Basics  You select an algorithm, show the algorithm some examples called training example and, from these examples,
Data Interoperability Basics Bruce Harold & Dale Lutz.
IT 456 Seminar 5 Dr Jeffrey A Robinson. Overview of Course Week 1 – Introduction Week 2 – Installation of SQL and management Tools Week 3 - Creating and.
Data Warehousing.
CSCI 6962: Server-side Design and Programming Database Manipulation in ASP.
Carey Probst Technical Director Technology Business Unit - OLAP Oracle Corporation.
Introduction to Enterprise Guide Jennifer Schmidt Rhonda Ellis Cassandra Hall.
Databases with LINQ. LINQ to SQL LINQ to SQL uses LINQ syntax to query databases. LINQ to SQL classes are automatically generated by the IDE’s LINQ to.
1 Chapter 20 – Data sources and datasets Outline How to create a data source How to use a data source How to use Query Builder to build a simple query.
UNIT-II Principles of dimensional modeling
Creating a Data Warehouse Data Acquisition: Extract, Transform, Load Extraction Process of identifying and retrieving a set of data from the operational.
1 Database Design and Development: A Visual Approach © 2006 Prentice Hall Chapter 12 DATABASE DESIGN AND DEVELOPMENT: A VISUAL APPROACH Chapter 12 Accessing.
02 | Data Flow – Extract Data Richard Currey | Senior Technical Trainer–New Horizons United George Squillace | Senior Technical Trainer–New Horizons Great.
NSF DUE ; Wen M. Andrews J. Sargeant Reynolds Community College Richmond, Virginia.
SSIS – Deep Dive Praveen Srivatsa Director, Asthrasoft Consulting Microsoft Regional Director | MVP.
XP New Perspectives on Microsoft Office Access 2003, Second Edition- Tutorial 8 1 Microsoft Office Access 2003 Tutorial 8 – Integrating Access with the.
1 CA201 Word Application Presenting Information in Tables and Columns Week # 5 By Tariq Ibn Aziz Dammam Community college.
Pulling Data into the Model. Agenda Overview BI Development Studio Integration Services Solutions Integration Services Packages DTS to SSIS.
1 Copyright © 2008, Oracle. All rights reserved. Repository Basics.
SSIS 2012: The Quiet Revolution Presented by Bryan Cafferky Business Intelligence Consultant BPC Global Solutions LLC
John Lloyd, Edgewater Fullscope
Doron Orbach UCMDB Product Manager
Data Virtualization Tutorial: Custom Functions
Presented By: Jessica M. Moss
Visual Basic 2010 How to Program
Creating Oracle Business Intelligence Interactive Dashboards
VB 2010 Pertemuan 10.
Data Warehousing/Loading the DW—Topics
DATABASE CONCEPTS A database is a collection of logically related data designed to meet the information needs of one or more users Data bases are store-houses.
Data Virtualization Tutorial: XSLT and Streaming Transformations
SQL MODELER - OPEN There are Three Ways to open the SQL Modeler
Presented by: Warren Sifre
Implementing Data Models & Reports with Microsoft SQL Server
MODULE 7 Microsoft Access 2010
University of Houston-Clear Lake Kaiser Permanente San Jose
بسم الله الرحمان الرحیم
Using JDeveloper.
Microsoft Office Access 2003
Microsoft Office Access 2003
Database Applications
Adding Multiple Logical Table Sources
Topic 11 Lesson 1 - Analyzing Data in Access
Microsoft Excel Basics: Pivot Tables
Microsoft Office Illustrated Fundamentals
Analysis Services Analysis Services vs. the Data Warehouse vs. OLTP DB
Unit J: Creating a Database
Database Performance Part 1—Topics
eSeries Entities By Julie Ladner
Assignment resource Working with Excel Tables, PivotTables, and Pivot Charts Fairhurst pp The commands on these slides work with the Week 2 Excel.
Data Base.
Visual Data Flows – Azure Data Factory v2
Implementing ETL solution for Incremental Data Load in Microsoft SQL Server Ganesh Lohani SR. Data Analyst Lockheed Martin
Presentation transcript:

Data Warehousing/Loading the DW—Topics Review of Steps & Components SQL Server Integration Services Define Data Sources Create Packages & Process Flows Data Flows for Dimension Tables Data Flows for Fact Tables Lookup Translations Importing Excel Data

Steps and Components Source Systems—provide raw data to the DW Integration Services—Provide transformation and loading services from source data to DW Data Warehouse—Customized data store for Business Intelligence Analysis Services—Tools for data mining and reporting Reporting Services—Out old friend acting on an enhanced data store

Star Schema & Dimension Tables Dimension Tables represent concepts (entities) used to group data in the fact tables Also contain descriptive attributes of the entity represented by the dimension table Simplest way for nontechnical users to picture the data Relate to FKs in the fact tables

Design Steps It is impractical to design a one-source DW as the first deliverable Identify initial scope of DW Problem Statement Business Requirements Build DW Data Model Business Processes to address requirements Level of Detail Fact Tables (what we are measuring) Dimension Tables (how we look at the data)

SQL Server Integration Services SSIS creates projects containing packages to select, transform, and load data From a data source To a data destination (DW) Start projects through Business Intelligence Studio (Same as Reporting Services designers) Packages can be narrowly or broadly focused Recommend narrowly focused packages to do one specific task

Create New Integration Services Project

Create Data Sources Right-Click Data Sources in Solution Explorer Create data sources for source and destination DB

Define Process Flow Process flow defines steps needed to perform package tasks Process flow tasks are found in the toolbar Add a data flow task to the process flow Set name property in properties Define Data flow task by double-clicking Process flows can be simple or incredibly complex

Define Data Flow Middle of Design Area Add source and destination data sources to Connection Managers area—Right-click Select New connection from data source… Select all sources needed for current transformation Source DB Destination (DW) DB

Define Data Flow (cont.) Data flows start with sources that provide columns of data End with an output destination Along the way… Columns can be added Other sources Calculations Columns can be transformed Columns can be ignored Flow columns map to columns in destination table

Adding Data Source & Destination Add Data Source & Destination controls from toolbox OLEDB Data Source & Destination for SQL Server Several other source and destination types available Set Name properties

Configure Data Source Control Double-click Choose avail. Connection Manager Select Data access mode Build queries if needed Use view or SP Use table

Configure Data Destination Control Drag green arrow from source control to destination control Set connection manager to DW Set name of table or view to DW destination table Select Mappings

Configure Data Destination Control (cont.) Confirm column mappings from source to dest. Drag source col onto dest. col if necessary

Modify Process Flow New data is appended to destination table If we want to empty the destination table we can add an Execute SQL Task control Set SQL statement to a DELETE statement

Loading Fact Tables Lookup Translations Fact table loading similar to dimension table loading More likely to be based on a query instead of a table Needs Lookup translations to map from available FKs in source DB to Dimension keys in destination table Lookup Translations

Lookup Translations Add Lookup control to data flow from toolbox Sequence is (usually) not important Set table on which to perform lookup action Use Columns tab to set lookup

Lookup Translations (cont.) Confirm column mappings for lookup values Select value to be returned Select Replace PK if available Otherwise select <add as new column>

Importing Excel Data See government statistical data in excel format http://www.gpoaccess.gov/eop/tables06.html Create new package as before Add Excel Source Set name property

Importing Excel Data (cont.) Double-click Excel source to configure New connection

Importing Excel Data (cont.) Find Excel file Set version Decide whether layout supports providing column names

Importing Excel Data (cont.) Excel data is likely to need some transformations Unusual data typing in Excel Unpivoting pivot tables Extracting data I recommend getting rid of messy data in Excel source file if possible before importing Derived Column Control Data Type Conversion

Importing Excel Data (cont.) As with other data flows the results are a set of columns Final step is to map the developed set of columns to the destination data source columns

Importing Data--Summary There are hundreds (or thousands) of combinations of data sources and transformations to result in data for DW Topics not covered Validating data Many different transformations Using alternative data sources Easy loads are easy Difficult transformations are more challenging