From Operational Systems Vincent Rainardi 21st November 2009 Celtic Manor, Newport.

Slides:



Advertisements
Similar presentations
Microsoft Confidential. We look at the world... with our own eyes...
Advertisements

CHAPTER OBJECTIVE: NORMALIZATION THE SNOWFLAKE SCHEMA.
Data Warehouse Overview (Financial Analysis) May 02, 2002.
SQLBits 8, 9 th April 2011, Brighton Vincent Rainardi
Introduction to ETL Using Microsoft Tools By Dr. Gabriel.
Big Data Working with Terabytes in SQL Server Andrew Novick
James Serra – Data Warehouse/BI/MDM Architect
Copyright © Starsoft Inc, Data Warehouse Architecture By Slavko Stemberger.
Technical BI Project Lifecycle
Dos and don’ts of Columnstore indexes The basis of xVelocity in-memory technology What’s it all about The compression methods (RLE / Dictionary encoding)
SQLBits 8, 9 th April 2011, Brighton Vincent Rainardi Blog: dwbi1.wordpress.com.
Data Warehousing M R BRAHMAM.
DATA WAREHOUSE DATA MODELLING
Dimensional Modeling Business Intelligence Solutions.
Data Manager Best Practices Business Intelligence Solutions.
M.S. Access Module CAS 133 Russ Erdman. M.S. Access Module Assignment Overview Two options for the unit: All students complete Units A, B and C In class.
© Ron McFadyen1 Many-to-one-to-many We need information that can only be obtained by accessing two fact tables through a common dimension … drilling across.
Project REAL and Tips for Migrating from SQL Server 2000 Analysis Services to SQL Server 2005 & Deploying, Securing and Managing AS in SQL 2005 Dan Weinstein.
Data Warehousing - 3 ISYS 650. Snowflake Schema one or more dimension tables do not join directly to the fact table but must join through other dimension.
Introduction Paul Turley SqlServerBiBlog.com Mentor, SQL Server MVP
ETL Design and Development Michael A. Fudge, Jr.
ETL By Dr. Gabriel.
Agenda Common terms used in the software of data warehousing and what they mean. Difference between a database and a data warehouse - the difference in.
Sayed Ahmed Logical Design of a Data Warehouse.  Free Training and Educational Services  Training and Education in Bangla: Training and Education in.
IST722 Data Warehousing Business Intelligence Development with SQL Server Analysis Services and Excel 2013 Michael A. Fudge, Jr.
Best Practices for Data Warehousing. 2 Agenda – Best Practices for DW-BI Best Practices in Data Modeling Best Practices in ETL Best Practices in Reporting.
ISV Innovation Presented by ISV Innovation Presented by Business Intelligence Fundamentals: Data Loading Ola Ekdahl IT Mentors 9/12/08.
IMS 6217: Data Warehousing / Business Intelligence Part 3 1 Dr. Lawrence West, Management Dept., University of Central Florida Analysis.
CS 345: Topics in Data Warehousing Tuesday, October 19, 2004.
Vidas Matelis, Toronto SQL Server User Group November 13, 2008.
Session 4: The HANA Curriculum and Demos Dr. Bjarne Berg Associate professor Computer Science Lenoir-Rhyne University.
Optimizing Time-Series Calculations in SSAS
Chapter 1 Adamson & Venerable Spring Dimensional Modeling Dimensional Model Basics Fact & Dimension Tables Star Schema Granularity Facts and Measures.
BI Terminologies.
6 MDS – all in a simple UI Data Editing and Errors Handling Hierarchy Modification Transactions Annotation (Auditing) Versioning Business Rules.
Decision Support and Date Warehouse Jingyi Lu. Outline Decision Support System OLAP vs. OLTP What is Date Warehouse? Dimensional Modeling Extract, Transform,
Data Staging Data Loading and Cleaning Marakas pg. 25 BCIS 4660 Spring 2012.
More Dimensional Modeling. Facts Types of Fact Design Transactional Periodic Snapshot –Predictable time period –Ex. Monthly, yearly, etc. Accumulating.
UNIT-II Principles of dimensional modeling
Sales Dim Date Dim Customers Dim Products Dim Categories Dim Geography The data warehouse is a simple and standard one, after all we.
2012 © Trivadis BASEL BERN LAUSANNE ZÜRICH DÜSSELDORF FRANKFURT A.M. FREIBURG I.BR. HAMBURG MÜNCHEN STUTTGART WIEN Welcome November 2012 Columnstore Indexes.
June 08, 2011 How to design a DATA WAREHOUSE Linh Nguyen (Elly)
Copyright© 2014, Sira Yongchareon Department of Computing, Faculty of Creative Industries and Business Lecturer : Dr. Sira Yongchareon ISCG 6425 Data Warehousing.
Cube Measure Group Measure Partition Cube Dimension Dimension Attribute Attribute Relationship Hierarchy Level Cube Attribute Cube Hierarchy.
SQL Server Analysis Services Understanding Unified Dimension Model (UDM)
Or How I Learned to Love the Cube…. Alexander P. Nykolaiszyn BLOG:
Building the Corporate Data Warehouse Pindaro Demertzoglou Data Resource Management.
INTELLIGENT DATA SOLUTIONS OM.
Building the Corporate Data Warehouse Pindaro Demertzoglou Lally School of Management Data Resource Management.
Summarized from various resources Modern Database Management
Applying Data Warehouse Techniques
Dimensional Model January 14, 2003
Inventory is used to illustrate:
SSIS Demo Michael A. Fudge, Jr.
Retail Sales is used to illustrate a first dimensional model
20 Questions with Azure SQL Data Warehouse
Unidad II Data Warehousing Interview Questions
Applying Data Warehouse Techniques
Applying Data Warehouse Techniques
Data warehouse architecture CIF, DM Bus Matrix Star schema
Retail Sales is used to illustrate a first dimensional model
Data Warehousing Concepts
Applying Data Warehouse Techniques
Examines blended and separate transaction schemas
Analysis Services Analysis Services vs. the Data Warehouse vs. OLTP DB
ETL Processing Mechanics of ETL.
Applying Data Warehouse Techniques
Processing Tabular Models
Dmytro Polishchuk BI Developer DB Best Technologies
Implementing ETL solution for Incremental Data Load in Microsoft SQL Server Ganesh Lohani SR. Data Analyst Lockheed Martin
Presentation transcript:

From Operational Systems Vincent Rainardi 21st November 2009 Celtic Manor, Newport

Cubes

Create Cubes from Ops System ETL DW Source Systems cube Can we create cubes straight from the transaction / operational systems? Saves us creating a data mart/warehouse.  And the ETL – 75 % effort?

Create Cubes from Ops System No date table  in DW we have a date dimension Attributes are buried in big transaction tables  in DW they are in small dimension tables Performance concern  it’s going to be slow, right? (which one, the ERP or the cube?)

Why would we want to do that? To save time (and money) To look at data – DQ / DM projects For reconciliation (data migration) For testing (ETL / DW load) Source ETL Target cube

Demo No surrogate keys – join on natural keys Decode tables: join in DSV, e.g. account type Date dimension: server dimension, from columns Related table: separate dimension or combined, e.g. customer put into account  Hierarchies, e.g. between account & customer attributes  Other MG needs dim_customer on its own

Demo Select distinct to form a dimension, e.g. trans type Date dim: complete members vs incomplete Time dimension: (half) hourly slot Numbers in dim: put in separate MG, e.g. shares Building cubes from ops sys requires some mart design, e.g. do we build a single fact table from both order & trade tables? Depends on grain

Demo Dates in dimension, e.g. order & account:  server date dim must link direct to fact  server date dim can’t be a reference dim Large dimension, e.g. dim order:  break into small dimensions  concat the attributes to form dim key, e.g. select attr1|attr2 as dim key

Large Dim DO and DON’T Do: Use combined attributes for dim key (to create a dimension from large table), e.g order: select col1|col2 as dimkey, col1, col2 from table1 Don’t: Use PK of the table (brings lots of rows) Don’t: lookup the real dim key in fact’s DSV, e.g. select m1,m2, T2.pk from T1 join T2 on T1.col1=T2.col1 and T1.col2=T2.col2 Do: add isnull/coalesce

Other points No unknown row – use SSAS unknown member Snapshot fact:  ops systems has no snapshot, just trans tables  query each month, then union across months  partitions, process last partition only Querying transaction systems  use ‘with (nolock)’  process outside trans hours  source from secondary copy

Thanks Comments/Corrections/Questions Feedback sheets