Designing Aggregations. Performance Fundamentals - Aggregations Pre-calculated summaries of data Intersections of levels from each dimension Tradeoff.

Slides:



Advertisements
Similar presentations
Adam Jorgensen Pragmatic Works Performance Optimization in SQL Server Analysis Services 2008.
Advertisements

An Array-Based Algorithm for Simultaneous Multidimensional Aggregates By Yihong Zhao, Prasad M. Desphande and Jeffrey F. Naughton Presented by Kia Hall.
Materialization and Cubing Algorithms. Cube Materialization Each cell of the data cube is a view consisting of an aggregation of interest. The values.
OLAP Tuning. Outline OLAP 101 – Data warehouse architecture – ROLAP, MOLAP and HOLAP Data Cube – Star Schema and operations – The CUBE operator – Tuning.
Outline What is a data warehouse? A multi-dimensional data model Data warehouse architecture Data warehouse implementation Further development of data.
Data Warehousing CPS216 Notes 13 Shivnath Babu. 2 Warehousing l Growing industry: $8 billion way back in 1998 l Range from desktop to huge: u Walmart:
OLAP Services Business Intelligence Solutions. Agenda Definition of OLAP Types of OLAP Definition of Cube Definition of DMR Differences between Cube and.
Online Analytical Processing. On-Line Analytical Processing (OLAP) Tools The use of a set of graphical tools that provides users with multidimensional.
Advanced Querying OLAP Part 2. Context OLAP systems for supporting decision making. Components: –Dimensions with hierarchies, –Measures, –Aggregation.
1 ACCTG 6910 Building Enterprise & Business Intelligence Systems (e.bis) The Data Warehouse Lifecycle Olivia R. Liu Sheng, Ph.D. Emma Eccles Jones Presidential.
13 Chapter 13 The Data Warehouse Hachim Haddouti.
Chapter 13 The Data Warehouse
An Array-Based Algorithm for Simultaneous Multidimensional Aggregates
Designing a Data Warehouse
Online Analytical Processing (OLAP) Hweichao Lu CS157B-02 Spring 2007.
Designing a Data Warehouse Issues in DW design. Three Fundamental Processes Data Acquisition Data Storage Data a Access.
Week 6 Lecture The Data Warehouse Samuel Conn, Asst. Professor
SQL Analysis Services Microsoft® SQL Server 2005 Analysis Services provides unified, fully integrated views of your business data to support online.
SharePoint 2010 Business Intelligence Module 6: Analysis Services.
Ahsan Abdullah 1 Data Warehousing Lecture-12 Relational OLAP (ROLAP) Virtual University of Pakistan Ahsan Abdullah Assoc. Prof. & Head Center for Agro-Informatics.
1.
Data Warehouse & Data Mining
Systems analysis and design, 6th edition Dennis, wixom, and roth
Analysis Services 101 Dave Fackler, MCDBA, MCSE, MCT Director, Business Intelligence Practice Intellinet Corporation.
DAT353 Analysis Service: Server Internals Tom Conlon Program Manager SQL Server Business Intelligence Unit Microsoft Corporation.
Scaling SQL Server 2000 Analysis Services to the MAX Dave Wickert Program Manager SQL Server BI Practices Team Microsoft Corporation.
ISV Innovation Presented by ISV Innovation Presented by Business Intelligence Fundamentals: Data Loading Ola Ekdahl IT Mentors 9/12/08.
IMS 6217: Data Warehousing / Business Intelligence Part 3 1 Dr. Lawrence West, Management Dept., University of Central Florida Analysis.
Introduction to OLAP / Microsoft Analysis Services
MSBI online training. MSBI Online Training Course Content : What Is Microsoft BI? Core concept – BI is the cube or UDM Example cube as seen using Excel.
DW-1: Introduction to Data Warehousing. Overview What is Database What Is Data Warehousing Data Marts and Data Warehouses The Data Warehousing Process.
Data Warehouse Overview September 28, 2012 presented by Terry Bilskie.
Ahsan Abdullah 1 Data Warehousing Lecture-11 Multidimensional OLAP (MOLAP) Virtual University of Pakistan Ahsan Abdullah Assoc. Prof. & Head Center for.
OnLine Analytical Processing (OLAP)
Copyright © 2002, SAS Institute Inc. All rights reserved. SAS is a registered trademark or trademark of SAS Institute Inc. in the USA and other countries.
Oracle Advanced Compression – Reduce Storage, Reduce Costs, Increase Performance Session: S Gregg Christman -- Senior Product Manager Vineet Marwah.
1 Data Warehouses BUAD/American University Data Warehouses.
OLAP & DSS SUPPORT IN DATA WAREHOUSE By - Pooja Sinha Kaushalya Bakde.
BI Terminologies.
1 Topics about Data Warehouses What is a data warehouse? How does a data warehouse differ from a transaction processing database? What are the characteristics.
Analytics & Reporting Tool.  Outline how to access SAS OLAP Cubes through SAS AMO  Review SAS OLAP Cube creation and how it relates to integration with.
Centre of Competence on data warehouse Workshop Helsinki Database Cube and Browsing the Cube Mark Rantala.
What is OLAP?.
Indexing OLAP Data Sunita Sarawagi Monowar Hossain York University.
Performance. Performance Performance is a critical issue especially in a multi-user environment. Benchmarking is one way of testing this.
Oracle OLAP Option Bud Endress Director of Product Management, OLAP.
Copyright © 2008, Oracle. All rights reserved Essbase New Features Summary Varying attributes Text and date measures Backup, transaction.
SQL Server Analysis Services Understanding Unified Dimension Model (UDM)
The Need for Data Analysis 2 Managers track daily transactions to evaluate how the business is performing Strategies should be developed to meet organizational.
1 Database Systems, 8 th Edition Star Schema Data modeling technique –Maps multidimensional decision support data into relational database Creates.
Pindaro Demertzoglou Data Resource Management – MGMT 4170 Lally School of Management Rensselaer Polytechnic Institute.
3 Copyright © 2006, Oracle. All rights reserved. Building an Analytic Workspace.
1 Copyright © 2008, Oracle. All rights reserved. Repository Basics.
9 Copyright © 2006, Oracle. All rights reserved. Summary Management.
CSE6011 Implementing a Warehouse  Monitoring: Sending data from sources  Integrating: Loading, cleansing,...  Processing: Query processing, indexing,...
Introduction to SQL Server Analysis Services
Data Warehousing CIS 4301 Lecture Notes 4/20/2006.
Using Partitions and Fragments
Chapter 13 The Data Warehouse
Data Warehouse.
Prepare Question Answers Exam Dumps - Dumps4Download.in
Buy Valid Microsoft Exam Study Guide Dumps Questions Answers Realexamdumps.com
Implementing Data Models & Reports with Microsoft SQL Server
Types of OLAP Servers.
Module 7: Working with Measures and Measure Groups
DataMart (Data Warehouse) Tool:
Introduction of Week 9 Return assignment 5-2
Data Warehouse.
Analysis Services Analysis Services vs. the Data Warehouse vs. OLTP DB
Presentation transcript:

Designing Aggregations

Performance Fundamentals - Aggregations Pre-calculated summaries of data Intersections of levels from each dimension Tradeoff between processing (disk space) and query times Number of Aggregations Time Processing Time Query Time

Aggregation Generalities Are created when a cube is processed Include all measures Cannot generally be designed for specific members Are built and maintained automatically

Contents of an Aggregation

Time – 33 Members All (1) Year (2) Quarter (8) Month (22) Total Number of Aggregations = 4 Total Aggregated Values = 33

Contents of an Aggregation Time – 33 Total Members All (1) Year (2) Quarter (8) Month (22) State - 14 Total Members All (1) Country (3) Region (4) State (6) Total Number of Aggregations = 16 Total Aggregated Values = 462 (Theoretical)

Data Storage – Minimal Data Explosion Data Explosion Historical weakness of OLAP 100% Dense No heaters sold in Phoenix in July; no storage allocated Intelligent Aggregation Design Pre-aggregate only a subset of the data Compression Algorithm

Fact Table Show all sales for all products for all... Most detailed aggregations Highest level of aggregation Partial Aggregation

Fact Table MonthProducts Quarter Pro. Family QuarterProduct Month Partial Aggregation

Analysis Server Cube Storage

MOLAP Storage Mode Details and Aggregations Stored in Multidimensional Format Fastest Storage Option for Queries Often the Most Efficient in Terms of Disk Storage, Due to Compression

ROLAP Storage Mode Details and Aggregations Stored in RDBMS Slowest Query Performance Most Often the Slowest to Process Analysis Server Can Create Indexed Views Useful for Large Data Sources Provides Real-Time OLAP Solution

HOLAP Storage Mode Details Maintained in RDBMS Aggregations Created in Multidimensional Format Good Option where Disk Consumption Is a Concern Good Compromise if Details Are Accessed Infrequently

Cube Aggregations Full Aggregation Not Necessary Effects on Cube Size and Processing Time Cube size and processing times increase as aggregations are added to a cube Tools for Implementing Aggregations Storage Design Wizard Usage-Based Optimization Wizard

Why Usage-Based Optimization? Limitations of Storage Design Wizard Driven by structural factors Does not take Into account user behavior Benefits of Usage Based Optimization Collects data about user queries Allows aggregations to be built based on usage patterns

Using the Query Log Populating the Usage Log The server logs one out of ten queries by default You can change the query sampling rate Accessing the Usage Log You can view reports in Analysis Manager You can open the file msmdqlog.mdb Modifying the Cube Structure If you update the cube structure, query log entries may be invalid You can clear or filter the contents of the query log in Analysis Manager

Partitioning Subsets of the data within a cube May come from different tables or data sources Role in data management Varying storage modes and aggregations Adding new and removing obsolete data

Current Year History Prior Year MOLAP 15% Agg ROLAP 25% Agg HOLAP 20% Agg History Prior Year Current Year Partitioning Strategy

Month 1 Partitioning Data Management Example Month 61 Month 24 Month 2 Month 25 Month 60 Month 26 Month 25 Most recent 3 years of data in MOLAP storage Prior 2 years of data in ROLAP storage New partition for new month data

Partitioning Guidelines Benefits for processing Parallelism Particularly in an initial load Increased selectivity when querying Must set slice information for the system! Beware! Changing the aggregations for a partition will not alter other partitions