Cube Intro. Decision Making Effective decision making Goal: Choice that moves an organization closer to an agreed-on set of goals in a timely manner Goal:

Slides:



Advertisements
Similar presentations
Business Intelligence Simon Pease. Experience with BI Developing end-to-end BI prototype for Plan International Developing end-to-end BI prototype for.
Advertisements

BY LECTURER/ AISHA DAWOOD DW Lab # 2. LAB EXERCISE #1 Oracle Data Warehousing Goal: Develop an application to implement defining subject area, design.
OLAP Tuning. Outline OLAP 101 – Data warehouse architecture – ROLAP, MOLAP and HOLAP Data Cube – Star Schema and operations – The CUBE operator – Tuning.
Data Warehousing CPS216 Notes 13 Shivnath Babu. 2 Warehousing l Growing industry: $8 billion way back in 1998 l Range from desktop to huge: u Walmart:
OLAP Services Business Intelligence Solutions. Agenda Definition of OLAP Types of OLAP Definition of Cube Definition of DMR Differences between Cube and.
Data Warehousing M R BRAHMAM.
Jennifer Widom On-Line Analytical Processing (OLAP) Introduction.
Data Sources Data Warehouse Analysis Results Data visualisation Analytical tools OLAP Data Mining Overview of Business Intelligence Data visualisation.
COMP 578 Data Warehousing And OLAP Technology Keith C.C. Chan Department of Computing The Hong Kong Polytechnic University.
13 Chapter 13 The Data Warehouse Hachim Haddouti.
CSE6011 Warehouse Models & Operators  Data Models  relations  stars & snowflakes  cubes  Operators  slice & dice  roll-up, drill down  pivoting.
Chapter 13 The Data Warehouse
DATA WAREHOUSE (Muscat, Oman).
Online Analytical Processing (OLAP) Hweichao Lu CS157B-02 Spring 2007.
1 Basic concepts of On-Line Analytical processing DT211 /4.
MIST Reporting Standard Reports Report Designer Report Builder Report Manager.
Chetan Bhirud Raza Mohammad Abinash Sahoo Online Marketing Giant.
© 2014 Zvi M. Kedem 1 Unit 11 Online Analytical Processing (OLAP) Basic Concepts.
Week 6 Lecture The Data Warehouse Samuel Conn, Asst. Professor
DATA WAREHOUSING IN SQL SERVER 2005/2008 BUSINESS INTELLIGENCE.
©Silberschatz, Korth and Sudarshan18.1Database System Concepts - 5 th Edition, Aug 26, 2005 Buzzword List OLTP – OnLine Transaction Processing (normalized,
Data Warehouse & Data Mining
On-Line Analytic Processing Chetan Meshram Class Id:221.
IMS 6217: Data Warehousing / Business Intelligence Part 3 1 Dr. Lawrence West, Management Dept., University of Central Florida Analysis.
Datawarehouse & Datamart OLAPs vs. OLTPs Dimensional Modeling Creating Physical Design Using SQL Mgt. Studio Module II: Designing Datamarts 1.
OnLine Analytical Processing (OLAP)
Presented By: Muhammad Rizvi Raghuram Vempali Surekha Vemuri.
CS 157B: Database Management Systems II March 20 Class Meeting Department of Computer Science San Jose State University Spring 2013 Instructor: Ron Mak.
Roadmap 1.What is the data warehouse, data mart 2.Multi-dimensional data modeling 3.Data warehouse design – schemas, indices 4.The Data Cube operator –
BI Terminologies.
Ahsan Abdullah 1 Data Warehousing Lecture-10 Online Analytical Processing (OLAP) Virtual University of Pakistan Ahsan Abdullah Assoc. Prof. & Head Center.
Data Warehousing. Databases support: Transaction Processing Systems –operational level decision –recording of transactions Decision Support Systems –tactical.
Decision Support and Date Warehouse Jingyi Lu. Outline Decision Support System OLAP vs. OLTP What is Date Warehouse? Dimensional Modeling Extract, Transform,
DIMENSIONAL MODELING MIS2502 Data Analytics. So we know… Relational databases are good for storing transactional data But bad for analytical data What.
MIS2502: Data Analytics Dimensional Data Modeling
Winter 2006Winter 2002 Keller, Ullman, CushingJudy Cushing 19–1 Warehousing The most common form of information integration: copy sources into a single.
Fox MIS Spring 2011 Data Warehouse Week 8 Introduction of Data Warehouse Multidimensional Analysis: OLAP.
UNIT-II Principles of dimensional modeling
Chapter 5 DATA WAREHOUSING Study Sections 5.2, 5.3, 5.5, Pages: & Snowflake schema.
1 On-Line Analytic Processing Warehousing Data Cubes.
CMPE 226 Database Systems October 21 Class Meeting Department of Computer Engineering San Jose State University Fall 2015 Instructor: Ron Mak
Data Warehousing Multidimensional Analysis
Business Intelligence Transparencies 1. ©Pearson Education 2009 Objectives What business intelligence (BI) represents. The technologies associated with.
Pooja Sharma Shanti Ragathi Vaishnavi Kasala. BUSINESS BACKGROUND Lowe's started as a single hardware store in North Carolina in 1946 and since then has.
Copyright© 2014, Sira Yongchareon Department of Computing, Faculty of Creative Industries and Business Lecturer : Dr. Sira Yongchareon ISCG 6425 Data Warehousing.
Database Management Systems, 2 nd Edition. R. Ramakrishnan and J. Gehrke1 Data Warehousing and Decision Support.
Database Management Systems 3ed, R. Ramakrishnan and J. Gehrke1 Data Warehousing and Decision Support Chapter 25.
1 Database Systems, 8 th Edition Star Schema Data modeling technique –Maps multidimensional decision support data into relational database Creates.
Introduction to OLAP and Data Warehouse Assoc. Professor Bela Stantic September 2014 Database Systems.
Or How I Learned to Love the Cube…. Alexander P. Nykolaiszyn BLOG:
Data Warehousing and OLAP Outline u Models & operations u Implementing a warehouse u Future directions.
The Concepts of Business Intelligence Microsoft® Business Intelligence Solutions.
CMPE 226 Database Systems April 12 Class Meeting Department of Computer Engineering San Jose State University Spring 2016 Instructor: Ron Mak
Data Warehousing CIS 4301 Lecture Notes 4/20/2006.
On-Line Analytic Processing
Chapter 13 The Data Warehouse
Data Warehouse.
MIS2502: Data Analytics Dimensional Data Modeling
MIS2502: Data Analytics Dimensional Data Modeling
On-Line Analytical Processing (OLAP)
CMPE 226 Database Systems April 11 Class Meeting
Data Warehouse and OLAP
University of Houston-Clear Lake Kaiser Permanente San Jose
MIS2502: Data Analytics Dimensional Data Modeling
MIS2502: Data Analytics Dimensional Data Modeling
Introduction of Week 9 Return assignment 5-2
Analysis Services Analysis Services vs. the Data Warehouse vs. OLTP DB
Data Warehouse and OLAP
Data Warehousing.
Presentation transcript:

Cube Intro

Decision Making Effective decision making Goal: Choice that moves an organization closer to an agreed-on set of goals in a timely manner Goal: Choice that moves an organization closer to an agreed-on set of goals in a timely manner Needs supports Needs supports Goals – needs to be specific Not to hit an iceberg – for Titanic or other boats Not to hit an iceberg – for Titanic or other boats Measures – needs to be concrete Away from any visible iceberg Away from any visible iceberg Feedback – needs to be timely and accurate We hit an iceberg, rather than saw one – not timely enough We hit an iceberg, rather than saw one – not timely enough The boat can sink, actually, again too late to find out The boat can sink, actually, again too late to find out

BI – Business Intelligence Is the delivery of accurate, useful information to the appropriate decision makers within necessary timeframe to support effective decision making It is a superset of what can be provided by data mining SQL Server 2014 is a decent tool for this and support both Cubes/Data warehouse and Data mining

OLAP/Cube Term coined in mid-1990’s when database people needed a new research direction Main goal: support ad-hoc but complex querying performed by business analysts Interactive process of creating, managing, analyzing and reporting on data Extends spreadsheet-like analysis to work with huge amounts of data in a data warehouse

Known vs. Unknown Questions When using SQL statements or even Cubes, we know the question and can formulate the SQL statements (most of the time) to dig out answers to the questions Layout let answers – SQL Layout let answers – SQL Data-led answers – the drill down in a cube dimension (if NW sales figure is low, which state is the main contributor?) Data-led answers – the drill down in a cube dimension (if NW sales figure is low, which state is the main contributor?) When we do not know what to ask, we use data mining – book’s definition is less accurate

What should be the sources OLTP is the source On line transaction Process On line transaction Process These are the things we have learned in database classes These are the things we have learned in database classes But OLTP, the direct data store is not suitable for BI because Not to fight for computing resources of OLTP systems Not to fight for computing resources of OLTP systems OLTP may not have all the history readily available OLTP may not have all the history readily available BI may have to use data from several OLTP systems BI may have to use data from several OLTP systems The solution is generally Data Mart

Data marts Data marts are related sets of data that are grouped together and separated out from the main body of data. It can be a repository of OLTP data or portion of data from a data warehouse Data in a Data Mart are not meant for supporting real time operations Data in a Data mart are generally have gone through the data clearing process

More on OLAP Place key performance indicators (measures) into context (dimensions) measures are pre-aggregated data retrieval is significantly faster modeled in a DW The processed “cube” is made available to business analysts who can browse the data using a variety of tools, making ad hoc interactive and analytical processing

OLAP versus OLTP

Measures and dimensions Measures: key performance indicators, such as sales amount, that you would like to evaluate typically numerical, including volume, sales, and costs a rule of thumb: if a number makes (business/social) sense when aggregated, then it is a measure examples: aggregate daily volume to month, quarter and year aggregating telephone numbers would not make sense therefore, telephone numbers are not measures postcode: not a measure, but can be a dimension (?) Totals in all the transactions would be a measure Number of high school students by school, county, state should be a measure

Measures and dimensions Dimensions: categories of data analysis typical dimensions include product, time, region a rule of thumb: when a report is requested to be ordered "by“ something, that something is usually a dimension e.g. in sales report: view sales by month, by region, so the two dimensions needed are time and region

Star Schema Dimensions and measures are physically represented by a star schema (typically) arrange the dimension tables around a central fact table that contains the measures a fact table contains a column for each measure as well as a column for each dimension

Example Dimension tables Market (Store_ID, City, Region) Product (Product_ID, Name, Category, Price) Time (Time_ID, Week, Month, Quarter, Year) Fact table Sales (Store_ID, Product_ID, Time_ID, Amount)Notes Amount should be the measure Amount should be the measure Time_ID is preprocessed to show at week or month level Time_ID is preprocessed to show at week or month level This can be modified to show Snowflakes This can be modified to show Snowflakes

Example supermarket sales roll-up: summaries over all cities, regions slice the cube to select sales only in day 3 dice the cube to select sales only in week 2 (days 8-14), and group by regions

The TWO Multi-dimensional model What we meant mostly when referring a cube or a data warehouse What we meant mostly when referring a cube or a data warehouse Tabular model Complicated in SQL Server Complicated in SQL Server May be accomplished easily with other tools such as vertica May be accomplished easily with other tools such as vertica

SSMS and SSDT SSDT (SQL Server Data Tool), used to be the BI studio, is really the Visual Studio 2013 Give a demo