A multi-dimensional data model

Slides:



Advertisements
Similar presentations
Chapter 4 Tutorial.
Advertisements

Data Warehousing.
Introduction to Data Warehousing CPS Notes 6.
ICS 421 Spring 2010 Data Warehousing 2 Asst. Prof. Lipyeow Lim Information & Computer Science Department University of Hawaii at Manoa 3/30/20101Lipyeow.
Data Warehousing Xintao Wu. Evolution of Database Technology (See Fig. 1.1) 1960s: Data collection, database creation, IMS and network DBMS 1970s: Relational.
1 Lecture 10: More OLAP - Dimensional modeling
CSE6011 Warehouse Models & Operators  Data Models  relations  stars & snowflakes  cubes  Operators  slice & dice  roll-up, drill down  pivoting.
1 Data Warehousing and OLAP. 2 Data Warehousing & OLAP Defined in many different ways, but not rigorously.  A decision support database that is maintained.
Tanvi Madgavkar CSE 7330 FALL Ralph Kimball states that : A data warehouse is a copy of transaction data specifically structured for query and analysis.
Business Intelligence Instructor: Bajuna Salehe Web:
Dr. Bernard Chen Ph.D. University of Central Arkansas
8/20/ Data Warehousing and OLAP. 2 Data Warehousing & OLAP Defined in many different ways, but not rigorously. Defined in many different ways, but.
Override the title Chris Harrington
Data Warehousing Xintao Wu. Can You Easily Answer These Questions? What are Personnel Services costs across all departments for all funding sources? What.
October 28, Data Warehouse Architecture Data Sources Operational DBs other sources Analysis Query Reports Data mining Front-End Tools OLAP Engine.
Ch3 Data Warehouse Dr. Bernard Chen Ph.D. University of Central Arkansas Fall 2009.
Shilpa Seth.  Multidimensional Data Model Concepts Multidimensional Data Model Concepts  Data Cube Data Cube  Data warehouse Schemas Data warehouse.
Data Warehousing Multidimensional Analysis
Data Mining Data Warehouses.
Managing Data for DSS II. Managing Data for DS Data Warehouse Common characteristics : –Database designed to meet analytical tasks comprising of data.
January 21, 2016Data Mining: Concepts and Techniques 1 Chapter 3: Data Warehousing and OLAP Technology: An Overview What is a data warehouse? A multi-dimensional.
مثال ‌ هایی از شماهای پایگاه داده تحلیلی سید حسن فیروزآبادی.
DATA WAREHOUSING – DIMENSIONAL MODELLING AND SCHEMAS With MIKE –AARONE ATUHE Handout 5.
Datawarehousing and OLAP C.Eng 714 Spring
1 Chapter 4: Data Warehousing and On-line Analytical Processing Data Warehouse: Basic Concepts Data Warehouse Modeling: Data Cube and OLAP Data Warehouse.
Data Mining and Data Warehousing: Concepts and Techniques Conceptual Modeling of Data Warehouses Defining a Snowflake Schema in Data Mining Query Language.
1 Meng Jiang  Postdoctoral Researcher in UIUC since Aug 2015 (Office 2130)  Ph. D. from Tsinghua University, Beijing ( )  Research topic: Data-driven.
Chapter 4. Data Warehousing and On-line Analytical Processing (OLAP)
Advanced Database Systems: DBS CB, 2nd Edition
Introduction to Data Warehousing
Data Mining: Concepts and Techniques (3rd ed.) — Chapter 4 —
Data Mining: Concepts and Techniques — Chapter 3 —
Data Mining: Concepts and Techniques
Data Warehousing and OLAP
Data Mining: Concepts and Techniques — Chapter 3 —
Information Management course
Data warehouse and OLAP
Data Mining: Concepts and Techniques (3rd ed.) — Chapter 4 —
Information Management course
Three tier Architecture of Data Warehousing
Data Mining: Concepts and Techniques (3rd ed.) — Chapter 4 —
OLAP Concepts and Techniques
Data Mining Data Warehousing
Data Mining: Concepts and Techniques — Chapter 3 —
Data Warehousing and OLAP Technology for Data Mining
Introduction to Data Mining
Data Warehousing and OLAP Technology for Data Mining
Data Warehousing modified by Donghui Zhang
Data Warehouses and OLAP — Slides for Textbook — — Chapter 2 —
©Jiawei Han and Micheline Kamber
Data Mining: Concepts and Techniques
Chapter 2: Data Warehousing and OLAP Technology for Data Mining
Data Mining: Concepts and Techniques — Chapter 3 —
Overview of Data Warehousing and OLAP
©Jiawei Han and Micheline Kamber
What is Data Warehouse? Defined in many different ways.
Data Warehousing and Decision Support Chapter 25
Data Mining: Concepts and Techniques (3rd ed.) — Chapter 4 —
What is Data Warehouse? Defined in many different ways.
©Jiawei Han and Micheline Kamber
Resource: J. Han and other books
Dr. Bernard Chen Ph.D. University of Central Arkansas Fall 2009
©Jiawei Han and Micheline Kamber
Data Mining: Concepts and Techniques — Chapter 3 —
Dedicated to the memory of
Data Mining: Concepts and Techniques
UNIT-1 Introduction Lecture-1 Motivation: Why data mining?
Presented by: Tek Narayan Adhikari
Data Mining: Concepts and Techniques (3rd ed.) — Chapter 4 —
CIS671-Knowledge Discovery and Data Mining
Presentation transcript:

A multi-dimensional data model Location Time Item The word cube brings to mind a 3-D object, and we can think of a 3-D data cube as being a set of similarly structured 2-D tables stacked on top of one another. We can think of a 4-D data cube as consisting of a series of 3-D cubes. If we continue in this way, we may display any n-D data as a series of (n - 1)-D “cubes". The data cube is a metaphor for multidimensional data storage. The actual physical storage of such data may differ from its logical representation.

Multi-dimensional Data Model A multidimensional data model views data in the form of a data cube. A data cube allows data to be modeled and viewed in multiple dimensions. It is defined by dimensions and facts. Dimensions are the perspectives or entities with respect to which an organization wants to keep records. It represents data by means of a multidimensional N grid Q which is Cartesian product of certain discrete sets Di called dimension. Elements of a dimensions are called members. The set of dimension & members forms metadata.

Example Loc= Chicago Loc= New York Loc=Toronto Loc= Vancouver Item home ent. Comp. phone Sec. Time Q1 Q2 Q3 Q4

Multi-dimensional Data Model Dimensions: Product, Region, Time Hierarchical summarization paths Product Region Time Industry Country Year Category Region Quarter Product City Month week Office Day Region W S N Product Toothpaste Juice Cola Milk Cream Soap Month 1 2 3 4 7 6 5

Conceptual Modeling of Data Warehouses The most popular data model for data warehouses is a multidimensional model. This model can exist in the form of a star schema, a snowflake schema, or a fact constellation schema. Star schema: The most common modeling paradigm is the star schema in which the data warehouse contains A large central table (fact table) containing the bulk of the data, with no redundancy, and A set of smaller attendant tables (dimension tables), one for each dimension. The schema graph resembles a starburst, with the dimension tables displayed in a radial pattern around the central fact table.

Example of Star Schema item branch time Sales Fact Table time_key day day_of_the_week month quarter year time item_key item_name brand type supplier_type item Sales Fact Table time_key item_key branch_key branch_key branch_name branch_type branch location_key street city state_or_province country location location_key units_sold dollars_sold avg_sales Measures

The Snowflake schema: The snowflake schema is a variant of the star schema model, where some dimension tables are normalized, thereby further splitting the data into additional tables. The resulting schema graph forms a shape similar to a snowflake. The major difference between the snowflake and star schema models is that the dimension tables of the snowflake model may be kept in normalized form. Such a table is easy to maintain and saves storage space. The snowflake structure can reduce the effectiveness of browsing since more joins will be needed to execute a query. Consequently, the system performance may be adversely impacted.

Example of Snowflake Schema time_key day day_of_the_week month quarter year time item_key item_name brand type supplier_key item supplier_key supplier_type supplier Sales Fact Table time_key item_key branch_key location_key street city_key location branch_key branch_name branch_type branch location_key units_sold city_key city state_or_province country dollars_sold avg_sales Measures

Fact schema: Sophisticated applications may require multiple fact tables to share dimension tables. This kind of schema can be viewed as a collection of stars, and hence is called a galaxy schema or a fact constellation.

Example of Fact Constellation time_key day day_of_the_week month quarter year time item_key item_name brand type supplier_type item Shipping Fact Table Sales Fact Table time_key item_key time_key shipper_key item_key from_location branch_key branch_key branch_name branch_type branch location_key to_location location_key street city province_or_state country location dollars_cost units_sold units_shipped dollars_sold avg_sales shipper_key shipper_name location_key shipper_type shipper Measures