Chapter 4 Tutorial.

Slides:



Advertisements
Similar presentations
ER Modeling Case Studies
Advertisements

Data Warehousing and Data Mining J. G. Zheng May 20 th 2008 MIS Chapter 3.
Chapter 4 Tutorial.
BY LECTURER/ AISHA DAWOOD DW Lab # 2. LAB EXERCISE #1 Oracle Data Warehousing Goal: Develop an application to implement defining subject area, design.
ER Modeling Case Studies
Step By Step Lei Yang Computer Science Department.
Data Warehousing and Decision Support, part 2
Alternative Database topology: The star schema
Copyright © Starsoft Inc, Data Warehouse Architecture By Slavko Stemberger.
ENTITY RELATIONSHIP MODELLING
5.1Database System Concepts - 6 th Edition Chapter 5: Advanced SQL Advanced Aggregation Features OLAP.
Data Warehousing CPS216 Notes 13 Shivnath Babu. 2 Warehousing l Growing industry: $8 billion way back in 1998 l Range from desktop to huge: u Walmart:
Introduction to Data Warehousing CPS Notes 6.
--The Entity Relationship Model(1)--1 The Entity Relationship Model.
Exercises.
1 Lecture 10: More OLAP - Dimensional modeling
Lab3 CPIT 440 Data Mining and Warehouse.
By N.Gopinath AP/CSE. Two common multi-dimensional schemas are 1. Star schema: Consists of a fact table with a single table for each dimension 2. Snowflake.
CSE6011 Warehouse Models & Operators  Data Models  relations  stars & snowflakes  cubes  Operators  slice & dice  roll-up, drill down  pivoting.
Data Warehousing ISYS 650. What is a data warehouse? A data warehouse is a subject-oriented, integrated, nonvolatile, time-variant collection of data.
Ch3 Data Warehouse part2 Dr. Bernard Chen Ph.D. University of Central Arkansas Fall 2009.
1 Data Warehousing and OLAP. 2 Data Warehousing & OLAP Defined in many different ways, but not rigorously.  A decision support database that is maintained.
Chapter 4 Tutorial.
CS346: Advanced Databases
Business Intelligence Instructor: Bajuna Salehe Web:
8/20/ Data Warehousing and OLAP. 2 Data Warehousing & OLAP Defined in many different ways, but not rigorously. Defined in many different ways, but.
Data Modeling Using the Entity-Relationship Model
On-Line Analytic Processing Chetan Meshram Class Id:221.
Introduction to the Orion Star Data
Instructor :Huda Al-Omair
Module Title? Data Base Design 30/6/2007 Entity Relationship Diagrams (ERDs)
More on relational databases, including 1 to 1, 1 to many and many to many relationships Please use speaker notes for additional information!
CS 157B: Database Management Systems II March 20 Class Meeting Department of Computer Science San Jose State University Spring 2013 Instructor: Ron Mak.
Roadmap 1.What is the data warehouse, data mart 2.Multi-dimensional data modeling 3.Data warehouse design – schemas, indices 4.The Data Cube operator –
BI Terminologies.
October 28, Data Warehouse Architecture Data Sources Operational DBs other sources Analysis Query Reports Data mining Front-End Tools OLAP Engine.
DIMENSIONAL MODELING MIS2502 Data Analytics. So we know… Relational databases are good for storing transactional data But bad for analytical data What.
Slide Chapter 3 Data Modeling Using the Entity- Relationship (ER) Model.
Winter 2006Winter 2002 Keller, Ullman, CushingJudy Cushing 19–1 Warehousing The most common form of information integration: copy sources into a single.
Shilpa Seth.  Multidimensional Data Model Concepts Multidimensional Data Model Concepts  Data Cube Data Cube  Data warehouse Schemas Data warehouse.
1 On-Line Analytic Processing Warehousing Data Cubes.
CMPE 226 Database Systems October 21 Class Meeting Department of Computer Engineering San Jose State University Fall 2015 Instructor: Ron Mak
Pooja Sharma Shanti Ragathi Vaishnavi Kasala. BUSINESS BACKGROUND Lowe's started as a single hardware store in North Carolina in 1946 and since then has.
Copyright© 2014, Sira Yongchareon Department of Computing, Faculty of Creative Industries and Business Lecturer : Dr. Sira Yongchareon ISCG 6425 Data Warehousing.
The Data Warehouse Chapter Operational Databases = transactional database  designed to process individual transaction quickly and efficiently.
Dr. Chen, Data Mining  A/W & Dr. Chen, Data Mining Chapter 6 The Data Warehouse Jason C. H. Chen, Ph.D. Professor of MIS School of Business Administration.
GSK FMCG Data Warehouse Business definition GSK FMCG industry 10 October 2014 Pavan Kumar Mantha Vinod Tati Shourya Konda 1.
Introduction to Database Systems
Database Management Systems, 2 nd Edition. R. Ramakrishnan and J. Gehrke1 Data Warehousing and Decision Support.
ITEC 3220M Using and Designing Database Systems Instructor: Prof. Z.Yang Course Website: c3220m.htm Office: TEL.
Houston Petroleum Valve Company Data-Mining Project Data Modeling Phase Fouad Alibrahim Mohammad H. Monakes University of Houston Clear Lake University.
DATA WAREHOUSING – DIMENSIONAL MODELLING AND SCHEMAS With MIKE –AARONE ATUHE Handout 5.
Data Warehousing and OLAP Outline u Models & operations u Implementing a warehouse u Future directions.
CMPE 226 Database Systems April 12 Class Meeting Department of Computer Engineering San Jose State University Spring 2016 Instructor: Ron Mak
DATA MODELING AND ENTITY-RELATIONSHIP MODEL II IST 210: Organization of Data IST210 1.
Information Management course
Data Warehousing CIS 4301 Lecture Notes 4/20/2006.
On-Line Analytic Processing
Data warehouse and OLAP
A multi-dimensional data model
Databases & Data Warehouses
Competing on Analytics II
CMPE 226 Database Systems April 11 Class Meeting
Data Warehouse and OLAP
ERD (Entity Relationship Diagrams)
ER Modeling Case Studies
Dimensional Model January 16, 2003
Fundamentals of Data Cube & OLAP Operations
Data Warehouse and OLAP
Presentation transcript:

Chapter 4 Tutorial

Q3 Suppose that a data warehouse consists of the three dimensions time, doctor, and patient, and the two measures count and charge, where charge is the fee that a doctor charges a patient for a visit. (a) Enumerate three classes of schemas that are popularly used for modeling data warehouses. Star schema Snowflake schema Fact constellation schema

Q3 cont. (b) Draw a schema diagram for the above data warehouse using one of the schema classes listed in (a). Using a star schema.

Q3 cont. Star Schema time Fact Table Measures time_key doctor_id day day_of_the_week month quarter year time doctor_id doctor_name phone # address gender doctor Fact Table time_key doctor_id patient_id Charge Count patient_id patient_name phone # address gender patient Measures

Q3 cont. (c) Starting with the base cuboid [day, doctor, patient], what specific OLAP operations should be performed in order to list the total fee collected by each doctor in 2010? The operations to be performed are: Roll-up on time from day to year. Slice for time = 2010. Roll-up on patient from individual patient to all.

Q4 Suppose that a data warehouse for Big-University consists of the following four dimensions: student, course, semester, and instructor, and two measures count and avg. grade. When at the lowest conceptual level (e.g., for a given student, course, semester, and instructor combination), the avg grade measure stores the actual course grade of the student. At higher conceptual levels, avg grade stores the average grade for the given combination.

Q4 cont. Snowflake Schema course_id Course_name department Course student_id student_name area_id Major status university Student area_id City State country Area Sales Fact Table course_id student_id instructor_id semester_id Count Avg. grade semester_id semester year Semester instructor_id depatment rank Instructor

Q4. cont. Starting with the base cuboid [student, course, semester, instructor], what specific OLAP operations (e.g., roll-up from semester to year) should one perform in order to list the average grade of CS courses for each Big University student. Roll-up on course from course id to department. Roll-up on semester from semester id to all. Slice for course=“CS” .

Q5 Suppose that a data warehouse consists of the four dimensions, date, spectator, location, and game, and the two measures, count and charge, where charge is the fare that a spectator pays when watching a game on a given date. Spectators may be students, adults, or seniors, with each category having its own charge rate. Draw a star schema diagram for the data warehouse.

Q5 cont. Star Schema date Sales Fact Table date_id spectator_id day month quarter year date spectator_id spectator_name phone # address Status Charge rate spectator Sales Fact Table date_id spectator_id location_id game_id Charge Count location_id phone # Street city province country location game_id game_name description producer game

Q5. cont. Starting with the base cuboid [date, spectator, location, game], what specific OLAP operations should one perform in order to list the total charge paid by student spectators at GM Place in 2010? The specific OLAP operations to be performed are: Roll-up on date from date id to year. Roll-up on game from game id to all. Roll-up on location from location id to location name. Roll-up on spectator from spectator id to status. Dice with status=“students”, location name=“GM Place”, and year = 2010.