Presentation 2: Data Warehouse Design Discussion Adwait Mulye, Yuga Pawar, Floyd J. Srubar, Vidyasagar Velamala.

Slides:



Advertisements
Similar presentations
Author: Graeme C. Simsion and Graham C. Witt Chapter 11 Logical Database Design.
Advertisements

Chapter 4 Tutorial.
Dimensional Modeling.
An overview of Data Warehousing and OLAP Technology Presented By Manish Desai.
1 Copyright Jiawei Han; modified by Charles Ling for CS411a/538a Data Mining and Data Warehousing  Introduction  Data warehousing and OLAP for data mining.
OLAP Tuning. Outline OLAP 101 – Data warehouse architecture – ROLAP, MOLAP and HOLAP Data Cube – Star Schema and operations – The CUBE operator – Tuning.
Chapter 18: Data Analysis and Mining Kat Powell. Chapter 18: Data Analysis and Mining ➔ Decision Support Systems ➔ Data Analysis and OLAP ➔ Data Warehousing.
Data Warehousing CPS216 Notes 13 Shivnath Babu. 2 Warehousing l Growing industry: $8 billion way back in 1998 l Range from desktop to huge: u Walmart:
Project Team: Project Team: SURYA REGMI SURYA REGMI PHANIMADHURI KANDALA PHANIMADHURI KANDALA JAYKANTH KAVERIPAKAM JAYKANTH KAVERIPAKAM VIDYASAGAR VELAMALA.
Data Warehousing M R BRAHMAM.
Dimensional Modeling CS 543 – Data Warehousing. CS Data Warehousing (Sp ) - Asim LUMS2 From Requirements to Data Models.
Data Warehouse IMS5024 – presented by Eder Tsang.
Data Warehousing - 3 ISYS 650. Snowflake Schema one or more dimension tables do not join directly to the fact table but must join through other dimension.
By N.Gopinath AP/CSE. Two common multi-dimensional schemas are 1. Star schema: Consists of a fact table with a single table for each dimension 2. Snowflake.
Data Warehousing ISYS 650. What is a data warehouse? A data warehouse is a subject-oriented, integrated, nonvolatile, time-variant collection of data.
M ODULE 5 Metadata, Tools, and Data Warehousing Section 4 Data Warehouse Administration 1 ITEC 450.
XP Information Information is everywhere in an organization Employees must be able to obtain and analyze the many different levels, formats, and granularities.
ISQS 3358, Business Intelligence Creating Data Marts Zhangxi Lin Texas Tech University 1.
©Silberschatz, Korth and Sudarshan18.1Database System Concepts - 5 th Edition, Aug 26, 2005 Buzzword List OLTP – OnLine Transaction Processing (normalized,
MACDAN SECURITY K. L. McBurnett Owner/Manager. MACDAN SECURITY – About Us Started the company on December 3, 2002 Over 35 years law enforcement experience,
IMS 6217: Data Warehousing / Business Intelligence Part 3 1 Dr. Lawrence West, Management Dept., University of Central Florida Analysis.
Introduction to the Orion Star Data
Presentation 3: Cube and DW Implementation Adwait Mulye Yuga Pawar Floyd J. Srubar Vidyasagar Velamala.
Datawarehouse & Datamart OLAPs vs. OLTPs Dimensional Modeling Creating Physical Design Using SQL Mgt. Studio Module II: Designing Datamarts 1.
Data Warehouse and Business Intelligence Dr. Minder Chen Fall 2009.
DIMENSIONAL MODELLING. Overview Clearly understand how the requirements definition determines data design Introduce dimensional modeling and contrast.
The Data Warehouse “A data warehouse is a subject-oriented, integrated, time-variant, and nonvolatile collection of “all” an organisation’s data in support.
Brandon Inscore Crime Analysis Supervisor Greensboro Police Department.
October 28, Data Warehouse Architecture Data Sources Operational DBs other sources Analysis Query Reports Data mining Front-End Tools OLAP Engine.
Decision Support and Date Warehouse Jingyi Lu. Outline Decision Support System OLAP vs. OLTP What is Date Warehouse? Dimensional Modeling Extract, Transform,
Ch3 Data Warehouse Dr. Bernard Chen Ph.D. University of Central Arkansas Fall 2009.
Data Staging Data Loading and Cleaning Marakas pg. 25 BCIS 4660 Spring 2012.
SHIFALI CHOUBEY GISE LAB IITB Decision Support System For Farmers.
Houston Petroleum Valve Company Data-Mining Project Mohammad H. Monakes Sam Houston State University Spring 2005.
1 On-Line Analytic Processing Warehousing Data Cubes.
ADVANCED TOPICS IN RELATIONAL DATABASES Spring 2011 Instructor: Hassan Khosravi.
Business Intelligence - 2 BUS 782. Topics Data warehousing Data Mining.
Data Mining Data Warehouses.
OLAP On Line Analytic Processing. OLTP On Line Transaction Processing –support for ‘real-time’ processing of orders, bookings, sales –typically access.
Copyright© 2014, Sira Yongchareon Department of Computing, Faculty of Creative Industries and Business Lecturer : Dr. Sira Yongchareon ISCG 6425 Data Warehousing.
The Data Warehouse Chapter Operational Databases = transactional database  designed to process individual transaction quickly and efficiently.
Copyright© 2014, Sira Yongchareon Department of Computing, Faculty of Creative Industries and Business Lecturer : Dr. Sira Yongchareon ISCG 6425 Data Warehousing.
Database Management Systems, 2 nd Edition. R. Ramakrishnan and J. Gehrke1 Data Warehousing and Decision Support.
CRIME STATISTICS OF HOUSTON POLICE DEPARTMENT Sai Shravya Bommi Sushma Paladugu Manoj Karnati Bharadwaj Vana Nikilesh Anikela.
CRIME STATISTICS OF HOUSTON POLICE DEPARTMENT Shravya Bommi Sushma Paladugu Manoj Karnati Bhardawaj Vana Nikilesh.
I am Xinyuan Niu I am here because I love to give presentations. Data Warehousing.
Database Management Systems 3ed, R. Ramakrishnan and J. Gehrke1 Data Warehousing and Decision Support Chapter 25.
Houston Petroleum Valve Company Data-Mining Project Data Modeling Phase Fouad Alibrahim Mohammad H. Monakes University of Houston Clear Lake University.
CrimeWatch Consulting “It’s what you don’t see that could save your life…” Presented by: Vidyasagar Velamala Yuga Pawar Floyd Srubar.
Data Analysis Decision Support Systems Data Analysis and OLAP Data Warehousing.
CRIME STATISTICS OF HOUSTON POLICE DEPARTMENT
Data Warehousing CIS 4301 Lecture Notes 4/20/2006.
On-Line Analytic Processing
Data warehouse and OLAP
Data Warehouse.
Databases & Data Warehouses
Competing on Analytics II
On-Line Analytical Processing (OLAP)
CMPE 226 Database Systems April 11 Class Meeting
Data Analysis.
Data Warehouse and OLAP
Data Warehousing and OLAP
CHAPTER SIX OVERVIEW SECTION 6.1 – DATABASE FUNDAMENTALS
Dr. Bernard Chen Ph.D. University of Central Arkansas Fall 2009
Aggregate Improvement and Lost, shrunken, and collapsed
Chapter 13 The Data Warehouse
Dimensional Model January 16, 2003
Analytics, BI & Data Integration
Data Warehouse and OLAP
Presentation transcript:

Presentation 2: Data Warehouse Design Discussion Adwait Mulye, Yuga Pawar, Floyd J. Srubar, Vidyasagar Velamala

Brief Review of Business Needs  “Value You’ll See Consulting” provides decision support services to clientele from various industries. City of Houston – Staffing and Resource Planning Realtors – Neighborhood Crime Statistics School Districts – Land Purchases Business Owners – Location Decisions

Brief Review of Business Needs  Additionally, the use of a data warehouse allows our firm to compile and re-assemble raw publicly available crime data into specific decision supporting material tailored to our clients’ information needs.

Tonight’s Discussion Overview  Tonight’s discussions will address the structure of the Data Warehousing system for our consulting firm. Database & Table Structure Discussion ○ Facts HPD Crime Data: June 01 – December 31, 2009 ○ Dimensions [Date, Type of Crime, Police Beat, Premises] Dimensional Modeling Discussion ○ Star Schema ○ Snowflake: Dimensional Hierarchies

Database & Table Structure Takeaways  The fact table is publicly available data from the City of Houston website.  Dimensional tables are a joint effort by both the City of Houston and our consulting firm. City of Houston data revealed natural dimensions based on hierarchies found in the HPD organizational chart. Future Dimensions in the works: ○ DimTimeOfDay: Morning, Mid-Day, Evening, Overnight ○ DimSceneOfCrime: Based on Premises node, we see a pattern emerging in that table for rolling up, or drilling down.  Part of the “Data Cleansing Process” involved simple tasks such as changing field names, and removing five orphaned records.

Fact Table: HPD Crime Data Jun – Dec 2009 *The data originates from the Houston Police Department’s OLTP systems.

Fact Table: HPD Crime Data Jun – Dec 2009 *The data originates from the Houston Police Department’s OLTP systems.

Dimension Table: Police Beats

Dimension Table: DimOffenseTypes

Dimension Table: DimDates

Dimension Table: DimPremisesCodes

Snowflake Schema

Police Beat Hierarchy  Structure is similar to the HPD organizational chart. Divisions: Treat these as Police Station Locations, as this trend emerges from the fact tables, and later discovered on the HPD website. (see map) Districts: A Division can have authority over multiple Districts. (e.g. Airport Division covers Hobby and Bush Airport districts. Police Beats: A District has jurisdictional authority over many police patrol beats.