THE INFORMATION ARCHITECTURE OF THE ORGANIZATION MIS2502 Data Analytics.

Slides:



Advertisements
Similar presentations
Dimensional Modeling.
Advertisements

CHAPTER OBJECTIVE: NORMALIZATION THE SNOWFLAKE SCHEMA.
Database Management3-1 L3 Database Management Santa R. Susarapu Ph.D. Student Virginia Commonwealth University.
Copyright © Starsoft Inc, Data Warehouse Architecture By Slavko Stemberger.
Chapter 3 Database Management
Chapter 13 The Data Warehouse
1 © Prentice Hall, 2002 Chapter 11: Data Warehousing.
DATA WAREHOUSE (Muscat, Oman).
Data Warehousing: Defined and Its Applications Pete Johnson April 2002.
Components of the Data Warehouse Michael A. Fudge, Jr.
CS 345: Topics in Data Warehousing Tuesday, September 28, 2004.
Data Warehousing Alex Ostrovsky CS157B Spring 2007.
Online Analytical Processing (OLAP) Hweichao Lu CS157B-02 Spring 2007.
Data Conversion to a Data warehouse Presented By Sanjay Gunasekaran.
Week 6 Lecture The Data Warehouse Samuel Conn, Asst. Professor
Review Session What is data, what is information, and give a real world example to differentiate these two concepts.
DATA WAREHOUSING IN SQL SERVER 2005/2008 BUSINESS INTELLIGENCE.
File and Database Design SYS364. Today’s Agenda WHTSA DBMS, RDBMS, SQL A place for everything and everything in its place. Entity Relationship Diagrams.
Database Systems – Data Warehousing
DW-1: Introduction to Data Warehousing. Overview What is Database What Is Data Warehousing Data Marts and Data Warehouses The Data Warehousing Process.
MIS2502: Data Analytics Coverting ERD into a DB Schema David Schuff
AN OVERVIEW OF DATA WAREHOUSING
OnLine Analytical Processing (OLAP)
Cube Intro. Decision Making Effective decision making Goal: Choice that moves an organization closer to an agreed-on set of goals in a timely manner Goal:
Business Intelligence Zamaneh Jahed. What is Business Intelligence? Business Intelligence (BI) is a broad category of applications and technologies for.
Data Warehousing.
Technology In Action Chapter 11 1 Databases and… Databases and their uses Database components Types of databases Database management systems Relational.
MIS2502: Data Analytics The Information Architecture of an Organization.
October 28, Data Warehouse Architecture Data Sources Operational DBs other sources Analysis Query Reports Data mining Front-End Tools OLAP Engine.
Decision Support and Date Warehouse Jingyi Lu. Outline Decision Support System OLAP vs. OLTP What is Date Warehouse? Dimensional Modeling Extract, Transform,
DIMENSIONAL MODELING MIS2502 Data Analytics. So we know… Relational databases are good for storing transactional data But bad for analytical data What.
MIS2502: Data Analytics Dimensional Data Modeling
1 Technology in Action Chapter 11 Behind the Scenes: Databases and Information Systems Copyright © 2010 Pearson Education, Inc. Publishing as Prentice.
MIS2502: Data Analytics The Things You Can Do With Data
Chapter 5 DATA WAREHOUSING Study Sections 5.2, 5.3, 5.5, Pages: & Snowflake schema.
Foundations of Business Intelligence: Databases and Information Management.
Data Warehousing Multidimensional Analysis
Business Intelligence Transparencies 1. ©Pearson Education 2009 Objectives What business intelligence (BI) represents. The technologies associated with.
MIS2502: Data Analytics The Things You Can Do With Data.
© 2003 Prentice Hall, Inc.3-1 Chapter 3 Database Management Information Systems Today Leonard Jessup and Joseph Valacich.
What is OLAP?.
Copyright© 2014, Sira Yongchareon Department of Computing, Faculty of Creative Industries and Business Lecturer : Dr. Sira Yongchareon ISCG 6425 Data Warehousing.
Data Resource Management Agenda What types of data are stored by organizations? How are different types of data stored? What are the potential problems.
MIS2502: Data Analytics The Things You Can Do With Data Amy Lavin
1 Copyright © 2009, Oracle. All rights reserved. Oracle Business Intelligence Enterprise Edition: Overview.
Database Management Systems, 2 nd Edition. R. Ramakrishnan and J. Gehrke1 Data Warehousing and Decision Support.
What Do You Do With Data? Gather Store Retrieve Interpret.
DIMENSIONAL MODELING MIS2502 Data Analytics. So we know… Relational databases are good for storing transactional data But bad for analytical data What.
The Need for Data Analysis 2 Managers track daily transactions to evaluate how the business is performing Strategies should be developed to meet organizational.
Jaclyn Hansberry MIS2502: Data Analytics The Things You Can Do With Data The Information Architecture of an Organization Jaclyn.
Intro to MIS – MGS351 Databases and Data Warehouses
MIS2502: Data Analytics The Things You Can Do With Data
Data warehouse and OLAP
Chapter 13 The Data Warehouse
MIS2502: Data Analytics Dimensional Data Modeling
MIS2502: Data Analytics Dimensional Data Modeling
MIS2502: Data Analytics Dimensional Data Modeling
MIS5101: Data Analytics The Things You Can Do With Data
MIS2502: Data Analytics Dimensional Data Modeling
Unidad II Data Warehousing Interview Questions
An Introduction to Data Warehousing
MIS2502: Data Analytics The Information Architecture of an Organization David Schuff
MIS2502: Data Analytics The Information Architecture of an Organization Acknowledgement: David Schuff.
MIS2502: Data Analytics The Information Architecture of an Organization Aaron Zhi Cheng Acknowledgement:
MIS2502: Data Analytics The Things You Can Do With Data
MIS2502: Data Analytics Dimensional Data Modeling
MIS2502: Data Analytics Dimensional Data Modeling
Data Warehousing Concepts
Analysis Services Analysis Services vs. the Data Warehouse vs. OLTP DB
Data Warehousing.
Presentation transcript:

THE INFORMATION ARCHITECTURE OF THE ORGANIZATION MIS2502 Data Analytics

A Brief Review Transactional Database Supports management of an organization’s data For everyday transactions Analytical Data Store Supports managerial decision-making For periodic analysis This is what is commonly thought of as “database management” This is the foundation for business intelligence

The Information Architecture of an Organization Transactional Database Analytical Data Store Stores real-time transactional data Stores historical transactional and summary data Data entry Data extraction Data analysis Called OLTP: Online transaction processing Called OLAP: Online analytical processing

The Transactional Database In business, a transaction is the exchange of information, goods, or services. For databases, a transaction is an action performed in a database management system. Operational databases deal with both: they store information about business transactions using database transactions Examples of transactions Purchase a product Enroll in a course Hire an employee Data is in real-time Reflects current state How things are “now” Stores real-time, transactional data

The Relational Paradigm How transactional data is collected and stored Primary Goal: Minimize redundancy Reduce errors Less space required Most database management systems are based on the relational paradigm Oracle, DB2, Access, SQL Server Which of these do you think is more important today ?

What Else is There? “NoSQL” (Not Only SQL): a catch-all phrase for other stuff Better geared toward “Big Data” storage More scalable Faster storage and retrieval Cheaper hardware requirements Less oversight/management (in theory) Out of necessity, rather than preference Data is unstructured, must be parsed Does not lend itself well to analysis Further reading:

Who is Using NoSQL?

The Relational Database Airline Reservation Example A series of tables with logical associations between them The associations (relationships) allow the data to be combined Flight FlightID AircraftID FlightNumber DepartureCity ArrivalCity DepartureTime ArrivalTime Reservation ReservationID FlightID PassengerID SeatID Name DatePurchased Price Passenger PassengerID Name Street City State ZipCode Aircraft AircraftID Type Capacity Aircraft Seat SeatID AircraftID RowNumber SeatNumber Class n n n n

Why more than one table? Every reservation has a passenger and a flight Passengers and flights have an ID number Split the details off into separate tables This is good because: Information is entered and stored once Minimizes redundancy This is good because: Information is entered and stored once Minimizes redundancy Flight FlightID AircraftID FlightNumber DepartureCity ArrivalCity DepartureTime ArrivalTime Reservation ReservationID FlightID PassengerID SeatID Name DatePurchased Price Passenger PassengerID Name Street City State ZipCode Aircraft AircraftID Type Capacity Aircraft Seat SeatID AircraftID RowNumber SeatNumber Class n n n n

Analyzing transactional data Can be difficult to do from a relational database Having multiple tables is good for storage and data integrity, but bad for analysis All those tables must be “joined” together before analysis can be done The solution is the Analytical Data Store Operational databases are optimized for storage efficiency, not retrieval Analytical databases are optimized for retrieval and analysis, not storage efficiency and data integrity

The Analytical Data Store Stores historical and summarized data “Historical” means we keep everything Data is extracted from the operational database and reformatted for the analytical database Most analytical databases use a dimensional paradigm Operational Database Analytical Database Query Data conversion Extract Transform Load Query We’ll discuss this in much more detail later in the course!!

The Dimensional Paradigm Sales SalesID ProductID StoreID TimeID QuantitySold TotalPrice Store StoreID StoreAddress StoreCity StoreState StoreType Product ProductID ProductName ProductPrice ProductWeight Time TimeID Day Month Year Fact Dimension Data is stored like this around a business event… …and can be summarized like this for analysis…

Dimensional Data and the Data Cube Sales ID Qty. Sold Total Price Prod. ID Prod. Name Prod. Price Prod. Weight Store ID Store Address Store City Store State Store Type Time ID DayMonthYear Product Dimension Store Dimension Time Dimension Sales Fact …or it can be expanded in detail like this so that data mining (complex statistical analysis) can be done.

Comparing Operational and Analytical Data Stores Operational Data StoreAnalytical Data Store Based on Relational paradigm Based on Dimensional paradigm Storage of real-time transactional data Storage of historical transactional data Optimized for storage efficiency and data integrity Optimized for data retrieval and summarization Supports day-to-day operations Supports periodic and on-demand analysis

The agenda for the course Weeks 1 through 5 Weeks 6 through 8 Weeks 9 through 13 Weeks 14 and 15 Data Visualization Transactional Database Analytical Data Store Stores real-time transactional data Stores historical transactional and summary data Data entry Data extraction Data analysis