INFS 3220 Systems Analysis & Design

Slides:



Advertisements
Similar presentations
CHAPTER OBJECTIVE: NORMALIZATION THE SNOWFLAKE SCHEMA.
Advertisements

Chapter 10: Designing Databases
Dimensional Modeling Business Intelligence Solutions.
PowerPoint Presentation for Dennis & Haley Wixom, Systems Analysis and Design Copyright 2000 © John Wiley & Sons, Inc. All rights reserved. Slide 1 Key.
An Introduction to Dimensional Data Warehouse Design Presented by Joseph J. Sarna Jr. JJS Systems, LLC.
Database Design Conceptual –identify important entities and relationships –determine attribute domains and candidate keys –draw the E-R diagram Logical.
1 5 Concepts of Database Management, 4 th Edition, Pratt & Adamski Chapter 5 Database Design: Normalization.
Accounting Databases Chapter 2 The Crossroads of Accounting & IT
Chapter 11 Data Management Layer Design
MIS 451 Building Business Intelligence Systems Logical Design (3) – Design Multiple-fact Dimensional Model.
PowerPoint Presentation for Dennis, Wixom & Tegarden Systems Analysis and Design Copyright 2001 © John Wiley & Sons, Inc. All rights reserved. Slide 1.
Normalization A337. A337 - Reed Smith2 Structure What is a database? ◦ Tables of information  Rows are referred to as records  Columns are referred.
Chapter 4 Relational Databases Copyright © 2012 Pearson Education, Inc. publishing as Prentice Hall 4-1.
Terms - data,information, file record, table, row, column, transaction, concurrency Concepts - data integrity, data redundancy, Type of databases – single-user,
1 5 Concepts of Database Management, 4 th Edition, Pratt & Adamski Chapter 5 Database Design 1: Normalization.
Chapter 4 Relational Databases Copyright © 2012 Pearson Education 4-1.
Slide 1 Systems Analysis and Design With UML 2.0 An Object-Oriented Approach, Second Edition Chapter 11: Data Management Layer Design Alan Dennis, Barbara.
Chapter 1 Overview of Database Concepts Oracle 10g: SQL
STORING ORGANIZATIONAL INFORMATION— DATABASES CIS 429—Chapter 7.
PowerPoint Presentation for Dennis & Haley Wixom, Systems Analysis and Design, 2 nd Edition Copyright 2003 © John Wiley & Sons, Inc. All rights reserved.
Lecture 2 An Overview of Relational Database IST 318 – DB Admin.
Avoiding Database Anomalies
Normalization A technique that organizes data attributes (or fields) such that they are grouped to form stable, flexible and adaptive entities.
Concepts of Database Management, Fifth Edition
Module III: The Normal Forms. Edgar F. Codd first proposed the process of normalization and what came to be known as the 1st normal form. The database.
BIS Database Systems School of Management, Business Information Systems, Assumption University A.Thanop Somprasong Chapter # 5 Normalization of Database.
CORE 2: Information systems and Databases NORMALISING DATABASES.
Object Persistence (Data Base) Design Chapter 13.
Object Persistence Design Chapter 13. Key Definitions Object persistence involves the selection of a storage format and optimization for performance.
Slide 1 Object Persistence Design Chapter 13 Alan Dennis, Barbara Wixom, and David Tegarden John Wiley & Sons, Inc. Slides by Fred Niederman Edited by.
6.1 © 2010 by Prentice Hall 6 Chapter Foundations of Business Intelligence: Databases and Information Management.
PowerPoint Presentation for Dennis, Wixom, & Tegarden Systems Analysis and Design with UML, 3rd Edition Copyright © 2009 John Wiley & Sons, Inc. All rights.
Competitive (Business) Intelligence Systems The Road to Denormalization (starring Charlie Sheen & other Random Celebrities)
NORMALIZATION. What is Normalization  The process of effectively organizing data in a database  Two goals  To eliminate redundant data  Ensure data.
1 CS 430 Database Theory Winter 2005 Lecture 7: Designing a Database Logical Level.
INFS 6220 Systems Analysis & Design Transactional DBs vs. Data Warehouses.
NORMALIZATION Handout - 4 DBMS. What is Normalization? The process of grouping data elements into tables in a way that simplifies retrieval, reduces data.
What Is Normalization  In relational database design, the process of organizing data to minimize redundancy  Usually involves dividing a database into.
Understanding Data Storage
INLS 623 – Database Normalization
Revised: 2 April 2004 Fred Swartz
Event-driven accounting information systems
Relational Model.
Chapter 9 Part-1: Concepts & Foreign Keys
Information Systems Today: Managing in the Digital World
CIS 155 Table Relationship
ITD1312 Database Principles Chapter 5: Physical Database Design
Normalization DBS201.
Chapter 4 Relational Databases
RELATIONAL DATABASE MODEL
Data Warehouses, Dimensional Modeling, and the Laundromat
Normalization Referential Integrity
Database Fundamentals
Chapter 9 Part-1: Concepts & Foreign Keys
Normalization A337.
Data Warehouses, Dimensional Modeling, and the Laundromat
Systems Analysis and Design
Database Design Agenda
Normalization Normalization theory is based on the observation that relations with certain properties are more effective in inserting, updating and deleting.
Databases and Information Management
The Physical Design Stage of SDLC (figures 2.4, 2.5 revisited)
The Road to Denormalization
Relational Database Design
Chapter 17 Designing Databases
Introduction to Database Design
Database Normalization.
Chapter 4 The Relational Model and Normalization
Introduction to MySQL NELINET October 28th, 2005.
DATABASE Purpose of database
Data Warehouses, Dimensional Modeling, and the Laundromat
Presentation transcript:

INFS 3220 Systems Analysis & Design Transactional DBs vs. Data Warehouses

Relational Databases (RDBMS) Collection of linked tables Tables linked by Primary Key / Foreign Key relationships (Referential Integrity) Primary Key – column whose values make each record unique Foreign Key – value in column that links to Primary Key in another table SQL – Structured Query Language (language to access data in relational tables)

Relational DB Example Cust # Cust Name 100 Bob 101 Sue 102 Juan Order # Prod# Qty Cust# 1 QR22 1 100 2 QR22 25 100 3 SB56 3 102 CUSTOMER TABLE ORDER TABLE Primary Key Foreign Key

Database Structure & Design 2 Approaches: Conflict 1. Optimize for Data Capture i.e., Capturing Transactions 2. Optimize for Data Access i.e., Queries & Reporting

Approach #1: Optimize for Data Capture To optimize for data storage, you must: Eliminate redundancy of data (or else wasted space & processing occurs) Ensure data integrity (or else data anomalies) Ensure that changes in data (modifications, deletions, etc. only have to happen in one place) Normalization – process in which a DBMS is optimized for data storage All data “redundancy” is removed from Database Has multiple forms (0, 1st, 2nd, 3rd, et al.)

Moving from 0NF to 1NF Rule: Make a separate table for each set of related attributes, and give each table a primary key of unique values. Cust # CustName 100, 100, 101 Bob, Sue, Juan CUSTOMER TABLE ONF Cust # Cust Name 100 Bob 101 Sue 102 Juan CUSTOMER TABLE 1NF Primary Key Created with Unique values

Moving from 1NF to 2NF Rule: Eliminate any repeating values caused by a dependency on a “keyed” column (i.e., either Primary or Foreign) Cust # Cust Name Order# 100 Bob 1 100 Bob 2 101 Sue 3 TABLE X 1NF 100 Bob Dependency on Primary Key Cust # Cust Name 100 Bob 101 Sue Order # Cust# 1 100 2 100 3 101 CUSTOMER TABLE ORDER TABLE 2NF

Moving from 2NF to 3NF Rule: Eliminate any repeating values caused by a dependency on a “non-keyed” column (i.e., dependency on ANY column) Cust # City Order# ShipTime 100 PGH 1 2 days 101 PGH 2 2 days 102 LA 3 5 days TABLE X 2NF PGH 2 days Dependency b/t 2 non-key columns City # City ShipTime 10 PGH 2 days 20 LA 5 days Cust # City# 100 10 101 10 102 20 SHIP TIME TABLE CUSTOMER TABLE 3NF

Normalized DB Example

To optimize for data access, you must: Approach #2: Optimize for Data Access (in a separate, read-only Data Warehouse) To optimize for data access, you must: Allow data redundancy Reduce the number of table joins (links among tables) Denormalizing – Adding redundancy & reducing joins in a DBMS

Denormalizing – Most Common Approach Star Schema (Clustering) Fact (core or transaction) Tables in middle of star Dimensional (structural or “lookup”) Tables around “points” of star Cust # CustName 100 Bob 101 Sue 102 Juan Rep # RepName 1000 Lee 2000 James 3000 Natasha REP TABLE CUSTOMER TABLE Order # Date Cust# Prod# Rep# 1 06/15/XX 100 QR22 1000 2 07/19/XX 100 QR22 1000 3 08/30/XX 101 SR56 2000 ORDER TABLE Date Quarter 06/29/XX 2 Bob 06/30/XX 2 Sue 07/01/XX 3 Juan Prod # ProdName QR22 Rake SR56 Spade TW43 Mulch PRODUCT TABLE DATE/TIME

Denormalizing (continued) • Stars are linked via common (i. e Denormalizing (continued) • Stars are linked via common (i.e., Conformed) Dimensions to form Data Warehouse Cust # CustName 100 Bob 101 Sue 102 Juan Rep # RepName 1000 Lee 2000 James 3000 Natasha REP TABLE CUSTOMER TABLE Order # Date Cust# Prod# Rep# 1 06/15/XX 100 QR22 1000 2 07/19/XX 100 QR22 1000 3 08/30/XX 101 SR56 2000 ORDER TABLE ORDER TABLE Prod # ProdName QR22 Rake SR56 Spade TW43 Mulch PRODUCT TABLE Date Quarter 06/29/XX 2 Bob 06/30/XX 2 Sue 07/01/XX 3 Juan CUSTOMER TABLE DATE/TIME TIME Prod# ProdName Stock Date Units QR22 Rake 03/23/XX 150 TW43 Mulch 04/15/XX 1452 SR56 Spade 05/01/XX 997 INVENTORY TABLE