Intro to MIS – MGS351 Databases and Data Warehouses Chapter 3
Chapter Overview m Data Hierarchy m Traditional File Environment m Databases – Relational, Hierarchical, Network m Design and Normalization m Data Warehousing
Data Hierarchy m Database m Table, File, Relation m Records, Rows, Tuples m Fields, Columns, Attributes m Bytes m Bits
Data Hierarchy Figure 7-1
Traditional File Environment Issues: m Data Redundancy m Data Inconsistency m Data Isolation m Data Integrity m Security m Application / Data Dependence
Traditional File Processing Figure 7-3
Database Approach m Minimal data redundancy m Data consistency m Integration of data m Sharing of data m Uniform security, privacy and integrity m Data independence
Database Environment Figure 7-4
DBMS Components Data definition language: Specifies content and structure of database and defines each data element Data manipulation language: Manipulates data in a database Data dictionary: Stores definitions of data elements, and data characteristics
Evolution of Databases
Relational Database Represents data as two- dimensional tables called relations Relates data across tables based on common data element Examples: DB2, Oracle, MS SQL Server
Relational Database Figure 7-6
Hierarchical Database Organizes data in a tree-like structure Supports one-to-many parent-child relationships Prevalent in large legacy systems
Hierarchical Database Figure 7-8
Network Database m Depicts data logically as many-to- many relationships m Less flexible compared to RDBMS m Lack support for ad-hoc and English language-like queries
Network Database Figure 7-9
Database Design m Conceptual / Logical Design – Abstract model of database from business perspective. m Physical Design – shows how the database is arranged on storage devices.
ER Diagram Figure 7-10
m Knowledge about your: – Customers – Competitors – Partners – Competitive environment – Internal operations Business Intelligence
oOnline transaction processing (OLTP) - the gathering of input information, processing that information, and updating existing information to reflect the gathered and processed information. oOperational databases - databases that support OLTP. oOnline analytical processing (OLAP) - the manipulation of information to support decision making.
Data Warehousing m Multidimensional Data Models m OLAP m Data Marts – subset of data warehouse m Data Mining – finding hidden patterns and trends in data
Data Warehousing m Data warehouse - a logical collection of information gathered from many different operational databases (Extract, Transform, Load) used to create business intelligence that supports business analysis activities and decision- making tasks.
Data Warehouse
Multidimensional Data Model Figure 7-15
Data Marts m Data mart - a subset of a data warehouse in which only a focused portion of the data warehouse information is kept.
Data Mining Tools
Distributed Databases Figure 7-13
Databases and the Web Figure 7-18