Data Warehouse & Data Mining Chapter 12 Data Warehouse & Data Mining Database Systems: Design, Implementation, and Management, Sixth Edition, Rob and Coronel
In this chapter, you will learn: How operational data and decision support data differ What a data warehouse is and how data for it are prepared What star schemas are and how they are constructed What steps are required to successfully implement a data warehouse What data mining is and what role it plays in decision support Database Systems: Design, Implementation, & Management, 6th Edition, Rob & Coronel
The Need for Data Analysis Managers must be able to track daily transactions to evaluate how the business is performing By tapping into the operational database, management can develop strategies to meet organizational goals Data analysis can provide information about short-term tactical evaluations and strategies Database Systems: Design, Implementation, & Management, 6th Edition, Rob & Coronel
Solving Business Problems and Adding Value with Data Warehouse-Based Solutions Database Systems: Design, Implementation, & Management, 6th Edition, Rob & Coronel
Solving Business Problems and Adding Value with Data Warehouse-Based Solutions (continued) Database Systems: Design, Implementation, & Management, 6th Edition, Rob & Coronel
Decision Support Systems Methodology (or series of methodologies) designed to extract information from data and to use such information as a basis for decision making Decision support system (DSS): Arrangement of computerized tools used to assist managerial decision making within a business Usually requires extensive data “massaging” to produce information Used at all levels within an organization Often tailored to focus on specific business areas Provides ad hoc query tools to retrieve data and to display data in different formats Database Systems: Design, Implementation, & Management, 6th Edition, Rob & Coronel
Decision Support Systems (continued) Composed of four main components: Data store component Basically a DSS database Data extraction and filtering component Used to extract and validate data taken from operational database and external data sources End-user query tool Used to create queries that access database End-user presentation tool Used to organize and present data Database Systems: Design, Implementation, & Management, 6th Edition, Rob & Coronel
Main Components of a Decision Support System (DSS) Database Systems: Design, Implementation, & Management, 6th Edition, Rob & Coronel
Transforming Operational Data Into Decision Support Data Database Systems: Design, Implementation, & Management, 6th Edition, Rob & Coronel
Contrasting Operational and DSS Data Characteristics Database Systems: Design, Implementation, & Management, 6th Edition, Rob & Coronel
Ten-Year Sales History for a Single Department, in Millions of Dollars Database Systems: Design, Implementation, & Management, 6th Edition, Rob & Coronel
Yearly Sales Summaries, Two Stores and Two Departments per Store, in Millions of Dollars Database Systems: Design, Implementation, & Management, 6th Edition, Rob & Coronel
The Data Warehouse Integrated, subject-oriented, time-variant, nonvolatile database that provides support for decision making Database Systems: Design, Implementation, & Management, 6th Edition, Rob & Coronel
A Comparison of Data Warehouse and Operational Database Characteristics Database Systems: Design, Implementation, & Management, 6th Edition, Rob & Coronel
Creating a Data Warehouse Database Systems: Design, Implementation, & Management, 6th Edition, Rob & Coronel
Online Analytical Processing Advanced data analysis environment that supports decision making, business modeling, and operations research OLAP systems share four main characteristics: Use multidimensional data analysis techniques Provide advanced database support Provide easy-to-use end-user interfaces Support client/server architecture Database Systems: Design, Implementation, & Management, 6th Edition, Rob & Coronel
Operational vs. Multidimensional View of Sales Database Systems: Design, Implementation, & Management, 6th Edition, Rob & Coronel
Integration of OLAP with a Spreadsheet Program Database Systems: Design, Implementation, & Management, 6th Edition, Rob & Coronel
OLAP Client/Server Architecture Database Systems: Design, Implementation, & Management, 6th Edition, Rob & Coronel
OLAP Server Arrangement Database Systems: Design, Implementation, & Management, 6th Edition, Rob & Coronel
OLAP Server with Multidimensional Data Store Arrangement Database Systems: Design, Implementation, & Management, 6th Edition, Rob & Coronel
OLAP Server With Local Mini Data Marts Database Systems: Design, Implementation, & Management, 6th Edition, Rob & Coronel
Bitmap Representation of REGION Values Database Systems: Design, Implementation, & Management, 6th Edition, Rob & Coronel
Typical ROLAP Client/Server Architecture Database Systems: Design, Implementation, & Management, 6th Edition, Rob & Coronel
MOLAP Client/Server Architecture Database Systems: Design, Implementation, & Management, 6th Edition, Rob & Coronel
Relational vs. Multidimensional OLAP Database Systems: Design, Implementation, & Management, 6th Edition, Rob & Coronel
Star Schemas Data modeling technique used to map multidimensional decision support data into a relational database Creates the near equivalent of a multidimensional database schema from the existing relational database Yield an easily implemented model for multidimensional data analysis, while still preserving the relational structures on which the operational database is built Has four components: facts, dimensions, attributes, and attribute hierarchies Database Systems: Design, Implementation, & Management, 6th Edition, Rob & Coronel
Simple Star Schema Database Systems: Design, Implementation, & Management, 6th Edition, Rob & Coronel
Possible Attributes for Sales Dimensions Database Systems: Design, Implementation, & Management, 6th Edition, Rob & Coronel
Three-Dimensional View of Sales Database Systems: Design, Implementation, & Management, 6th Edition, Rob & Coronel
Slice and Dice View of Sales Database Systems: Design, Implementation, & Management, 6th Edition, Rob & Coronel
Location Attribute Hierarchy Database Systems: Design, Implementation, & Management, 6th Edition, Rob & Coronel
Attribute Hierarchies In Multidimensional Analysis Database Systems: Design, Implementation, & Management, 6th Edition, Rob & Coronel
Star Schema for Sales Database Systems: Design, Implementation, & Management, 6th Edition, Rob & Coronel
Orders Star Schema Database Systems: Design, Implementation, & Management, 6th Edition, Rob & Coronel
Normalized Dimension tables Database Systems: Design, Implementation, & Management, 6th Edition, Rob & Coronel
Multiple Fact Tables Database Systems: Design, Implementation, & Management, 6th Edition, Rob & Coronel
Implementing a Data Warehouse Numerous constraints: Available funding Management’s view of the role played by an IS department and of the extent and depth of the information requirements Corporate culture No single formula can describe perfect data warehouse development Database Systems: Design, Implementation, & Management, 6th Edition, Rob & Coronel
Factors Common to Data Warehousing Data warehouse is not a static database Dynamic framework for decision support that is always a work in progress Data warehouse data cross departmental lines and geographical boundaries Must satisfy: Data integration and loading criteria Data analysis capabilities with acceptable query performance End-user data analysis needs Apply database design procedures Database Systems: Design, Implementation, & Management, 6th Edition, Rob & Coronel
Data Warehouse Implementation Road Map Database Systems: Design, Implementation, & Management, 6th Edition, Rob & Coronel
Data Mining Tools that: Require minimal end-user intervention analyze data uncover problems or opportunities hidden in data relationships, form computer models based on their findings, and then use the models to predict business behavior Require minimal end-user intervention Database Systems: Design, Implementation, & Management, 6th Edition, Rob & Coronel
Extraction of Knowledge From Data Database Systems: Design, Implementation, & Management, 6th Edition, Rob & Coronel
Data-Mining Phases Database Systems: Design, Implementation, & Management, 6th Edition, Rob & Coronel
A Sample of Current Data Warehousing and Data-Mining Vendors Database Systems: Design, Implementation, & Management, 6th Edition, Rob & Coronel
Summary Data analysis is used to derive and interpret information from data Decision support is a methodology designed to extract information from data and to use such information as a basis for decision making Decision support system is an arrangement of computerized tools used to assist managerial decision making within a business Data warehouse is an integrated, subject-oriented, time-variant, nonvolatile database that provides support for decision making Database Systems: Design, Implementation, & Management, 6th Edition, Rob & Coronel
Summary (continued) Online analytical processing is an advanced data analysis environment that supports decision making, business modeling, and operations research Star schema is a data-modeling technique used to map multidimensional decision support data into a relational database The implementation of any company-wide information system is subject to conflicting organizational and behavioral factors Database Systems: Design, Implementation, & Management, 6th Edition, Rob & Coronel
Summary (continued) Data mining automates analysis of operational data with the intention of finding previously unknown data characteristics, relationships, dependencies, and/or trends Data warehouse is storage location for decision support data Database Systems: Design, Implementation, & Management, 6th Edition, Rob & Coronel