David M. Kroenke and David J. Auer Database Processing Fundamentals, Design, and Implementation Chapter Twelve: Big Data, Data Warehouses, and Business.

Slides:



Advertisements
Similar presentations
DAVID M. KROENKE’S DATABASE PROCESSING, 10th Edition © 2006 Pearson Prentice Hall 15-1 David M. Kroenke Database Processing Chapter 15 Business Intelligence.
Advertisements

Database Management3-1 L3 Database Management Santa R. Susarapu Ph.D. Student Virginia Commonwealth University.
Business Intelligence Systems
Database Design Chapter Five DATABASE CONCEPTS, 6th Edition
DAVID M. KROENKE’S DATABASE PROCESSING, 10th Edition © 2006 Pearson Prentice Hall COS 236 Day 25.
The database approach to data management provides significant advantages over the traditional file-based approach Define general data management concepts.
Chapter 9 Business Intelligence Systems
Chapter 9 Competitive Advantage with Information Systems for Decision Making © 2008 Pearson Prentice Hall, Experiencing MIS, David Kroenke.
Chapter 3 Database Management
The Hierarchy of Data Bit (a binary digit): a circuit that is either on or off Byte: 8 bits Character: each byte represents a character; the basic building.
Database Management: Getting Data Together Chapter 14.
DAVID M. KROENKE’S DATABASE PROCESSING, 10th Edition © 2006 Pearson Prentice Hall COS 346 Day 26.
Chapter 14 The Second Component: The Database.
SQL Views Chapter 3A DAVID M. KROENKE and DAVID J. AUER DATABASE CONCEPTS, 5 th Edition.
Database Processing for Business Intelligence Systems
Big Data, Data Warehouses, and Business Intelligence Systems
Getting Started with Microsoft Visio 2010 Appendix G DAVID M. KROENKE and DAVID J. AUER DATABASE CONCEPTS, 6 th Edition.
Getting Started Chapter One DAVID M. KROENKE and DAVID J. AUER DATABASE CONCEPTS, 5 th Edition.
Getting Started with Microsoft SQL Server 2012 Express Edition Appendix A DAVID M. KROENKE and DAVID J. AUER DATABASE CONCEPTS, 6 th Edition.
David M. Kroenke and David J. Auer Database Processing—12 th Edition Fundamentals, Design, and Implementation Chapter One: Introduction KROENKE AND AUER.
Getting Started Chapter One DATABASE CONCEPTS, 7th Edition
Advanced Topics Chapter Seven DAVID M. KROENKE’S DATABASE CONCEPTS, 2 nd Edition.
Data Resource Management Data Concepts Database Management Types of Databases Chapter 5 McGraw-Hill/Irwin Copyright © 2007 by The McGraw-Hill Companies,
Big Data, Data Warehouses, and Business Intelligence Systems Chapter Eight DAVID M. KROENKE and DAVID J. AUER DATABASE CONCEPTS, 7 th Edition.
Getting Started with Microsoft Access The Access Workbench: Section One DAVID M. KROENKE and DAVID J. AUER DATABASE CONCEPTS, 4 th Edition.
Getting Started with Oracle Database 11g Release 2 Express Edition Appendix B DAVID M. KROENKE and DAVID J. AUER DATABASE CONCEPTS, 6 th Edition.
Getting Started Chapter One DAVID M. KROENKE and DAVID J. AUER DATABASE CONCEPTS, 6 th Edition.
Fundamentals of Information Systems, Third Edition2 Principles and Learning Objectives The database approach to data management provides significant advantages.
5.1 © 2007 by Prentice Hall 5 Chapter Foundations of Business Intelligence: Databases and Information Management.
Chapter 5 Lecture 2. Principles of Information Systems2 Objectives Understand Data definition language (DDL) and data dictionary Learn about popular DBMSs.
Computers Are Your Future Tenth Edition Chapter 12: Databases & Information Systems Copyright © 2009 Pearson Education, Inc. Publishing as Prentice Hall1.
Systems analysis and design, 6th edition Dennis, wixom, and roth
Chapter 9 Business Intelligence and Information Systems for Decision Making.
The McGraw-Hill Companies, Inc Information Technology & Management Thompson Cats-Baril Chapter 3 Content Management.
Fundamentals of Information Systems, Fifth Edition
Technology in Action Alan Evans Kendall Martin Mary Anne Poatsy Twelfth Edition.
Advanced Topics Chapter Seven DAVID M. KROENKE’S DATABASE CONCEPTS, 2 nd Edition.
Chapter 11 Business Intelligence Copyright © 2013 Pearson Education, Inc. Publishing as Prentice Hall 11-1.
1 Data Warehouses BUAD/American University Data Warehouses.
5-1 McGraw-Hill/Irwin Copyright © 2007 by The McGraw-Hill Companies, Inc. All rights reserved.
Business Intelligence Systems Appendix J DAVID M. KROENKE and DAVID J. AUER DATABASE CONCEPTS, 6 th Edition.
1 Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall.
5 - 1 Copyright © 2006, The McGraw-Hill Companies, Inc. All rights reserved.
Module 5 Part 2 Introduction To Database. Module Objectives At the end of the module, students should be able to: –Describe the process of creating a.
UNIVERSITI TENAGA NASIONAL “Generates Professionals” MODULE 5 : Part 2 INTRODUCTION TO DATABASE.
Getting Started Chapter One DAVID M. KROENKE and DAVID J. AUER DATABASE CONCEPTS, 4 th Edition.
Copyright 2004 John Wiley & Sons, Inc Information Technology: Strategic Decision Making For Managers Henry C. Lucas Jr. John Wiley & Sons, Inc Dinesh.
Skills for Success with Microsoft Office 2013 Volume 1 Copyright © 2014 Pearson Education, Inc. Publishing as Prentice Hall. by Kris Townsend, Catherine.
Foundations of Business Intelligence: Databases and Information Management.
Business Intelligence Transparencies 1. ©Pearson Education 2009 Objectives What business intelligence (BI) represents. The technologies associated with.
+ Big Data. + Chapter Objectives Learn the basic concepts of Big Data, structured storage, and the MapReduce process Learn the basic concepts of data.
© 2003 Prentice Hall, Inc.3-1 Chapter 3 Database Management Information Systems Today Leonard Jessup and Joseph Valacich.
Fundamentals of Information Systems, Sixth Edition Chapter 3 Database Systems, Data Centers, and Business Intelligence.
David M. Kroenke and David J. Auer Database Processing Fundamentals, Design, and Implementation Chapter Eight: Database Redesign.
David M. Kroenke and David J. Auer Database Processing: Fundamentals, Design, and Implementation Chapter One: Introduction.
David M. Kroenke and David J. Auer Database Processing Fundamentals, Design, and Implementation Appendix I: Getting Started with Web Servers, PHP and the.
David M. Kroenke and David J. Auer Database Processing Fundamentals, Design, and Implementation Appendix J: Business Intelligence Systems.
David M. Kroenke and David J. Auer Database Processing Fundamentals, Design, and Implementation Appendix A: Getting Started with Microsoft Access 2013.
David M. Kroenke and David J. Auer Database Processing Fundamentals, Design, and Implementation Appendix G: Data Structures for Database Processing.
1 Database Systems, 8 th Edition Star Schema Data modeling technique –Maps multidimensional decision support data into relational database Creates.
Copyright © 2016 Pearson Education, Inc. Modern Database Management 12 th Edition Jeff Hoffer, Ramesh Venkataraman, Heikki Topi CHAPTER 11: BIG DATA AND.
David M. Kroenke and David J. Auer Database Processing: Fundamentals, Design, and Implementation Chapter Ten: Managing Databases with SQL Server 2012,
McGraw-Hill/Irwin ©2008,The McGraw-Hill Companies, All Rights Reserved Chapter 5 Data Resource Management.
TECHNOLOGY IN ACTION. Chapter 11 Behind the Scenes: Databases and Information Systems.
Data Analytics, Data Mining, OLAP, Reporting Systems
David M. Kroenke and David J
David M. Kroenke and David J
Getting Started Chapter One DATABASE CONCEPTS, 5th Edition
Big DATA.
Getting Started Chapter One DATABASE CONCEPTS, 4th Edition
Presentation transcript:

David M. Kroenke and David J. Auer Database Processing Fundamentals, Design, and Implementation Chapter Twelve: Big Data, Data Warehouses, and Business Intelligence Systems

Chapter Objectives To learn the basic concepts of Big Data, structured storage, and the MapReduce process To learn the basic concepts of data warehouses and data marts To learn the basic concepts of dimensional databases To learn the basic concepts of business intelligence (BI) systems To learn the basic concepts of OnLine Analytical Processing (OLAP) and data mining To learn the basic concepts of distributed databases To learn the basic concepts of virtual machines To learn the basic concepts of cloud computing KROENKE AND AUER - DATABASE PROCESSING, 13th Edition © 2014 Pearson Education, Inc. 12-2

Big Data Big Data—the current term for the enormous datasets generated by Web applications such as search tools (for example, Google and Bing) and Web 2.0 social networks (for example, Facebook, LinkedIn, and Twitter). Although these new and very visible Web applications are highlighting the problems of dealing with large datasets, these problems were already present in other areas, such as scientific research and business operations. KROENKE AND AUER - DATABASE PROCESSING, 13th Edition © 2014 Pearson Education, Inc. 12-3

Storage Capacity Terms KROENKE AND AUER - DATABASE PROCESSING, 13th Edition © 2014 Pearson Education, Inc. 12-4

Business Intelligence (BI) Systems Business intelligence (BI) systems are information systems that assist managers and other professionals: –To analyze current and past activities. –To predict future events. Two broad categories: –Reporting –Data mining KROENKE AND AUER - DATABASE PROCESSING, 13th Edition © 2014 Pearson Education, Inc. 12-5

The Relationship of Operational and BI Systems KROENKE AND AUER - DATABASE PROCESSING, 13th Edition © 2014 Pearson Education, Inc. 12-6

Data for BI Systems BI systems obtain data in three ways: –From the operational database Read and process data only DO NOT insert, modify or delete operational data –From extracts from the operational database Data is in a BI DBMS May be a different DBMS than the operations DBMS –From data purchased from data vendors KROENKE AND AUER - DATABASE PROCESSING, 13th Edition © 2014 Pearson Education, Inc. 12-7

Reporting Applications Reporting system applications: –Filter –Sort –Group –Make simple calculations –Classify entities RFM Analysis –Can be performed using standard SQL –Extensions to SQL are sometimes used OnLine Analytical Processing (OLAP) –Summarize current business status –Compare current business status to past or future –Deal with critical report delivery KROENKE AND AUER - DATABASE PROCESSING, 13th Edition © 2014 Pearson Education, Inc. 12-8

Data Mining Applications Data mining applications are used to: –Perform what-if analysis –Make predictions –Facilitate decision making Data mining applications use sophisticated statistical and mathematical techniques. Report delivery is not as critical. KROENKE AND AUER - DATABASE PROCESSING, 13th Edition © 2014 Pearson Education, Inc. 12-9

Characteristics of BI Applications KROENKE AND AUER - DATABASE PROCESSING, 13th Edition © 2014 Pearson Education, Inc

Purchased Data AmeriLINK Sells Data on 230+ Million Americans KROENKE AND AUER - DATABASE PROCESSING, 13th Edition © 2014 Pearson Education, Inc

Components of a Data Warehouse KROENKE AND AUER - DATABASE PROCESSING, 13th Edition © 2014 Pearson Education, Inc

Data Warehouses and Data Marts: Problems with Operational Data KROENKE AND AUER - DATABASE PROCESSING, 13th Edition © 2014 Pearson Education, Inc

Data Warehouses and Data Marts: Data Warehouse compared to Data Marts KROENKE AND AUER - DATABASE PROCESSING, 13th Edition © 2014 Pearson Education, Inc

Characteristics of Operational and Dimensional Databases KROENKE AND AUER - DATABASE PROCESSING, 13th Edition © 2014 Pearson Education, Inc

The Star Schema KROENKE AND AUER - DATABASE PROCESSING, 13th Edition © 2014 Pearson Education, Inc

The HSD Database Design KROENKE AND AUER - DATABASE PROCESSING, 13th Edition © 2014 Pearson Education, Inc

The HSD Database Diagram KROENKE AND AUER - DATABASE PROCESSING, 13th Edition © 2014 Pearson Education, Inc

The HSD-DW Star Schema KROENKE AND AUER - DATABASE PROCESSING, 13th Edition © 2014 Pearson Education, Inc

The HSD-DW SQL Statements KROENKE AND AUER - DATABASE PROCESSING, 13th Edition © 2014 Pearson Education, Inc

The HSD-DW Table Data KROENKE AND AUER - DATABASE PROCESSING, 13th Edition © 2014 Pearson Education, Inc

The HSD-DW SQL Query KROENKE AND AUER - DATABASE PROCESSING, 13th Edition © 2014 Pearson Education, Inc

The HSD-DW SQL Query Results KROENKE AND AUER - DATABASE PROCESSING, 13th Edition © 2014 Pearson Education, Inc

Two-Dimensional Matrix KROENKE AND AUER - DATABASE PROCESSING, 13th Edition © 2014 Pearson Education, Inc

Three-Dimensional Cube KROENKE AND AUER - DATABASE PROCESSING, 13th Edition © 2014 Pearson Education, Inc

Conformed Dimensions KROENKE AND AUER - DATABASE PROCESSING, 13th Edition © 2014 Pearson Education, Inc

Reporting Systems: RFM Analysis RFM Analysis analyzes and ranks customers according to purchasing patterns –R = recent (most recent order) –F = frequent (how often an order is made) –M = money (dollar amount of orders) Customers are sorted into five groups, each containing 20% of the customers. Each group is given a numerical value: –1 = top 20% –2, 3, 4 = each 20% in between top and bottom 20% –5 = bottom 20% KROENKE AND AUER - DATABASE PROCESSING, 13th Edition © 2014 Pearson Education, Inc

Reporting Systems: RFM Analysis KROENKE AND AUER - DATABASE PROCESSING, 13th Edition © 2014 Pearson Education, Inc

Reporting Systems: OnLine Analytical Processing [OLAP] An OLAP report has measures and dimensions: –Measure—a data item of interest –Dimension—a characteristic of a measure OLAP cube—a presentation of a measure with associated dimensions. –An OLAP cube can have any number of axes. –The terms OLAP cube and OLAP report are synonymous. OLAP allows drill-down—a further division of the data into more detail. KROENKE AND AUER - DATABASE PROCESSING, 13th Edition © 2014 Pearson Education, Inc

Reporting Systems: OLAP Reports I KROENKE AND AUER - DATABASE PROCESSING, 13th Edition © 2014 Pearson Education, Inc

Reporting Systems: OLAP Reports II KROENKE AND AUER - DATABASE PROCESSING, 13th Edition © 2014 Pearson Education, Inc

Reporting Systems: OLAP Reports III KROENKE AND AUER - DATABASE PROCESSING, 13th Edition © 2014 Pearson Education, Inc

Reporting Systems: OLAP Reports IV KROENKE AND AUER - DATABASE PROCESSING, 13th Edition © 2014 Pearson Education, Inc

Reporting Systems: OLAP Reports V KROENKE AND AUER - DATABASE PROCESSING, 13th Edition © 2014 Pearson Education, Inc

Reporting Systems: OLAP Reports VI KROENKE AND AUER - DATABASE PROCESSING, 13th Edition © 2014 Pearson Education, Inc

Reporting Systems: OLAP Reports VII KROENKE AND AUER - DATABASE PROCESSING, 13th Edition © 2014 Pearson Education, Inc

Reporting Systems: OLAP Reports VIII KROENKE AND AUER - DATABASE PROCESSING, 13th Edition © 2014 Pearson Education, Inc

Reporting Systems: OLAP Reports IX KROENKE AND AUER - DATABASE PROCESSING, 13th Edition © 2014 Pearson Education, Inc

Reporting Systems: OLAP Reports X KROENKE AND AUER - DATABASE PROCESSING, 13th Edition © 2014 Pearson Education, Inc

Reporting Systems: OLAP Reports XI KROENKE AND AUER - DATABASE PROCESSING, 13th Edition © 2014 Pearson Education, Inc

Reporting Systems: OLAP Drill Down I KROENKE AND AUER - DATABASE PROCESSING, 13th Edition © 2014 Pearson Education, Inc

Reporting Systems: OLAP Drill Down II KROENKE AND AUER - DATABASE PROCESSING, 13th Edition © 2014 Pearson Education, Inc

Reporting Systems: OLAP Servers and OLAP Databases KROENKE AND AUER - DATABASE PROCESSING, 13th Edition © 2014 Pearson Education, Inc

Data Mining Applications: The Convergence of the Disciplines KROENKE AND AUER - DATABASE PROCESSING, 13th Edition © 2014 Pearson Education, Inc

Types of Distributed Databases KROENKE AND AUER - DATABASE PROCESSING, 13th Edition © 2014 Pearson Education, Inc

Object-Oriented Database I Object-oriented programming (OOP) is a technique for designing and writing computer programs. Objects are data structures that have: – Methods—computer programs that perform some task – Properties—data items particular to an object. KROENKE AND AUER - DATABASE PROCESSING, 13th Edition © 2014 Pearson Education, Inc

Object-Oriented Database II Storing the values of properties of an object is called object persistence. Many different techniques have been used for object persistence. One of them is to use some variation of database technology. KROENKE AND AUER - DATABASE PROCESSING, 13th Edition © 2014 Pearson Education, Inc

Object-Oriented Database III Special-purpose DBMS products for storing object data are called object- oriented DBMSs (OODBMSs) Never achieved commercial success. –Billions of bytes of data were already stored in relational DBMS format –No organization wanted to convert their data to OODBMS format to be able to use an OODBMS. KROENKE AND AUER - DATABASE PROCESSING, 13th Edition © 2014 Pearson Education, Inc

Object-Oriented Database IV The need for object persistence did not disappear. Some vendors, most notably Oracle, added features and functions to their relational database DBMS products to create object-relational databases. DBMS products for storing object data are called object-oriented DBMSs (OODBMSs). KROENKE AND AUER - DATABASE PROCESSING, 13th Edition © 2014 Pearson Education, Inc

Virtualization Underutilization of Computer Resources KROENKE AND AUER - DATABASE PROCESSING, 13th Edition © 2014 Pearson Education, Inc

Virtualization The Virtual Machine Environment I KROENKE AND AUER - DATABASE PROCESSING, 13th Edition © 2014 Pearson Education, Inc

Virtualization The Virtual Machine Environment II KROENKE AND AUER - DATABASE PROCESSING, 13th Edition © 2014 Pearson Education, Inc

Virtualization The Virtual Machine Environment III KROENKE AND AUER - DATABASE PROCESSING, 13th Edition © 2014 Pearson Education, Inc

Cloud Computing The Cloud Computing Environment KROENKE AND AUER - DATABASE PROCESSING, 13th Edition © 2014 Pearson Education, Inc

Cloud Computing Microsoft SQL Server 2012 Express in the Cloud KROENKE AND AUER - DATABASE PROCESSING, 13th Edition © 2014 Pearson Education, Inc

The NoSQL Movement I The NoSQL movement, better described as the Not only SQL movement, is a movement to using non-relational databases. These databases are often described as structured storage. KROENKE AND AUER - DATABASE PROCESSING, 13th Edition © 2014 Pearson Education, Inc. 7-56

The NoSQL Movement II One implementation is as a distributed, replicated database that is described in this chapter. Example: Apache CassandraCassandra –Used for Facebook –Used for Twitter KROENKE AND AUER - DATABASE PROCESSING, 13th Edition © 2014 Pearson Education, Inc. 7-57

The NoSQL Movement III Another implementation is based on XML document structures as described in this Chapter. Example: dbXMLdbXML XML database typically support: –W3C XQuery standardW3C XQuery standard –W3C XPath standardW3C XPath standard KROENKE AND AUER - DATABASE PROCESSING, 13th Edition © 2014 Pearson Education, Inc. 7-58

Generalized Structured Storage: A Column KROENKE AND AUER - DATABASE PROCESSING, 13th Edition © 2014 Pearson Education, Inc. 8-59

Generalized Structured Storage: A Super Column KROENKE AND AUER - DATABASE PROCESSING, 13th Edition © 2014 Pearson Education, Inc. 8-60

Generalized Structured Storage: A Column Family KROENKE AND AUER - DATABASE PROCESSING, 13th Edition © 2014 Pearson Education, Inc. 8-61

The MapReduce Process KROENKE AND AUER - DATABASE PROCESSING, 13th Edition © 2014 Pearson Education, Inc. 8-62

HADOOP Hadoop Distributed File System (HDFS) – provides standard file services to clustered servers so their file systems can function as one distributed file system.Hadoop Distributed File System (HDFS) The Hadoop family includes a full set of applications including: –Hbase – A nonrelational data store.Hbase –Pig – A query language.Pig KROENKE AND AUER - DATABASE PROCESSING, 13th Edition © 2014 Pearson Education, Inc

David Kroenke and David Auer Database Processing Fundamentals, Design, and Implementation (13th Edition) End of Presentation: Chapter Twelve KROENKE AND AUER - DATABASE PROCESSING, 13th Edition © 2014 Pearson Education, Inc

All rights reserved. No part of this publication may be reproduced, stored in a retrieval system, or transmitted, in any form or by any means, electronic, mechanical, photocopying, recording, or otherwise, without the prior written permission of the publisher. Printed in the United States of America. KROENKE AND AUER - DATABASE PROCESSING, 13th Edition © 2014 Pearson Education, Inc