Download presentation
Presentation is loading. Please wait.
Published byMyles Studdard Modified over 9 years ago
1
McGraw-Hill/Irwin © 2008 The McGraw-Hill Companies, All Rights Reserved Chapter 3 Databases and Data Warehouses
2
3-2 STUDENT LEARNING OUTCOMES 1.Describe business intelligence and its role in an organization. 2.Differentiate between databases and data warehouses with respect to their focus on OLTP and OLAP. 3.List and describe the key characteristics of a relational database.
3
3-3 STUDENT LEARNING OUTCOMES 4.Define the five software components of a database management system. 5.List and describe the key characteristics of a data warehouse. 6.Define the four major types of data-mining tools in a data warehouse environment. 7.List key considerations in information ownership in an organization.
4
3-4 Can Companies Keep Your Personal Information Secure and Private? Databases and data warehouses are organizational repositories of informationDatabases and data warehouses are organizational repositories of information Much of the information is personalMuch of the information is personal It must be secureIt must be secure If hackers get your personal information, you can suffer from identity theftIf hackers get your personal information, you can suffer from identity theft
5
3-5 Can Companies Keep Your Personal Information Secure and Private? Top-10 incidents of personal information loss by organizationsTop-10 incidents of personal information loss by organizations Could affect over 53 million peopleCould affect over 53 million people CardSystems lost information on 40 million customersCardSystems lost information on 40 million customers Many othersMany others
6
3-6 Can Companies Keep Your Personal Information Secure and Private? Have you been a victim of identity theft?Have you been a victim of identity theft? –What happened? –What did you do to recover? –How long did it take?
7
3-7 INTRODUCTION Businesses need business intelligence (BI)Businesses need business intelligence (BI) Business intelligence – knowledge about your customers, competitors, business partners, environment, and internal operationsBusiness intelligence – knowledge about your customers, competitors, business partners, environment, and internal operations –Enables effective decision making –Information on steroids
8
3-8 INTRODUCTION IT tools help process information to create business intelligence according to…IT tools help process information to create business intelligence according to… –OLTP (online transaction processing) –OLAP (online analytical processing)
9
3-9 INTRODUCTION OLTP – gathering and processing transaction information and updating existing information to reflect transactionOLTP – gathering and processing transaction information and updating existing information to reflect transaction –Databases support OLTP –Operational database – database that supports OLTP
10
3-10 INTRODUCTION OLAP – manipulation of information to support decision makingOLAP – manipulation of information to support decision making –Databases can help some –Data warehouses support only OLAP, not OLTP –Data warehouses – special forms of databases that support decision making
11
3-11 INTRODUCTION
12
3-12 INTRODUCTION This chapter – database and data warehouse conceptsThis chapter – database and data warehouse concepts Along with some privacy and security considerationsAlong with some privacy and security considerations
13
3-13 RELATIONAL DATABASE MODEL Database – logical collection of information you organize and access according to the logical structure of the informationDatabase – logical collection of information you organize and access according to the logical structure of the information Relational database – uses a series of two- dimensional tables or files to store information in the form of a databaseRelational database – uses a series of two- dimensional tables or files to store information in the form of a database
14
3-14 Databases Are… Collections of informationCollections of information Created with logical structuresCreated with logical structures With logical ties within the informationWith logical ties within the information With built-in integrity constraintsWith built-in integrity constraints
15
3-15 Databases – Collections of Information Databases have many tablesDatabases have many tables Solomon Enterprises as a concrete provider. Tables include:Solomon Enterprises as a concrete provider. Tables include: –Order –Customer –Concrete Type –Employee –Truck
16
3-16 Databases – Collections of Information
17
3-17 Databases – Created with Logical Structures In databases, row numbers are irrelevantIn databases, row numbers are irrelevant In databases, columns have logical names such as Order Date and Customer NameIn databases, columns have logical names such as Order Date and Customer Name Data dictionary – contains the logical structure of the information in a databaseData dictionary – contains the logical structure of the information in a database
18
3-18 Databases – Logical Ties within the Information Logical ties must exist between the tablesLogical ties must exist between the tables Logical ties are created with primary and foreign keysLogical ties are created with primary and foreign keys Primary key – field (or group of fields in some cases) that uniquely describe each recordPrimary key – field (or group of fields in some cases) that uniquely describe each record
19
3-19 Databases – Logical Ties within the Information Foreign key – primary key of one file that appears in another fileForeign key – primary key of one file that appears in another file Foreign keys help create relationships among tablesForeign keys help create relationships among tables Table = file = relation (don’t confuse yourself)Table = file = relation (don’t confuse yourself)
20
3-20 Databases – Logical Ties within the Information
21
3-21 Databases – Built-in Integrity Constraints Integrity constraint – rule that helps ensure the quality of informationIntegrity constraint – rule that helps ensure the quality of information ExamplesExamples –Primary keys must be unique –Foreign keys cannot be blank –Sales price cannot be negative –Phone numbers must have an area code
22
3-22 DBMS TOOLS Database management system (DBMS) – helps you specify the logical organization for a database and access and use the information within a databaseDatabase management system (DBMS) – helps you specify the logical organization for a database and access and use the information within a database –Word processing software = document –Spreadsheet software = workbook –DBMS software = database
23
3-23 DBMS TOOLS 5 software components5 software components 1.DBMS engine 2.Data definition subsystem 3.Data manipulation subsystem 4.Application generation subsystem 5.Data administration subsystem
24
3-24 DBMS TOOLS
25
3-25 DBMS Engine DBMS engine – accepts logical requests, converts them into their physical equivalent, and accesses the database and data dictionaryDBMS engine – accepts logical requests, converts them into their physical equivalent, and accesses the database and data dictionary DBMS engine separates the logical from the physicalDBMS engine separates the logical from the physical
26
3-26 DBMS Engine Physical view – how information is arranged, stored, and accessed on a storage devicePhysical view – how information is arranged, stored, and accessed on a storage device Logical view – how you (knowledge worker) need to arrange and access informationLogical view – how you (knowledge worker) need to arrange and access information Databases – you work only with logical viewsDatabases – you work only with logical views
27
3-27 Data Definition Subsystem Data definition subsystem – helps you create and maintain the data dictionary and define the structure of the files in a databaseData definition subsystem – helps you create and maintain the data dictionary and define the structure of the files in a database Must create data dictionary for a database before entering any informationMust create data dictionary for a database before entering any information
28
3-28 Data Manipulation Subsystem Data manipulation subsystem – helps you add, change, and delete informationData manipulation subsystem – helps you add, change, and delete information Primary interface between you and a databasePrimary interface between you and a database –Views –Report generators –QBE tools –SQL
29
3-29 Views View – allows you to see the contents of a database fileView – allows you to see the contents of a database file Similar to a spreadsheet viewSimilar to a spreadsheet view –Make changes –Sort –Query
30
3-30 Views
31
3-31 Report Generators Report generator – helps you quickly define formats of reports and what information you want to see in a reportReport generator – helps you quickly define formats of reports and what information you want to see in a report Save report formats to use laterSave report formats to use later Uses a wizard interfaceUses a wizard interface
32
3-32 Report Generators Specify the fields you want in a report Specify the layout of the report
33
3-33 Report Generators
34
3-34 QBE Tools Query-by-example (QBE) tool – helps you graphically design the answer to a questionQuery-by-example (QBE) tool – helps you graphically design the answer to a question “What driver most often delivers concrete to Triple A Homes?”“What driver most often delivers concrete to Triple A Homes?”
35
3-35 QBE Tools
36
3-36 SQL Structured query language (SQL) – standardized fourth-generation language found in most DBMSsStructured query language (SQL) – standardized fourth-generation language found in most DBMSs Performs same task as QBEPerforms same task as QBE Uses sentence structure insteadUses sentence structure instead Mostly used by IT peopleMostly used by IT people
37
3-37 Application Generation Subsystem Application generation subsystem – contains facilities to help you develop transaction- intensive applicationsApplication generation subsystem – contains facilities to help you develop transaction- intensive applications –Data entry screens (called forms in Access) –Programming languages Mostly used by IT peopleMostly used by IT people
38
3-38 Data Administration Subsystem Data administration subsystem – helps you manage the overall database environmentData administration subsystem – helps you manage the overall database environment –Backup and recovery –Security management –Query optimization –Concurrency control –Change management
39
3-39 Data Administration Subsystem Backup and recoveryBackup and recovery –Periodically back up information –Recover a database after a failure Security managementSecurity management –Who has access to what information –Who can perform CRUD tasks on information
40
3-40 Data Administration Subsystem Query optimizationQuery optimization –Restructure physical view to optimize response times to queries Concurrency controlConcurrency control –What happens if two people simultaneously try to change the same information?
41
3-41 Data Administration Subsystem Change managementChange management –What is the effect of structural changes to a database? –What if you add a new column? –What happens if you delete a column? –What happens if you change a column’s attributes?
42
3-42 DATA WAREHOUSES & DATA MINING Data warehouses support OLAP and decision makingData warehouses support OLAP and decision making Data warehouses do not support OLTPData warehouses do not support OLTP Data-mining tools are tools for working with data warehouse informationData-mining tools are tools for working with data warehouse information –DBMS software = database –Data-mining tools = data warehouse
43
3-43 What Is a Data Warehouse? Data warehouse – logical collection of information – gathered from operational databases – used to create business intelligence that supports business analysis activities and decision-making tasksData warehouse – logical collection of information – gathered from operational databases – used to create business intelligence that supports business analysis activities and decision-making tasks
44
3-44 What Is a Data Warehouse?
45
3-45 What Is a Data Warehouse? MultidimensionalMultidimensional Rows and columnsRows and columns Also layersAlso layers Many times called hypercubesMany times called hypercubes What are the dimensions in Figure 3.8 on page 97?What are the dimensions in Figure 3.8 on page 97?
46
3-46 What Are Data-Mining Tools? Data-mining tools – software tools that you use to query information in a data warehouseData-mining tools – software tools that you use to query information in a data warehouse –Query-and-reporting tools –Intelligent agents –Multidimensional analysis tools –Statistical tools
47
3-47 What Are Data-Mining Tools?
48
3-48 Query-and-Reporting Tools Query-and-reporting tools – similar to QBE tools, SQL, and report generators in the typical database environmentQuery-and-reporting tools – similar to QBE tools, SQL, and report generators in the typical database environment –Also similar to pivot tables in Excel
49
3-49 Intelligent Agents Use various AI tools such as neural networks and fuzzy logic to form the basis for “information discovery” and building BIUse various AI tools such as neural networks and fuzzy logic to form the basis for “information discovery” and building BI Help you find hidden patterns in informationHelp you find hidden patterns in information Chapter 4 focuses on theseChapter 4 focuses on these
50
3-50 Multidimensional Analysis Tools Multidimensional analysis (MDA) tools – slice-and-dice techniques that allow you to view multidimensional information from different perspectivesMultidimensional analysis (MDA) tools – slice-and-dice techniques that allow you to view multidimensional information from different perspectives –Bring new layers to the front –Reorganize rows and columns
51
3-51 Statistical Tools Help you apply various mathematical models to the information stored in a data warehouse to discover new informationHelp you apply various mathematical models to the information stored in a data warehouse to discover new information –Regression –Analysis of variance –And so on
52
3-52 Data Marts Data warehouses are organizationwideData warehouses are organizationwide Data marts have subsets of an organizationwide data warehouseData marts have subsets of an organizationwide data warehouse Data mart – subset of a data warehouse in which only a focused portion of the data warehouse information is keptData mart – subset of a data warehouse in which only a focused portion of the data warehouse information is kept
53
3-53 Data Marts
54
3-54 Data Mining as a Career Opportunity Knowledge of data mining can be a substantial career opportunity for youKnowledge of data mining can be a substantial career opportunity for you –Business Objects –SAS –Cognos –Informatica –Many others
55
3-55 Considerations in Using a Data Warehouse Do you need a data warehouse?Do you need a data warehouse? –DBMS may offer all you need Do all employees need the entire data warehouse?Do all employees need the entire data warehouse? –Consider a data mart How up-to-date must information be?How up-to-date must information be? –“Snapshot” concept What data-mining tools do you need?What data-mining tools do you need? –Training can be expensive
56
3-56 INFORMATION OWNERSHIP Strategic management supportStrategic management support The sharing of information with responsibilityThe sharing of information with responsibility Information cleanlinessInformation cleanliness
57
3-57 Strategic Management Support Chief privacy officer (CPO) – ensuring that information is used in an ethical wayChief privacy officer (CPO) – ensuring that information is used in an ethical way Chief security officer (CSO) – ensuring security of information (e.g., firewalls)Chief security officer (CSO) – ensuring security of information (e.g., firewalls) Chief information officer (CIO) – oversees every aspect of an organization’s information resourceChief information officer (CIO) – oversees every aspect of an organization’s information resource
58
3-58 Strategic Management Support Data administration – plans for, oversees the development of, and monitors the information resourceData administration – plans for, oversees the development of, and monitors the information resource Database administration – responsible for the more technical aspects and operational aspects of managing informationDatabase administration – responsible for the more technical aspects and operational aspects of managing information Both often report to the CIOBoth often report to the CIO
59
3-59 The Sharing of Information with Responsibility If you create it, you “own” itIf you create it, you “own” it You will also share it with othersYou will also share it with others Because you “own” it, you are responsible for its qualityBecause you “own” it, you are responsible for its quality
60
3-60 Information Cleanliness Database and data warehouse information must be “clean”Database and data warehouse information must be “clean” –No errors –No duplicates
61
3-61 Information Cleanliness Extraction, transformation, and loading (ETL) – what information you want from each database, how the information is associated, and what rules to follow in consolidating the information to ensure its cleanliness in a data warehouseExtraction, transformation, and loading (ETL) – what information you want from each database, how the information is associated, and what rules to follow in consolidating the information to ensure its cleanliness in a data warehouse
62
3-62 CAN YOU… 1.Describe business intelligence and its role in an organization. 2.Differentiate between databases and data warehouses with respect to their focus on OLTP and OLAP. 3.List and describe the key characteristics of a relational database.
63
3-63 CAN YOU… 4.Define the five software components of a database management system. 5.List and describe the key characteristics of a data warehouse. 6.Define the four major types of data-mining tools in a data warehouse environment. 7.List key considerations in information ownership in an organization.
Similar presentations
© 2025 SlidePlayer.com. Inc.
All rights reserved.