Database Management Character, file, field, record, database??? 1.

Slides:



Advertisements
Similar presentations
Process Models: Data Flow Diagrams
Advertisements

Database Management Character, file, field, record, database??? 1.
C6 Databases.
Database Management: Getting Data Together Chapter 14.
Chapter 4: Database Management. Databases Before the Use of Computers Data kept in books, ledgers, card files, folders, and file cabinets Long response.
Introduction to Database Management
BUSINESS DRIVEN TECHNOLOGY
How Come It Takes Me So Long to Get Answers to Simple Questions About My Business? Technologies for Business Intelligence.
Process Models: Data Flow Diagrams
Mgt 20600: IT Management & Applications Databases Tuesday April 4, 2006.
ACS1803 Lecture Outline 2 DATA MANAGEMENT CONCEPTS Text, Ch. 3 How do we store data (numeric and character records) in a computer so that we can optimize.
MD240 - MIS Oct. 4, 2005 Databases & the Data Asset Harrah’s & Allstate Cases.
Database Design - Lecture 1
6-1 DATABASE FUNDAMENTALS Information is everywhere in an organization Information is stored in databases –Database – maintains information about various.
© Paradigm Publishing Inc. 9-1 Chapter 9 Database and Information Management.
Objectives Overview Define the term, database, and explain how a database interacts with data and information Define the term, data integrity, and describe.
STORING ORGANIZATIONAL INFORMATION— DATABASES CIS 429—Chapter 7.
Chapter 6: Foundations of Business Intelligence - Databases and Information Management Dr. Andrew P. Ciganek, Ph.D.
Chapter 7: Database Systems Succeeding with Technology: Second Edition.
© Paradigm Publishing Inc. 9-1 Chapter 9 Database and Information Management.
311: Management Information Systems Database Systems Chapter 3.
Managing Information for Business Initiatives
I Information Systems Technology Ross Malaga 4 "Part I Understanding Information Systems Technology" Copyright © 2005 Prentice Hall, Inc. 4-1 DATABASE.
Chapter 3 and Module C DATABASES AND DATA WAREHOUSES Building Business Intelligence.
Lecturer: Gareth Jones. How does a relational database organise data? What are the principles of a database management system? What are the principal.
Storing Organizational Information - Databases
Dimu' Rumpak © 2009 by Prentice Hall 1 Getting Started Didimus Rumpak, M.Si. Database Concepts Chapter 1 1.
C6 Databases. 2 Traditional file environment Data Redundancy and Inconsistency: –Data redundancy: The presence of duplicate data in multiple data files.
5-1 McGraw-Hill/Irwin Copyright © 2007 by The McGraw-Hill Companies, Inc. All rights reserved.
5 - 1 Copyright © 2006, The McGraw-Hill Companies, Inc. All rights reserved.
McGraw-Hill/Irwin © 2008 The McGraw-Hill Companies, All Rights Reserved Chapter 7 Storing Organizational Information - Databases.
6.1 © 2010 by Prentice Hall 6 Chapter Foundations of Business Intelligence: Databases and Information Management.
McGraw-Hill/Irwin ©2009 The McGraw-Hill Companies, All Rights Reserved CHAPTER 6 DATABASES AND DATA WAREHOUSES CHAPTER 6 DATABASES AND DATA WAREHOUSES.
Data Resource Management Application Layer TPS A RCHITECTURE Data Layer Sales/MarketingHR Finance/Accounting Operations Spreadsheet Data MS Access Accounts.
© 2017 by McGraw-Hill Education. This proprietary material solely for authorized instructor use. Not authorized for sale or distribution in any manner.
Database Principles: Fundamentals of Design, Implementation, and Management Chapter 1 The Database Approach.
Intro to MIS – MGS351 Databases and Data Warehouses
Data Resource Management
CHAPTER SIX DATA Business Intelligence
Database Management:.
Data warehouse and OLAP
Chapter 1: Introduction
Database Systems: Design, Implementation, and Management Tenth Edition
Chapter Ten Managing a Database.
ACS1803 Lecture Outline 2   DATA MANAGEMENT CONCEPTS Text, Ch. 3
DATABASES AND DATA WAREHOUSES Searching for Revenue - Google
Data Resource Management
Database Management System (DBMS)
Basic Concepts in Data Management
MANAGING DATA RESOURCES
Chapter 1 Database Systems
File Systems and Databases
Chapter 9 Database and Information Management.
Data Resource Management
C.U.SHAH COLLEGE OF ENG. & TECH.
CHAPTER SIX OVERVIEW SECTION 6.1 – DATABASE FUNDAMENTALS
MANAGING DATA RESOURCES
Looking at the Quality of Data and Information
Chapter 1 Database Systems
Chapter 1: Introduction
Chapter 1: Introduction
Data Resource Management
DATABASE TECHNOLOGIES
Chapter 3 Database Management
Chapter 1: Introduction
Database & Information Systems
Valuing Organizational Information
Chapter 1: Introduction
The Database Environment
Presentation transcript:

Database Management Character, file, field, record, database??? 1

What’s “File Processing”? The “old” way of doing things; still often used in practice. Separate information stored on separate files. 2

File Processing Example: Sales Production Marketing Knows how many of Products A, B, and C have been sold. File stores Prod. Name, Production Schedule, and Sales. Knows how much of Products A, B, and C have been produced. File stores Prod. Name, Production Schedule, and Number Produced. Knows the price of Products A, B, and C. File stores Prod. Name and Product Price. 3

Any problems here? Duplication (redundancy). Inconsistency. Does anyone know how much money we made? No integration. Set format. Data dependence. Y2K!! 4

Database Management Database Management System (DBMS) Provides one integrated repository for data to be stored and queried. Standards for data can be defined and enforced. Reports and queries are easy (er). SQL, etc. 5

Database Management Ex.: Sales Production Marketing (App. Progs) DBMS Database Prod. Name Production Schedule Sales Number Produced Product Price 6

DATABASE MANAGEMENT SYSTEMS Four components of a DBMS The components of the DBMS are discussed in detail on the following slides A DBMS contains: Data definition component – helps create and maintain the data dictionary and the structure of the database Data manipulation component – allows users to create, read, update, and delete information in a database Application generation component – includes tools for creating visually appealing and easy-to-use applications Data administration component – provides tools for managing the overall database environment by providing faculties for backup, recovery, security, and performance

BUT... Expensive. Difficult. Slow / inefficient. 7

Another Look (thanks to John Gallaugher, Boston College) Server - responds to client requests DBMS - the program. Manages interaction with databases. request response Client - makes requests of the DBMS server database - the collection of data. Created and defined to meet the needs of the organization. Database a collection of related data. Usually organized according to topics: e.g. customer info, products, transactions Database Management System (DBMS) a program for creating & managing databases; ex. Oracle, MS-Access, Sybase When we talk about databases, we’re really talking about nothing more than lists of information. DBMS are used to create and manage databases (sometimes broadly called a database or database program) Show where DBMS & Databases relate to client/server computing The request in a request / response might include things like: add a record retrieve a record modify a record delete a record

A Simple Database File/Table Field/Column Record/Row Customers 5 shown: CUSTID, FIRST, LAST, CITY, STATE Record/Row 5 shown: one for each customer Why do we even need a customer ID? to uniquely identify a user - there may be two people with the same name

A More Complex Example Entry & Maintenance is complicated How many records do I have to change if Abby Johnson gets married & decides she wants to change her name? Two How many records do I change if Warren Buffet moves? Three You get the picture - there’s got to be a better way to manage this & there is - it’s called the relational model. Entry & Maintenance is complicated redundant data exists, increases chance of error, complicates updates/changes, takes up space

Normalize Data: Remove Redundancy Customer Table Transaction Table One Many Normalization is nothing more than removing redundancy. There are forma l steps & levels of normal form, but those are for a database course (a grad database elective is offered by the CS department). One normalizes to: eliminate redundancy simplify maintenance save space promote re-use Files are related via a common key. How many transactions has Buffet made? Three. How do we tell that? The unique ID

Key Terms Relational DBMS SQL - Structured Query Language manages databases as a collection of files/tables in which all data relationships are represented by common values in related tables (referred to as keys). a relational system has the flexibility to take multiple files and generate a new file from the records that meet the matching criteria (join). SQL - Structured Query Language Most popular relational database standard. Includes a language for creating & manipulating data. Note: other types of databases exist (e.g. older - hierarchical. newer - OO). SQL is by far the most popular. It is a standard that most follow. Many desktop applications can generate SQL statements using a GUI without any programming (e.g. Excel, Access). These appellations act as a client & send database access requests (e.g. get records, add record, delete record) to the database server.

Using SQL for Querying SQL (Structured Query Language) Data language English-like, nonprocedural, very user friendly language Free format Example: SELECT Name, Salary FROM Employees WHERE Salary >2000

THE VALUE OF QUALITY INFORMATION Five common characteristics of high-quality information Class Activity: Break your students into groups and ask each group to provide an additional example of each of the five common characteristics of high-quality information that is not provided in Figure 2.4 For example, Accuracy – does a purchase price on a bill match the item description on the bill? Item 1: Kids juice cup, cost $10,000. Chances are a kids juice cup would not cost $10,000 and this is an inaccurate item.

THE VALUE OF QUALITY INFORMATION Low-quality information example Walk-thru each of the six issues and have your students extrapolate a potential business problem that might be associated with each issue. The example does not state what type of database or spreadsheet this information is contained (sales, marketing, customer service, billing, etc), so allow your students use their imagination when they are extrapolating the potential business problems Issue 1: Without a first name it would be impossible to correlate this customer with customers in other databases (Sales, Marketing, Billing, Customer Service) to gain a compete customer view (CRM) Issue 2: Without a complete street address there is no possible way to communicate with this customer via mail or deliveries. An order might be sitting in a warehouse waiting for the complete address before shipping. The company has spent time and money processing an order that might never be completed Issue 3: If this is the same customer, the company will waste money sending out two sets of promotions and advertisements to the same customers. It might also send two identical orders and have to incur the expense of one order being returned Issue 4: This is a good example of where cleaning data is difficult because this may or may not be an error. There are many times when a phone and a fax have the same number. Since the phone number is also in the e-mail address field, chances are that the number is inaccurate Issue 5: The business would have no way of communicating with this customer via e-mail Issue 6: The company could determine the area code based on the customer’s address. This takes time, which costs the company money. This is a good reason to ensure that information is entered correctly the first time. All incorrect information needs to be fixed, which costs time and money

THE VALUE OF QUALITY INFORMATION The four primary sources of low-quality information include: Online customers intentionally enter inaccurate information to protect their privacy Information from different systems that have different information entry standards and formats Call center operators enter abbreviated or erroneous information by accident or to save time Third party and external information contains inconsistencies, inaccuracies, and errors Addressing the above sources of information inaccuracies will significantly improve the quality of organizational information Ask your students to determine a few additional sources of low-quality information Ans: A customer service representative could accidentally transpose a number in an address or misspell a last name

Understanding the Costs of Low-quality Information Potential business effects resulting from low-quality information Inability to accurately track customers Difficulty identifying valuable customers Inability to identify selling opportunities Marketing to nonexistent customers Difficulty tracking revenue due to inaccurate invoices Inability to build strong customer relationships – which increases buyer power Ask your students if they can list any additional business effects resulting from low-quality information Ask your students to focus on organizational strategies such as SCM, CRM, and ERP Ans: Low-quality information could cause the SCM system to order too much inventory from a supplier based on inaccurate orders Low-quality information could cause a CRM system to send an expensive promotional item (such as a fruit basket) to the wrong address of one of its best customers

Structures Hierarchical: The old way. “Tree”. Access elements by moving down tree. One-to-many. Network: Criss-cross patterns. Many-to-many. Relational: a common element relates “tables” to one another. Permits “ad hoc”. Object-oriented: “objects” have data, processes, and properties “encapsulated” in them. 8

Database Structures Relation Hierarchical Structure Network Structure Relational Structure Dept Empno Dept A 1 A B 2 B C 3 C 9

Pros and Cons Obj. Relat. Ad Hoc Flexibility ==> Net. Hier. Speed ==> 10

Data Dictionaries The Data Dictionary A reference work of data about data (metadata) compiled by the systems analyst to guide analysis and design. As a document, the data dictionary collects, coordinates, and confirms the meaning of data terms to various users throughout the organization. Uses of the Data Dictionary Documentation, Elimination of data redundancy Validate the data flow diagram for completeness and accuracy Provide a starting point for developing screens and reports Determine contents of data stored in files Develop the logic for data flow diagram processes 11

Data Flow Diagrams (“DFD”) Process File or Data Store Source or Entity

D1 Tenant File 1 Tenant Bank 2 Ext. Mgr 3 D1 Tenant File DFD Example: Apartment Rental Tenant Info D1 Tenant File Copy of lease 1 Lease New Tenant Process Tenant Receipt Bank Deposit Bank 2 Payments Notice Collection Process Cash Report Unpaid Charges Tenant Info Ext. Mgr 3 D1 Tenant File Delinquent Process Delinquency Report Delinquencies

Entity Relationship Diagrams Dept. Projects Dept. Employee * Project Deadline Resources works on * Name Title Address “zero” “one” “many”

New Names, Same Ideas Data Mining, OLAP Data Warehousing

Data Mining automated information discovery process, uncovers important patterns in existing data can use neural networks or other approaches. Requires ‘clean’, reliable, consistent data. Historical data must reflect the current environment. e.g. “What are the characteristics that identify when we are likely to lose a customer?” OLAP is user-driven discovery GIGO - garbage in, garbage out. Results are only as good as the data being mined. Data mining - knoweldge discovery rather than verification. Identifies and characterizes interrelationships among multi-variable dimensions without requiring a human to formulate specific answers. Data mining is about knowledge discover: extracting previously unknown & useful information. Usually use neural networks. Expert Systems - rules are identifiable . Neural Networks - rules are unknown & the machine discovers pattnerns. 88% of global 2000 companies believe data mining will be very important or critical to their success by the year 2000. Examples: FLEET: 11th largest bank. Testing a $35 million data warehousing project. 1TB warehouse & two 500 GB marts. Data on consumer & small businesses is fed in from 36, mostly mainframe applications. A commercial mart is created from 34 applications. Annual support costs are $6.5 million for 100 users. Expects profits of $50 million / year from the effort. Banks - notorious for having unprofitable customers & products. Also, what’s more expensive - attracting new customers or retaining existing customers? New customers - so you don’t want existing custoemrs to leave (churn, online industry faces this problem). Reducing attrition by 2% among the top 1/3 of customers would save $20 million/year! Example: FedEx saw a 300% increase in direct mail response rates. Revenue growth from identified accounts increases at an 8-1 ratio compared with the cost of other marketing campaigns.

Warehouses & Marts Data Warehouse Data Mart a database designed to support decision-making in an organization. It is batch-updated and structured for fast online queries and exploration. Data warehouses may aggregate enormous amounts of data from many different operational systems. Data Mart a database focused on addressing the concerns of a specific problem or business unit (e.g. Marketing, Engineering). Size doesn’t define data marts, but they tend to be smaller than data warehouses.

Data Warehouses & Data Marts 3rd party data Data Mart (Marketing) TPS & other operational systems Data Warehouse Data Mart (Engineering) = operational clients = query, OLAP, mining, etc.