Database Management Character, file, field, record, database??? 1.

Slides:



Advertisements
Similar presentations
Process Models: Data Flow Diagrams
Advertisements

McGraw-Hill/Irwin © The McGraw-Hill Companies, All Rights Reserved CHAPTER 6 Valuing Organizational Information.
C6 Databases.
Database Management3-1 L3 Database Management Santa R. Susarapu Ph.D. Student Virginia Commonwealth University.
Lecture-7/ T. Nouf Almujally
Chapter 3 Database Management
Database Management: Getting Data Together Chapter 14.
McGraw-Hill/Irwin ©2008 The McGraw-Hill Companies, All Rights Reserved DATABASES AND DATA WAREHOUSES Opening Case Searching for Revenue - Google DATABASES.
Organizing Data & Information
McGraw-Hill/Irwin Copyright © 2008, The McGraw-Hill Companies, Inc. All rights reserved.McGraw-Hill/Irwin Copyright © 2008 The McGraw-Hill Companies, Inc.
Chapter 4: Database Management. Databases Before the Use of Computers Data kept in books, ledgers, card files, folders, and file cabinets Long response.
How Come It Takes Me So Long to Get Answers to Simple Questions About My Business? Technologies for Business Intelligence.
Process Models: Data Flow Diagrams
Mgt 20600: IT Management & Applications Databases Tuesday April 4, 2006.
McGraw-Hill/Irwin Copyright © 2008, The McGraw-Hill Companies, Inc. All rights reserved.McGraw-Hill/Irwin Copyright © 2008 The McGraw-Hill Companies, Inc.
Managing Data Resources. File Organization Terms and Concepts Bit: Smallest unit of data; binary digit (0,1) Byte: Group of bits that represents a single.
Professor Michael J. Losacco CIS 1150 – Introduction to Computer Information Systems Databases Chapter 11.
Data Resource Management Chapter 5 McGraw-Hill/IrwinCopyright © 2011 by The McGraw-Hill Companies, Inc. All rights reserved.
ACS1803 Lecture Outline 2 DATA MANAGEMENT CONCEPTS Text, Ch. 3 How do we store data (numeric and character records) in a computer so that we can optimize.
5.1 © 2007 by Prentice Hall 5 Chapter Foundations of Business Intelligence: Databases and Information Management.
Intro to MIS – MGS351 Databases and Data Warehouses Chapter 3.
MD240 - MIS Oct. 4, 2005 Databases & the Data Asset Harrah’s & Allstate Cases.
Copyright © 2003 by Prentice Hall Computers: Tools for an Information Age Chapter 13 Database Management Systems: Getting Data Together.
© Paradigm Publishing Inc. 9-1 Chapter 9 Database and Information Management.
Chapter 6: Foundations of Business Intelligence - Databases and Information Management Dr. Andrew P. Ciganek, Ph.D.
© Paradigm Publishing Inc. 9-1 Chapter 9 Database and Information Management.
CHAPTER 8: MANAGING DATA RESOURCES. File Organization Terms Field: group of characters that represent something Record: group of related fields File:
311: Management Information Systems Database Systems Chapter 3.
Copyright © 2012 by The McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill/Irwin CHAPTER SIX DATA: BUSINESS INTELLIGENCE.
BUSINESS DRIVEN TECHNOLOGY
McGraw-Hill/Irwin © 2006 The McGraw-Hill Companies, Inc. All rights reserved. 6-1 BUSINESS DRIVEN TECHNOLOGY UNIT 2: Managing Information for Business.
Managing Information for Business Initiatives
I Information Systems Technology Ross Malaga 4 "Part I Understanding Information Systems Technology" Copyright © 2005 Prentice Hall, Inc. 4-1 DATABASE.
Chapter 3 and Module C DATABASES AND DATA WAREHOUSES Building Business Intelligence.
Lecturer: Gareth Jones. How does a relational database organise data? What are the principles of a database management system? What are the principal.
1.file. 2.database. 3.entity. 4.record. 5.attribute. When working with a database, a group of related fields comprises a(n)…
C6 Databases. 2 Traditional file environment Data Redundancy and Inconsistency: –Data redundancy: The presence of duplicate data in multiple data files.
McGraw-Hill/Irwin ©2008,The McGraw-Hill Companies, All Rights Reserved Chapter 5 Data Resource Management.
5-1 McGraw-Hill/Irwin Copyright © 2007 by The McGraw-Hill Companies, Inc. All rights reserved.
Valuing Organizational Information
5 - 1 Copyright © 2006, The McGraw-Hill Companies, Inc. All rights reserved.
6.1 © 2010 by Prentice Hall 6 Chapter Foundations of Business Intelligence: Databases and Information Management.
MANAGING DATA RESOURCES ~ pertemuan 7 ~ Oleh: Ir. Abdul Hayat, MTI.
Data resource management
Management Information Systems, 4 th Edition 1 Chapter 8 Data and Knowledge Management.
McGraw-Hill/Irwin ©2009 The McGraw-Hill Companies, All Rights Reserved CHAPTER 6 DATABASES AND DATA WAREHOUSES CHAPTER 6 DATABASES AND DATA WAREHOUSES.
Managing Data Resources. File Organization Terms and Concepts Bit: Smallest unit of data; binary digit (0,1) Byte: Group of bits that represents a single.
DATA RESOURCE MANAGEMENT
Foundations of Business Intelligence: Databases and Information Management.
McGraw-Hill/Irwin © The McGraw-Hill Companies, All Rights Reserved CHAPTER 6 Valuing Organizational Information.
Data: Business Intelligence Chapter 6 McGraw-Hill/Irwin Copyright © 2013 by The McGraw-Hill Companies, Inc. All rights reserved.
Exploring Business Intelligence
Valuing Organizational Information CHAPTER 06 McGraw-Hill/Irwin Copyright © 2013 by The McGraw-Hill Companies, Inc. All rights reserved.
© 2003 Prentice Hall, Inc.3-1 Chapter 3 Database Management Information Systems Today Leonard Jessup and Joseph Valacich.
Fundamentals of Information Systems, Sixth Edition Chapter 3 Database Systems, Data Centers, and Business Intelligence.
McGraw-Hill/Irwin ©2008,The McGraw-Hill Companies, All Rights Reserved Chapter 5 Data Resource Management.
Data Resource Management Data Concepts Database Management Types of Databases Chapter 5 McGraw-Hill/Irwin Copyright © 2007 by The McGraw-Hill Companies,
Managing Data Resources File Organization and databases for business information systems.
Intro to MIS – MGS351 Databases and Data Warehouses
Data Resource Management
CHAPTER SIX DATA Business Intelligence
Databases and Data Warehouses Chapter 3
Data Resource Management
Basic Concepts in Data Management
MANAGING DATA RESOURCES
Data Resource Management
Data Resource Management
Chapter 3 Database Management
Database Management Character, file, field, record, database??? 1.
Valuing Organizational Information
Presentation transcript:

Database Management Character, file, field, record, database??? 1

What’s “File Processing”? The “old” way of doing things; still often used in practice. Separate information stored on separate files. 2

File Processing Example: Sales Production Marketing Knows how many of Products A, B, and C have been sold. File stores Prod. Name, Production Schedule, and Sales. Knows how much of Products A, B, and C have been produced. File stores Prod. Name, Production Schedule, and Number Produced. Knows the price of Products A, B, and C. File stores Prod. Name and Product Price. 3

Any problems here? Duplication (redundancy). Inconsistency. Does anyone know how much money we made? No integration. Set format. Data dependence. Y2K!! 4

Database Management Database Management System (DBMS) Provides one integrated repository for data to be stored and queried. Standards for data can be defined and enforced. Reports and queries are easy (er). SQL, etc. 5

Database Management Ex.: Sales Production Marketing (App. Progs) DBMS Database Prod. Name Production Schedule Sales Number Produced Product Price 6

DATABASE MANAGEMENT SYSTEMS Four components of a DBMS The components of the DBMS are discussed in detail on the following slides A DBMS contains: Data definition component – helps create and maintain the data dictionary and the structure of the database Data manipulation component – allows users to create, read, update, and delete information in a database Application generation component – includes tools for creating visually appealing and easy-to-use applications Data administration component – provides tools for managing the overall database environment by providing faculties for backup, recovery, security, and performance

BUT... Expensive. Difficult. Slow / inefficient. 7

Another Look (thanks to John Gallaugher, Boston College) Server - responds to client requests DBMS - the program. Manages interaction with databases. request response Client - makes requests of the DBMS server database - the collection of data. Created and defined to meet the needs of the organization. Database a collection of related data. Usually organized according to topics: e.g. customer info, products, transactions Database Management System (DBMS) a program for creating & managing databases; ex. Oracle, MS-Access, Sybase When we talk about databases, we’re really talking about nothing more than lists of information. DBMS are used to create and manage databases (sometimes broadly called a database or database program) Show where DBMS & Databases relate to client/server computing The request in a request / response might include things like: add a record retrieve a record modify a record delete a record

A Simple Database File/Table Field/Column Record/Row Customers 5 shown: CUSTID, FIRST, LAST, CITY, STATE Record/Row 5 shown: one for each customer Why do we even need a customer ID? to uniquely identify a user - there may be two people with the same name

A More Complex Example Entry & Maintenance is complicated How many records do I have to change if Abby Johnson gets married & decides she wants to change her name? Two How many records do I change if Warren Buffet moves? Three You get the picture - there’s got to be a better way to manage this & there is - it’s called the relational model. Entry & Maintenance is complicated redundant data exists, increases chance of error, complicates updates/changes, takes up space

Normalize Data: Remove Redundancy Customer Table Transaction Table One Many Normalization is nothing more than removing redundancy. There are forma l steps & levels of normal form, but those are for a database course (a grad database elective is offered by the CS department). One normalizes to: eliminate redundancy simplify maintenance save space promote re-use Files are related via a common key. How many transactions has Buffet made? Three. How do we tell that? The unique ID

Key Terms Relational DBMS SQL - Structured Query Language manages databases as a collection of files/tables in which all data relationships are represented by common values in related tables (referred to as keys). a relational system has the flexibility to take multiple files and generate a new file from the records that meet the matching criteria (join). SQL - Structured Query Language Most popular relational database standard. Includes a language for creating & manipulating data. Note: other types of databases exist (e.g. older - hierarchical. newer - OO). SQL is by far the most popular. It is a standard that most follow. Many desktop applications can generate SQL statements using a GUI without any programming (e.g. Excel, Access). These appellations act as a client & send database access requests (e.g. get records, add record, delete record) to the database server.

Using SQL for Querying SQL (Structured Query Language) Data language English-like, nonprocedural, very user friendly language Free format Example: SELECT Name, Salary FROM Employees WHERE Salary >2000

THE VALUE OF QUALITY INFORMATION Five common characteristics of high-quality information Class Activity: Break your students into groups and ask each group to provide an additional example of each of the five common characteristics of high-quality information that is not provided in Figure 2.4 For example, Accuracy – does a purchase price on a bill match the item description on the bill? Item 1: Kids juice cup, cost $10,000. Chances are a kids juice cup would not cost $10,000 and this is an inaccurate item.

THE VALUE OF QUALITY INFORMATION Low-quality information example Walk-thru each of the six issues and have your students extrapolate a potential business problem that might be associated with each issue. The example does not state what type of database or spreadsheet this information is contained (sales, marketing, customer service, billing, etc), so allow your students use their imagination when they are extrapolating the potential business problems Issue 1: Without a first name it would be impossible to correlate this customer with customers in other databases (Sales, Marketing, Billing, Customer Service) to gain a compete customer view (CRM) Issue 2: Without a complete street address there is no possible way to communicate with this customer via mail or deliveries. An order might be sitting in a warehouse waiting for the complete address before shipping. The company has spent time and money processing an order that might never be completed Issue 3: If this is the same customer, the company will waste money sending out two sets of promotions and advertisements to the same customers. It might also send two identical orders and have to incur the expense of one order being returned Issue 4: This is a good example of where cleaning data is difficult because this may or may not be an error. There are many times when a phone and a fax have the same number. Since the phone number is also in the e-mail address field, chances are that the number is inaccurate Issue 5: The business would have no way of communicating with this customer via e-mail Issue 6: The company could determine the area code based on the customer’s address. This takes time, which costs the company money. This is a good reason to ensure that information is entered correctly the first time. All incorrect information needs to be fixed, which costs time and money

THE VALUE OF QUALITY INFORMATION The four primary sources of low-quality information include: Online customers intentionally enter inaccurate information to protect their privacy Information from different systems that have different information entry standards and formats Call center operators enter abbreviated or erroneous information by accident or to save time Third party and external information contains inconsistencies, inaccuracies, and errors Addressing the above sources of information inaccuracies will significantly improve the quality of organizational information Ask your students to determine a few additional sources of low-quality information Ans: A customer service representative could accidentally transpose a number in an address or misspell a last name

Understanding the Costs of Low-quality Information Potential business effects resulting from low-quality information Inability to accurately track customers Difficulty identifying valuable customers Inability to identify selling opportunities Marketing to nonexistent customers Difficulty tracking revenue due to inaccurate invoices Inability to build strong customer relationships – which increases buyer power Ask your students if they can list any additional business effects resulting from low-quality information Ask your students to focus on organizational strategies such as SCM, CRM, and ERP Ans: Low-quality information could cause the SCM system to order too much inventory from a supplier based on inaccurate orders Low-quality information could cause a CRM system to send an expensive promotional item (such as a fruit basket) to the wrong address of one of its best customers

Structures Hierarchical: The old way. “Tree”. Access elements by moving down tree. One-to-many. Network: Criss-cross patterns. Many-to-many. Relational: a common element relates “tables” to one another. Permits “ad hoc”. Object-oriented: “objects” have data, processes, and properties “encapsulated” in them. 8

Database Structures Relation Hierarchical Structure Network Structure Relational Structure Dept Empno Dept A 1 A B 2 B C 3 C 9

Pros and Cons Obj. Relat. Ad Hoc Flexibility ==> Net. Hier. Speed ==> 10

Data Dictionaries The Data Dictionary A reference work of data about data (metadata) compiled by the systems analyst to guide analysis and design. As a document, the data dictionary collects, coordinates, and confirms the meaning of data terms to various users throughout the organization. Uses of the Data Dictionary Documentation, Elimination of data redundancy Validate the data flow diagram for completeness and accuracy Provide a starting point for developing screens and reports Determine contents of data stored in files Develop the logic for data flow diagram processes 11

Data Flow Diagrams (“DFD”) Process File or Data Store Source or Entity

D1 Tenant File 1 Tenant Bank 2 Ext. Mgr 3 D1 Tenant File DFD Example: Apartment Rental Tenant Info D1 Tenant File Copy of lease 1 Lease New Tenant Process Tenant Receipt Bank Deposit Bank 2 Payments Notice Collection Process Cash Report Unpaid Charges Tenant Info Ext. Mgr 3 D1 Tenant File Delinquent Process Delinquency Report Delinquencies

Entity Relationship Diagrams Dept. Projects Dept. Employee * Project Deadline Resources works on * Name Title Address “zero” “one” “many”

New Names, Same Ideas Data Mining, OLAP Data Warehousing

Data Mining automated information discovery process, uncovers important patterns in existing data can use neural networks or other approaches. Requires ‘clean’, reliable, consistent data. Historical data must reflect the current environment. e.g. “What are the characteristics that identify when we are likely to lose a customer?” OLAP is user-driven discovery GIGO - garbage in, garbage out. Results are only as good as the data being mined. Data mining - knoweldge discovery rather than verification. Identifies and characterizes interrelationships among multi-variable dimensions without requiring a human to formulate specific answers. Data mining is about knowledge discover: extracting previously unknown & useful information. Usually use neural networks. Expert Systems - rules are identifiable . Neural Networks - rules are unknown & the machine discovers pattnerns. 88% of global 2000 companies believe data mining will be very important or critical to their success by the year 2000. Examples: FLEET: 11th largest bank. Testing a $35 million data warehousing project. 1TB warehouse & two 500 GB marts. Data on consumer & small businesses is fed in from 36, mostly mainframe applications. A commercial mart is created from 34 applications. Annual support costs are $6.5 million for 100 users. Expects profits of $50 million / year from the effort. Banks - notorious for having unprofitable customers & products. Also, what’s more expensive - attracting new customers or retaining existing customers? New customers - so you don’t want existing custoemrs to leave (churn, online industry faces this problem). Reducing attrition by 2% among the top 1/3 of customers would save $20 million/year! Example: FedEx saw a 300% increase in direct mail response rates. Revenue growth from identified accounts increases at an 8-1 ratio compared with the cost of other marketing campaigns.

Warehouses & Marts Data Warehouse Data Mart a database designed to support decision-making in an organization. It is batch-updated and structured for fast online queries and exploration. Data warehouses may aggregate enormous amounts of data from many different operational systems. Data Mart a database focused on addressing the concerns of a specific problem or business unit (e.g. Marketing, Engineering). Size doesn’t define data marts, but they tend to be smaller than data warehouses.

Data Warehouses & Data Marts 3rd party data Data Mart (Marketing) TPS & other operational systems Data Warehouse Data Mart (Engineering) = operational clients = query, OLAP, mining, etc.