Download presentation
Presentation is loading. Please wait.
Published byTracy Brown Modified over 9 years ago
1
Finding Information A337/A523
2
What are some of the possible problems with finding information?
3
Information is often lacks STRUCTURE ASSOCIATION between the identifying information (i.e., labels and the actual information is not always obvious) and the data CONSISTENCY is not always present. E.g., 317-274-0185 (317)274-0185 3172740185 May later need to MANIPULATE data (filter, sorting, etc.)
4
Typical “Office” Applications Word Processing Spreadsheet Database Management System (DBMS)
5
Spreadsheets and DBMSes Columns (labels) Rows (“instance” or record) Intersection (value) Information often lacks STRUCTURE ASSOCIATION between the identifying information (i.e., labels and the actual information) is not always obvious CONSISTENCY is not always present. E.g., 317-274-0185 (317)274-0185 3172740185 May later need to MANIPULATE data (deeper search, sorting, etc.)
6
Spreadsheets Tables in MS Excel Information often lacks STRUCTURE ASSOCIATION between the identifying information (i.e., labels and the actual information) is not always obvious CONSISTENCY is not always present. E.g., 317-274-0185 (317)274-0185 3172740185 May later need to MANIPULATE data (deeper search, sorting, etc.)
7
DBMSes Tables in MS Access Table is one of many objects in a database Easier to associate tables than in a spreadsheet (i.e., vlookup) Tables have several unique properties we’ll discuss later Information often lacks STRUCTURE ASSOCIATION between the identifying information (i.e., labels and the actual information) is not always obvious CONSISTENCY is not always present. E.g., 317-274-0185 (317)274-0185 3172740185 May later need to MANIPULATE data (deeper search, sorting, etc.)
8
ERP Systems Centralized database eliminates the need to associated data located on separate systems Information often lacks STRUCTURE ASSOCIATION between the identifying information (i.e., labels and the actual information) is not always obvious CONSISTENCY is not always present. E.g., 317-274-0185 (317)274-0185 3172740185 May later need to MANIPULATE data (deeper search, sorting, etc.)
9
Data Quality: What is Dirty Data? It happens when the UPC code on a package doesn't match the item. Causes? Vendor-Unique product code and cost Retailer-Unique product code and price
10
Data Quality: What is Dirty Data? Potential Problems? Inventory Reorder Profit per unit Net profit Customer Satisfaction Repeat Business Angry Bloggers Solution: Same code for vendor and retailer Data Integrity: Wal-Mart's Dirty Secret
11
Extract, Transform, Load (ETL) From Computerworld QuickStudy
Similar presentations
© 2025 SlidePlayer.com. Inc.
All rights reserved.